Listing entries tagged with google
subtitles and the future of reading
02.22.2006, 6:57 PM
After enduring a weeks-long PR pummeling for its dealings in China, Google is hard at work to improve its image in the world, racking up some points for good after slipping briefly into evil. Recently they launched Google.org: a website for the Google Foundation, the corporation's philanthropic arm and central office of evil mitigation. Paying a visit to the site, the disillusioned among us will be pleased to find that the foundation is already sponsoring a handful of worthy initiatives, along with a grants program that donates free web advertising to nonprofit organizations. And just in case we were concerned that Google might not apply its techno-capitalist wizardry to altruism as zealously as to making profit, they just announced today they've named a new director for the foundation by the name of -- no joke -- Dr. Brilliant. So it seems the world is in capable hands.
One project in particular caught my eye in light of recent discussions about screen-based reading and genre-blending visions of the book. Planet Read is an organization that promotes literacy in India through Same Language Subtitling -- a simple but apparently effective technique for building basic reading skills, taking popular visual entertainment like Bollywood movies and adding subtitles in English and Hindi along the bottom of the screen. A number of samples (sadly no Bollywood, just videos or photo montages set to Indian folk songs) can be found on Google Video. Here's one that I particularly liked:
Watching the video -- managing the interplay between moving text and moving pictures -- I began to wonder whether there are possibly some clues to be mined here about the future of reading. Yes, Planet Read is designed first and foremost to train basic alphabetic literacy, turning a captive audience into a captive classroom. But in doing so, might it not also be nurturing another kind of literacy?
The problem with contemporary discussions about the future of the book is that they are mired -- for cultural and economic reasons -- in a highly inflexible conception of what a book can be. People who grew up with print tend to assume that going digital is simply a matter of switching containers (with a few enhancements thrown in the mix), failing to consider how the actual content of books might change, or how the act of reading -- which increasingly takes place in a dyanamic visual context -- may eventually demand a more dynamic kind of text.
Blurring the lines between text and visual media naturally makes us uneasy because it points to a future that quite literally (for us dinosaurs at least) could be unreadable. But kids growing up today, in India or here in the States, are already highly accustomed to reading in screen-based environments, and so they probably have a somewhat different idea of what reading is. For them, text is likely just one ingredient in a complex combinatory medium.
Another example: Nochnoi Dozor (translated "Night Watch") is a film that has widely been credited as the first Russian blockbuster of the post-Soviet era -- an adrenaline-pumping, special effects-infused, sci-fi vampire epic made entirely by Russians, on Russian soil and on Russian themes (it's based on a popular trilogy of novels). When it was released about a year and a half ago it shattered domestic box office records previously held by Western hits like Titanic and Lord of the Rings. Just about a month ago, the sequel "Day Watch" shattered the records set by "Night Watch."
While highly derivative of western action movies, Nochnoi Dozor is moody, raucous and darkly gorgeous, giving a good, gritty feel of contemporary Moscow. Its plot grows rickety in places, and sometimes things are downright incomprehensible (even, I'm told, with fluent Russian), so I'm skeptical about its prospects on this side of the globe. But goshdarnit, Russians can't seem to get enough of it -- so in an effort to lure American audiences over to this uniquely Russian gothic thriller, start building a brand out of the projected trilogy (and presumably pave the way for the eventual crossover to Hollywood of director Timur Bekmambetov), Fox Searchlight just last week rolled the film out in the U.S. on a very limited release.
What could this possibly have to do with the future of reading? Well, naturally the film is subtitled, and we all know how subtitles are the kiss of death for a film in the U.S. market (Passion of the Christ notwithstanding). But the marketers at Fox are trying something new with Nochnoi Dozor. No, they weren't foolish enough to dub it, which would have robbed the film of the scratchy, smoke-scarred Moscow voices that give it so much of its texture. What they've done is played with the subtitles themselves, making them more active and responsive to the action in the film (sounds like some Flash programmer had a field day...). Here's a description from an article in the NY Times (unfortunately now behind pay wall):
...[the words] change color and position on the screen, simulate dripping blood, stutter in emulation of a fearful query, or dissolve into red vapor to emulate a character's gasping breaths.
And this from Anthony Lane's review in the latest New Yorker:
...the subtitles, for instance, are the best I have encountered. Far from palely loitering at the foot of the screen, they lurk in odd corners of the frame and, at one point, glow scarlet and then spool away, like blood in water. I trust that this will start a technical trend and that, from here on, no respectable French actress will dream of removing her clothes unless at least three lines of dialogue can be made to unwind across her midriff.
It might seem strange to think of subtitling of foreign films as a harbinger of future reading practices. But then, with the increasing popularity of Asian cinema, and continued cross-pollination between comics and film, it's not crazy to suspect that we'll be seeing more of this kind of textual-visual fusion in the future.
Most significant is the idea that the text can itself be an actor in a perfomance: a frontier that has only barely been explored -- though typography enthusiasts will likely pillory me for saying so.
Posted by ben vershbow at 06:57 PM
| Comments (6)
| TrackBack
tags: animation , books , cinema , digital_literature , ebooks , film , flash , google , google_video , india , language , literacy , reading , russia , subtitles , translation , typography , video
google gets mid-evil
01.30.2006, 3:46 PM
At the World Economic Forum in Davos last Friday, Google CEO Eric Schmidt assured a questioner in the audience that his company had in fact thoroughly searched its soul before deciding to roll out a politically sanitized search engine in China:
We concluded that although we weren't wild about the restrictions, it was even worse to not try to serve those users at all... We actually did an evil scale and decided not to serve at all was worse evil.
(via Ditherati)
Posted by ben vershbow at 03:46 PM
| Comments (0)
tags: Libraries, Search and the Web , Network_Freedom , censorship , china , evil , free_speech , google , internet , search , web
illusions of a borderless world
01.27.2006, 3:57 PM
A number of influential folks around the blogosphere are reluctantly endorsing Google's decision to play by China's censorship rules on its new Google.cn service -- what one local commentator calls a "eunuch version" of Google.com. Here's a sampler of opinions:
Ethan Zuckerman ("Google in China: Cause For Any Hope?"):
It’s a compromise that doesn’t make me happy, that probably doesn’t make most of the people who work for Google very happy, but which has been carefully thought through...In launching Google.cn, Google made an interesting decision - they did not launch versions of Gmail or Blogger, both services where users create content. This helps Google escape situations like the one Yahoo faced when the Chinese government asked for information on Shi Tao, or when MSN pulled Michael Anti’s blog. This suggests to me that Google’s willing to sacrifice revenue and market share in exchange for minimizing situations where they’re asked to put Chinese users at risk of arrest or detention... This, in turn, gives me some cause for hope.
Rebecca MacKinnon ("Google in China: Degrees of Evil"):
At the end of the day, this compromise puts Google a little lower on the evil scale than many other internet companies in China. But is this compromise something Google should be proud of? No. They have put a foot further into the mud. Now let's see whether they get sucked in deeper or whether they end up holding their ground.
David Weinberger ("Google in China"):
If forced to choose — as Google has been — I'd probably do what Google is doing. It sucks, it stinks, but how would an information embargo help? It wouldn't apply pressure on the Chinese government. Chinese citizens would not be any more likely to rise up against the government because they don't have access to Google. Staying out of China would not lead to a more free China.
Doc Searls ("Doing Less Evil, Possibly"):
I believe constant engagement — conversation, if you will — with the Chinese government, beats picking up one's very large marbles and going home. Which seems to be the alternative.
Much as I hate to say it, this does seem to be the sensible position -- not unlike opposing America's embargo of Cuba. The logic goes that isolating Castro only serves to further isolate the Cuban people, whereas exposure to the rest of the world -- even restricted and filtered -- might, over time, loosen the state's monopoly on civic life. Of course, you might say that trading Castro for globalization is merely an exchange of one tyranny for another. But what is perhaps more interesting to ponder right now, in the wake of Google's decision, is the palpable melancholy felt in the comments above. What does it reveal about what we assume -- or used to assume -- about the internet and its relationship to politics and geography?
A favorite "what if" of recent history is what might have happened in the Soviet Union had it lasted into the internet age. Would the Kremlin have managed to secure its virtual borders? Or censor and filter the net into a state-controlled intranet -- a Union of Soviet Socialist Networks? Or would the decentralized nature of the technology, mixed with the cultural stirrings of glasnost, have toppled the totalitarian state from beneath?
Ten years ago, in the heady early days of the internet, most would probably have placed their bets against the Soviets. The Cold War was over. Some even speculated that history itself had ended, that free-market capitalism and democracy, on the wings of the information revolution, would usher in a long era of prosperity and peace. No borders. No limits.

"Jingjing" and "Chacha." Internet police officers from the city of Shenzhen who float over web pages and monitor the cyber-traffic of local users.
It's interesting now to see how exactly the opposite has occurred. Bubbles burst. Towers fell. History, as we now realize, did not end, it was merely on vacation; while the utopian vision of the internet -- as a placeless place removed from the inequities of the physical world -- has all but evaporated. We realize now that geography matters. Concrete features have begun to crystallize on this massive information plain: ports, gateways and customs houses erected, borders drawn. With each passing year, the internet comes more and more to resemble a map of the world.
Those of us tickled by the "what if" of the Soviet net now have ourselves a plausible answer in China, who, through a stunning feat of pipe control -- a combination of censoring filters, on-the-ground enforcement, and general peering over the shoulders of its citizens -- has managed to create a heavily restricted local net in its own image. Barely a decade after the fall of the Iron Curtain, we have the Great Firewall of China.
And as we've seen this week, and in several highly publicized instances over the past year, the virtual hand of the Chinese government has been substantially strengthened by Western technology companies willing to play by local rules so as not to be shut out of the explosive Chinese market. Tech giants like Google, Yahoo! , and Cisco Systems have proved only too willing to abide by China's censorship policies, blocking certain search returns and politically sensitive terms like "Taiwanese democracy," "multi-party elections" or "Falun Gong". They also specialize in precision bombing, sometimes removing the pages of specific users at the government's bidding. The most recent incident came just after New Year's when Microsoft acquiesced to government requests to shut down the My Space site of popular muckraking blogger Zhao Jing, aka Michael Anti.

One of many angry responses that circulated the non-Chinese net in the days that followed.
We tend to forget that the virtual is built of physical stuff: wires, cable, fiber -- the pipes. Whoever controls those pipes, be it governments or telecomms, has the potential to control what passes through them. The result is that the internet comes in many flavors, depending in large part on where you are logging in. As Jack Goldsmith and Timothy Wu explain in an excellent article in Legal Affairs (adapted from their forthcoming book Who Controls the Internet? : Illusions of a Borderless World), China, far from being the boxed-in exception to an otherwise borderless net, is actually just the uglier side of a global reality. The net has been mapped out geographically into "a collection of nation-state networks," each with its own politics, social mores, and consumer appetites. The very same technology that enables Chinese authorities to write the rules of their local net enables companies around the world to target advertising and gear services toward local markets. Goldsmith and Wu:
...information does not want to be free. It wants to be labeled, organized, and filtered so that it can be searched, cross-referenced, and consumed....Geography turns out to be one of the most important ways to organize information on this medium that was supposed to destroy geography.
Who knows? When networked devices truly are ubiquitous and can pinpoint our location wherever we roam, the internet could be censored or tailored right down to the individual level (like the empire in Borges' fable that commissions a one-to-one map of its territory that upon completion perfectly covers every corresponding inch of land like a quilt).
The case of Google, while by no means unique, serves well to illustrate how threadbare the illusion of the borderless world has become. The company's famous credo, "don't be evil," just doesn't hold up in the messy, complicated real world. "Choose the lesser evil" might be more appropriate. Also crumbling upon contact with air is Google's famous mission, "to make the world's information universally accessible and useful," since, as we've learned, Google will actually vary the world's information depending on where in the world it operates.
Google may be behaving responsibly for a corporation, but it's still a corporation, and corporations, in spite of well-intentioned employees, some of whom may go to great lengths to steer their company onto the righteous path, are still ultimately built to do one thing: get ahead. Last week in the States, the get-ahead impulse happened to be consonant with our values. Not wanting to spook American users, Google chose to refuse a Dept. of Justice request for search records to aid its anti-pornography crackdown. But this week, not wanting to ruffle the Chinese government, Google compromised and became an agent of political repression. "Degrees of evil," as Rebecca MacKinnon put it.
The great irony is that technologies we romanticized as inherently anti-tyrannical have turned out to be powerful instruments of control, highly adaptable to local political realities, be they state or market-driven. Not only does the Chinese government use these technologies to suppress democracy, it does so with the help of its former Cold War adversary, America -- or rather, the corporations that in a globalized world are the de facto co-authors of American foreign policy. The internet is coming of age and with that comes the inevitable fall from innocence. Part of us desperately wanted to believe Google's silly slogans because they said something about the utopian promise of the net. But the net is part of the world, and the world is not so simple.
Posted by ben vershbow at 03:57 PM
| Comments (3)
tags: ISP , Libraries, Search and the Web , Network_Freedom , broadband , capitalism , china , cyberspace , democracy , evil , falun_gong , free_speech , geography , globalization , glocalization , good , google , human_rights , search , spectrum , technology
cheney and google
01.21.2006, 6:27 PM
(this is a follow-up to ben's recent post "the book is reading you."
i rarely read Maureen Dowd but the headline of her column in today's New York Times, "Googling past the Graveyard," caught my attention. Dowd calls Dick Cheney on the carpet for asking Google to release the search records of U.S. citizens. while i'm horrified that the govt. would even consider asking for such information, i'm concerned that the way this particular issue is playing out, Google is being portrayed as the poor beleaguered neutral entity caught between an over-reaching bureaucracy and its citizens. Cheney will expire eventually. in the meantime Google will collect even more data. Google is a very big corporation, who's power will grow over time. in the long run, why aren't people outraged that this information is in Google's hands in the first place. shouldn't we be?
Posted by bob stein at 06:27 PM
| Comments (5)
tags: Libraries, Search and the Web , cheney , google , government , privacy
the book is reading you
01.19.2006, 1:42 PM
I just noticed that Google Book Search requires users to be logged in on a Google account to view pages of copyrighted works.
They provide the following explanation:
Why do I have to log in to see certain pages?Because many of the books in Google Book Search are still under copyright, we limit the amount of a book that a user can see. In order to enforce these limits, we make some pages available only after you log in to an existing Google Account (such as a Gmail account) or create a new one. The aim of Google Book Search is to help you discover books, not read them cover to cover, so you may not be able to see every page you're interested in.
So they're tracking how much we've looked at and capping our number of page views. Presumably a bone tossed to publishers, who I'm sure will continue suing Google all the same (more on this here). There's also the possibility that publishers have requested information on who's looking at their books -- geographical breakdowns and stats on click-throughs to retailers and libraries. I doubt, though, that Google would share this sort of user data. Substantial privacy issues aside, that's valuable information they want to keep for themselves.
That's because "the aim of Google Book Search" is also to discover who you are. It's capturing your clickstreams, analyzing what you've searched and the terms you've used to get there. The book is reading you. Substantial privacy issues aside, (it seems more and more that's where we'll be leaving them) Google will use this data to refine Google's search algorithms and, who knows, might even develop some sort of personalized recommendation system similar to Amazon's -- you know, where the computer lists other titles that might interest you based on what you've read, bought or browsed in the past (a system that works only if you are logged in). It's possible Google is thinking of Book Search as the cornerstone of a larger venture that could compete with Amazon.
There are many ways Google could eventually capitalize on its books database -- that is, beyond the contextual advertising that is currently its main source of revenue. It might turn the scanned texts into readable editions, hammer out licensing agreements with publishers, and become the world's biggest ebook store. It could start a print-on-demand service -- a Xerox machine on steroids (and the return of Google Print?). It could work out deals with publishers to sell access to complete online editions -- a searchable text to go along with the physical book -- as Amazon announced it will do with its Upgrade service. Or it could start selling sections of books -- individual pages, chapters etc. -- as Amazon has also planned to do with its Pages program.
Amazon has long served as a valuable research tool for books in print, so much so that some university library systems are now emulating it. Recent additions to the Search Inside the Book program such as concordances, interlinked citations, and statistically improbable phrases (where distinctive terms in the book act as machine-generated tags) are especially fun to play with. Although first and foremost a retailer, Amazon feels more and more like a search system every day (and its A9 engine, though seemingly always on the back burner, is also developing some interesting features). On the flip side Google, though a search system, could start feeling more like a retailer. In either case, you'll have to log in first.
Posted by ben vershbow at 01:42 PM
| Comments (5)
tags: Copyright and Copyleft , Libraries, Search and the Web , POD , amazon , books , e-commerce , e-publishing , ebooks , google , google_book_search , google_print , internet , print_on_demand , privacy , publishing , search , web
.tv
01.09.2006, 6:15 PM
People have been talking about internet television for a while now. But Google and Yahoo's unveiling of their new video search and subscription services last week at the Consumer Electronics Show in Las Vegas seemed to make it real.
Sifting through the predictions and prophecies that subsequently poured forth, I stumbled on something sort of interesting -- a small concrete discovery that helped put some of this in perspective. Over the weekend, Slate Magazine quietly announced its partnership with "meaningoflife.tv," a web-based interview series hosted by Robert Wright, author of Nonzero and The Moral Animal, dealing with big questions at the perilous intersection of science and religion.
Launched last fall (presumably in response to the intelligent design fracas), meaningoflife.tv is a web page featuring a playlist of video interviews with an intriguing roster of "cosmic thinkers" -- philosophers, scientists and religious types -- on such topics as "Direction in evolution," "Limits in science," and "The Godhead."
This is just one of several experiments in which Slate is fiddling with its text-to-media ratio. Today's Pictures, a collaboration with Magnum Photos, presents a daily gallery of images and audio-photo essays, recalling both the heyday of long-form photojournalism and a possible future of hybrid documentary forms. One problem is that it's not terribly easy to find these projects on Slate's site. The Magnum page has an ad tucked discretely on the sidebar, but meaningoflife.tv seems to have disappeared from the front page after a brief splash this weekend. For a born-digital publication that has always thought of itself in terms of the web, Slate still suffers from a pretty appalling design, with its small headline area capping a more or less undifferentiated stream of headlines and teasers.
Still, I'm intrigued by these collaborations, especially in light of the forecast TV-net convergence. While internet TV seems to promise fragmentation, these projects provide a comforting dose of coherence -- a strong editorial hand and a conscious effort to grapple with big ideas and issues, like the reassuringly nutritious programming of PBS or the BBC. It's interesting to see text-based publications moving now into the realm of television. As Tivo, on demand, and now, the internet atomize TV beyond recognition, perhaps magazines and newspapers will fill part of the void left by channels.
Limited as it may now seem, traditional broadcast TV can provide us with valuable cultural touchstones, common frames of reference that help us speak a common language about our culture. That's one thing I worry we'll lose as the net blows broadcast media apart. Then again, even in the age of five gazillion cable channels, we still have our water-cooler shows, our mega-hits, our television "events." And we'll probably have them on the internet too, even when "by appointment" television is long gone. We'll just have more choice regarding where, when and how we get at them. Perhaps the difference is that in an age of fragmentation, we view these touchstone programs with a mildly ironic awareness of their mainstream status, through the multiple lenses of our more idiosyncratic and infinitely gratified niche affiliations. They are islands of commonality in seas of specialization. And maybe that makes them all the more refreshing. Shows like "24," "American Idol," or a Ken Burns documentary, or major sporting events like the World Cup or the Olympics that draw us like prairie dogs out of our niches. Coming up for air from deep submersion in our self-tailored, optional worlds.
Posted by ben vershbow at 06:15 PM
| Comments (6)
tags: Publishing, Broadcast, and the Press , TV , broadband , broadcast , documentary , google , internet , journalism , media , media_consumption , multimedia , network , photography , religion , science , slate , television , yahoo
useful rss
01.04.2006, 1:58 PM
Hi. I'm Jesse, the latest member to join the staff here at the Institute. I'm interested in network effects, online communities, and emergent behavior. Right now I'm interested in the tools we have available to control and manipulate RSS feeds. My goal is to collect a wide variety of feeds and tease out the threads that are important to me. In my experience, mechanical aggregation gives you quantity and diversity, but not quality and focus. So I did a quick investigation of the tools that exist to manage and manipulate feeds.
Sites like MetaFilter and Technorati skim the most popular topics in the blogosphere.
But what sort of tools exist to help us narrow our focus? There are two tools that we can use right now: tag searches/filtering, and keyword searching. Tag searches (on Technorati) and tag filtering (on Metafilter) drill down to specific areas, like "books" or "books and publishing." A casual search on MetaFilter was a complete failure, but Technorati, with its combination of tags and keyword search results produced good material.
There is also the Google Blog search. As Google puts it, you can 'find blogs on your favorite topics.' PageRank works, so PageRank applied to blogs should work too. Unfortunately it results in too many pages that, while higher ranked in the whole set of the Internet, either fail to be on topic or exist outside of the desired sub-spheres of a topic. For example, I searched for "gourmet food" and found one of the premier food blogs on the fourth page, just below Carpundit. Google blog search fails here because it can't get small enough to understand the relationships in the blogosphere, and relies more heavily on text retrieval algorithms that sabotage the results.
Finally, let's talk about aggregators. There is more human involvement in selecting sites you're interested in reading. This creates a personalized network of sites that are related, if only by your personal interest. The problem is, you get what they want to write about. Managing a large collection of feeds can be tiresome when you're looking for specific information. Bloglines has a search function that allows you to find keywords inside your subscriptions, then treat that as a feed. This neatly combines hand-picked sources with keyword or tag harvesting. The result: a slice of from your trusted collection of authors about a specific topic.
What can we envision for the future of RSS? Affinity mapping and personalized recommendation systems could augment the tag/keyword search functionality to automatically generate a slice from a small network of trusted blogs. Automatic harvesting of whole swaths of linked entries for offline reading in a bounded hypertext environment. Reposting and remixing feed content on the fly based on text-processing algorithms. And we'll have to deal with the dissolving identity and trust relationships that are a natural consequence of these innovations.
Posted by jesse wilbur at 01:58 PM
| Comments (5)
tags: RSS , aggregators , blog_search , bloglines , google , tools
Wikipedia to consider advertising
12.30.2005, 4:29 PM
The London Times just published an interview with Wikipedia founder Jimmy Wales in which he entertains the
idea of carrying ads. This mention is likely to generate an avalanche of discussion about the commercialization of open-source resources. While i would love to see Wikipedia stay out of the commercial realm, it's just not likely. Yahoo, Google and other big companies are going to commercialize Wikipedia anyway so taking ads is likely to end up a no-brainer. As i mentioned in my comment on Lisa's earlier post, this is going to happen as long as the overall context is defined by capitalist relations. Presuming that the web can be developed in a cooperative, non-capitalist way without fierce competition and push-back from the corporations who control the web's infrastructure seems naive to me.
Posted by bob stein at 04:29 PM
| Comments (1)
tags: advertising , capitalism , google , open_content , open_source , wikipedia , yahoo
why google and yahoo love wikipedia
12.29.2005, 3:16 PM
From Dan Cohen's excellent Digital Humanities Blog comes a discussion of the Wikipedia story that Cohen claims no one seems to be writing about — namely, the question of why Google and Yahoo give so much free server space and bandwith to Wikipedia. Cohen points out that there's more going on here than just the open source ethos of these tech companies: in fact, the two companies are becoming increasingly dependent on Wikipedia as a resource, both as something to repackage for commercial use (in sites such as Answers.com), and as a major component in the programming of search algorithms. Cohen writes:
Let me provide a brief example that I hope will show the value of having such a free resource when you are trying to scan, sort, and mine enormous corpora of text. Let's say you have a billion unstructured, untagged, unsorted documents related to the American presidency in the last twenty years. How would you differentiate between documents that were about George H. W. Bush (Sr.) and George W. Bush (Jr.)? This is a tough information retrieval problem because both presidents are often referred to as just "George Bush" or "Bush." Using data-mining algorithms such as Yahoo's remarkable Term Extraction service, you could pull out of the Wikipedia entries for the two Bushes the most common words and phrases that were likely to show up in documents about each (e.g., "Berlin Wall" and "Barbara" vs. "September 11" and "Laura"). You would still run into some disambiguation problems ("Saddam Hussein," "Iraq," "Dick Cheney" would show up a lot for both), but this method is actually quite a powerful start to document categorization.
Cohen's observation is a valuable reminder that all of the discussion of Wikipedia's accuracy and usefulness as an academic tool is really only skimming the surface of how and why the open-souce encyclopedia is reshaping the way knowledge is made and accessed. Ultimately, the question of whether or not Wikipedia should be used in the classroom might be less important than whether — or how — it is used in the boardroom, by companies whose function is to repackage, reorganize and return "the people's knowledge" back to the people at a tidy profit.
Posted by lisa lynch at 03:16 PM
| Comments (7)
tags: Libraries, Search and the Web , google , wikipedia , yahoo
last week: wikipedia, r kelly, gaming and google panels, and more...
12.18.2005, 4:27 PM
Here's an overview of what we've been posting over the last week. As well, a few of us having been talking about ways to graphically represent text, so I thought I would include a mind map of this overview.
As a follow up to the increasingly controversial wikipedia front, Daniel Brandt uncovered that Brian Chase posted false information about John Seignthaler that was reported here last week. To add fuel to the fire, Nature weighed in that Encyclopedia Britannica may not be as reliable as Wikipedia.
Business Week noted a possible future of pricing for data transfer. Currently, carries such as phone and cable companies are developing technology to identify and control what types of media (voice, images, text or video) are being uploaded. This ability opens the door to being able to charge for different uses of data transfer, which would have a huge impact on uploading content for personal creative use of the internet.
Liz Barry and Bill Wetzel shared some of their experiences from their "Talk to Me" Project. With their "talk to me" sign in tow, they travel around New York and the rest of the US looking for conversation. We were impressed at how they do not have a specific agenda besides talking to people. In the mediated age, they are not motivated by external political/ religious/ documentary intentions. What they do document is available on their website, and we look forward to see what they come up with next.
The Google Book Search debate continues as well, via a panel discussion hosted by the American Bar Association. Interestingly, publishers spoke as if the wide scale use of ebooks is imminent. More importantly and even if this particular case settles out of court, the courts have a pressing need to define copyright and fair use guidelines for these emerging uses.
With the protest of the WTO meetings in Hong Kong this past week, new journalism forms took one step forward. The website Curbside @ WTO covered the meetings with submissions from journalism students, bloggers and professional journalists.
McDonalds filed a patent which suggests that it intends to offer clips of movies instead of the traditional toys in their kids oriented Happy Meals. Lisa pondered if a video clip can successfully replace a toy, and if it does, what the effects on children's imaginations might be.
R. Kelly's experiments in form and the "serial song" through his Trapped in the Closet recordings. While R Kelly has varying success in this endeavor, Dan compared the experience of not only the serial novel, but also Julie Powell's foray into transferring her blog into book form and what she might have learned from R. Kelly (its hard to make unified pieces maintain an overall coherency.)
The world of academic publishing was challenged with a proposal calling to create an electronic academic press. This segment seems especially ripe for the shift to digital publishing as many journals with small circulations face raising printing and production costs.
Sol and others from the institute attended "Making Games Matter," a panel with contributors from The Game Design Reader: A Rules of Play Anthology, edited by Katie Salen and Eric Zimmerman. The discussion covered among other things: involving the academy in creating a discourse for gaming and game design, obstacles in studying and creating games, and the game "industry" itself. The book and panel called out for games and gaming to undergo a formal study akin to the novel and the experience of reading. Also, in the gaming world, the class economics of the real and virtual began to emerge as a Chinese firm pays employees to build up characters in MMOGs to sell to affluent gamers.
Posted by ray cha at 04:27 PM
| Comments (0)
tags: Roundup , academia , broadband , e-publishing , fast_food , gaming , google , google_book_search , internet , mcdonalds , network_neutrality , publishing , r_kelly , video_games , wikipedia
google book search debated at american bar association
12.15.2005, 3:50 PM
Last night I attended a fascinating panel discussion at the American Bar Association on the legality of Google Book Search. In many ways, this was the debate made flesh. Making the case against Google were high-level representatives from the two entities that have brought suit, the Authors' Guild (Executive Director Paul Aiken) and the Association of American Publishers (VP for legal counsel Allan Adler). It would have been exciting if Google, in turn, had sent representatives to make their case, but instead we had two independent commentators, law professor and blogger Susan Crawford and Cameron Stracher, also a law professor and writer. The discussion was vigorous, at times heated -- in many ways a preview of arguments that could eventually be aired (albeit under a much stricter clock) in front of federal judges.
The lawsuits in question center around whether Google's scanning of books and presenting tiny snippet quotations online for keyword searches is, as they claim, fair use. As I understand it, the use in question is the initial scanning of full texts of copyrighted books held in the collections of partner libraries. The fair use defense hinges on this initial full scan being the necessary first step before the "transformative" use of the texts, namely unbundling the book into snippets generated on the fly in response to user search queries.

...in case you were wondering what snippets look like
At first, the conversation remained focused on this question, and during that time it seemed that Google was winning the debate. The plaintiffs' arguments seemed weak and a little desperate. Aiken used carefully scripted language about not being against online book search, just wanting it to be licensed, quipping "we're just throwing a little gravel in the gearbox of progress." Adler was a little more strident, calling Google "the master of misdirection," using the promise of technological dazzlement to turn public opinion against the legitimate grievances of publishers (of course, this will be settled by judges, not by public opinion). He did score one good point, though, saying Google has betrayed the weakness of its fair use claim in the way it has continually revised its description of the program.
Almost exactly one year ago, Google unveiled its "library initiative" only to re-brand it several months later as a "publisher program" following a wave of negative press. This, however, did little to ease tensions and eventually Google decided to halt all book scanning (until this past November) while they tried to smooth things over with the publishers. Even so, lawsuits were filed, despite Google's offer of an "opt-out" option for publishers, allowing them to request that certain titles not be included in the search index. This more or less created an analog to the "implied consent" principle that legitimates search engines caching web pages with "spider" programs that crawl the net looking for new material.
In that case, there is a machine-to-machine communication taking place and web page owners are free to insert programs that instruct spiders not to cache, or can simply place certain content behind a firewall. By offering an "opt-out" option to publishers, Google enables essentially the same sort of communication. Adler's point (and this was echoed more succinctly by a smart question from the audience) was that if Google's fair use claim is so air-tight, then why offer this middle ground? Why all these efforts to mollify publishers without actually negotiating a license? (I am definitely concerned that Google's efforts to quell what probably should have been an anticipated negative reaction from the publishing industry will end up undercutting its legal position.)
Crawford came back with some nice points, most significantly that the publishers were trying to make a pretty egregious "double dip" into the value of their books. Google, by creating a searchable digital index of book texts -- "a card catalogue on steroids," as she put it -- and even generating revenue by placing ads alongside search results, is making a transformative use of the published material and should not have to seek permission. Google had a good idea. And it is an eminently fair use.
And it's not Google's idea alone, they just had it first and are using it to gain a competitive advantage over their search engine rivals, who in their turn, have tried to get in on the game with the Open Content Alliance (which, incidentally, has decided not to make a stand on fair use as Google has, and are doing all their scanning and indexing in the context of license agreements). Publishers, too, are welcome to build their own databases and to make them crawl-able by search engines. Earlier this week, Harper Collins announced it would be doing exactly that with about 20,000 of its titles. Aiken and Adler say that if anyone can scan books and make a search engine, then all hell will break loose and millions of digital copies will be leaked into the web. Crawford shot back that this lawsuit is not about net security issues, it is about fair use.
But once the security cat was let out of the bag, the room turned noticeably against Google (perhaps due to a preponderance of publishing lawyers in the audience). Aiken and Adler worked hard to stir up anxiety about rampant ebook piracy, even as Crawford repeatedly tried to keep the discussion on course. It was very interesting to hear, right from the horse's mouth, that the Authors' Guild and AAP both are convinced that the ebook market, tiny as it currently is, is within a few years of exploding, pending the release of some sort of ipod-like gadget for text. At that point, they say, Google will have gained a huge strategic advantage off the back of appropriated content.
Their argument hinges on the fourth determining factor in the fair use exception, which evaluates "the effect of the use upon the potential market for or value of the copyrighted work." So the publishers are suing because Google might be cornering a potential market!!! (Crawford goes further into this in her wrap-up) Of course, if Google wanted to go into the ebook business using the material in their database, there would have to be a licensing agreement, otherwise they really would be pirating. But the suits are not about a future market, they are about creating a search service, which should be ruled fair use. If publishers are so worried about the future ebook market, then they should start planning for business.
To echo Crawford, I sincerely hope these cases reach the court and are not settled beforehand. Larger concerns about Google's expansionist program aside, I think they have made a very brave stand on the principle of fair use, the essential breathing space carved out within our over-extended copyright laws. Crawford reminded the room that intellectual property is NOT like physical property, over which the owner has nearly unlimited rights. Copyright is a "temporary statutory monopoly" originally granted ("with hesitation," Crawford adds) in order to incentivize creative expression and the production of ideas. The internet scares the old-guard publishing industry because it poses so many threats to the security of their product. These threats are certainly significant, but they are not the subject of these lawsuits, nor are they Google's, or any search engine's, fault. The rise of the net should not become a pretext for limiting or abolishing fair use.
Posted by ben vershbow at 03:50 PM
| Comments (2)
tags: Copyright and Copyleft , Libraries, Search and the Web , copyright , ebooks , fair_use , google , google_book_search , publishing
where we've been, where we're going
12.09.2005, 12:54 PM

This past week at if:book we've been thinking a lot about the relationship between this weblog and the work we do. We decided that while if:book has done a fine job reflecting and provoking the conversations we have at the Institute, we wanted to make sure that it also seems as coherent to our readers as it does to us. With that in mind, we've decided to begin posting a weekly roundup of our blog posts, in which we synthesize (as much a possible) what we've been thinking and talking about from Monday to Friday.
So here goes. This week we spent a lot of time reflecting on simulation and virtuality. In part, this reflection grew out of our collective reading of a Tom Zengotita's book Mediated, which discusses (among other things) the link between alienation from the "real" through digital mediation and increased solipsism. Bob seemed especially interested in the dialectic relationship between, on one hand, the opportunity for access afforded by ever-more sophisticated form of simulation, and, on the other, the sense that something must be lost when as the encounter with the "real" recedes entirely.
This, in turn, led to further conversation about what we might think of as the "loss of the real" in the transition from books on paper to books on a computer screen. On one hand, there seems to be a tremendous amount of anxiety that Google Book Search might somehow make actual books irrelevant and thus destroy reading and writing practices linked to the bound book. On the other hand, one could take the position of Cory Doctorow that books as objects are overrated, and challenge the idea that a book needs to be digitally embodied to be "real."
As the debate over Google Book Search continually reminds us, one of the most challenging things in sifting through discussions of emerging media forms is learning to tell the difference between nostalgia and useful critical insight. Often the two are hopelessly intertwined; in this week's debates about Wikipedia, for example, discussion of how to make the open-source encyclopedia more useful was often tempered by the suggestion that encyclopedias of the past were always be superior to Wikipedia, an assertion easily challenged by a quick browse through some old encyclopedias.
Finally, I want to mention that we finally got around to setting up a del.icio.us account. There will be a formal link on the blog up soon, but you can take a look now. It will expand quickly.
Posted by lisa lynch at 12:54 PM
| Comments (0)
tags: Roundup , book , google , search , simulation , wikipedia
google libraries podcast now available
12.07.2005, 11:33 AM
In case you missed Open Source's Monday hour on Google Book Search... Listen here. Podcast RSS here. Show summary here.
Posted by ben vershbow at 11:33 AM
| Comments (1)
tags: Libraries, Search and the Web , ebook , google , google_book_search , google_print , library , podcast , publishing
google on the air
12.06.2005, 12:34 AM
Open Source's hour on the Googlization of libraries was refreshingly light on the copyright issue and heavier on questions about research, reading, the value of libraries, and the public interest. With its book-scanning project, Google is a private company taking on the responsibilities of a public utility, and Siva Vaidhyanathan came down hard on one of the company's chief legal reps for the mystery shrouding their operations (scanning technology, algorithms and ranking system are all kept secret). The rep reasonably replied that Google is not the only digitization project in town and that none of its library partnerships are exclusive. But most of his points were pretty obvious PR boilerplate about Google's altruism and gosh darn love of books. Hearing the counsel's slick defense, your gut tells you it's right to be suspicious of Google and to keep demanding more transparency, clearer privacy standards and so on. If we're going to let this much information come into the hands of one corporation, we need to be very active watchdogs.
Our friend Karen Schneider then joined the fray and as usual brought her sage librarian's perspective. She's thrilled by the possibilities of Google Book Search, seeing as it solves the fundamental problem of library science: that you can only search the metadata, not the texts themselves. But her enthusiasm is tempered by concerns about privatization similar to Siva's and a conviction that a research service like Google can never replace good librarianship and good physical libraries. She also took issue with the fact that Book Search doesn't link to other library-related search services like Open Worldcat. She has her own wrap-up of the show on her blog.
Rounding out the discussion was Matthew G. Kirschenbaum, a cybertext studies blogger and professor of english at the University of Maryland. Kirschenbaum addressed the question of how Google, and the web in general, might be changing, possibly eroding, our reading practices. He nicely put the question in perspective, suggesting that scattershot, inter-textual, "snippety" reading is in fact the older kind of reading, and that the idea of sustained, deeply immersed involvement with a single text is largely a romantic notion tied to the rise of the novel in the 18th century.
A satisfying hour, all in all, of the sort we should be having more often. It was fun brainstorming with Brendan Greeley, the Open Source on "blogger-in-chief," on how to put the show together. Their whole bit about reaching out to the blogosphere for ideas and inspiration isn't just talk. They put their money where their mouth is. I'll link to the podcast when it becomes available.
image: Real Gabinete Português de Literatura, Rio de Janeiro - Claudio Lara via Flickr
Posted by ben vershbow at 12:34 AM
| Comments (2)
tags: Libraries, Search and the Web , copyright , digitization , ebook , google , google_book_search , google_print , library , literature , metadata , reading , search
thinking about google books: tonight at 7 on radio open source
12.05.2005, 4:58 PM
While visiting the Experimental Television Center in upstate New York this past weekend, Lisa found a wonderful relic in a used book shop in Owego, NY -- a small, leatherbound volume from 1962 entitled "Computers," which IBM used to give out as a complimentary item. An introductory note on the opening page reads:
The machines do not think -- but they are one of the greatest aids to the men who do think ever invented! Calculations which would take men thousands of hours -- sometimes thousands of years -- to perform can be handled in moments, freeing scientists, technicians, engineers, businessmen, and strategists to think about using the results.
This echoes Vannevar Bush's seminal 1945 essay on computing and networked knowledge, "As We May Think", which more or less prefigured the internet, web search, and now, the migration of print libraries to the world wide web. Google Book Search opens up fantastic possibilities for research and accessibility, enabling readers to find in seconds what before might have taken them hours, days or weeks. Yet it also promises to transform the very way we conceive of books and libraries, shaking the foundations of major institutions. Will making books searchable online give us more time to think about the results of our research, or will it change the entire way we think? By putting whole books online do we begin the steady process of disintegrating the idea of the book as a bounded whole and not just a sequence of text in a massive database?
The debate thus far has focused too much on the legal ramifications -- helped in part by a couple of high-profile lawsuits from authors and publishers -- failing to take into consideration the larger cognitive, cultural and institutional questions. Those questions will hopefully be given ample air time tonight on Radio Open Source.
Tune in at 7pm ET on local public radio or stream live over the web. The show will also be available later in the week as a podcast.
Posted by ben vershbow at 04:58 PM
| Comments (0)
tags: Libraries, Search and the Web , books , copyright , ebook , google , google_book_search , google_print , library , literature , radio , research , university
the role of note taking in the information age
12.03.2005, 3:19 PM
An article by Ann Blair in a recent issue of Critical Inquiry (vol 31 no 1) discusses the changing conceptions of the function of note-taking from about the sixth century to the present, and ends with a speculation on the way that textual searches (such as Google Book Search) might change practices of note-taking in the twenty-first century. Blair argues that "one of the most significant shifts in the history of note taking" occured in the beginning of the twentieth century, when the use of notes as memorization aids gave way to the use of notes as a aid to replace the memorization of too-abundant information. With the advent of the net, she notes:
Today we delegate to sources that we consider authoritative the extraction of information on all but a few carefully specialized areas in which we cultivate direct experience and original research. New technologies increasingly enable us to delegate more tasks of remembering to the computer, in that shifting division of labor between human and thing. We have thus mechanized many research tasks. It is possible that further changes would affect even the existence of note taking. At a theoretical extreme, for example, if every text one wanted were constantly available for searching anew, perhaps the note itself, the selection made for later reuse, might play a less prominent role.
The result of this externalization, Blair notes, is that we come to think of long-term memory as something that is stored elsewhere, in "media outside the mind." At the same time, she writes, "notes must be rememorated or absorbed in the short-term memory at least enough to be intelligently integrated into an argument; judgment can only be applied to experiences that are present to the mind."
Blair's article doesn't say that this bifurcation between short-term and long-term memory is a problem: she simply observes it as a phenomenon. But there's a resonance between Blair's article and Naomi Baron's recent Los Angeles Times piece on Google Book Search: both point to the fact that what we commonly have defined as scholarly reflection has increasingly become more and more a process of database management. Baron seems to see reflection and database management as being in tension, though I'm not completely convinced by her argument. Blair, less apocalyptic than Baron, nonetheless gives me something to ponder. What happens to us if (or when) all of our efforts to make the contents of our extrasomatic memory "present to our mind" happen without the mediation of notes? Blair's piece focuses on the epistemology rather than the phenomenology of note taking — still, she leads me to wonder what happens if the mediating function of the note is lost, when the triangular relation between book, scholar and note becomes a relation between database and user.
Posted by lisa lynch at 03:19 PM
| Comments (1)
tags: Libraries, Search and the Web , book , google , internet , note_taking , search
google print on deck at radio open source
12.01.2005, 8:07 AM
Open Source, the excellent public radio program (not to be confused with "Open Source Media") that taps into the blogosphere to generate its shows, has been chatting with me about putting together an hour on the Google library project. Open Source is a unique hybrid, drawing on the best qualities of the blogosphere -- community, transparency, collective wisdom -- to produce an otherwise traditional program of smart talk radio. As host Christopher Lydon puts it, the show is "fused at the brain stem with the world wide web." Or better, it "uses the internet to be a show about the world."
The Google show is set to air live this evening at 7pm (ET) (they also podcast). It's been fun working with them behind the scenes, trying to figure out the right guests and questions for the ideal discussion on Google and its bookish ambitions. My exchange has been with Brendan Greeley, the Radio Open Source "blogger-in-chief" (he's kindly linked to us today on their site). We agreed that the show should avoid getting mired in the usual copyright-focused news peg -- publishers vs. Google etc. -- and focus instead on the bigger questions. At my suggestion, they've invited Siva Vaidhyanathan, who wrote the wonderful piece in the Chronicle of Higher Ed. that I talked about yesterday (see bigger questions). I've also recommended our favorite blogger-librarian, Karen Schneider (who has appeared on the show before), science historian George Dyson, who recently wrote a fascinating essay on Google and artificial intelligence, and a bunch of cybertext studies people: Matthew G. Kirschenbaum, N. Katherine Hayles, Jerome McGann and Johanna Drucker. If all goes well, this could end up being a very interesting hour of discussion. Stay tuned.
UPDATE: Open Source just got a hold of Nicholas Kristof to do an hour this evening on Genocide in Sudan, so the Google piece will be pushed to next week.
Posted by ben vershbow at 08:07 AM
| Comments (0)
tags: Libraries, Search and the Web , Online , copyright , google , google_book_search , google_print , library , open_source , podcast , publishing , radio , radio_open_source , search , web
sober thoughts on google: privatization and privacy
11.30.2005, 8:18 AM
Siva Vaidhyanathan has written an excellent essay for the Chronicle of Higher Education on the "risky gamble" of Google's book-scanning project -- some of the most measured, carefully considered comments I've yet seen on the issue. His concerns are not so much for the authors and publishers that have filed suit (on the contrary, he believes they are likely to benefit from Google's service), but for the general public and the future of libraries. Outsourcing to a private company the vital task of digitizing collections may prove to have been a grave mistake on the part of Google's partner libraries. Siva:
The long-term risk of privatization is simple: Companies change and fail. Libraries and universities last.....Libraries should not be relinquishing their core duties to private corporations for the sake of expediency. Whichever side wins in court, we as a culture have lost sight of the ways that human beings, archives, indexes, and institutions interact to generate, preserve, revise, and distribute knowledge. We have become obsessed with seeing everything in the universe as "information" to be linked and ranked. We have focused on quantity and convenience at the expense of the richness and serendipity of the full library experience. We are making a tremendous mistake.
This essay contains in abundance what has largely been missing from the Google books debate: intellectual courage. Vaidhyanathan, an intellectual property scholar and "avowed open-source, open-access advocate," easily could have gone the predictable route of scolding the copyright conservatives and spreading the Google gospel. But he manages to see the big picture beyond the intellectual property concerns. This is not just about economics, it's about knowledge and the public interest.
What irks me about the usual debate is that it forces you into a position of either resisting Google or being its apologist. But this fails to get at the real bind we all are in: the fact that Google provides invaluable services and yet is amassing too much power; that a private company is creating a monopoly on public information services. Sooner or later, there is bound to be a conflict of interest. That is where we, the Google-addicted public, are caught. It's more complicated than hip versus square, or good versus evil.
Here's another good piece on Google. On Monday, The New York Times ran an editorial by Adam Cohen that nicely lays out the privacy concerns:
Google says it needs the data it keeps to improve its technology, but it is doubtful it needs so much personally identifiable information. Of course, this sort of data is enormously valuable for marketing. The whole idea of "Don't be evil," though, is resisting lucrative business opportunities when they are wrong. Google should develop an overarching privacy theory that is as bold as its mission to make the world's information accessible - one that can become a model for the online world. Google is not necessarily worse than other Internet companies when it comes to privacy. But it should be doing better.
Two graduate students in Stanford in the mid-90s recognized that search engines would the most important tools for dealing with the incredible flood of information that was then beginning to swell, so they started indexing web pages and working on algorithms. But as the company has grown, Google's admirable-sounding mission statement -- "to organize the world's information and make it universally accessible and useful" -- has become its manifest destiny, and "information" can now encompass the most private of territories.
At one point it simply meant search results -- the answers to our questions. But now it's the questions as well. Google is keeping a meticulous record of our clickstreams, piecing together an enormous database of queries, refining its search algorithms and, some say, even building a massive artificial brain (more on that later). What else might they do with all this personal information? To date, all of Google's services are free, but there may be a hidden cost.
"Don't be evil" may be the company motto, but with its IPO earlier this year, Google adopted a new ideology: they are now a public corporation. If web advertising (their sole source of revenue) levels off, then investors currently high on $400+ shares will start clamoring for Google to maintain profits. "Don't be evil to us!" they will cry. And what will Google do then?
images: New York Public Library reading room by Kalloosh via Flickr; archive of the original Google page
Posted by ben vershbow at 08:18 AM
| Comments (7)
tags: Copyright and Copyleft , Libraries, Search and the Web , books , copyright , ethics , google , google_book_search , google_print , intellectual_property , libraries , library , literature , privacy , publishing , university
flushing the net down the tubes
11.29.2005, 8:11 AM
Grand theories about upheavals on the internet horizon are in ready supply. Singularities are near. Explosions can be expected in the next six to eight months. Or the whole thing might just get "flushed" down the tubes. This last scenario is described at length in a recent essay in Linux Journal by Doc Searls, which predicts the imminent hijacking of the net by phone and cable companies who will turn it into a top-down, one-way broadcast medium. In other words, the net's utopian moment, the "read/write" web, may be about to end. Reading Searls' piece, I couldn't help thinking about the story of radio and a wonderful essay Brecht wrote on the subject in 1932:
Here is a positive suggestion: change this apparatus over from distribution to communication. The radio would be the finest possible communication apparatus in public life, a vast network of pipes. That is to say, it would be if it knew how to receive as well as to transmit, how to let the listener speak as well as hear, how to bring him into a relationship instead of isolating him. On this principle the radio should step out of the supply business and organize its listeners as suppliers....turning the audience not only into pupils but into teachers.
Unless you're the military, law enforcement, or a short-wave hobbyist, two-way radio never happened. On the mainstream commercial front, radio has always been about broadcast: a one-way funnel. The big FM tower to the many receivers, "prettifying public life," as Brecht puts it. Radio as an agitation? As an invitation to a debate, rousing families from the dinner table into a critical encounter with their world? Well, that would have been neat.
Now there's the internet, a two-way, every-which-way medium -- a stage of stages -- that would have positively staggered a provocateur like Brecht. But although the net may be a virtual place, it's built on some pretty actual stuff. Copper wire, fiber optic cable, trunks, routers, packets -- "the vast network of pipes." The pipes are owned by the phone and cable companies -- the broadband providers -- and these guys expect a big return (far bigger than they're getting now) on the billions they've invested in laying down the plumbing. Searls:
The choke points are in the pipes, the permission is coming from the lawmakers and regulators, and the choking will be done....The carriers are going to lobby for the laws and regulations they need, and they're going to do the deals they need to do. The new system will be theirs, not ours....The new carrier-based Net will work in the same asymmetrical few-to-many, top-down pyramidal way made familiar by TV, radio, newspapers, books, magazines and other Industrial Age media now being sucked into Information Age pipes. Movement still will go from producers to consumers, just like it always did.
If Brecht were around today I'm sure he would have already written (or blogged) to this effect, no doubt reciting the sad fate of radio as a cautionary tale. Watch the pipes, he would say. If companies talk about "broad" as in "broadband," make sure they're talking about both ends of the pipe. The way broadband works today, the pipe running into your house dwarfs the one running out. That means more download and less upload, and it's paving the way for a content delivery platform every bit as powerful as cable on an infinitely broader band. Data storage, domain hosting -- anything you put up there -- will be increasingly costly, though there will likely remain plenty of chat space and web mail provided for free, anything that allows consumers to fire their enthusiasm for commodities through the synapse chain.
If the net goes the way of radio, that will be the difference (allow me to indulge in a little dystopia). Imagine a classic Philco cathedral radio but with a few little funnel-ended hoses extending from the side that connect you to other listeners. "Tune into this frequency!" "You gotta hear this!" You whisper recommendations through the tube. It's sending a link. Viral marketing. Yes, the net will remain two-way to the extent that it helps fuel the market. Web browsers, like the old Philco, would essentially be receivers, enabling participation only to the extent that it encouraged others to receive.
You might even get your blog hosted for free if you promote products -- a sports shoe with gelatinous heels or a music video that allows you to undress the dancing girls with your mouse. Throw in some political rants in between to blow off some steam, no problem. That's entrepreneurial consumerism. Make a living out of your appetites and your ability to make them infectious. Hip recommenders can build a cosy little livelihood out of their endorsements. But any non-consumer activity will be more like amateur short-wave radio: a mildly eccentric (and expensive) hobby (and they'll even make a saccharine movie about a guy communing with his dead firefighter dad through a ghost blog).
Searls sees it as above all a war of language and metaphor. The phone and cable companies will dominate as long as the internet is understood fundamentally as a network of pipes, a kind of information transport system. This places the carriers at the top of the hierarchy -- the highway authority setting the rules of the road and collecting the tolls. So far the carriers have managed, through various regulatory wrangling and court rulings, to ensure that the "transport metaphor" has prevailed.
But obviously the net is much more than the sum of its pipes. It's a public square. It's a community center. It's a market. And it's the biggest publishing system the world has ever known. Searls wants to promote "place metaphors" like these. Sure, unless you're a lobbyist for Verizon or SBC, you probably already think of it this way. But in the end it's the lobbyists that will make all the difference. Unless, that is, an enlightened citizens' lobby begins making some noise. So a broad, broad as in broadband, public conversation should be in order. Far broader than what goes on in the usual progressive online feedback loops -- the Linux and open source communities, the creative commies, and the techno-hip blogosphere, that I'm sure are already in agreement about this.
Google also seems to have an eye on the pipes, reportedly having bought thousands of miles of "dark fiber" -- pipe that has been laid but is not yet in use. Some predict a nationwide "Googlenet." But this can of worms is best saved for another post.
Posted by ben vershbow at 08:11 AM
| Comments (2)
tags: Network_Freedom , Publishing, Broadcast, and the Press , brecht , broadband , broadcast , cable , fiber , google , internet , linux , media , net , radio , short_wave , telecom , telephone , tubes , utopia , verizon , web
virtual libraries, real ones, empires
11.28.2005, 12:36 PM
Last Tuesday, a Washington Post editorial written by Library of Congress librarian James Billington outlined the possible benefits of a World Digital Library, a proposed LOC endeavor discussed last week in a post by Ben Vershbow. Billington seemed to imagine the library as sort of a United Nations of information: claiming that "deep conflict between cultures is fired up rather than cooled down by this revolution in communications," he argued that a US-sponsored, globally inclusive digital library could serve to promote harmony over conflict:
Libraries are inherently islands of freedom and antidotes to fanaticism. They are temples of pluralism where books that contradict one another stand peacefully side by side just as intellectual antagonists work peacefully next to each other in reading rooms. It is legitimate and in our nation's interest that the new technology be used internationally, both by the private sector to promote economic enterprise and by the public sector to promote democratic institutions. But it is also necessary that America have a more inclusive foreign cultural policy -- and not just to blunt charges that we are insensitive cultural imperialists. We have an opportunity and an obligation to form a private-public partnership to use this new technology to celebrate the cultural variety of the world.
What's interesting about this quote (among other things) is that Billington seems to be suggesting that a World Digital Library would function in much the same manner as a real-world library, and yet he's also arguing for the importance of actual physical proximity. He writes, after all, about books literally, not virtually, touching each other, and about researchers meeting up in a shared reading room. There seems to be a tension here, in other words, between Billington's embrace of the idea of a world digital library, and a real anxiety about what a "library" becomes when it goes online.
I also feel like there's some tension here — in Billington's editorial and in the whole World Digital Library project — between "inclusiveness" and "imperialism." Granted, if the United States provides Brazilians access to their own national literature online, this might be used by some as an argument against the idea that we are "insensitive cultural imperialists." But there are many varieties of empire: indeed, as many have noted, the sun stopped setting on Google's empire a while ago.
To be clear, I'm not attacking the idea of the World Digital Library. Having watch the Smithsonian invest in, and waffle on, some of their digital projects, I'm all for a sustained commitment to putting more material online. But there needs to be some careful consideration of the differences between online libraries and virtual ones — as well as a bit more discussion of just what a privately-funded digital library might eventually morph into.
Posted by lisa lynch at 12:36 PM
| Comments (0)
tags: Libraries, Search and the Web , cultural , digital , google , imperialism , internet , libraries
explosion
11.22.2005, 2:10 PM
A Nov. 18 post on Adam Green's Darwinian Web makes the claim that the web will "explode" (does he mean implode?) over the next year. According to Green, RSS feeds will render many websites obsolete:
The explosion I am talking about is the shifting of a website's content from internal to external. Instead of a website being a "place" where data "is" and other sites "point" to, a website will be a source of data that is in many external databases, including Google. Why "go" to a website when all of its content has already been absorbed and remixed into the collective datastream.
Does anyone agree with Green? Will feeds bring about the restructuring of "the way content is distributed, valued and consumed?" More on this here.
Posted by lisa lynch at 02:10 PM
| Comments (5)
tags: Libraries, Search and the Web , Online , Publishing, Broadcast, and the Press , RSS , blogging , blogs , darwin , darwinism , google , internet , singularity , syndication , web , xml
world digital library
11.22.2005, 7:41 AM
The Library of Congress has announced plans for the creation of a World Digital Library, "a shared global undertaking" that will make a major chunk of its collection freely available online, along with contributions from other national libraries around the world. From The Washington Post:
...[the] goal is to bring together materials from the United States and Europe with precious items from Islamic nations stretching from Indonesia through Central and West Africa, as well as important materials from collections in East and South Asia.
Google has stepped forward as the first corporate donor, pledging $3 million to help get operations underway. At this point, there doesn't appear to be any direct connection to Google's Book Search program, though Google has been working with LOC to test and refine its book-scanning technology.
Posted by ben vershbow at 07:41 AM
| Comments (0)
tags: Libraries, Search and the Web , books , digital , google , library , library_of_congress , literature , preservation , scanning
google print is no more
11.18.2005, 8:06 AM
Not the program, of course, just the name. From now on it is to be known as Google Book Search. "Print" obviously struck a little too close to home with publishers and authors. On the company blog, they explain the shift in emphasis:
No, we don't think that this new name will change what some folks think about this program. But we do believe it will help a lot of people understand better what we're doing. We want to make all the world's books discoverable and searchable online, and we hope this new name will help keep everyone focused on that important goal.
Posted by ben vershbow at 08:06 AM
| Comments (1)
tags: Libraries, Search and the Web , books , copyright , google , google_book_search , google_print , publishing , search
all your base are belong to google
11.16.2005, 7:04 AM
Google Base is live and ready for our stuff.
In AP: "New Project Will Expand Google's Reach"
Posted by ben vershbow at 07:04 AM
| Comments (0)
tags: Online , advertising , classifieds , craigslist , ebay , etail , google , google_base , search , web
the book in the network - masses of metadata
11.15.2005, 6:42 PM
In this weekend's Boston Globe, David Weinberger delivers the metadata angle on Google Print:
...despite the present focus on who owns the digitized content of books, the more critical battle for readers will be over how we manage the information about that content-information that's known technically as metadata....we're going to need massive collections of metadata about each book. Some of this metadata will come from the publishers. But much of it will come from users who write reviews, add comments and annotations to the digital text, and draw connections between, for example, chapters in two different books.
As the digital revolution continues, and as we generate more and more ways of organizing and linking books-integrating information from publishers, libraries and, most radically, other readers-all this metadata will not only let us find books, it will provide the context within which we read them.
The book in the network is a barnacled spirit, carrying with it the sum of its various accretions. Each book is also its own library by virtue not only of what it links to itself, but of what its readers are linking to, of what its readers are reading. Each book is also a milk crate of earlier drafts. It carries its versions with it. A lot of weight for something physically weightless.
Posted by ben vershbow at 06:42 PM
| Comments (0)
tags: ISBN , Libraries, Search and the Web , books , ebook , electronic_literature , folksonomy , google , google_print , hypertext , library , literature , marginalia , metadata , social_software , tagging , weinberger
having browsed google print a bit more...
11.14.2005, 4:53 AM
...I realize I was over-hasty in dismissing the recent additions made since book scanning resumed earlier this month. True, many of the fine wines in the cellar are there only for the tasting, but the vintage stuff can be drunk freely, and there are already some wonderful 19th century titles, at this point mostly from Harvard. The surest way to find them is to search by date, or by title and date. Specify a date range in advanced search or simply enter, for example, "date: 1890" and a wealth of fully accessible texts comes up, any of which can be linked to from a syllabus. An astonishing resource for teachers and students.
The conclusion: Google Print really is shaping up to be a library, that is, of the world pre-1923 -- the current line of demarcation between copyright and the public domain. It's a stark reminder of how over-extended copyright is. Here's an 1899 english printing of The Mahabharata:
A charming detail found on the following page is this old Harvard library stamp that got scanned along with the rest:
Posted by ben vershbow at 04:53 AM
| Comments (0)
tags: Copyright and Copyleft , Libraries, Search and the Web , OCR , copyright , ebook , fair_use , google , google_print , library , mahabharata , scan
pages à la carte
11.04.2005, 7:20 AM
The New York Times reports on programs being developed by both Amazon and Google that would allow readers to purchase online access to specific sections of books -- say, a single recipe from a cookbook, an individual chapter from a how-to manual, or a particular short story or poem from an anthology. Such a system would effectively "unbind" books into modular units that consumers patch into their online reading, just as iTunes blew apart the integrity of the album and made digital music all about playlists. We become scrapbook artists.
It seems Random House is in on this too, developing a micropayment model and consulting closely with the two internet giants. Pages would sell for anywhere between five and 25 cents each.
Posted by ben vershbow at 07:20 AM
| Comments (1)
tags: Publishing, Broadcast, and the Press , Transliteracies , amazon , books , e-commerce , google , google_print , literature , media_consumption , publishing , randomhouse , reading
google print's not-so-public domain
11.03.2005, 4:16 PM
Google's first batch of public domain book scans is now online, representing a smattering of classics and curiosities from the collections of libraries participating in Google Print. Essentially snapshots of books, they're not particularly comfortable to read, but they are keyword-searchable and, since no copyright applies, fully accessible.
The problem is, there really isn't all that much there. Google's gotten a lot of bad press for its supposedly cavalier attitude toward copyright, but spend a few minutes browsing Google Print and you'll see just how publisher-centric the whole affair is. The idea of a text being in the public domain really doesn't amount to much if you're only talking about antique manuscripts, and these are the only books that they've made fully accessible. Daisy Miller's copyright expired long ago but, with the exception of Harvard's illustrated 1892 copy, all the available scanned editions are owned by modern publishers and are therefore only snippeted. This is not an online library, it's a marketing program. Google Print will undeniably have its uses, but we shouldn't confuse it with a library.
(An interesting offering from the stacks of the New York Public Library is this mid-19th century biographic registry of the wealthy burghers of New York: "Capitalists whose wealth is estimated at one hundred thousand dollars and upwards...")
Posted by ben vershbow at 04:16 PM
| Comments (0)
tags: Copyright and Copyleft , Libraries, Search and the Web , OCR , books , copyright , ebook , google , google_print , library , literature , public_domain , scan
the creeping (digital) death of fair use
11.02.2005, 1:13 PM
Meant to post about this last week but it got lost in the shuffle... In case anyone missed it, Tarleton Gillespie of Cornell has published a good piece in Inside Higher Ed about how sneaky settings in course management software are effectively eating away at fair use rights in the academy. Public debate tends to focus on the music and movie industries and the ever more fiendish anti-piracy restrictions they build into their products (the latest being the horrendous "analog hole"). But a similar thing is going on in education and it is decidely under-discussed.
Gillespie draws our attention to the "Copyright Permissions Building Block," a new add-on for the Blackboard course management platform that automatically obtains copyright clearances for any materials a teacher puts into the system. It's billed as a time-saver, a friendly chauffeur to guide you through the confounding back alleys of copyright.
But is it necessary? Gillespie, for one, is concerned that this streamlining mechanism encourages permission-seeking that isn't really required, that teachers should just invoke fair use. To be sure, a good many instructors never bother with permissions anyway, but if they stop to think about it, they probably feel that they are doing something wrong. Blackboard, by sneakily making permissions-seeking the default, plays to this misplaced guilt, lulling teachers away from awareness of their essential rights. It's a disturbing trend, since a right not sufficiently excercised is likely to wither away.
Fair use is what oxygenates the bloodstream of education, allowing ideas to be ideas, not commodities. Universities, and their primary fair use organs, libraries, shouldn't be subjected to the same extortionist policies of the mainstream copyright regime, which, like some corrupt local construction authority, requires dozens of permits to set up a simple grocery store. Fair use was written explicitly into law in 1976 to guarantee protection. But the market tends to find a way, and code is its latest, and most insidious, weapon.
Amazingly, few academics are speaking out. John Holbo, writing on The Valve, wonders:
Why aren’t academics - in the humanities in particular - more exercised by recent developments in copyright law? Specifically, why aren’t they outraged by the prospect of indefinite copyright extension?......It seems to me odd, not because overextended copyright is the most pressing issue in 2005 but because it seems like a social/cultural/political/economic issue that recommends itself as well suited to be taken up by academics - starting with the fact that it is right here on their professional doorstep...
Most obviously on the doorstep is Google, currently mired in legal unpleasantness for its book-scanning ambitions and the controversial interpretation of fair use that undergirds them. Why aren't the universities making a clearer statement about this? In defense? In concern? Soon, when search engines move in earnest into video and sound, the shit will really hit the fan. The academy should be preparing for this, staking out ground for the healthy development of multimedia scholarship and literature that necessitates quotation from other "texts" such as film, television and music, and for which these searchable archives will be an essential resource.
Fair use seems to be shrinking at just the moment it should be expanding, yet few are speaking out.
Posted by ben vershbow at 01:13 PM
| Comments (4)
tags: Copyright and Copyleft , DRM , Education , academy , blackboard , copyright , fair_use , google , google_print
microsoft joins open content alliance
10.26.2005, 9:06 AM
Microsoft's forthcoming "MSN Book Search" is the latest entity to join the Open Content Alliance, the non-controversial rival to Google Print. ZDNet says: "Microsoft has committed to paying for the digitization of 150,000 books in the first year, which will be about $5 million, assuming costs of about 10 cents a page and 300 pages, on average, per book..."
Apparently having learned from Google's mistakes, OCA operates under a strict "opt-in" policy for publishers vis-a-vis copyrighted works (whereas with Google, publishers have until November 1 to opt out). Judging by the growing roster of participants, including Yahoo, the National Archives of Britain, the University of California, Columbia University, and Rice University, not to mention the Internet Archive, it would seem that less hubris equals more results, or at least lower legal fees. Supposedly there is some communication between Google and OCA about potential cooperation.
Also story in NY Times.
Posted by ben vershbow at 09:06 AM
| Comments (2)
tags: Libraries, Search and the Web , Microsoft , OCA , books , brewster_kahle , copyright , google , google_print , library , open_content_alliance , search , web , yahoo
to some writers, google print sounds like a sweet deal
10.25.2005, 9:25 AM
Wired has a piece today about authors who are in favor of Google's plans to digitize millions of books and make them searchable online. Most seem to agree that obscurity is a writer's greatest enemy, and that the exposure afforded by Google's program far outweighs any intellectual property concerns. Sometimes to get more you have to give a little.
The article also mentions the institute.
Posted by ben vershbow at 09:25 AM
| Comments (0)
tags: Libraries, Search and the Web , Publishing, Broadcast, and the Press , books , copyright , google , google_print , publishing , search , web , writing
debating google print
10.22.2005, 5:53 PM
The Washington Post has run a pair of op-eds, one from each side of the Google Print dispute. Neither says anything particularly new. Moreover, they enforce the perception that there can be only two positions on the subject -- an endemic problem in newspaper opinion pages with their addiction to binaries, where two cardboard boxers are allotted their space to throw a persuasive punch. So you're either for Google or against it? That's awfully close to you're either for technology -- for progress -- or against it. Unfortunately, like technology's impact, the Google book-scanning project is a little trickier to figure out, and a more nuanced conversation is probably in order.
The first piece, "Riches We Must Share...", is submitted in support of Google by University of Michigan President Sue Coleman (a partner in the Google library project). She argues that opening up the elitist vaults of the world's great (english) research libraries will constitute a democratic revolution. "We believe the result can be a widening of human conversation comparable to the emergence of mass literacy itself." She goes on to deliver some boilerplate about the "Net Generation" -- too impatient to look for books unless they're online etc. etc. (great to see a major university president being led by the students instead of leading herself).
Coleman then devotes a couple of paragraphs to the copyright question, failing to tackle any of its controversial elements:
Universities are no strangers to the responsible management of complex copyright, permission and security issues; we deal with them every day in our classrooms, libraries, laboratories and performance halls. We will continue to work within the current criteria for fair use as we move ahead with digitization.
The problem is, Google is stretching the current criteria of fair use, possibly to the breaking point. Coleman does not acknowledge or address this. She does, however, remind the plaintiffs that copyright is not only about the owners:
The protections of copyright are designed to balance the rights of the creator with the rights of the public. At its core is the most important principle of all: to facilitate the sharing of knowledge, not to stifle such exchange.
All in all a rather bland statement in support of open access. It fails to weigh in on the fair use question -- something about which the academy should have a few things to say -- and does not indicate any larger concern about what Google might do with its books database down the road.
The opposing view, "...But Not at Writers' Expense", comes from Nick Taylor, writer, and president of the Authors' Guild (which sued Google last month). Taylor asserts that mega-rich Google is tramping on the dignity of working writers. But a couple of paragraphs in, he gets a little mixed up about contemporary publishing:
Except for a few big-name authors, publishers roll the dice and hope that a book's sales will return their investment. Because of this, readers have a wealth of wonderful books to choose from.
A dubious assessment, since publishing conglomerates are not exactly enthusiastic dice rollers. I would counter that risk-averse corporate publishing has steadily shrunk the number of available titles, counting on a handful of blockbusters to drive the market. Taylor goes on to defend not just the publishing status quo, but the legal one:
Now that the Authors Guild has objected, in the form of a lawsuit, to Google's appropriation of our books, we're getting heat for standing in the way of progress, again for thoughtlessly wanting to be paid. It's been tradition in this country to believe in property rights. When did we decide that socialism was the way to run the Internet?
First of all, it's funny to think of the huge corporations that dominate the web as socialist. Second, this talk about being paid for appropriating books for a search database is revealing of the two totally different worldviews that are at odds in this struggle. The authors say that any use of their book requires a payment. Google sees including the books in the database as a kind of payment in itself. No one with a web page expects Google to pay them for indexing their site. They are grateful that they do! Otherwise, they are totally invisible. This is the unspoken compact that underpins web search. Google assumed the same would apply with books. Taylor says not so fast.
Here's Taylor on fair use:
Google contends that the portions of books it will make available to searchers amount to "fair use," the provision under copyright that allows limited use of protected works without seeking permission. That makes a private company, which is profiting from the access it provides, the arbiter of a legal concept it has no right to interpret. And they're scanning the entire books, with who knows what result in the future.
Actually, Google is not doing all the interpreting. There is a legal precedent for Google's reading of fair use established in the 2003 9th Circuit Court decision Kelly v. Arriba Soft. In the case, Kelly, a photographer, sued Arriba Soft, an online image search system, for indexing several of his photographs in their database. Kelly believed that his intellectual property had been stolen, but the court ruled that Arriba's indexing of thumbnail-sized copies of images (which always linked to their source sites) was fair use: "Arriba’s use of the images serves a different function than Kelly’s use – improving access to information on the internet versus artistic expression.” Still, Taylor's "with who knows what result in the future" concern is valid.
So on the one hand we have many writers and most publishers trying to defend their architecture of revenue (or, as Taylor would have it, their dignity). But I can't imagine how Google Print would really be damaging that architecture, at least not in the foreseeable future. Rather it leverages it by placing it within the frame of another architecture: web search. The irony for the authors is that the current architecture doesn't seem to be serving them terribly well. With print-on-demand gaining in quality and legitimacy, online book search could totally re-define what is an acceptable risk to publishers, and maybe more non-blockbuster authors would get published.
On the other hand we have the universities and libraries participating in Google's program, delivering the good news of accessibility. But they are not sufficiently questioning what Google might do with its database down the road, or the implications of a private technology company becoming the principal gatekeeper of the world's corpus.
If only this debate could be framed in a subtler way, rather than the for-Google-or-against-it paradigm we have now. I'm cautiously optimistic about the effect of having books searchable on the web. And I tend to believe it will be beneficial to authors and publishers. But I have other, deep reservations about the direction in which Google is heading, and feel that a number of things could go wrong. We think the cencorship of the marketplace is bad now in the age of publishing conglomerates. What if one company has total control of everything? And is keeping track of every book, every page, that you read. And is reading you while you read, throwing ads into your peripheral vision. I'm curious to hear from readers what they feel could be the hazards of Google Print.
Posted by ben vershbow at 05:53 PM
| Comments (4)
tags: Libraries, Search and the Web , Publishing, Broadcast, and the Press , academy , books , copyright , google , google_print , michigan , publishing , writing
google is sued... again
10.20.2005, 8:08 AM
This time by publishers. Penguin Group USA, McGraw-Hill, Pearson Education, Simon & Schuster and John Wiley & Sons. The gripe is the same as with the Authors' Guild, which filed suit last month alleging "massive copyright infringement." Publishers fear a dangerous precedent is set by Google's scanning of books to construct what amounts to a giant card catalogue on the web. Google claims "fair use" (see rationale), again pointing out that for copyrighted works only tiny "snippets" of text are displayed around keywords (though perhaps this is not yet fully in effect - I was searching around in this book and was able to look at quite a lot).
Google calls the publishers' suit "near-sighted." And it probably is. The benefit to readers and researchers will be tremendous, as will (Google is eager to point out) the exposure for authors and publishers. But Google Print is undoubtedly an earth-shaking program. Look at the reaction in Europe, where alarm bells rung by France warned of cultural imperialism, an english-drenched web. Heads of state and culture convened and initial plans for a European digital library have been drawn up.
What the transatlantic flap makes clear is that Google's book scanning touches a deep nerve, and the argument over intellectual property, signficant though it is, distracts from a more profound human anxiety -- an anxiety about the form of culture and the shape of thoughts. If we try to grope back through the millennia, we can find find an analogy in the invention of writing.
The shift from oral to written language froze speech into stable strings that could be transmitted and stored over distance and time. This change not only affected the modes of communication, it dramatically refigured the cognitive makeup of human beings (as McLuhan, Ong and others have described). We are currently going through another such shift. The digital takes the freezing medium of text and throws it back into fluidity. Like the melting of polar ice caps, it unsettles equilibriums, changes weather patterns. It is a lot to adjust to, and we wonder if our great-great-grandchildren will literally think differently from us.
But in spite of this disorienting new fluidity, we still have print, we still have the book. And actually, Google Print in many ways affirms this since its search returns will point to print retailers and brick-and-mortar libraries. Yet the fact remains that the canon is being scanned, with implications we can't fully perceive, and future uses we can't fully predict, and so it is understandable that many are unnerved. The ice is really beginning to melt.
In Phaedrus, Plato expresses a similar anxiety about the invention of writing. He tells the tale of Theuth, an Egyptian deity who goes around spreading the new technology, and one day encounters a skeptic in King Thamus:
...you who are the father of letters, from a paternal love of your own children have been led to attribute to them a power opposite to that which they in fact possess. For this discovery of yours will create forgetfulness in the minds of those who learn to use it; they will not exercise their memories, but, trusting in external, foreign marks, they will not bring things to remembrance from within themselves. You have discovered a remedy not for memory, but for reminding. You offer your students the appearance of wisdom, not true wisdom. They will be hearers of many things and will have learned nothing; they will appear to be omniscient and will generally know nothing; they will be tiresome company, having the show of wisdom without the reality.
As I type, I'm exhibiting wisdom without the reality. I've read Plato, but nowhere near exhaustively. Yet I can slash and weave texts on the web in seconds, throw together a blog entry and send it screeching into the commons. And with Google Print I can get the quote I need and let the rest of the book rot behind the security fence. This fluidity is dangerous because it makes connections so easy. Do we know what we are connecting?
Posted by ben vershbow at 08:08 AM
| Comments (5)
tags: Copyright and Copyleft , Libraries, Search and the Web , Transliteracies , copyright , google , literacy , mcluhan , ong , plato , publishing , search , web
google expands book-scanning project to europe
10.18.2005, 8:56 AM
This week Google will be paying a visit to the Frankfurt Book Fair to talk with European publishers and chief librarians (including arch nemesis Jean-Nöel Jeanneney) about eight new local incarnations of Google Print. (more)
Posted by ben vershbow at 08:56 AM
| Comments (0)
tags: Libraries, Search and the Web , Online , books , copyright , ebook , europe , frankfurt , google , internet , library , publishing , search , web
google dystopia
10.10.2005, 10:06 AM
Google as big brother -- the paranoia certainly seems to be creeping into the mainstream. "Op-Art" by Randy Siegel from today's NY Times:
Posted by ben vershbow at 10:06 AM
| Comments (0)
tags: 1984 , 2084 , Libraries, Search and the Web , NYTimes , Online , algorithm , art , cartoon , dystopia , editorial , google , information , internet , newspaper , orwell , paranoia , privacy , satire , search , technology , web
yahoo! announces book-scanning project to rival google's
10.03.2005, 2:00 PM
Yahoo, in collaboration with The Internet Archive, Adobe, O'Reilly Media, Hewlett Packard Labs, the University of California, the University of Toronto, The National Archives of England, and others, will be participating in The Open Content Alliance, a book and media archiving project that will greatly enlarge the body of knowledge available online. At first glance, it appears the program will focus primarily on public domain works, and in the case of copyrighted books, will seek to leverage the Creative Commons.
Google Print, on the other hand, is more self-consciously a marketing program for publishers and authors (although large portions of the public domain will be represented as well). Google aims to make money off its indexing of books through keyword advertising and click-throughs to book vendors. Yahoo throwing its weight behind the "open content" movement seems on the surface to be more of a philanthropic move, but clearly expresses a concern over being outmaneuvered in the search wars. But having this stuff available online is clearly a win for the world at large.
The Alliance was conceived in large part by Brewster Kahle of the Internet Archive. He announced the project on Yahoo's blog:
To kick this off, Internet Archive will host the material and sometimes helps with digitization, Yahoo will index the content and is also funding the digitization of an initial corpus of American literature collection that the University of California system is selecting, Adobe and HP are helping with the processing software, University of Toronto and O'Reilly are adding books, Prelinger Archives and the National Archives of the UK are adding movies, etc. We hope to add more institutions and fine tune the principles of working together.Initial digitized material will be available by the end of the year.
More in:
NY Times
Chronicle of Higher Ed.
Posted by ben vershbow at 02:00 PM
| Comments (0)
tags: Libraries, Search and the Web , archive , book , books , brewster_kahle , digital , digitize , ebook , google , google_print , googleprint , internet_archive , kahle , library , literature , reading , scanning , yahoo , yahoo!
human versus algorithm
09.29.2005, 3:40 PM
I just came across Common Times, a new community-generated news aggregation page, part of something called the Common Media Network, that takes the social bookmarking concept of del.icio.us and applies it specifically to news gathering. Anyone can add a story from any source to a series of sections (which seem pre-set and non-editable) arranged on a newspaper-style "front page." You add links through a bookmarklet on the links bar on your browser. Whenever you come across an article you'd like to submit, you just click the button and a page comes up where you can enter the metadata like tags and comments. Each user has a "channel" - basically a stripped-down blog - where all their links are displayed chronologically with an RSS feed, giving individuals a venue to show their chops as news curators and annotators. You can set it up so links are posted simultaneously to a del.icio.us account (there's also a Firefox extension that allows you to post stories directly from Bloglines).
Human aggregation is often more interesting than what the Google News algorithm can turn up, but it can easily mould to the biases of the community. Of course, search algorithms are developed by people, and source lists don't just manufacture themselves (Google is notoriously tight-lipped about its list of news sources). In the case of something like Common Times, a slick new web application hyped on Boing Boing and other digital culture sites, the communities can be rather self-selecting. Still, this is a very interesting experiment in multi-player annotation. When I first arrived at the front page, not yet knowing how it all worked, I was impressed by the fairly broad spread of stories. And the tag cloud to the right is an interesting little snapshot of the zeitgeist.
(via Infocult)
Posted by ben vershbow at 03:40 PM
| Comments (0)
tags: Publishing, Broadcast, and the Press , aggregator , algorithm , bibliography , blog , blogging , bookmarking , del.icio.us , delicious , folksonomy , google , journalism , media , news , newspaper , search , socialsoftware , tag , tagging , tags
this laptop costs $100
09.28.2005, 1:03 PM

MIT has released some new images of its $100 laptop prototype, of which it hopes to have 5 to 15 million test units within the year. The laptops are much more durable than your average commercial machine, can be used as writing tablets or rotated 90 degrees as ebooks, and run on Linux - 100% free software. The idea is for the machines to provide a platform for an open source education movement throughout the South - a major hack of the current global order.
I love the hand cranks on the side, a backup charging option for remote or poorly provided areas where there is little or no electricity.
("The $100 laptop moves closer to reality" in CNET)

Posted by ben vershbow at 01:03 PM
| Comments (6)
tags: $100laptop , Education , MIT , Negroponte , The Ideal Device? , africa , asia , classroom , computer , gadget , google , justice , laptop , linux , notebook , opensource , poor , poverty , redhat , revolution , south , southamerica , tablet , technology
the database of intentions
09.16.2005, 11:16 AM
Interesting edition of Open Source last week on "Google Sociology" with David Weinberger and John Battelle, author of the just-published "The Search: How Google and Its Rivals Rewrote the Rules of Business and Transformed Our Culture". Listen here.
Weinberger has some interesting things to say about Google (and the other search engines) as "publishers." I have some thoughts on that too. More to come later.
Battelle has done a great deal of thinking on search from a variety of angles: the technology of search, the economics of search, and the more esoteric dimensions of a "search" culture. He touches briefly on this last point, laying out a construct that is probably treated more extensively in his book: the "database of intentions." By this he means the archive, or "artifact," of the world's search queries. A picture of the collective consciousness formed by the questions everyone is asking. Even now, when logged in to Google, a history of all your search query strings is kept - your own database of intentions. The potential value of this database is still being determined, but obvious uses are targeted advertising, and more relevant search results based on analysis of search histories.
As regards the collective database of intentions, Battelle speculates that future advances in artificial intelligence will likely draw on this enormous crop of information about how humans think and seek.
Posted by ben vershbow at 11:16 AM
| Comments (0)
tags: Libraries, Search and the Web , Online , algorithm , audio , battelle , database , google , internet , listen , opensource , podcast , radio , radioopensource , search , searchengine , web , weinberger
google blog search - still a long way to go
09.14.2005, 5:01 PM
Google's new blog search engine reminds me of how far we still have to go with blog search. The engine works much the same way as Google's general web search - with keywords and page ranking - only here it's searching RSS feeds. Recent posts with keyword matches fill the column, and a few links to related blogs come up at the top. But there's the rub. These so-called "related" blogs are only related by direct keyword matches in their title tagline. I just searched "poetry" and came up with only three related blogs. C'mon. A search for "gossip" turns up only one related blog - "Starbucks Gossip". There has to be some kind of promotion going on here, though their "about" page mentions nothing of the kind.
A good engine would be capable of searching blogs by their subject, their preoccupation, their obsession. Many blogs could be considered "general," but just as many have a special focus, and readers are often searching with a particular theme in mind. They don't just want a list of transient posts, but whole sites that might potentially become regular destinations. Many blogs are valuable publications that prove themselves day after day. But blog search hasn't yet grown beyond the trendy "what's the latest chatter on the blogosphere" mode.
I do have to give credit to Technorati. Glitchy as it is, they're trying to think of creative ways - tagging, author-determined keywords - to help readers find interesting blogs and authors their audience. Then again, my greatest finds have usually been from other blogs. Humans will always be the smartest aggregators.
People out there, what do you use?
Posted by ben vershbow at 05:01 PM
| Comments (2)
tags: RSS , blog , blogger , blogging , blogs , blogsearch , feeds , feedster , googlblogsearch , google , pubsub , search , technorati , xml
hive mind
08.31.2005, 12:02 PM
I spend a lot of time looking for specific resources on the web. That means sifting through Google search results and following links that seem promising. A semi-interesting link may take me to an article with another semi-interesting link; that link takes me to another, and so on. As I progress, the articles become more thinly related to the topic, but I pursue them anyway, hoping they will lead me on a trajectory I hadn't thought of, to a great idea that I couldn't have anticipated.
During the whole process, however, I can't shake the unpleasant sensation that I am not the master of my own destiny. I come out of a Google session with a wrung-out feeling, like I've just been lead along a path that was not entirely of my own choosing, marching behind an army of web searchers carving networked pathways into the information landscape, but not necessarily finding that unique morsel that will knit my ideas together. Lee Bryant explains this phenomenon as entaglement in the complex systems addressed by complexity theory. "Complexity theory," says Bryant, "shows us that from the seeds of such small inter-connected actions, large trees of system behaviour can grow. These physical phenomena are reflected online as well, where the emergence of the Wiki movement and the growing cult of Google both display a simple form of collective intelligence." He gives us this metaphor to consider:
The classic pop-science example that illustrates the point is the way in which ants forage for food. Ants display a kind of collective intelligence (described by some as a “hive mind” ) that is based on apparently dumb rules, repetitively followed by thousands of individual insects. Each ant forages for food in an apparently random manner, but when it finds food it marks a pheromone trail back to its colony. Trails fade over time, but positive feedback means that well-travelled paths will attract more and more ants until the particular food source is exhausted. The system works because there are enough ants each following the same rules to ensure comprehensive coverage of any given area.
The fact that my participation in the web, even at the browsing level, means that I will be drawn, unavoidably, into the group effort evokes a mixed response. My independent artistic sensibility hates anything that erases the individual voice and immerses me in a placid groupthink. But my social human sensibility sincerely wants to know what everyone else is doing; it makes me want to dive in, pitch in, follow along, and celebrate the complex social web we are weaving.
Posted by Kim White at 12:02 PM
| Comments (1)
tags: ants , collective_intelligence , complex_systems , complexity , complexity_theory , emergence , google , hivemind , media_consumption
tower of babel or trivial pursuit?
12.20.2004, 3:59 PM
Read New York Times Article
In an article in yesterday’s NY Times, Alberto Manguel compares the Genesis story of Babel and the library at Alexandria with their alleged modern-day counterpart—Google’s commitment to digitize all human knowledge. Are we constructing a modern-day tower of Babel? A monument to the hubris of what might be possible if we could just get a little smarter. Will Google help us find answers to the big questions: where did we come from, and what’s the meaning of it all? I went online to find out. I Googled the question “What is the meaning of it all?” and got the following:
In an article in yesterday’s NY Times, Alberto Manguel compares the Genesis story of Babel and the ambitions of the library at Alexandria with their alleged modern-day counterpart—Google’s commitment to digitize all human knowledge. Are we constructing a modern-day tower of Babel—a monument to the hubris of what might be possible if we could just get a little smarter? Will Google help us find answers to perennial puzzlers like: where did we come from? Is anyone or anything in charge? And, what’s the meaning of it all? I went online to find out. I Googled the question “What is the meaning of it all?” and got the following:
The Meaning of Emmanuel
... "What is the meaning of it all?" "What is its purpose?" The human tendency always is to forget origins. And now that Christmas has grown to be such a ...
The Kubrick Site: John Morgan on 2001 vs. 2010
... What is the meaning of it all? Is there a God? What is the purpose of Art? Is there a merging of Art and Science?' Where Clarke in comparison only asks ...
The meaning of life, the universe and everything
... What is the meaning of it all? 'Antennae' colliding galaxies. When we contemplatethe unimaginable vastness of the universe, the incredible diversity ...
London theater musical on stage in London's West End Shaftesbury ...
... But what is the meaning of it all? Well, mainly that the dreamy idealist, Boney, had all he needed in Anastasia Barzee’s sweetly trilling Jo and never ...
'Rings' actor: 'It'll be the biggest film of all time'
... What is the meaning of it all? In some ways, that sort of inquiry is completely unfashionable. "I often think one of the reasons people are dismissive ...
Becoming a Wise Elder
... Questions such as "What is the meaning of it all?" and "Does my life make any kind of difference to anyone?" were very unlikely to arise. ...
Psychology Today: Still news
... PT: What is the meaning of it all now? BB: There was a recklessness in Kennedy's life that I didn't see, a sexual recklessness I don't understand. ...
None of these offerings brought me closer to a substantive answer. Demoralized by the thought of having to go through the other 517 possibilities. I decided to respond to the suggestion at the top of my page:
Tip: Have a question? Ask the researchers at Google Answers.
I clicked "Google Answers" and entered my question: What is the meaning of it all?
Then I had to set a price for my question between $2 and $200. I clicked on “How do I price my question?” And found the following guidelines:
*The more you pay, the more time and effort a Researcher will likely spend on your answer. However, this depends somewhat on the nature of your question.
*Above all - try to pay what the information is worth to you, not what you think you can get it for - that is the best way to get a good answer - but only you can know the value of the information you seek.
Hmm, what is the information worth to me?
I took a look at Google’s examples to get an idea of where my question might fit on the pay scale. Fifty dollars is the “minimum price appropriate for complex, multi-part questions. Researchers will typically spend at least one hour on $50 questions and be very responsive to follow-up questions.” One hundred dollar questions merit two to four hours of “highly thorough research.” Examples of hundred dollar questions included “Parking in New York City, and How does infant-family bonding develop?” The two hundred dollar question required researchers to “spend extensive amounts of time (4 hours plus).” Examples of $200 questions included: Searching for Barrett's Ginger Beer, Applications using databases, What is the impact of a baby with Down's Syndrome on its family?
None of those examples seemed to be in the same league with “what’s the meaning of it all?” Can a Google researcher find the answer in 4 hours? probably not, although I do wonder what they would come up with. Anyway, the point of all this is that Google is set up to search out trivial, quotidian sorts of things and it will be interesting to see how/if they can make the transition from those who can tell you how to “search for Barrett’s Ginger Beer,” to gatekeepers of all human knowledge.
Posted by Kim White at 03:59 PM
| Comments (0)
tags: Libraries, Search and the Web , babel , google , internet , meaning , search , semantic_web , web
enter the cybrarian
12.18.2004, 3:02 PM
The recent buzz surrounding Google's library intitiative has everyone talking about the future of research, which inevitably raises the question: how will the digitization of library collections change the role of the librarian? I would guess that, far from becoming obsolete, their role will in fact be elevated in importance, if not necessarily in status. They could very well come to be our indispensible guides through the labyrinth - if perhaps invisible, engineering behind the digital walls.
It's also important to consider the question of visualization. When you run a search on Google you are given an enormous list. This is already deeply ingrained in the day-to-day business of finding information. But these lists are basically the electronic equivelant of scrolls, with the items algorithmically determined to be most relevant placed at the top. But sooner or later we have to admit that using scrolls for this kind of business is ludicrous. There has to be a better way of arraying these vast harvests of information in a way that allows the researcher to zoom across degrees of specificity and through associative chains of context and meaning. I see no reason why a search shouldn't take place in some kind of virtual library, emulating the physical architecture of research settings, and allowing for some of the associative or accidental echoes that so often enrich a paper trail blazed through a brick-and-mortar library. Or cannot knowledge resemble a tree, or an arterial matrix? Must we be bound to the scroll?
Returning to the question of the librarian's role, I recalled this passage from James J O'Donnell's 1996 paper The Pragmatics of the New: Trithemius, McLuhan, Cassiodorus:
"The librarians of the world have, moreover, already led the way, for academics at least, into the new information environment, not least because they are caught between rising demand from their customers (faculty and students) and rising supply and prices from their suppliers, and so have already been making reality-based decisions about ownership versus access, print versus electronics, and so on. In short, they are just now our leading pragmatists. Can we imagine a time in our universities when the librarians are the well-paid principals and the teachers their mere acolytes in a distribution chain? I do not think we can or should rule out that possibility for a moment"
Related articles:
"Questions and Praise for Google Web Library" - NY Times
"Google's library plan 'a huge help'" - USA Today
"Making books readable on computer proves trying task" - USA Today
Also, I found this on Searchblog. For a trip down memory lane, check out the original Google in the Stanford archives (click on picture to right). Unfortunately, although it seems interactive, a search just brings up a bunch of stylesheets.
Posted by ben vershbow at 03:02 PM
| Comments (0)
tags: Libraries, Search and the Web , google , google_print , information_architecture , infoviz , librarian , library , michigan , search
google and big brother
12.15.2004, 7:35 PM
Can Google remain true to its promise to "do no evil," now that it has shareholders to worry about, advertisers to please, and an ever-increasing reach into the repositories of human knowledge? Google still gives you that warm and fuzzy feeling. It's got the goofy name, those cute seasonal tailorings of its masthead, the lava lamps. And this is not to mention the various amusing pastimes - the "Google Whack" game in which you try to find two words that cohabit only one of the search engine's eight billion web pages; or every writer's guilty pleasure, the Googling of the self, the "auto-Google," that delicious act of cyber-onanism.
But where might it lead? One day, when I open my fridge, might a sensor not read my searching eye and know that I am looking for milk? And knowing that I have run out, suggest an array of retailers who might be able to replenish my supply? Could Google come to mediate every exchange of information, no matter how inane, or how carnal?
Or could it come to resemble something like the Central Intelligence Corporation in Neal Stephenson's Snow Crash - a cross between the CIA, the Library of Congress, and DARPA's "Total Information Awareness" program?
MercuryNews.com | 12/14/2004 | Does Google move augur commercialization of libraries?
Posted by ben vershbow at 07:35 PM
| Comments (0)
tags: Libraries, Search and the Web , evil , google , internet , library , library_of_congress , neal_stephenson , privacy , search , surveillance , web
Dr. Dial-up
12.15.2004, 1:05 PM
Click here to read more
There is a new initiative underway to make biomedical research immediately available on line and free to the public. According to the Pew Internet & American Life Project, 66% of those with internet access have used it to look for health/medical information. That means that over 85 million Americans (and who knows how many people worldwide) went online last year to doctor themselves. Is this a new kind of do-it-yourselfer, the amateur physician, Google-ing a diagnosis and a cure? And when all of this new “information” becomes available, will the office visit—which the HMOs are already putting the squeeze on—become a thing of the past?
Posted by Kim White at 01:05 PM
| Comments (0)
tags: google , internet , medical , open_access , open_content , pew , research , web
books behind bars - the Google library project
12.14.2004, 4:34 PM
How useful will this service be for in-depth research when copyrighted books (which will account for a huge percentage of searchable texts) cannot be fully accessed? In such cases, a person will be able to view only a selection of pages (depending on agreements with publishers), and will find themselves bombarded with a variety of retail options. On a positive note, the search will be able to refer the user to any local libraries where the desired book is available, but still, the focus here remains squarely on digital texts as simply a means of getting to print texts.
Absent a major paradigm shift with regard to the accessibility and inherent virtue of electronic texts, this ambitious project will never achieve its full potential. For someone searching outside the public domain, the Google library project may amount to nothing more than a guided tour through a prison of incarcerated texts. I've found this to be true so far with Google Scholar - it turned up a lot of interesting stuff, but much of it was password protected or required purchase.
article in Filter: Google -- 21st Century Dewey Decimal System (washingtonpost.com)
Posted by ben vershbow at 04:34 PM
| Comments (0)
tags: Libraries, Search and the Web , books , copyright , digitization , ebooks , google , google_book_search , google_print , google_scholar , libraries , library















