Listing entries tagged with google


subtitles and the future of reading Post date  02.22.2006, 6:57 PM

After enduring a weeks-long PR pummeling for its dealings in China, Google is hard at work to improve its image in the world, racking up some points for good after slipping briefly into evil. Recently they launched Google.org: a website for the Google Foundation, the corporation's philanthropic arm and central office of evil mitigation. Paying a visit to the site, the disillusioned among us will be pleased to find that the foundation is already sponsoring a handful of worthy initiatives, along with a grants program that donates free web advertising to nonprofit organizations. And just in case we were concerned that Google might not apply its techno-capitalist wizardry to altruism as zealously as to making profit, they just announced today they've named a new director for the foundation by the name of -- no joke -- Dr. Brilliant. So it seems the world is in capable hands.

One project in particular caught my eye in light of recent discussions about screen-based reading and genre-blending visions of the book. Planet Read is an organization that promotes literacy in India through Same Language Subtitling -- a simple but apparently effective technique for building basic reading skills, taking popular visual entertainment like Bollywood movies and adding subtitles in English and Hindi along the bottom of the screen. A number of samples (sadly no Bollywood, just videos or photo montages set to Indian folk songs) can be found on Google Video. Here's one that I particularly liked:

Watching the video -- managing the interplay between moving text and moving pictures -- I began to wonder whether there are possibly some clues to be mined here about the future of reading. Yes, Planet Read is designed first and foremost to train basic alphabetic literacy, turning a captive audience into a captive classroom. But in doing so, might it not also be nurturing another kind of literacy?

The problem with contemporary discussions about the future of the book is that they are mired -- for cultural and economic reasons -- in a highly inflexible conception of what a book can be. People who grew up with print tend to assume that going digital is simply a matter of switching containers (with a few enhancements thrown in the mix), failing to consider how the actual content of books might change, or how the act of reading -- which increasingly takes place in a dyanamic visual context -- may eventually demand a more dynamic kind of text.

Blurring the lines between text and visual media naturally makes us uneasy because it points to a future that quite literally (for us dinosaurs at least) could be unreadable. But kids growing up today, in India or here in the States, are already highly accustomed to reading in screen-based environments, and so they probably have a somewhat different idea of what reading is. For them, text is likely just one ingredient in a complex combinatory medium.

Another example: Nochnoi Dozor (translated "Night Watch") is a film that has widely been credited as the first Russian blockbuster of the post-Soviet era -- an adrenaline-pumping, special effects-infused, sci-fi vampire epic made entirely by Russians, on Russian soil and on Russian themes (it's based on a popular trilogy of novels). When it was released about a year and a half ago it shattered domestic box office records previously held by Western hits like Titanic and Lord of the Rings. Just about a month ago, the sequel "Day Watch" shattered the records set by "Night Watch."

nochnoi dozor.jpg

While highly derivative of western action movies, Nochnoi Dozor is moody, raucous and darkly gorgeous, giving a good, gritty feel of contemporary Moscow. Its plot grows rickety in places, and sometimes things are downright incomprehensible (even, I'm told, with fluent Russian), so I'm skeptical about its prospects on this side of the globe. But goshdarnit, Russians can't seem to get enough of it -- so in an effort to lure American audiences over to this uniquely Russian gothic thriller, start building a brand out of the projected trilogy (and presumably pave the way for the eventual crossover to Hollywood of director Timur Bekmambetov), Fox Searchlight just last week rolled the film out in the U.S. on a very limited release.

What could this possibly have to do with the future of reading? Well, naturally the film is subtitled, and we all know how subtitles are the kiss of death for a film in the U.S. market (Passion of the Christ notwithstanding). But the marketers at Fox are trying something new with Nochnoi Dozor. No, they weren't foolish enough to dub it, which would have robbed the film of the scratchy, smoke-scarred Moscow voices that give it so much of its texture. What they've done is played with the subtitles themselves, making them more active and responsive to the action in the film (sounds like some Flash programmer had a field day...). Here's a description from an article in the NY Times (unfortunately now behind pay wall):

...[the words] change color and position on the screen, simulate dripping blood, stutter in emulation of a fearful query, or dissolve into red vapor to emulate a character's gasping breaths.

And this from Anthony Lane's review in the latest New Yorker:

...the subtitles, for instance, are the best I have encountered. Far from palely loitering at the foot of the screen, they lurk in odd corners of the frame and, at one point, glow scarlet and then spool away, like blood in water. I trust that this will start a technical trend and that, from here on, no respectable French actress will dream of removing her clothes unless at least three lines of dialogue can be made to unwind across her midriff.

It might seem strange to think of subtitling of foreign films as a harbinger of future reading practices. But then, with the increasing popularity of Asian cinema, and continued cross-pollination between comics and film, it's not crazy to suspect that we'll be seeing more of this kind of textual-visual fusion in the future.

Most significant is the idea that the text can itself be an actor in a perfomance: a frontier that has only barely been explored -- though typography enthusiasts will likely pillory me for saying so.

Posted by ben vershbow at 06:57 PM | Comments (6) | TrackBack
tags: animation , books , cinema , digital_literature , ebooks , film , flash , google , google_video , india , language , literacy , reading , russia , subtitles , translation , typography , video

google gets mid-evil Post date  01.30.2006, 3:46 PM

At the World Economic Forum in Davos last Friday, Google CEO Eric Schmidt assured a questioner in the audience that his company had in fact thoroughly searched its soul before deciding to roll out a politically sanitized search engine in China:

We concluded that although we weren't wild about the restrictions, it was even worse to not try to serve those users at all... We actually did an evil scale and decided not to serve at all was worse evil.

(via Ditherati)

Posted by ben vershbow at 03:46 PM | Comments (0)
tags: Libraries, Search and the Web , Network_Freedom , censorship , china , evil , free_speech , google , internet , search , web

illusions of a borderless world Post date  01.27.2006, 3:57 PM

china google falun gong.jpg

A number of influential folks around the blogosphere are reluctantly endorsing Google's decision to play by China's censorship rules on its new Google.cn service -- what one local commentator calls a "eunuch version" of Google.com. Here's a sampler of opinions:

Ethan Zuckerman ("Google in China: Cause For Any Hope?"):

It’s a compromise that doesn’t make me happy, that probably doesn’t make most of the people who work for Google very happy, but which has been carefully thought through...

In launching Google.cn, Google made an interesting decision - they did not launch versions of Gmail or Blogger, both services where users create content. This helps Google escape situations like the one Yahoo faced when the Chinese government asked for information on Shi Tao, or when MSN pulled Michael Anti’s blog. This suggests to me that Google’s willing to sacrifice revenue and market share in exchange for minimizing situations where they’re asked to put Chinese users at risk of arrest or detention... This, in turn, gives me some cause for hope.

Rebecca MacKinnon ("Google in China: Degrees of Evil"):

At the end of the day, this compromise puts Google a little lower on the evil scale than many other internet companies in China. But is this compromise something Google should be proud of? No. They have put a foot further into the mud. Now let's see whether they get sucked in deeper or whether they end up holding their ground.

David Weinberger ("Google in China"):

If forced to choose — as Google has been — I'd probably do what Google is doing. It sucks, it stinks, but how would an information embargo help? It wouldn't apply pressure on the Chinese government. Chinese citizens would not be any more likely to rise up against the government because they don't have access to Google. Staying out of China would not lead to a more free China.

Doc Searls ("Doing Less Evil, Possibly"):

I believe constant engagement — conversation, if you will — with the Chinese government, beats picking up one's very large marbles and going home. Which seems to be the alternative.

Much as I hate to say it, this does seem to be the sensible position -- not unlike opposing America's embargo of Cuba. The logic goes that isolating Castro only serves to further isolate the Cuban people, whereas exposure to the rest of the world -- even restricted and filtered -- might, over time, loosen the state's monopoly on civic life. Of course, you might say that trading Castro for globalization is merely an exchange of one tyranny for another. But what is perhaps more interesting to ponder right now, in the wake of Google's decision, is the palpable melancholy felt in the comments above. What does it reveal about what we assume -- or used to assume -- about the internet and its relationship to politics and geography?

A favorite "what if" of recent history is what might have happened in the Soviet Union had it lasted into the internet age. Would the Kremlin have managed to secure its virtual borders? Or censor and filter the net into a state-controlled intranet -- a Union of Soviet Socialist Networks? Or would the decentralized nature of the technology, mixed with the cultural stirrings of glasnost, have toppled the totalitarian state from beneath?

Ten years ago, in the heady early days of the internet, most would probably have placed their bets against the Soviets. The Cold War was over. Some even speculated that history itself had ended, that free-market capitalism and democracy, on the wings of the information revolution, would usher in a long era of prosperity and peace. No borders. No limits.

jingjing_1.jpg chacha.jpg
"Jingjing" and "Chacha." Internet police officers from the city of Shenzhen who float over web pages and monitor the cyber-traffic of local users.

It's interesting now to see how exactly the opposite has occurred. Bubbles burst. Towers fell. History, as we now realize, did not end, it was merely on vacation; while the utopian vision of the internet -- as a placeless place removed from the inequities of the physical world -- has all but evaporated. We realize now that geography matters. Concrete features have begun to crystallize on this massive information plain: ports, gateways and customs houses erected, borders drawn. With each passing year, the internet comes more and more to resemble a map of the world.

Those of us tickled by the "what if" of the Soviet net now have ourselves a plausible answer in China, who, through a stunning feat of pipe control -- a combination of censoring filters, on-the-ground enforcement, and general peering over the shoulders of its citizens -- has managed to create a heavily restricted local net in its own image. Barely a decade after the fall of the Iron Curtain, we have the Great Firewall of China.

And as we've seen this week, and in several highly publicized instances over the past year, the virtual hand of the Chinese government has been substantially strengthened by Western technology companies willing to play by local rules so as not to be shut out of the explosive Chinese market. Tech giants like Google, Yahoo! , and Cisco Systems have proved only too willing to abide by China's censorship policies, blocking certain search returns and politically sensitive terms like "Taiwanese democracy," "multi-party elections" or "Falun Gong". They also specialize in precision bombing, sometimes removing the pages of specific users at the government's bidding. The most recent incident came just after New Year's when Microsoft acquiesced to government requests to shut down the My Space site of popular muckraking blogger Zhao Jing, aka Michael Anti.

MS_and_China.jpg
One of many angry responses that circulated the non-Chinese net in the days that followed.

We tend to forget that the virtual is built of physical stuff: wires, cable, fiber -- the pipes. Whoever controls those pipes, be it governments or telecomms, has the potential to control what passes through them. The result is that the internet comes in many flavors, depending in large part on where you are logging in. As Jack Goldsmith and Timothy Wu explain in an excellent article in Legal Affairs (adapted from their forthcoming book Who Controls the Internet? : Illusions of a Borderless World), China, far from being the boxed-in exception to an otherwise borderless net, is actually just the uglier side of a global reality. The net has been mapped out geographically into "a collection of nation-state networks," each with its own politics, social mores, and consumer appetites. The very same technology that enables Chinese authorities to write the rules of their local net enables companies around the world to target advertising and gear services toward local markets. Goldsmith and Wu:

...information does not want to be free. It wants to be labeled, organized, and filtered so that it can be searched, cross-referenced, and consumed....Geography turns out to be one of the most important ways to organize information on this medium that was supposed to destroy geography.

Who knows? When networked devices truly are ubiquitous and can pinpoint our location wherever we roam, the internet could be censored or tailored right down to the individual level (like the empire in Borges' fable that commissions a one-to-one map of its territory that upon completion perfectly covers every corresponding inch of land like a quilt).

The case of Google, while by no means unique, serves well to illustrate how threadbare the illusion of the borderless world has become. The company's famous credo, "don't be evil," just doesn't hold up in the messy, complicated real world. "Choose the lesser evil" might be more appropriate. Also crumbling upon contact with air is Google's famous mission, "to make the world's information universally accessible and useful," since, as we've learned, Google will actually vary the world's information depending on where in the world it operates.

Google may be behaving responsibly for a corporation, but it's still a corporation, and corporations, in spite of well-intentioned employees, some of whom may go to great lengths to steer their company onto the righteous path, are still ultimately built to do one thing: get ahead. Last week in the States, the get-ahead impulse happened to be consonant with our values. Not wanting to spook American users, Google chose to refuse a Dept. of Justice request for search records to aid its anti-pornography crackdown. But this week, not wanting to ruffle the Chinese government, Google compromised and became an agent of political repression. "Degrees of evil," as Rebecca MacKinnon put it.

The great irony is that technologies we romanticized as inherently anti-tyrannical have turned out to be powerful instruments of control, highly adaptable to local political realities, be they state or market-driven. Not only does the Chinese government use these technologies to suppress democracy, it does so with the help of its former Cold War adversary, America -- or rather, the corporations that in a globalized world are the de facto co-authors of American foreign policy. The internet is coming of age and with that comes the inevitable fall from innocence. Part of us desperately wanted to believe Google's silly slogans because they said something about the utopian promise of the net. But the net is part of the world, and the world is not so simple.

Posted by ben vershbow at 03:57 PM | Comments (3)
tags: ISP , Libraries, Search and the Web , Network_Freedom , broadband , capitalism , china , cyberspace , democracy , evil , falun_gong , free_speech , geography , globalization , glocalization , good , google , human_rights , search , spectrum , technology

cheney and google Post date  01.21.2006, 6:27 PM

(this is a follow-up to ben's recent post "the book is reading you."

i rarely read Maureen Dowd but the headline of her column in today's New York Times, "Googling past the Graveyard," caught my attention. Dowd calls Dick Cheney on the carpet for asking Google to release the search records of U.S. citizens. while i'm horrified that the govt. would even consider asking for such information, i'm concerned that the way this particular issue is playing out, Google is being portrayed as the poor beleaguered neutral entity caught between an over-reaching bureaucracy and its citizens. Cheney will expire eventually. in the meantime Google will collect even more data. Google is a very big corporation, who's power will grow over time. in the long run, why aren't people outraged that this information is in Google's hands in the first place. shouldn't we be?

Posted by bob stein at 06:27 PM | Comments (5)
tags: Libraries, Search and the Web , cheney , google , government , privacy

the book is reading you Post date  01.19.2006, 1:42 PM

I just noticed that Google Book Search requires users to be logged in on a Google account to view pages of copyrighted works.

google book search account.jpg

They provide the following explanation:

Why do I have to log in to see certain pages?

Because many of the books in Google Book Search are still under copyright, we limit the amount of a book that a user can see. In order to enforce these limits, we make some pages available only after you log in to an existing Google Account (such as a Gmail account) or create a new one. The aim of Google Book Search is to help you discover books, not read them cover to cover, so you may not be able to see every page you're interested in.

So they're tracking how much we've looked at and capping our number of page views. Presumably a bone tossed to publishers, who I'm sure will continue suing Google all the same (more on this here). There's also the possibility that publishers have requested information on who's looking at their books -- geographical breakdowns and stats on click-throughs to retailers and libraries. I doubt, though, that Google would share this sort of user data. Substantial privacy issues aside, that's valuable information they want to keep for themselves.

That's because "the aim of Google Book Search" is also to discover who you are. It's capturing your clickstreams, analyzing what you've searched and the terms you've used to get there. The book is reading you. Substantial privacy issues aside, (it seems more and more that's where we'll be leaving them) Google will use this data to refine Google's search algorithms and, who knows, might even develop some sort of personalized recommendation system similar to Amazon's -- you know, where the computer lists other titles that might interest you based on what you've read, bought or browsed in the past (a system that works only if you are logged in). It's possible Google is thinking of Book Search as the cornerstone of a larger venture that could compete with Amazon.

There are many ways Google could eventually capitalize on its books database -- that is, beyond the contextual advertising that is currently its main source of revenue. It might turn the scanned texts into readable editions, hammer out licensing agreements with publishers, and become the world's biggest ebook store. It could start a print-on-demand service -- a Xerox machine on steroids (and the return of Google Print?). It could work out deals with publishers to sell access to complete online editions -- a searchable text to go along with the physical book -- as Amazon announced it will do with its Upgrade service. Or it could start selling sections of books -- individual pages, chapters etc. -- as Amazon has also planned to do with its Pages program.

Amazon has long served as a valuable research tool for books in print, so much so that some university library systems are now emulating it. Recent additions to the Search Inside the Book program such as concordances, interlinked citations, and statistically improbable phrases (where distinctive terms in the book act as machine-generated tags) are especially fun to play with. Although first and foremost a retailer, Amazon feels more and more like a search system every day (and its A9 engine, though seemingly always on the back burner, is also developing some interesting features). On the flip side Google, though a search system, could start feeling more like a retailer. In either case, you'll have to log in first.

Posted by ben vershbow at 01:42 PM | Comments (5)
tags: Copyright and Copyleft , Libraries, Search and the Web , POD , amazon , books , e-commerce , e-publishing , ebooks , google , google_book_search , google_print , internet , print_on_demand , privacy , publishing , search , web

.tv Post date  01.09.2006, 6:15 PM

People have been talking about internet television for a while now. But Google and Yahoo's unveiling of their new video search and subscription services last week at the Consumer Electronics Show in Las Vegas seemed to make it real.

Sifting through the predictions and prophecies that subsequently poured forth, I stumbled on something sort of interesting -- a small concrete discovery that helped put some of this in perspective. Over the weekend, Slate Magazine quietly announced its partnership with "meaningoflife.tv," a web-based interview series hosted by Robert Wright, author of Nonzero and The Moral Animal, dealing with big questions at the perilous intersection of science and religion.

life_banner_mono.gif

Launched last fall (presumably in response to the intelligent design fracas), meaningoflife.tv is a web page featuring a playlist of video interviews with an intriguing roster of "cosmic thinkers" -- philosophers, scientists and religious types -- on such topics as "Direction in evolution," "Limits in science," and "The Godhead."

This is just one of several experiments in which Slate is fiddling with its text-to-media ratio. Today's Pictures, a collaboration with Magnum Photos, presents a daily gallery of images and audio-photo essays, recalling both the heyday of long-form photojournalism and a possible future of hybrid documentary forms. One problem is that it's not terribly easy to find these projects on Slate's site. The Magnum page has an ad tucked discretely on the sidebar, but meaningoflife.tv seems to have disappeared from the front page after a brief splash this weekend. For a born-digital publication that has always thought of itself in terms of the web, Slate still suffers from a pretty appalling design, with its small headline area capping a more or less undifferentiated stream of headlines and teasers.

Still, I'm intrigued by these collaborations, especially in light of the forecast TV-net convergence. While internet TV seems to promise fragmentation, these projects provide a comforting dose of coherence -- a strong editorial hand and a conscious effort to grapple with big ideas and issues, like the reassuringly nutritious programming of PBS or the BBC. It's interesting to see text-based publications moving now into the realm of television. As Tivo, on demand, and now, the internet atomize TV beyond recognition, perhaps magazines and newspapers will fill part of the void left by channels.

Limited as it may now seem, traditional broadcast TV can provide us with valuable cultural touchstones, common frames of reference that help us speak a common language about our culture. That's one thing I worry we'll lose as the net blows broadcast media apart. Then again, even in the age of five gazillion cable channels, we still have our water-cooler shows, our mega-hits, our television "events." And we'll probably have them on the internet too, even when "by appointment" television is long gone. We'll just have more choice regarding where, when and how we get at them. Perhaps the difference is that in an age of fragmentation, we view these touchstone programs with a mildly ironic awareness of their mainstream status, through the multiple lenses of our more idiosyncratic and infinitely gratified niche affiliations. They are islands of commonality in seas of specialization. And maybe that makes them all the more refreshing. Shows like "24," "American Idol," or a Ken Burns documentary, or major sporting events like the World Cup or the Olympics that draw us like prairie dogs out of our niches. Coming up for air from deep submersion in our self-tailored, optional worlds.

Posted by ben vershbow at 06:15 PM | Comments (6)
tags: Publishing, Broadcast, and the Press , TV , broadband , broadcast , documentary , google , internet , journalism , media , media_consumption , multimedia , network , photography , religion , science , slate , television , yahoo

useful rss Post date  01.04.2006, 1:58 PM

Hi. I'm Jesse, the latest member to join the staff here at the Institute. I'm interested in network effects, online communities, and emergent behavior. Right now I'm interested in the tools we have available to control and manipulate RSS feeds. My goal is to collect a wide variety of feeds and tease out the threads that are important to me. In my experience, mechanical aggregation gives you quantity and diversity, but not quality and focus. So I did a quick investigation of the tools that exist to manage and manipulate feeds.

Sites like MetaFilter and Technorati skim the most popular topics in the blogosphere. 62191942_5ef7f2ded3_m.jpgBut what sort of tools exist to help us narrow our focus? There are two tools that we can use right now: tag searches/filtering, and keyword searching. Tag searches (on Technorati) and tag filtering (on Metafilter) drill down to specific areas, like "books" or "books and publishing." A casual search on MetaFilter was a complete failure, but Technorati, with its combination of tags and keyword search results produced good material.

There is also the Google Blog search. As Google puts it, you can 'find blogs on your favorite topics.' PageRank works, so PageRank applied to blogs should work too. Unfortunately it results in too many pages that, while higher ranked in the whole set of the Internet, either fail to be on topic or exist outside of the desired sub-spheres of a topic. For example, I searched for "gourmet food" and found one of the premier food blogs on the fourth page, just below Carpundit. Google blog search fails here because it can't get small enough to understand the relationships in the blogosphere, and relies more heavily on text retrieval algorithms that sabotage the results.

Finally, let's talk about aggregators. There is more human involvement in selecting sites you're interested in reading. This creates a personalized network of sites that are related, if only by your personal interest. The problem is, you get what they want to write about. Managing a large collection of feeds can be tiresome when you're looking for specific information. Bloglines has a search function that allows you to find keywords inside your subscriptions, then treat that as a feed. This neatly combines hand-picked sources with keyword or tag harvesting. The result: a slice of from your trusted collection of authors about a specific topic.

What can we envision for the future of RSS? Affinity mapping and personalized recommendation systems could augment the tag/keyword search functionality to automatically generate a slice from a small network of trusted blogs. Automatic harvesting of whole swaths of linked entries for offline reading in a bounded hypertext environment. Reposting and remixing feed content on the fly based on text-processing algorithms. And we'll have to deal with the dissolving identity and trust relationships that are a natural consequence of these innovations.

Posted by jesse wilbur at 01:58 PM | Comments (5)
tags: RSS , aggregators , blog_search , bloglines , google , tools

Wikipedia to consider advertising Post date  12.30.2005, 4:29 PM

The London Times just published an interview with Wikipedia founder Jimmy Wales in which he entertains the jimmywales.jpgidea of carrying ads. This mention is likely to generate an avalanche of discussion about the commercialization of open-source resources. While i would love to see Wikipedia stay out of the commercial realm, it's just not likely. Yahoo, Google and other big companies are going to commercialize Wikipedia anyway so taking ads is likely to end up a no-brainer. As i mentioned in my comment on Lisa's earlier post, this is going to happen as long as the overall context is defined by capitalist relations. Presuming that the web can be developed in a cooperative, non-capitalist way without fierce competition and push-back from the corporations who control the web's infrastructure seems naive to me.

Posted by bob stein at 04:29 PM | Comments (1)
tags: advertising , capitalism , google , open_content , open_source , wikipedia , yahoo

why google and yahoo love wikipedia Post date  12.29.2005, 3:16 PM

wikipedia.png From Dan Cohen's excellent Digital Humanities Blog comes a discussion of the Wikipedia story that Cohen claims no one seems to be writing about — namely, the question of why Google and Yahoo give so much free server space and bandwith to Wikipedia. Cohen points out that there's more going on here than just the open source ethos of these tech companies: in fact, the two companies are becoming increasingly dependent on Wikipedia as a resource, both as something to repackage for commercial use (in sites such as Answers.com), and as a major component in the programming of search algorithms. Cohen writes:

Let me provide a brief example that I hope will show the value of having such a free resource when you are trying to scan, sort, and mine enormous corpora of text. Let's say you have a billion unstructured, untagged, unsorted documents related to the American presidency in the last twenty years. How would you differentiate between documents that were about George H. W. Bush (Sr.) and George W. Bush (Jr.)? This is a tough information retrieval problem because both presidents are often referred to as just "George Bush" or "Bush." Using data-mining algorithms such as Yahoo's remarkable Term Extraction service, you could pull out of the Wikipedia entries for the two Bushes the most common words and phrases that were likely to show up in documents about each (e.g., "Berlin Wall" and "Barbara" vs. "September 11" and "Laura"). You would still run into some disambiguation problems ("Saddam Hussein," "Iraq," "Dick Cheney" would show up a lot for both), but this method is actually quite a powerful start to document categorization.

Cohen's observation is a valuable reminder that all of the discussion of Wikipedia's accuracy and usefulness as an academic tool is really only skimming the surface of how and why the open-souce encyclopedia is reshaping the way knowledge is made and accessed. Ultimately, the question of whether or not Wikipedia should be used in the classroom might be less important than whether — or how — it is used in the boardroom, by companies whose function is to repackage, reorganize and return "the people's knowledge" back to the people at a tidy profit.

Posted by lisa lynch at 03:16 PM | Comments (7)
tags: Libraries, Search and the Web , google , wikipedia , yahoo

last week: wikipedia, r kelly, gaming and google panels, and more... Post date  12.18.2005, 4:27 PM

Here's an overview of what we've been posting over the last week. As well, a few of us having been talking about ways to graphically represent text, so I thought I would include a mind map of this overview.

wrapup_sm.jpg


As a follow up to the increasingly controversial wikipedia front, Daniel Brandt uncovered that Brian Chase posted false information about John Seignthaler that was reported here last week. To add fuel to the fire, Nature weighed in that Encyclopedia Britannica may not be as reliable as Wikipedia.

Business Week noted a possible future of pricing for data transfer. Currently, carries such as phone and cable companies are developing technology to identify and control what types of media (voice, images, text or video) are being uploaded. This ability opens the door to being able to charge for different uses of data transfer, which would have a huge impact on uploading content for personal creative use of the internet.

Liz Barry and Bill Wetzel shared some of their experiences from their "Talk to Me" Project. With their "talk to me" sign in tow, they travel around New York and the rest of the US looking for conversation. We were impressed at how they do not have a specific agenda besides talking to people. In the mediated age, they are not motivated by external political/ religious/ documentary intentions. What they do document is available on their website, and we look forward to see what they come up with next.

The Google Book Search debate continues as well, via a panel discussion hosted by the American Bar Association. Interestingly, publishers spoke as if the wide scale use of ebooks is imminent. More importantly and even if this particular case settles out of court, the courts have a pressing need to define copyright and fair use guidelines for these emerging uses.

With the protest of the WTO meetings in Hong Kong this past week, new journalism forms took one step forward. The website Curbside @ WTO covered the meetings with submissions from journalism students, bloggers and professional journalists.

McDonalds filed a patent which suggests that it intends to offer clips of movies instead of the traditional toys in their kids oriented Happy Meals. Lisa pondered if a video clip can successfully replace a toy, and if it does, what the effects on children's imaginations might be.

R. Kelly's experiments in form and the "serial song" through his Trapped in the Closet recordings. While R Kelly has varying success in this endeavor, Dan compared the experience of not only the serial novel, but also Julie Powell's foray into transferring her blog into book form and what she might have learned from R. Kelly (its hard to make unified pieces maintain an overall coherency.)

The world of academic publishing was challenged with a proposal calling to create an electronic academic press. This segment seems especially ripe for the shift to digital publishing as many journals with small circulations face raising printing and production costs.

Sol and others from the institute attended "Making Games Matter," a panel with contributors from The Game Design Reader: A Rules of Play Anthology, edited by Katie Salen and Eric Zimmerman. The discussion covered among other things: involving the academy in creating a discourse for gaming and game design, obstacles in studying and creating games, and the game "industry" itself. The book and panel called out for games and gaming to undergo a formal study akin to the novel and the experience of reading. Also, in the gaming world, the class economics of the real and virtual began to emerge as a Chinese firm pays employees to build up characters in MMOGs to sell to affluent gamers.

Posted by ray cha at 04:27 PM | Comments (0)
tags: Roundup , academia , broadband , e-publishing , fast_food , gaming , google , google_book_search , internet , mcdonalds , network_neutrality , publishing , r_kelly , video_games , wikipedia

google book search debated at american bar association Post date  12.15.2005, 3:50 PM

Last night I attended a fascinating panel discussion at the American Bar Association on the legality of Google Book Search. In many ways, this was the debate made flesh. Making the case against Google were high-level representatives from the two entities that have brought suit, the Authors' Guild (Executive Director Paul Aiken) and the Association of American Publishers (VP for legal counsel Allan Adler). It would have been exciting if Google, in turn, had sent representatives to make their case, but instead we had two independent commentators, law professor and blogger Susan Crawford and Cameron Stracher, also a law professor and writer. The discussion was vigorous, at times heated -- in many ways a preview of arguments that could eventually be aired (albeit under a much stricter clock) in front of federal judges.

The lawsuits in question center around whether Google's scanning of books and presenting tiny snippet quotations online for keyword searches is, as they claim, fair use. As I understand it, the use in question is the initial scanning of full texts of copyrighted books held in the collections of partner libraries. The fair use defense hinges on this initial full scan being the necessary first step before the "transformative" use of the texts, namely unbundling the book into snippets generated on the fly in response to user search queries.

google snippets.jpg
...in case you were wondering what snippets look like

At first, the conversation remained focused on this question, and during that time it seemed that Google was winning the debate. The plaintiffs' arguments seemed weak and a little desperate. Aiken used carefully scripted language about not being against online book search, just wanting it to be licensed, quipping "we're just throwing a little gravel in the gearbox of progress." Adler was a little more strident, calling Google "the master of misdirection," using the promise of technological dazzlement to turn public opinion against the legitimate grievances of publishers (of course, this will be settled by judges, not by public opinion). He did score one good point, though, saying Google has betrayed the weakness of its fair use claim in the way it has continually revised its description of the program.

Almost exactly one year ago, Google unveiled its "library initiative" only to re-brand it several months later as a "publisher program" following a wave of negative press. This, however, did little to ease tensions and eventually Google decided to halt all book scanning (until this past November) while they tried to smooth things over with the publishers. Even so, lawsuits were filed, despite Google's offer of an "opt-out" option for publishers, allowing them to request that certain titles not be included in the search index. This more or less created an analog to the "implied consent" principle that legitimates search engines caching web pages with "spider" programs that crawl the net looking for new material.

In that case, there is a machine-to-machine communication taking place and web page owners are free to insert programs that instruct spiders not to cache, or can simply place certain content behind a firewall. By offering an "opt-out" option to publishers, Google enables essentially the same sort of communication. Adler's point (and this was echoed more succinctly by a smart question from the audience) was that if Google's fair use claim is so air-tight, then why offer this middle ground? Why all these efforts to mollify publishers without actually negotiating a license? (I am definitely concerned that Google's efforts to quell what probably should have been an anticipated negative reaction from the publishing industry will end up undercutting its legal position.)

Crawford came back with some nice points, most significantly that the publishers were trying to make a pretty egregious "double dip" into the value of their books. Google, by creating a searchable digital index of book texts -- "a card catalogue on steroids," as she put it -- and even generating revenue by placing ads alongside search results, is making a transformative use of the published material and should not have to seek permission. Google had a good idea. And it is an eminently fair use.

And it's not Google's idea alone, they just had it first and are using it to gain a competitive advantage over their search engine rivals, who in their turn, have tried to get in on the game with the Open Content Alliance (which, incidentally, has decided not to make a stand on fair use as Google has, and are doing all their scanning and indexing in the context of license agreements). Publishers, too, are welcome to build their own databases and to make them crawl-able by search engines. Earlier this week, Harper Collins announced it would be doing exactly that with about 20,000 of its titles. Aiken and Adler say that if anyone can scan books and make a search engine, then all hell will break loose and millions of digital copies will be leaked into the web. Crawford shot back that this lawsuit is not about net security issues, it is about fair use.

But once the security cat was let out of the bag, the room turned noticeably against Google (perhaps due to a preponderance of publishing lawyers in the audience). Aiken and Adler worked hard to stir up anxiety about rampant ebook piracy, even as Crawford repeatedly tried to keep the discussion on course. It was very interesting to hear, right from the horse's mouth, that the Authors' Guild and AAP both are convinced that the ebook market, tiny as it currently is, is within a few years of exploding, pending the release of some sort of ipod-like gadget for text. At that point, they say, Google will have gained a huge strategic advantage off the back of appropriated content.

Their argument hinges on the fourth determining factor in the fair use exception, which evaluates "the effect of the use upon the potential market for or value of the copyrighted work." So the publishers are suing because Google might be cornering a potential market!!! (Crawford goes further into this in her wrap-up) Of course, if Google wanted to go into the ebook business using the material in their database, there would have to be a licensing agreement, otherwise they really would be pirating. But the suits are not about a future market, they are about creating a search service, which should be ruled fair use. If publishers are so worried about the future ebook market, then they should start planning for business.

To echo Crawford, I sincerely hope these cases reach the court and are not settled beforehand. Larger concerns about Google's expansionist program aside, I think they have made a very brave stand on the principle of fair use, the essential breathing space carved out within our over-extended copyright laws. Crawford reminded the room that intellectual property is NOT like physical property, over which the owner has nearly unlimited rights. Copyright is a "temporary statutory monopoly" originally granted ("with hesitation," Crawford adds) in order to incentivize creative expression and the production of ideas. The internet scares the old-guard publishing industry because it poses so many threats to the security of their product. These threats are certainly significant, but they are not the subject of these lawsuits, nor are they Google's, or any search engine's, fault. The rise of the net should not become a pretext for limiting or abolishing fair use.

Posted by ben vershbow at 03:50 PM | Comments (2)
tags: Copyright and Copyleft , Libraries, Search and the Web , copyright , ebooks , fair_use , google , google_book_search , publishing

where we've been, where we're going Post date  12.09.2005, 12:54 PM

Roundup-weed5L.gif

This past week at if:book we've been thinking a lot about the relationship between this weblog and the work we do. We decided that while if:book has done a fine job reflecting and provoking the conversations we have at the Institute, we wanted to make sure that it also seems as coherent to our readers as it does to us. With that in mind, we've decided to begin posting a weekly roundup of our blog posts, in which we synthesize (as much a possible) what we've been thinking and talking about from Monday to Friday.

So here goes. This week we spent a lot of time reflecting on simulation and virtuality. In part, this reflection grew out of our collective reading of a Tom Zengotita's book Mediated, which discusses (among other things) the link between alienation from the "real" through digital mediation and increased solipsism. Bob seemed especially interested in the dialectic relationship between, on one hand, the opportunity for access afforded by ever-more sophisticated form of simulation, and, on the other, the sense that something must be lost when as the encounter with the "real" recedes entirely.

This, in turn, led to further conversation about what we might think of as the "loss of the real" in the transition from books on paper to books on a computer screen. On one hand, there seems to be a tremendous amount of anxiety that Google Book Search might somehow make actual books irrelevant and thus destroy reading and writing practices linked to the bound book. On the other hand, one could take the position of Cory Doctorow that books as objects are overrated, and challenge the idea that a book needs to be digitally embodied to be "real."

As the debate over Google Book Search continually reminds us, one of the most challenging things in sifting through discussions of emerging media forms is learning to tell the difference between nostalgia and useful critical insight. Often the two are hopelessly intertwined; in this week's debates about Wikipedia, for example, discussion of how to make the open-source encyclopedia more useful was often tempered by the suggestion that encyclopedias of the past were always be superior to Wikipedia, an assertion easily challenged by a quick browse through some old encyclopedias.

Finally, I want to mention that we finally got around to setting up a del.icio.us account. There will be a formal link on the blog up soon, but you can take a look now. It will expand quickly.

Posted by lisa lynch at 12:54 PM | Comments (0)
tags: Roundup , book , google , search , simulation , wikipedia

google libraries podcast now available Post date  12.07.2005, 11:33 AM

In case you missed Open Source's Monday hour on Google Book Search... Listen here. Podcast RSS here. Show summary here.

Posted by ben vershbow at 11:33 AM | Comments (1)
tags: Libraries, Search and the Web , ebook , google , google_book_search , google_print , library , podcast , publishing

google on the air Post date  12.06.2005, 12:34 AM

librarybrazil.jpg

Open Source's hour on the Googlization of libraries was refreshingly light on the copyright issue and heavier on questions about research, reading, the value of libraries, and the public interest. With its book-scanning project, Google is a private company taking on the responsibilities of a public utility, and Siva Vaidhyanathan came down hard on one of the company's chief legal reps for the mystery shrouding their operations (scanning technology, algorithms and ranking system are all kept secret). The rep reasonably replied that Google is not the only digitization project in town and that none of its library partnerships are exclusive. But most of his points were pretty obvious PR boilerplate about Google's altruism and gosh darn love of books. Hearing the counsel's slick defense, your gut tells you it's right to be suspicious of Google and to keep demanding more transparency, clearer privacy standards and so on. If we're going to let this much information come into the hands of one corporation, we need to be very active watchdogs.

Our friend Karen Schneider then joined the fray and as usual brought her sage librarian's perspective. She's thrilled by the possibilities of Google Book Search, seeing as it solves the fundamental problem of library science: that you can only search the metadata, not the texts themselves. But her enthusiasm is tempered by concerns about privatization similar to Siva's and a conviction that a research service like Google can never replace good librarianship and good physical libraries. She also took issue with the fact that Book Search doesn't link to other library-related search services like Open Worldcat. She has her own wrap-up of the show on her blog.

Rounding out the discussion was Matthew G. Kirschenbaum, a cybertext studies blogger and professor of english at the University of Maryland. Kirschenbaum addressed the question of how Google, and the web in general, might be changing, possibly eroding, our reading practices. He nicely put the question in perspective, suggesting that scattershot, inter-textual, "snippety" reading is in fact the older kind of reading, and that the idea of sustained, deeply immersed involvement with a single text is largely a romantic notion tied to the rise of the novel in the 18th century.

A satisfying hour, all in all, of the sort we should be having more often. It was fun brainstorming with Brendan Greeley, the Open Source on "blogger-in-chief," on how to put the show together. Their whole bit about reaching out to the blogosphere for ideas and inspiration isn't just talk. They put their money where their mouth is. I'll link to the podcast when it becomes available.

image: Real Gabinete Português de Literatura, Rio de Janeiro - Claudio Lara via Flickr

Posted by ben vershbow at 12:34 AM | Comments (2)
tags: Libraries, Search and the Web , copyright , digitization , ebook , google , google_book_search , google_print , library , literature , metadata , reading , search

thinking about google books: tonight at 7 on radio open source Post date  12.05.2005, 4:58 PM

While visiting the Experimental Television Center in upstate New York this past weekend, Lisa found a wonderful relic in a used book shop in Owego, NY -- a small, leatherbound volume from 1962 entitled "Computers," which IBM used to give out as a complimentary item. An introductory note on the opening page reads:

The machines do not think -- but they are one of the greatest aids to the men who do think ever invented! Calculations which would take men thousands of hours -- sometimes thousands of years -- to perform can be handled in moments, freeing scientists, technicians, engineers, businessmen, and strategists to think about using the results.

This echoes Vannevar Bush's seminal 1945 essay on computing and networked knowledge, "As We May Think", which more or less prefigured the internet, web search, and now, the migration of print libraries to the world wide web. Google Book Search opens up fantastic possibilities for research and accessibility, enabling readers to find in seconds what before might have taken them hours, days or weeks. Yet it also promises to transform the very way we conceive of books and libraries, shaking the foundations of major institutions. Will making books searchable online give us more time to think about the results of our research, or will it change the entire way we think? By putting whole books online do we begin the steady process of disintegrating the idea of the book as a bounded whole and not just a sequence of text in a massive database?

The debate thus far has focused too much on the legal ramifications -- helped in part by a couple of high-profile lawsuits from authors and publishers -- failing to take into consideration the larger cognitive, cultural and institutional questions. Those questions will hopefully be given ample air time tonight on Radio Open Source.

Tune in at 7pm ET on local public radio or stream live over the web. The show will also be available later in the week as a podcast.

Posted by ben vershbow at 04:58 PM | Comments (0)
tags: Libraries, Search and the Web , books , copyright , ebook , google , google_book_search , google_print , library , literature , radio , research , university

the role of note taking in the information age Post date  12.03.2005, 3:19 PM

An article by Ann Blair in a recent issue of Critical Inquiry (vol 31 no 1) discusses the changing conceptions of the function of note-taking from about the sixth century to the present, and ends with a speculation on the way that textual searches (such as Google Book Search) might change practices of note-taking in the twenty-first century. Blair argues that "one of the most significant shifts in the history of note taking" occured in the beginning of the twentieth century, when the use of notes as memorization aids gave way to the use of notes as a aid to replace the memorization of too-abundant information. With the advent of the net, she notes:

Today we delegate to sources that we consider authoritative the extraction of information on all but a few carefully specialized areas in which we cultivate direct experience and original research. New technologies increasingly enable us to delegate more tasks of remembering to the computer, in that shifting division of labor between human and thing. We have thus mechanized many research tasks. It is possible that further changes would affect even the existence of note taking. At a theoretical extreme, for example, if every text one wanted were constantly available for searching anew, perhaps the note itself, the selection made for later reuse, might play a less prominent role.

The result of this externalization, Blair notes, is that we come to think of long-term memory as something that is stored elsewhere, in "media outside the mind." At the same time, she writes, "notes must be rememorated or absorbed in the short-term memory at least enough to be intelligently integrated into an argument; judgment can only be applied to experiences that are present to the mind."

Blair's article doesn't say that this bifurcation between short-term and long-term memory is a problem: she simply observes it as a phenomenon. But there's a resonance between Blair's article and Naomi Baron's recent Los Angeles Times piece on Google Book Search: both point to the fact that what we commonly have defined as scholarly reflection has increasingly become more and more a process of database management. Baron seems to see reflection and database management as being in tension, though I'm not completely convinced by her argument. Blair, less apocalyptic than Baron, nonetheless gives me something to ponder. What happens to us if (or when) all of our efforts to make the contents of our extrasomatic memory "present to our mind" happen without the mediation of notes? Blair's piece focuses on the epistemology rather than the phenomenology of note taking — still, she leads me to wonder what happens if the mediating function of the note is lost, when the triangular relation between book, scholar and note becomes a relation between database and user.

Posted by lisa lynch at 03:19 PM | Comments (1)
tags: Libraries, Search and the Web , book , google , internet , note_taking , search

google print on deck at radio open source Post date  12.01.2005, 8:07 AM

Open Source, the excellent public radio program (not to be confused with "Open Source Media") that taps into the blogosphere to generate its shows, has been chatting with me about putting together an hour on the Google library project. Open Source is a unique hybrid, drawing on the best qualities of the blogosphere -- community, transparency, collective wisdom -- to produce an otherwise traditional program of smart talk radio. As host Christopher Lydon puts it, the show is "fused at the brain stem with the world wide web." Or better, it "uses the internet to be a show about the world."

The Google show is set to air live this evening at 7pm (ET) (they also podcast). It's been fun working with them behind the scenes, trying to figure out the right guests and questions for the ideal discussion on Google and its bookish ambitions. My exchange has been with Brendan Greeley, the Radio Open Source "blogger-in-chief" (he's kindly linked to us today on their site). We agreed that the show should avoid getting mired in the usual copyright-focused news peg -- publishers vs. Google etc. -- and focus instead on the bigger questions. At my suggestion, they've invited Siva Vaidhyanathan, who wrote the wonderful piece in the Chronicle of Higher Ed. that I talked about yesterday (see bigger questions). I've also recommended our favorite blogger-librarian, Karen Schneider (who has appeared on the show before), science historian George Dyson, who recently wrote a fascinating essay on Google and artificial intelligence, and a bunch of cybertext studies people: Matthew G. Kirschenbaum, N. Katherine Hayles, Jerome McGann and Johanna Drucker. If all goes well, this could end up being a very interesting hour of discussion. Stay tuned.

UPDATE: Open Source just got a hold of Nicholas Kristof to do an hour this evening on Genocide in Sudan, so the Google piece will be pushed to next week.

Posted by ben vershbow at 08:07 AM | Comments (0)
tags: Libraries, Search and the Web , Online , copyright , google , google_book_search , google_print , library , open_source , podcast , publishing , radio , radio_open_source , search , web

sober thoughts on google: privatization and privacy Post date  11.30.2005, 8:18 AM

nypl reading room.jpg

Siva Vaidhyanathan has written an excellent essay for the Chronicle of Higher Education on the "risky gamble" of Google's book-scanning project -- some of the most measured, carefully considered comments I've yet seen on the issue. His concerns are not so much for the authors and publishers that have filed suit (on the contrary, he believes they are likely to benefit from Google's service), but for the general public and the future of libraries. Outsourcing to a private company the vital task of digitizing collections may prove to have been a grave mistake on the part of Google's partner libraries. Siva:

The long-term risk of privatization is simple: Companies change and fail. Libraries and universities last.....Libraries should not be relinquishing their core duties to private corporations for the sake of expediency. Whichever side wins in court, we as a culture have lost sight of the ways that human beings, archives, indexes, and institutions interact to generate, preserve, revise, and distribute knowledge. We have become obsessed with seeing everything in the universe as "information" to be linked and ranked. We have focused on quantity and convenience at the expense of the richness and serendipity of the full library experience. We are making a tremendous mistake.

This essay contains in abundance what has largely been missing from the Google books debate: intellectual courage. Vaidhyanathan, an intellectual property scholar and "avowed open-source, open-access advocate," easily could have gone the predictable route of scolding the copyright conservatives and spreading the Google gospel. But he manages to see the big picture beyond the intellectual property concerns. This is not just about economics, it's about knowledge and the public interest.

What irks me about the usual debate is that it forces you into a position of either resisting Google or being its apologist. But this fails to get at the real bind we all are in: the fact that Google provides invaluable services and yet is amassing too much power; that a private company is creating a monopoly on public information services. Sooner or later, there is bound to be a conflict of interest. That is where we, the Google-addicted public, are caught. It's more complicated than hip versus square, or good versus evil.

Here's another good piece on Google. On Monday, The New York Times ran an editorial by Adam Cohen that nicely lays out the privacy concerns:

Google says it needs the data it keeps to improve its technology, but it is doubtful it needs so much personally identifiable information. Of course, this sort of data is enormously valuable for marketing. The whole idea of "Don't be evil," though, is resisting lucrative business opportunities when they are wrong. Google should develop an overarching privacy theory that is as bold as its mission to make the world's information accessible - one that can become a model for the online world. Google is not necessarily worse than other Internet companies when it comes to privacy. But it should be doing better.

original google.jpg Two graduate students in Stanford in the mid-90s recognized that search engines would the most important tools for dealing with the incredible flood of information that was then beginning to swell, so they started indexing web pages and working on algorithms. But as the company has grown, Google's admirable-sounding mission statement -- "to organize the world's information and make it universally accessible and useful" -- has become its manifest destiny, and "information" can now encompass the most private of territories.

At one point it simply meant search results -- the answers to our questions. But now it's the questions as well. Google is keeping a meticulous record of our clickstreams, piecing together an enormous database of queries, refining its search algorithms and, some say, even building a massive artificial brain (more on that later). What else might they do with all this personal information? To date, all of Google's services are free, but there may be a hidden cost.

"Don't be evil" may be the company motto, but with its IPO earlier this year, Google adopted a new ideology: they are now a public corporation. If web advertising (their sole source of revenue) levels off, then investors currently high on $400+ shares will start clamoring for Google to maintain profits. "Don't be evil to us!" they will cry. And what will Google do then?

images: New York Public Library reading room by Kalloosh via Flickr; archive of the original Google page

Posted by ben vershbow at 08:18 AM | Comments (7)
tags: Copyright and Copyleft , Libraries, Search and the Web , books , copyright , ethics , google , google_book_search , google_print , intellectual_property , libraries , library , literature , privacy , publishing , university

flushing the net down the tubes Post date  11.29.2005, 8:11 AM

Grand theories about upheavals on the internet horizon are in ready supply. Singularities are near. Explosions can be expected in the next six to eight months. Or the whole thing might just get "flushed" down the tubes. This last scenario is described at length in a recent essay in Linux Journal by Doc Searls, which predicts the imminent hijacking of the net by phone and cable companies who will turn it into a top-down, one-way broadcast medium. In other words, the net's utopian moment, the "read/write" web, may be about to end. Reading Searls' piece, I couldn't help thinking about the story of radio and a wonderful essay Brecht wrote on the subject in 1932:

brecht-foto.jpg

Here is a positive suggestion: change this apparatus over from distribution to communication. The radio would be the finest possible communication apparatus in public life, a vast network of pipes. That is to say, it would be if it knew how to receive as well as to transmit, how to let the listener speak as well as hear, how to bring him into a relationship instead of isolating him. On this principle the radio should step out of the supply business and organize its listeners as suppliers....turning the audience not only into pupils but into teachers.

Unless you're the military, law enforcement, or a short-wave hobbyist, two-way radio never happened. On the mainstream commercial front, radio has always been about broadcast: a one-way funnel. The big FM tower to the many receivers, "prettifying public life," as Brecht puts it. Radio as an agitation? As an invitation to a debate, rousing families from the dinner table into a critical encounter with their world? Well, that would have been neat.

Now there's the internet, a two-way, every-which-way medium -- a stage of stages -- that would have positively staggered a provocateur like Brecht. But although the net may be a virtual place, it's built on some pretty actual stuff. Copper wire, fiber optic cable, trunks, routers, packets -- "the vast network of pipes." The pipes are owned by the phone and cable companies -- the broadband providers -- and these guys expect a big return (far bigger than they're getting now) on the billions they've invested in laying down the plumbing. Searls:

The choke points are in the pipes, the permission is coming from the lawmakers and regulators, and the choking will be done....The carriers are going to lobby for the laws and regulations they need, and they're going to do the deals they need to do. The new system will be theirs, not ours....The new carrier-based Net will work in the same asymmetrical few-to-many, top-down pyramidal way made familiar by TV, radio, newspapers, books, magazines and other Industrial Age media now being sucked into Information Age pipes. Movement still will go from producers to consumers, just like it always did.

If Brecht were around today I'm sure he would have already written (or blogged) to this effect, no doubt reciting the sad fate of radio as a cautionary tale. Watch the pipes, he would say. If companies talk about "broad" as in "broadband," make sure they're talking about both ends of the pipe. The way broadband works today, the pipe running into your house dwarfs the one running out. That means more download and less upload, and it's paving the way for a content delivery platform every bit as powerful as cable on an infinitely broader band. Data storage, domain hosting -- anything you put up there -- will be increasingly costly, though there will likely remain plenty of chat space and web mail provided for free, anything that allows consumers to fire their enthusiasm for commodities through the synapse chain.

rad30cathedral10.jpg If the net goes the way of radio, that will be the difference (allow me to indulge in a little dystopia). Imagine a classic Philco cathedral radio but with a few little funnel-ended hoses extending from the side that connect you to other listeners. "Tune into this frequency!" "You gotta hear this!" You whisper recommendations through the tube. It's sending a link. Viral marketing. Yes, the net will remain two-way to the extent that it helps fuel the market. Web browsers, like the old Philco, would essentially be receivers, enabling participation only to the extent that it encouraged others to receive.

You might even get your blog hosted for free if you promote products -- a sports shoe with gelatinous heels or a music video that allows you to undress the dancing girls with your mouse. Throw in some political rants in between to blow off some steam, no problem. That's entrepreneurial consumerism. Make a living out of your appetites and your ability to make them infectious. Hip recommenders can build a cosy little livelihood out of their endorsements. But any non-consumer activity will be more like amateur short-wave radio: a mildly eccentric (and expensive) hobby (and they'll even make a saccharine movie about a guy communing with his dead firefighter dad through a ghost blog).

Searls sees it as above all a war of language and metaphor. The phone and cable companies will dominate as long as the internet is understood fundamentally as a network of pipes, a kind of information transport system. This places the carriers at the top of the hierarchy -- the highway authority setting the rules of the road and collecting the tolls. So far the carriers have managed, through various regulatory wrangling and court rulings, to ensure that the "transport metaphor" has prevailed.

But obviously the net is much more than the sum of its pipes. It's a public square. It's a community center. It's a market. And it's the biggest publishing system the world has ever known. Searls wants to promote "place metaphors" like these. Sure, unless you're a lobbyist for Verizon or SBC, you probably already think of it this way. But in the end it's the lobbyists that will make all the difference. Unless, that is, an enlightened citizens' lobby begins making some noise. So a broad, broad as in broadband, public conversation should be in order. Far broader than what goes on in the usual progressive online feedback loops -- the Linux and open source communities, the creative commies, and the techno-hip blogosphere, that I'm sure are already in agreement about this.

Google also seems to have an eye on the pipes, reportedly having bought thousands of miles of "dark fiber" -- pipe that has been laid but is not yet in use. Some predict a nationwide "Googlenet." But this can of worms is best saved for another post.

Posted by ben vershbow at 08:11 AM | Comments (2)
tags: Network_Freedom , Publishing, Broadcast, and the Press , brecht , broadband , broadcast , cable , fiber , google , internet , linux , media , net , radio , short_wave , telecom , telephone , tubes , utopia , verizon , web

virtual libraries, real ones, empires Post date  11.28.2005, 12:36 PM

Handsworth readers.jpg Last Tuesday, a Washington Post editorial written by Library of Congress librarian James Billington outlined the possible benefits of a World Digital Library, a proposed LOC endeavor discussed last week in a post by Ben Vershbow. Billington seemed to imagine the library as sort of a United Nations of information: claiming that "deep conflict between cultures is fired up rather than cooled down by this revolution in communications," he argued that a US-sponsored, globally inclusive digital library could serve to promote harmony over conflict:

Libraries are inherently islands of freedom and antidotes to fanaticism. They are temples of pluralism where books that contradict one another stand peacefully side by side just as intellectual antagonists work peacefully next to each other in reading rooms. It is legitimate and in our nation's interest that the new technology be used internationally, both by the private sector to promote economic enterprise and by the public sector to promote democratic institutions. But it is also necessary that America have a more inclusive foreign cultural policy -- and not just to blunt charges that we are insensitive cultural imperialists. We have an opportunity and an obligation to form a private-public partnership to use this new technology to celebrate the cultural variety of the world.

What's interesting about this quote (among other things) is that Billington seems to be suggesting that a World Digital Library would function in much the same manner as a real-world library, and yet he's also arguing for the importance of actual physical proximity. He writes, after all, about books literally, not virtually, touching each other, and about researchers meeting up in a shared reading room. There seems to be a tension here, in other words, between Billington's embrace of the idea of a world digital library, and a real anxiety about what a "library" becomes when it goes online.

I also feel like there's some tension here — in Billington's editorial and in the whole World Digital Library project — between "inclusiveness" and "imperialism." Granted, if the United States provides Brazilians access to their own national literature online, this might be used by some as an argument against the idea that we are "insensitive cultural imperialists." But there are many varieties of empire: indeed, as many have noted, the sun stopped setting on Google's empire a while ago.

To be clear, I'm not attacking the idea of the World Digital Library. Having watch the Smithsonian invest in, and waffle on, some of their digital projects, I'm all for a sustained commitment to putting more material online. But there needs to be some careful consideration of the differences between online libraries and virtual ones — as well as a bit more discussion of just what a privately-funded digital library might eventually morph into.

Posted by lisa lynch at 12:36 PM | Comments (0)
tags: Libraries, Search and the Web , cultural , digital , google , imperialism , internet , libraries

explosion Post date  11.22.2005, 2:10 PM

250px-Nuclear_fireball.jpg A Nov. 18 post on Adam Green's Darwinian Web makes the claim that the web will "explode" (does he mean implode?) over the next year. According to Green, RSS feeds will render many websites obsolete:

The explosion I am talking about is the shifting of a website's content from internal to external. Instead of a website being a "place" where data "is" and other sites "point" to, a website will be a source of data that is in many external databases, including Google. Why "go" to a website when all of its content has already been absorbed and remixed into the collective datastream.

Does anyone agree with Green? Will feeds bring about the restructuring of "the way content is distributed, valued and consumed?" More on this here.

Posted by lisa lynch at 02:10 PM | Comments (5)
tags: Libraries, Search and the Web , Online , Publishing, Broadcast, and the Press , RSS , blogging , blogs , darwin , darwinism , google , internet , singularity , syndication , web , xml

world digital library Post date  11.22.2005, 7:41 AM

library of congress.jpg The Library of Congress has announced plans for the creation of a World Digital Library, "a shared global undertaking" that will make a major chunk of its collection freely available online, along with contributions from other national libraries around the world. From The Washington Post:

...[the] goal is to bring together materials from the United States and Europe with precious items from Islamic nations stretching from Indonesia through Central and West Africa, as well as important materials from collections in East and South Asia.

Google has stepped forward as the first corporate donor, pledging $3 million to help get operations underway. At this point, there doesn't appear to be any direct connection to Google's Book Search program, though Google has been working with LOC to test and refine its book-scanning technology.

Posted by ben vershbow at 07:41 AM | Comments (0)
tags: Libraries, Search and the Web , books , digital , google , library , library_of_congress , literature , preservation , scanning

google print is no more Post date  11.18.2005, 8:06 AM

Not the program, of course, just the name. From now on it is to be known as Google Book Search. "Print" obviously struck a little too close to home with publishers and authors. On the company blog, they explain the shift in emphasis:

No, we don't think that this new name will change what some folks think about this program. But we do believe it will help a lot of people understand better what we're doing. We want to make all the world's books discoverable and searchable online, and we hope this new name will help keep everyone focused on that important goal.

Posted by ben vershbow at 08:06 AM | Comments (1)
tags: Libraries, Search and the Web , books , copyright , google , google_book_search , google_print , publishing , search

all your base are belong to google Post date  11.16.2005, 7:04 AM

Google Base is live and ready for our stuff.

In AP: "New Project Will Expand Google's Reach"

Posted by ben vershbow at 07:04 AM | Comments (0)
tags: Online , advertising , classifieds , craigslist , ebay , etail , google , google_base , search , web

the book in the network - masses of metadata Post date  11.15.2005, 6:42 PM

In this weekend's Boston Globe, David Weinberger delivers the metadata angle on Google Print:

...despite the present focus on who owns the digitized content of books, the more critical battle for readers will be over how we manage the information about that content-information that's known technically as metadata.

...we're going to need massive collections of metadata about each book. Some of this metadata will come from the publishers. But much of it will come from users who write reviews, add comments and annotations to the digital text, and draw connections between, for example, chapters in two different books.

As the digital revolution continues, and as we generate more and more ways of organizing and linking books-integrating information from publishers, libraries and, most radically, other readers-all this metadata will not only let us find books, it will provide the context within which we read them.

The book in the network is a barnacled spirit, carrying with it the sum of its various accretions. Each book is also its own library by virtue not only of what it links to itself, but of what its readers are linking to, of what its readers are reading. Each book is also a milk crate of earlier drafts. It carries its versions with it. A lot of weight for something physically weightless.

Posted by ben vershbow at 06:42 PM | Comments (0)
tags: ISBN , Libraries, Search and the Web , books , ebook , electronic_literature , folksonomy , google , google_print , hypertext , library , literature , marginalia , metadata , social_software , tagging , weinberger

having browsed google print a bit more... Post date  11.14.2005, 4:53 AM

...I realize I was over-hasty in dismissing the recent additions made since book scanning resumed earlier this month. True, many of the fine wines in the cellar are there only for the tasting, but the vintage stuff can be drunk freely, and there are already some wonderful 19th century titles, at this point mostly from Harvard. The surest way to find them is to search by date, or by title and date. Specify a date range in advanced search or simply enter, for example, "date: 1890" and a wealth of fully accessible texts comes up, any of which can be linked to from a syllabus. An astonishing resource for teachers and students.

The conclusion: Google Print really is shaping up to be a library, that is, of the world pre-1923 -- the current line of demarcation between copyright and the public domain. It's a stark reminder of how over-extended copyright is. Here's an 1899 english printing of The Mahabharata:

mahabharata.jpg

A charming detail found on the following page is this old Harvard library stamp that got scanned along with the rest:

mahabharata harvard stamp.jpg

Posted by ben vershbow at 04:53 AM | Comments (0)
tags: Copyright and Copyleft , Libraries, Search and the Web , OCR ,