Showing posts with label loc. Show all posts
Showing posts with label loc. Show all posts

Wednesday, October 21, 2009

Crowdsourcing Redux

In an amusing twist, Flickr user Heather Falk discovered my post "Lick This": LOC, Flickr, and the Limits of Crowd Sourcing and said hi (actually she said "LOL"). It was of course Falk who posted "lick this" to the forehead of a 1942 woman aircraft worker at the Library of Congress Flickr photostream, a comment that I held up as illustrating the limits of historical crowdsourcing. In any case, Falk has a Flickr photostream and appears to be a perfectly nice person with a cute cat.

This reminds me that I had promised to follow up with another post about crowdsourcing showing some of the triumphs of this approach. Stay tuned.

Thursday, June 25, 2009

"Lick This": LOC, Flickr, and the Limits of Crowd Sourcing


[Update: This post has provoked quite the discussion over at the Flickr Commons board.]

In January of 2008 the Library of Congress and the photo-sharing web service Flickr announced a unique partnership. The Library of Congress Flickr Pilot Project put 3000 historic LOC photographs on the website Flickr and invited the public to view, annotate, tag, and generally mess with them. This was perhaps the LOC's first foray into the world of Web 2.0 and generated a tremendous buzz. "In the first 24 hours after launch, Flickr reported 1.1 million total views on our account, with 3.6 million views a week later," according to this LOC report on the project. The project--"a match made in photo heaven" according to the LOC blog--has been praised everywhere from the New York Times to the popular community weblog Metafilter.

The goals of the project are to "increase awareness of the Library and its collections; spark creative interaction with collections; provide LC staff with experience with social tagging and Web 2.0 community input; and provides leadership opportunities to cultural heritage and government communities." Especially talked about was the second goal--sparking interaction with the collections. The idea was that visitors to Flickr could add useful metadata LOC images, such things as the names of people in the photographs, locations, models of cars or other machinery, etc.

The project may well be a success overall, but as a way to add useful metadata to historical documents, the Library of Congress Flickr Pilot Project is a disappointment. Let me explain...



Above is a screen shot of this photograph, from the very popular 1930s-40s in Color photograph set. This iconic photograph is also used as the cover image on the LOC's Final Report Summary for the project. This one photograph, and the user-generated metadata attached to it, demonstrate the problems with inviting the general public to contribute to a historical collection.

One of the most innovative features of Flickr is the ability of visitors to add notes to the pictures. You can create a rectangular box over some portion of an image and add a text note. This is especially useful for identifying individuals in group photos or pointing out specific details.

So what sort of metadata have users added to supplement the sparse LOC identification ("
Bransby, David,, photographer. Woman aircraft worker, Vega Aircraft Corporation, Burbank, Calif. Shown checking electrical assemblies, 1942 June ") of the photo?

There are 20-30 notes on the photograph and not one contains useful historical information to give context or help us understand the photograph. Most are throw-away jokes or comments, "I love this fabric!" by Flickr user Mrelia and "Lick this" by user HeatherrFalk (referring to the woman's forehead!). Most of the rest of the notes refer to the woman's appearance or the composition of the picture. Almost useful is a little nested debate about the authenticity of the photograph--how staged was it?--but the discussion is hard to follow, requiring hovering the mouse over each box to see the comment.

Flickr users may also add comments and tags to images, and organize them together into sets. But here again the crowdsourced noise overwhelms the signal of useful historical information. There are over 100 comments attached to this one photograph, all but a few devoted to the picture's composition (well it is a photography website after all) or how pretty the woman is or posting just to post something. Within the chaff there are a few grains of wheat--as when user BeadMobile adds some pencil drawings made by his grandmother when she worked in a factory during World War Two. But you really have to dig.

What about tagging? User tagging is often presented as a simple and powerful way to crowdsource metadata in online archives. There are 71 user-generated tags for this image. Some are obvious and useful--"1942" and "rosie the riveter." Many others however are odd ("everyone did their part") or cryptic ("sfv" "LF").

And the sets? How have Flickr users organized this image with others? Well the woman in the picture should be proud that she is in the "Nation Of Domination. (We Rule The Universe)" photo pool and the "cable porn" pool.

The above might seem like a lot of text to bash on one image and its metadata, but the problems extend to all of the other images in the project. The notes are mostly smart-ass remarks, the comments are empty, the tags are idiosyncratic. The frustrating thing is that there really is some crowd sourced gold withing the flood of junk, such as the transcriptions of hand-lettered signs in the windows of the Brockton Enterprise newspaper office in this photo.

The most useful comment I found in this project? User Catskills Grrl's comment: "Gee, I wish the stupid, smart-ass notes would be deleted off these photos."

I will pick up the topic of crowd sourcing again in a future post, pointing towards some archives that I believe are doing it correctly.

Friday, February 22, 2008

Webcasts from the Library of Congress

The Library of Congress is the topic of a series of posts that I will make this week. The topic today: this rich selection of webcasts of LOC lectures and other public events. Categories include Biography, History, Performing Arts, Education, Government, World Affairs, Literature, Religion and Science. I am watching this panel: "Indian Religious Freedom, to Litigate or Legislate?" right now. Other intriguing titles include End of European Colonial Empires, Robert E. Lee, and 1507 Waldseemuller World Map. There are hundreds more!

Sadly the implementation of the webcasts is not what it could be. They are presented in the odious Realplayer format. (What is wrong with RealPlayer? Well PC Magazine listed it as #2 in their article "The 25 Worst Tech Products of All Time.") An unfortunate side effect of the choice to use Realplayer is that you can't easily download the webcasts onto a portable device for later viewing--you have to watch them online, at your computer. (There are of course ways around this.) These programs were put on at the taxpayers' expense and are free of copyright, we should be able to download them if we like. There are no interactive features, we can't leave comments at the webcast pages. And it is not possible to insert the webcasts into a blog or webpage, as I so often do with YouTube videos on this blog.

I am delighted by the wealth of material in these webcasts, but wish they had presented them differently. I wonder why the LOC didn't just get a YouTube channel?