VIAFbot Debriefing

Shortly after reaching the 1/4 million edits milestone VIAFbot finished linking Wikipedia biography articles to VIAF.org. Examining the bot’s logs reveals telling statistics about the landscape of Authorities on Wikipedia. We can now know how much linked authority data is on Wikipedia, it’s composition, and the similarities between languages. First, let’s understand the flow of …

More

Top Corporate Names in WorldCat

As I explained earlier, I have been doing some investigations into how MARC has been used over the last several decades. Curious about the contents of the 110 $a (corporate names), I parsed it and the top 30 headings are listed below. Keep in mind a few things, however: Entities can be put together in …

More

Top Topics in WorldCat

As I’ve described in a series of posts recently (“Adventures in Hadoop”, four so far), I’ve been having fun on our new compute cluster. Well, maybe “fun” isn’t exactly the right term for diving into the depths of the MARC format, but hey, librarians have to get their kicks somehow. Anyway, I’ve been doing some …

More

The Flipped Library

My colleague, Lorcan Dempsey, did a very nice synthesis of “MOOCs, Libraries, OCLC” for the OCLC Board of Trustees this morning. Given the massive attention and the surge of interest in MOOCs (witness that the article – Year of the MOOC – in the New York Times has stayed on the most emailed since it …

More