Archive for the 'Modeling new services' Category

Breaking Open the ILS Silos

Friday, August 20th, 2010 by Roy

In 2007-2008, the Digital Library Federation (DLF) convened a Task Group to recommend standard interfaces for integrating the data and services of the Integrated Library System (ILS) with new applications supporting user discovery. The group produced a report with recommendations in December 2008. After that not much happened.

In February 2010, at the Code4Lib Conference, Karen Coombs (the OCLC Developer Network manager) and I brought together some of the people who had been on that task group as well as other interested parties who were at the conference to take this work to the next stage. At this ad hoc meeting we agreed that we were ready to take this work to the next stage. The next stage, we felt, was to actually create a middleware layer that we could collaboratively maintain. Read the rest of this entry »

Terminologies in action

Monday, August 2nd, 2010 by Merrilee

A few weeks ago I listened in on Karen Coombs TAI-CHI webinar “OCLC Web Services in action” (you can find the slides here, and watch the webinar here). Even though I work at OCLC, I’m not always as up to speed on what’s going on as I should be, so I was quite amazed at just how many web services are available, and the variety of use cases.

I was particularly gratified to see the Terminology Services being put to use in several different prototypes, since Günter, Diane, Andy and I pulled together our Strawman document and held a meeting on uses of termonolgies way back when. All of the prototype work has been done around our top vote getter, which was “leveraging terminologies for search optimization.”

I particularly like this example, from Demian Katz (via Karen Coombs) which shows how VuFind could be used with WorldCat Identities and OCLC Terminology Services to provide users with suggested terms.

Terminology Services is quite experimental at this point — Karen and others who work with our Developers’ Network are looking for feedback in order to gauge the community’s level of interest in the Terminology Services before committing the resources to make it a production service. So if you have ideas about how to put the power of terminologies to work, I hope you will give it a whirl!

Other examples of use of the Terminologies Service are featured in the Application Gallery. You can also view other offerings in our TAI CHI webinar series.

Got MissingMaterials?

Tuesday, May 25th, 2010 by Jennifer

Theft and loss of materials held in libraries and archives worldwide is a concern not only for owning institutions, but also for the international antiquarian book trade and global law enforcement. Two years ago, OCLC Research was tapped by members of the Rare Books and Manuscripts Section (of ALA/ACRL) to develop a way to expose these stolen or “missing” materials in a centralized, highly-visible way, in order to help identify stolen materials, recover missing items and deter future crimes.

Together with the RLG Partnership, the RBMS Security Committee and the Antiquarian Booksellers’ Association of America, OCLC Research developed MissingMaterials.org, which (combined with the list functions available in WorldCat.org) creates a free mechanism for sharing reliable information about missing rare books and other materials at the network level. In order for this solution to work, however, it is of vital importance that the community use it. Therefore, I invite you, the community, to go forward, create lists, and add to MissingMaterials.org. It’s easy, it’s free. If you hadn’t been ripped off, it would even be fun.

A few weeks ago, Jen and I did a webinar that included some leading voices from the rare books and archives community, particularly those who have been committed to creating transparency around theft. Webinar presenters included: Katherine Kyes Leab (American Book Prices Current); Richard Oram (Associate Dir., Harry Ransom Center, University of Texas at Austin); Maria Holden, Prudence Backman, and Britanny Turner (New York State Archives).

If you watch the webinar, you will see a demonstration of how to create a list and add it to MissingMaterials.org. Or, you can go here to read a short explanation of how to do it (only 165 words!). Did I mention it was free? Did I mention it was easy?

More information about the project is available here. Please contact Jennifer Schaffner or me if you are interested in more information, or if you would like some handholding.

Research dissemination and ‘the archive’

Monday, April 26th, 2010 by John

Ithaka S+R recently published its Faculty Survey 2009: Key Strategic Insights for Libraries, Publishers, and Societies. It considers the way faculty views of the library are changing, and analyses library roles into three key functions:

“The library is a starting point or ’gateway’ for locating information for my research” (which we refer to as the gateway function). “The library pays for resources I need, from academic journals to books to electronic databases” (which we refer to as the buyer function). “The library is a repository of resources – in other words, it archives, preserves, and keeps track of resources” (which we refer to as the archive function).

Ithaka’s analysis shows that the gateway function has declined (its importance rating has dropped from 70%-58%) over the six years in which the biennnial studies have been made, while the buyer function has steadily increased (81%-90%). The archive function has remained relatively static at just over 70%.

Many of the findings in this report are interesting, and relevant to us as we focus - via our Working Group on Research Services - on the specific topic of Support for Research Dissemination. We have chosen the word dissemination with some care. What we will be looking at is researcher behaviours and practices concerning institutional repositories, individual websites, subject archives, virtual research environments, blogs, blog aggregations and other social venues. In other words, every research dissemination venue except the conventional (and still overpoweringly influential) modes of scholarly publishing - the journal, the monograph and the conference paper. We will look at the way researchers use these alternative venues to disseminate their work, and the factors that account for the types and rates of dissemination. Read the rest of this entry »

OCLC Research @ University of Calgary

Tuesday, February 16th, 2010 by Günter

As those of you who have listened to Tom Hickerson’s Distinguished Seminar Series lecture will know, the University of Calgary has embarked on an ambitious plan of integrating their libraries, archives and museums under a single administrative umbrella (Libraries & Cultural Resources or LCR). This convergence is catalyzed by a new building in the heart of the university’s campus, which will co-locate the units as well as many campus research, teaching & learning support functions. In latest news, last week a reorganization of LCR was announced to realign the staff with emerging priorities. The University of Calgary is our latest addition to the roster of institutions participating in the RLG Partnership, and to make proper mutual introductions, a team from OCLC Research visited Calgary last week.

In conversations preparing for our trip, we were asked to make a contribution in moving LAM integration at the university forward, and in particular, to focus on Calgary’s ambitions to create a single search across LCR resources. (Calgary currently experiments with Summon for single search - watch an introduction here). Our agenda (inspired by our LAM workshops) called for a broad discussion establishing key features for single search, followed by sessions focused on how archives, metadata services/libraries and museums can contribute to these features and the overarching goal of single search. You’ll find the presentation we used to set the scene for the single search discussions here - it also contains a number of examples from other institutions who have ventured down this path, including the Victoria & Albert, Yale & the Smithsonian.
Read the rest of this entry »

Next-Gen Harvesting

Thursday, February 4th, 2010 by Roy

Metadata harvesting (collecting metadata from others and aggregating it in a collection) is not new. Although there are any number of ways to do this, the OAI-PMH protocol for metadata harvesting is often used and has been around for years. It defines a small set of actions that allows anyone to discover what sets of metadata are available for harvesting from a digital repository, which metadata formats are offered, and select and download those records. Thousands of repositories worldwide support it, sometimes even unknowingly, because many repository applications such as DSpace and ePrints come with OAI-PMH support out of the box.

This has led to a world in which there are metadata aggregators and even agreggators of aggregators. It has also led to potential confusion and difficulty. Records that are picked up from their “native” location and indexed and displayed elsewhere may not be depicted as the creator of that metadata intended. They also may not be refreshed in a timely fashion, thereby potentially leading to records that are out-of-date persisting in various corners of the Internet.

This is why when my colleagues on the services side of the house announced the WorldCat Digital Collection Gateway I sat up and took notice. This heralds a new world in which those being harvested can exert some control over not only how frequently their records are updated, but also how those records are depicted in the aggregation — in this case, WorldCat. Through a simple web-based interface, you can provide your OAI-PMH base URL, have the Gateway test harvest some records, view how those records would display in WorldCat, and change the mapping if you wish. Another benefit is that your records will then appear in all of the places WorldCat is syndicated.

A pilot project to test the Digital Collection Gateway was just announced, beginning March 1, and we are seeking volunteers to try it out and provide feedback. During the pilot you will be asked to:

  • Attend a two-hour webinar reviewing the use of the Gateway
  • Upload a minimum of 500 metadata records to WorldCat
  • Offer feedback and input on your experience with the Gateway to our support and product teams so we can improve the tool and workflows

If you would like to help us create a next-generation harvesting infrastructure, in which you control your metadata more than ever before, email us at oaister@oclc.org.

ORCID and ISNI: Author, Swineherd, Taxman, Alcohol Researcher

Saturday, January 30th, 2010 by Jim

At recent meetings I attended in Washington D.C. there was significant hallway discussion about the Open Researcher Contributor Identification (ORCID) initiative. Given the science orientation of the meetings this initiative to resolve the problem of name ambiguity and attribution in scholarly publication was particularly welcomed. As you’ll see if you visit the ORCID site this is early days for this pre-competitive multi-publisher effort whose goal is to establish

“an open, independent registry that is adopted and embraced as the industry’s de facto standard.” Their mission is “to resolve the systemic name ambiguity, by means of assigning unique identifiers linkable to an individual’s research output, to enhance the scientific discovery process and improve the efficiency of funding and collaboration.”

Meeting one was convened by Thomson Reuters and Nature Publishing not long ago with the first meeting in November 2009. The roster of participants is impressive and the continued involvement of Elsevier made those with whom I talked hopeful that this would be as successful an effort as CrossRef has been. A recent editorial in Nature Credit where credit is due (pdf) is quite to the point about the implications of success.

My colleagues, Thom Hickey and Janifer Gatenby, have been involved. OCLC has much to contribute here given Thom’s leadership of the Virtual International Authority File (VIAF) effort and Janifer’s in the development of the International Standard Name Identifier (ISNI). The scope of ORCID is narrower than ISNI as the latter is intended for the identification of “identities used publicly by parties involved throughout the media content industries in the creation, production, management, and content distribution chains.” This goes across all fields of creative activity not just science. As Janifer said,

“ISNI could become a cross domain identifier so that a researcher who also plays in a rock band (and wants it known that he is one and the same) can be identified.”

Read the rest of this entry »

National systems of research assessment and implications for libraries

Tuesday, December 22nd, 2009 by John

Research assessment is a very big deal in some countries. Countries whose university systems are largely publicly-funded routinely check up on the research quality of individual universities to ensure that they are squeezing the best possible performance out of their systems. They do this because they see a link between high-quality research and economic development. The economic potential of research is growing in importance as national ‘knowledge economies’ recognise the need for international research excellence, and see universities as a key driver.

We have just published a report which reviews the research assessment regimes of five countries, and the role of libraries in the processes of assessment that exist. This report was produced by Key Perspectives Ltd, a UK consultancy, and it surveys the research assessment situation in the Netherlands, Ireland, the UK, Denmark and Australia. We chose countries that we knew were doing interesting things in assessment - or in preparation for its introduction. The high political stakes involved were evident even as the report was being written. In the UK, the pilot exercise for the system that will replace the Research Assessment Exercise (RAE) ditched one of its proposed new thrusts (bibliometrics) and found another (economic impact) for the country’s universities to stress about. In Australia, a recent change of government led to temporary abandonment of a system that tied assessment outcomes to government funding, and arguably lost the country some ground in the international scramble for both reputation and economic advantage.

The Review provides a fascinating account of different cultural understandings of the purposes of assessment, and a glimpse of the trend of concentrating research excellence in a small number of top universities that is now taking shape in many countries, as the competition for research income, top faculty and students becomes one that occurs within a single international marketplace. We found countries that tied research assessment to large amounts of government funding, and others that did not (yet); countries that operated systems based on bibliometrics and others that mistrusted them; countries that devised league tables of journals and awarded points to researchers on those they published in - and others that assembled national panels of experts to determine the rankings.

Libraries are involved in these assessment exercises in a range of ways, from the clerical (data entry) to the highly strategic, and from the specialist (bibliometric expertise) to a role as providers of general infrastructure (institutional repositories). Whatever differences there may be in the assessment systems adopted by different countries, they all share a focus upon the research outputs produced by their researchers and faculty. These outputs are managed by libraries - both indirectly (via publications) and, increasingly directly (via arrangements with the authors themselves at pre-publication stages). Does this suggest that libraries play a central role in research assessment within their institutions? Or that they should? At the very least, shouldn’t libraries seek a shared view on this question?

Getting to Know Pebbles

Friday, December 11th, 2009 by Roy

Recently I’ve required access to more computing power than I normally have available, since as part of the “Cloud Library” project managed by my colleague Constance Malpas we need to process millions of records from WorldCat, the Hathi Trust, and a regional storage facility. Luckily, OCLC Research sits on quite a bit of processing power, and it’s called “pebbles”.

Pebbles is a cluster of computers that consists of one “head” node and 32 “compute” nodes. Each node has 4 CPUs, 16 GB RAM, and around 1.5 TB of disk storage. The cluster takes its name from the fact that we use the Rocks open source cluster administration software to manage our cluster. Those of you who have been paying attention will remember that my colleague Thom Hickey wrote about this cluster when it was new, some 2 1/2 years ago.

Read the rest of this entry »

Climate change for libraries

Monday, November 30th, 2009 by John

At the RLG Partnership Annual Meeting in 2007, Timothy Burke told the assembled research librarians ‘you have to figure out how to be hydraulic engineers of information flow rather than the guardians of the fortress’. It’s an image that has stuck with me. Everywhere now in our professional literature we see the challenges of our work represented by the imagery of flow and fluidity. We try to scope and identify workflows that are changing or need to change. The platform of the web dips and peaks faster and differently than we can predict, and as it does so content suddenly flows in different directions, taking new channels. Stability in this environment is rare, and a relief when we find it, even though it may lie in places that librarians take some time to trust - like Google and Wikipedia.

I often show a slide produced by Rick Luce, Vice-Provost and Director of Libraries at Emory University, when describing the territory of our Research Information Management (RIM) programme. This appeals to me because it indicates that library attention needs to be focused on the workflow layer, rather than the repository layer that sits below it.

Understanding the particular environments of researchers, and the flows that matter to them, is perhaps not a new challenge for research libraries, but it is a newly urgent one. In the pre-digital world the flows were not digital flows, with the capture challenges and opportunities that now exist. The library dealt mainly in the solid world of published literature. It collected from the physical outputs that emerged at the end of flow processes, and could structure its operations around that bounded reality (within its ‘fortress’ print stores, to use Tim Burke’s analogy). Now, we see potential for library services everywhere, because we have systems that capture flows, and allow them to combine, split and replicate wherever it is useful for them to do so, and legal barriers do not obstruct. But to do so optimally, we need to understand researchers’ worlds at a level of detail that is still not familiar to libraries. Read the rest of this entry »