Library Journal Mobile
Log In  |  Register          Free Newsletter Subscription
Subscribe to LJ Magazine

So, Can Google Use OCLC Records? Yes, But

Questions remain about the impact of WorldCat on Google's metadata

Norman Oder -- Library Journal, 9/10/2009

Go back to the
Academic Newswire
for more stories
  • Google has access to WorldCat metadata
  • Google says bad metadata comes from external providers
  • No restrictions on which WorldCat metadata fields can be used

The discussion about Google’s many metadata errors raised by linguistics researcher Geoff Nunberg has now led to questions about Google’s relationship with OCLC and WorldCat records.

In a comment on Nunberg's Language Log post, Karen Coyle questioned whether Google was using library metadata that is part of the WorldCat database and/or whether restrictions on use of that metadata hampered Google. 

In a blog post, she went on to question whether Google Book Search had a contract with OCLC and whether it had any restrictions. 

OCLC’s response
Subsequently, Chip Nilges of OCLC posted a mailing list comment, writing: "We've made the entire WorldCat database (excluding certain metadata records that OCLC is contractually prohibited from providing) available to Google to support discovery of the books Google has digitized from library collections. In exchange, Google has agreed to display a link to libraries on pages describing library digitized materials."

Google’s response
LJ queried whether Google is using WorldCat data. Google metadata point man Jon Orwant responded, “Chip is correct. We get a nearly-full OCLC feed and it substantially improves the quality of our metadata. We've been using it for years and are happy with it. We also get individual library catalogs and commercial data feeds. We have over 100 metadata sources, and this is why we have so many errors: if you have only one source of truth, you never have any doubt.”

“When Google Books exposes errors in metadata, it is sometimes the result of errors in the feeds and catalogs, sometimes the result of varying cataloging practices (e.g., different libraries storing serials information in different places in the MARC record), and sometimes errors in how we adjudicate between conflicting metadata sources,” Orwant added, pointing out that bad metadata on Google Books “almost always comes from an external metadata provider.”

WorldCat a priority?
LJ followed up, asking whether Google considered using WorldCat as the default, thus presumably improving the data. 

Orwant hasn’t yet responded.

Questioning OCLC
Coyle was skeptical of OCLC's response, asking, “Are there restrictions on what MARC fields and subfields Google can include in its metadata?”

Nilges responded, "To answer Karen's most recent post, Google can use any WC metadata field. And it's important to note as well that our agreement with Google is not exclusive. We're happy to work with others in the same way. The goal, as I said in my original post, is to support the efforts of our members to bring their collections online, make them discoverable, and drive traffic to library services.”

Read more Newswire stories:

At Congressional Hearing, Register of Copyrights Slams Google Settlement

Academics, ProQuest, Networks Object to Google Settlement


Columns:
Library Cloud Atlas: A Guide to Cloud Computing and Storage | Stacking the Tech

An Open Question | Peer to Peer Review

The Stories Those Tour Guides Tell | From the Bell Tower


Ovid expands agreement with CABI; EBSCO releases Business Continuity & Disaster Recovery Reference Center; Summon goes live at University of Huddersfield

Architectural Questionnaire for Academic Libraries

Best Sellers in Biology

Sponsored Links




 
Advertisement
Sponsored Links

MOST POPULAR PAGES

More Content

  • Blogs
  • Podcasts
  • Photos

Blogs

  • Cheryl LaGuardia
    E-Views

    November 20, 2009
    Portable Libraries, Mobile Students
    I attended this excellent ACRL-NE Information Information Technology Interest Group (ITIG) Social pr...
    More
  • Cheryl LaGuardia
    E-Views

    November 20, 2009
    Parker Library on the Web
    Corpus Christi College (Cambridge) and Stanford University Libraries recently released t...
    More
  • » VIEW ALL BLOGS RSS

Photos

  • Design Institute 2007
    December 11, 2007 at Chicago's Harold Washington Library Center:Design Institute 2007
  • Learning Gardens
    New York's GreenBranches program links the library to the street.
  • Green Picks: LBD May 2007
    Want to reduce your library's carbon footprint? Join the Cradle-to-Cradle revolution. Helen Milling shares the green products her firm is using.
Advertisements





LJ NEWSLETTERS

Booksmack
LJXpress
LJ Academic Newswire
LJReview Alert
LJ Criticas Review Alert
SLJ Extra Helping
Curriculum Connections
SLJTeen
PWDaily
Children's Bookshelf
PW Comics Week
Cooking the Books
Religion BookLine
Please read our Privacy Policy
©2009 Reed Business Information, a division of Reed Elsevier Inc. All rights reserved.
Use of this Web site is subject to its Terms of Use | Privacy Policy
Please visit these other Reed Business sites