Login  |  Register          Free Newsletter Subscription
Subscribe to LJ Magazine
Email
Print
Reprint
Learn RSS

Digital Libraries- Metadata As If Libraries Depended on It

By Roy Tennant -- Library Journal, 4/15/2002

In digital library work, we often talk about metadata, what I jokingly call "cataloging by those paid better than librarians." A more serious definition would be "a structured description of an object or collection of objects." A MARC record for a book is clearly metadata but so, too, is an archival finding aid encoded in EAD, or even descriptive information embedded in META tags at the top of a web page. Metadata is increasingly everywhere-sometimes hidden, sometimes apparent.

But, as any cataloger will tell you, not all metadata is created equal. Excellent metadata is constructed in layers. First you must settle on what is important to you or to those you serve. You create containers (defined and codifed by schemas) to squirrel away the tidbits that will, taken as a whole, describe something well enough for your purposes.

Qualities of containers

LINK LIST

Dublin Core
dublincore.org

EAD
www.loc.gov/ead

IFLA Metadata
Resources

ifla.inist.fr/II/
metadata.htm

MARC
www.loc.gov/marc

METS
www.loc.gov/
standards/mets

MODS
www.loc.gov/
standards/mods

Metadata containers are a tricky business. A new metadata schema or definition requires the correct mix of power, flexibility, and complexity for a given user community and its readiness for it. Do it well-as with MARC when it was created in 1968-and you'll prosper. Do it poorly, and you're toast.

A good container must solve a problem that people care about, but it must do so in a method that allows and encourages adoption (e.g., by being readily understandable). It should be extensible and alterable without breaking systems that will come to rely upon it. It also should be interoperable with systems that may have other metadata requirements.

Also, a metadata container can succeed or fail depending on how well it deals with granularity issues-how finely you chop your metadata. This is so critical an issue that next month's column will focus on it alone.

To create a new metadata container, you must imagine all required uses and provide the right hooks to make those uses possible. If you have done your work really well, you will have enabled uses that you never imagined.

A variety of containers

When it comes to metadata containers, we live in interesting and fruitful times. Besides such longstanding workhorses as MARC, we also have the Dublin Core, Metadata Encoding and Transfer Syntax (METS), and most recently Metadata Object Description Schema (MODS).

The Dublin Core has been in development for several years and is meant to be a common meeting ground among richer descriptive metadata standards. METS is relatively new and aimed to encapsulate a digital object or set of objects in a standard wrapper. Descriptive metadata can either be embedded directly in the METS object, or the METS record can point to an external description using another metadata standard such as MODS or MARC. MODS, the newest draft standard, is much richer and more granular than Dublin Core but simpler than MARC.

Consider the following scenario: a book is digitized, and METS is used to organize all its files. The METS record points to (or embeds within it) a descriptive record using MODS. A simplified version of the MODS record is made available for harvesting so that service providers can build cross-collection search services.

More standards are not necessarily a problem, as long as they can fruitfully interoperate and exchange data when required. As libraries digitize collections, metadata is required to organize and provide access to this content outside of, or in association with, the library catalog. Also, some libraries are developing such things as working paper repositories, which have different metadata requirements. With such diversity of uses, a diversity of metadata containers is likely essential.

Qualifying information

A good metadata container must provide for a method of qualifying the information held within a particular element, e.g, a subject heading. It may be useful to know that the text found within a subject field comes from a controlled vocabulary. If so, you can then qualify that information with a reference to the controlled vocabulary from which it comes. Using Dublin Core within an HTML META tag, this could be expressed as:

<meta name='DC.Subject' scheme='LCSH'

content='Vietnamese Conflict, 1961-1975'>

Another classic case that may require user qualification is the role that a person has in relation to the item. Dublin Core specifies a 'creator' field, but what does that mean? In one case the person identified may be an author; in another, a painter. There should be some method of providing this information.

Usage guidelines and information

A standard container and a means to qualify the information contained therein is only a start. Without guidelines on how the container is to be used, errors in interpretation and usage could render the metadata less useful. A classic example of usage guidelines is the Anglo-American Cataloging Rules (AACR), which defined how and with what information a cataloger was to fill the MARC metadata container.

Such guidelines are still a dream away for some recent entries into the metadata container field. But these issues are understood by the librarians involved, so it is only a matter of time before they are codified. Also, usage of these emerging metadata standards will inform and enhance the process of codifying their use.

Finally, we arrive at the ingredient that is at the very center of this enterprise-the information itself. What we know about that information both informs our intellect and limits our imagination as we develop new schema. In the 1960s, when MARC was created, who would have thought that it would help to have a place to put such things as book reviews and not so helpful to have a physical description?

What this means to you

Catalogers must learn new things, certainly, but also must apply much of their longstanding knowledge and many skills in new and interesting ways. Reference librarians must understand both the possibilities for user services presented by these new schema and the consequences to your mission should they not be used. Check the great list of metadata resources compiled by the International Federation of Library Associations and Institutions (IFLA).

Library administrators, pay close attention. Your library has the potential to serve your clientele in ways you never dreamed of. Your library also might be increasingly marginalized until your clientele never supports a bond issue again. Not everything libraries need to collect or provide access to can be usefully described using MARC, or accessed via a database vendor for a fee.

We're in a brave new world, in which MARC must make room for METS and MODS and whatever else becomes necessary to do our work better. Stand by your staff, give those who are eager what they need to learn and grow, and provide incentives to those who are reluctant.


Author Information
Roy Tennant (roy.tennant@ucop.edu) is Manager, eScholarship Web & Services Design, California Digital Library. He is founder and manager of the electronic discussion lists Web4Lib and Current Cites
Email
Print
Reprint
Learn RSS

Talkback

We would love your feedback!

Post a comment

» VIEW ALL TALKBACK THREADS

Related Content

Related Content

 

By This Author

Sponsored Links




 
Advertisement
Sponsored Links

More Content

  • Blogs
  • Podcasts
  • Photos

Blogs


Sorry, no blogs are active for this topic.

» VIEW ALL BLOGS RSS

Photos

  • Design Institute 2007
    December 11, 2007 at Chicago's Harold Washington Library Center:Design Institute 2007
  • Learning Gardens
    New York's GreenBranches program links the library to the street.
  • Green Picks: LBD May 2007
    Want to reduce your library's carbon footprint? Join the Cradle-to-Cradle revolution. Helen Milling shares the green products her firm is using.
Advertisements





LJ NEWSLETTERS

Click on a title below to learn more.

LJ BookSmack
LJXPRESS
LJ ACADEMIC NEWSWIRE
LJ REVIEW ALERT
CRÍTICAS
©2009 Reed Business Information, a division of Reed Elsevier Inc. All rights reserved.
Use of this Web site is subject to its Terms of Use | Privacy Policy
Please visit these other Reed Business sites