More about MODS (and XML)

At the Metadata Discussion Group meeting on March 8 April 5, 2016, we will talk about some of the challenges of mapping a descriptive metadata structure standard (in this case, MODS) from a XML-based expression to one that is RDF-based. This post will explain what MODS is and what it’s used for.

MODS: the ‘Who, What, and When’

The Metadata Object Description Schema (MODS) was published in 2002 by the Library of Congress’ Network Development and MARC Standards Office. The standard is maintained by an editorial committee comprised of library metadata practitioners from North America and Europe.

MODS is a “bibliographic element set” that may be used to describe information resources. MODS consists of 108 elements and subelements (there are 20 top-level or “parent” elements). At this point, I’ll urge you to go read the brief overview of MODS on the Library of Congress’ Standards website.

Go ahead. I’ll wait.

You read that bit about MODS being more or less based on MARC21, right? In the example below, I’ve described a sheet map using MODS elements and MARC tags.

DATA (formulated according to AACR2, if that sort of thing matters to you) MODS ELEMENT MARC TAG (and mapped MARC data value, when applicable)
Campbell County, Wyoming title 245 $a
Campbell County Chamber of Commerce (Wyo.) namePart 110 $a
cartographic typeOfResource Leader/06 “e”
Gillette, Wyo. place 260/264 $a
Campbell County Chamber of Commerce publisher 260/264 $b
[1982?] dateIssued 260/264 $c
1 map ; 33 x 15 cm extent 300

Table 1. Data expressed in MODS elements and MARC tags.

There’s a full mapping of MARC21 tags to MODS elements available, if you’re really curious. This example demonstrates that, although there are a few divergences, MARC21 was built to map almost directly into a MODS element.

MODS encodes descriptive metadata, or information about resources (title, creator, etc.). MODS and MARC21 are examples of data structure standards. Elements or tags are meant to serve as containers for data. Structure standards do not give any directions about how to formulate data—those directions come from data content standards (AACR2, RDA, DACS, etc.). The main purpose for structure standards (Dublin Core, EAD, and TEI are other examples of metadata structure standards) is to encode data so that it can be manipulated by machines. Elements separate discreet information for use in search and browse indices. Data structure standard elements often convey the meaning of the data. The MODS:title element only contains the word or words that are used to refer to a resource. MODS:title will never serve as a container for the resource’s size.

MODS: the ‘Where, Why, and How’

MODS was built “for library applications.” MODS has been chiefly implemented to support discovery of digital library collections. At IUB Libraries, MODS is the metadata standard of choice for the digital objects that are ingested into our digital collections repository, Fedora.

MODS elements are expressed in XML. XML is a metalanguage, which means that XML is an alphabet, of sorts, for expressing other languages. The figure below illustrates the XML syntax (the “alphabet”) by which XML expresses another language. A fake language with a bogus element named “greeting” is encoded in Figure 1.

An XML statement is shown. The syntax components--start and end tags, element name, attribute name, attribute value, and content value--are highlighted.
Figure 1. Anatomy of an XML statement. [click image to enlarge]
HTML (the language responsible for displaying this webpage to you right now), EAD, and TEI are also expressed using XML.

From the beginning, MODS was designed to be expressed as an XML schema. Schemata are the sets of rules for how languages work: which elements are valid and what their semantic meanings are, which elements nest within others, whether or not an element can be modified by attributes (e.g., the MODS:titleInfo might have an attribute called “type”), and whether there is a controlled list of values for a given attribute (e.g., the MODS:titleInfo “type” attribute is limited to the values “abbreviated, “translated,” “alternative,” “uniform”).

MODS records are created in a number of ways. You could open up an XML editor and start creating a MODS/XML record. If you want to really get to the know the MODS standard, that wouldn’t be a bad idea. However, if you wish to create metadata for a half a million photographs, editing raw XML won’t be terribly efficient. At IU, we have a few different methods for creating MODS records for digital objects. My favorite is the Image Collections Online cataloging tool. Use of the tool is restricted but I’ve included a screenshot below.

Screenshot of the Image Collection Online cataloging tool. The web form include fields for title, subjects, etc. A thumbnail of the digital object (an image) and an option for transforming the metadata to MODS are included.
Figure 2. Screenshot of the metadata interface for the Image Collection Online cataloging tool. [click image to enlarge]
Once a collection manager decides which metadata elements are desired and has consulted with the metadata specialist for digital collections (our own Julie Hardesty), those elements will display in a web form. Data may then be entered without needing to know XML or MODS. In Figure 1, you’ll see a box in the lower right-hand corner “Transform metadata to…” Clicking on that link that says “mods” allows me to download the data that I input into the web form as MOD/XML. You may view the full record for this photograph below.

That’s the 5 cent tour of MODS, as it’s expressed in XML. Questions? Leave a comment below!

Resources

Sample MODS Record

<?xml version="1.0" encoding="UTF-8"?>
<mods:mods xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
 xmlns:mods="http://www.loc.gov/mods/v3"
 xmlns:xlink="http://www.w3.org/1999/xlink"
 version="3.3"
 xsi:schemaLocation="http://www.loc.gov/mods/v3 http://www.loc.gov/standards/mods/v3/mods-3-3.xsd">
 <mods:titleInfo>
 <mods:title>James Whitcomb Riley and Lew Wallace</mods:title>
 </mods:titleInfo>
 <mods:name type="personal">
 <mods:namePart>Hohenberger, Frank M.</mods:namePart>
 <mods:role>
 <mods:roleTerm authority="marcrelator" type="text">Photographer</mods:roleTerm>
 <mods:roleTerm authority="marcrelator" type="code">pht</mods:roleTerm>
 </mods:role>
 </mods:name>
 <mods:typeOfResource>still image</mods:typeOfResource>
 <mods:genre authority="bgtchm">Photographs</mods:genre>
 <mods:genre authority="">Group portraits</mods:genre>
 <mods:originInfo>
 <mods:issuance>monographic</mods:issuance>
 </mods:originInfo>
 <mods:physicalDescription>
 <mods:internetMediaType>image/jpeg</mods:internetMediaType>
 <mods:extent>8x10</mods:extent>
 <mods:digitalOrigin>reformatted digital</mods:digitalOrigin>
 </mods:physicalDescription>
 <mods:note type="citation">Hohenberger mss., Lilly Library, Indiana University, Bloomington, Indiana.</mods:note>
 <mods:subject authority="local-hohenberger">
 <mods:topic>Wallace/Riley</mods:topic>
 </mods:subject>
 <mods:subject authority="lctgm">
 <mods:topic>Poets</mods:topic>
 </mods:subject>
 <mods:subject authority="lctgm">
 <mods:topic>Authors</mods:topic>
 </mods:subject>
 <mods:subject authority="local-geo">
 <mods:geographic>Crawfordsville (Ind.)</mods:geographic>
 </mods:subject>
 <mods:subject authority="">
 <mods:name type="personal">
 <mods:namePart>Riley, James Whitcomb, 1849-1916</mods:namePart>
 <mods:role>
 <mods:roleTerm type="code" authority="marcrelator">dpc</mods:roleTerm>
 <mods:roleTerm type="text" authority="marcrelator">Depicted</mods:roleTerm>
 </mods:role>
 </mods:name>
 </mods:subject>
 <mods:subject authority="">
 <mods:name type="personal">
 <mods:namePart>Wallace, Lew, 1827-1905</mods:namePart>
 <mods:role>
 <mods:roleTerm type="code" authority="marcrelator">dpc</mods:roleTerm>
 <mods:roleTerm type="text" authority="marcrelator">Depicted</mods:roleTerm>
 </mods:role>
 </mods:name>
 </mods:subject>
 <mods:relatedItem type="host">
 <mods:titleInfo>
 <mods:title>Frank M. Hohenberger Photograph Collection</mods:title>
 </mods:titleInfo>
 <mods:location>
 <mods:url>http://www.dlib.indiana.edu/collections/lilly/hohenberger/</mods:url>
 </mods:location>
 </mods:relatedItem>
 <mods:relatedItem type="series">
 <mods:titleInfo>
 <mods:title>Wallace/Riley</mods:title>
 </mods:titleInfo>
 </mods:relatedItem>
 <mods:identifier type="local">Hoh051.000.0001</mods:identifier>
 <mods:identifier type="local-callnumber">Wallace/Riley, Item 1</mods:identifier>
 <mods:location>
 <mods:physicalLocation>Lilly Library (Indiana University, Bloomington)</mods:physicalLocation>
 <mods:url access="preview">http://purl.dlib.indiana.edu/iudl/lilly/hohenberger/thumbnail/Hoh051.000.0001</mods:url>
 <mods:url access="raw object">http://purl.dlib.indiana.edu/iudl/lilly/hohenberger/large/Hoh051.000.0001</mods:url>
 <mods:url access="object in context" usage="primary display">http://purl.dlib.indiana.edu/iudl/lilly/hohenberger/Hoh051.000.0001</mods:url>
 </mods:location>
 <mods:accessCondition type="restriction on access">There are no restrictions on access.</mods:accessCondition>
 <mods:accessCondition type="use and reproduction">Copyright and reproduction rights for all Frank Hohenberger photographs are held and administered by the Lilly Library, Indiana University, Bloomington, In 47405. Additional permissions may be required prior to any reproduction of images of works by artists and photographers other than Frank M. Hohenberger that are retained in the Hohenberger Collection.</mods:accessCondition>
 <mods:recordInfo>
 <mods:recordContentSource>InU-Li</mods:recordContentSource>
 <mods:recordIdentifier>/lilly/hohenberger/Hoh051.000.0001</mods:recordIdentifier>
 <mods:recordOrigin>MODS record generated by transforming the photo cataloging metadata</mods:recordOrigin>
 <mods:languageOfCataloging>
 <mods:languageTerm type="text">English</mods:languageTerm>
 <mods:languageTerm type="code" authority="iso639-2b">eng</mods:languageTerm>
 </mods:languageOfCataloging>
 </mods:recordInfo>
</mods:mods>

Next meeting: Where might we end up?

In the final part of our series, Past, Present, Future: Understanding Technological Contexts for Metadata Creation at IUB Libraries, we will look at what’s on the horizon for metadata standards in general (BIBFRAME, EAD3) as well as technology stacks to be implemented at IU (DSI, Hydra, Kuali OLE, etc.).When technologies change, what are the workflow implications for library metadata creation and maintenance? What advances might we see in discoverability of library resources? Does technological development affect cataloging practice?

Join us to discuss!

DATE: Tuesday, May 12
TIME: 9-10 am
PLACE: Wells Library Room 043
TOPIC: Part III: Where might we end up?
MODERATOR: Jennifer Liss