As part of my archives and management class, I’ve been researching about Electronic Archival Description (EAD) on account it seems to be a very significant part of the metadata world. Back in 1997, shortly after EAD was officially released to the archival community, the America Archivist did a special two-part issue on this most complicated and most useful scheme. While most of the articles are boringly technical (detailed descriptions of what the <frontmatter> element versus the <eadheader> element is and is supposed to do), there are a couple of articles that are extremely interesting and bring up some compelling points about just why EAD ought to by all rights take the archival community by storm: here is where it becomes funny, for me, reading these arguments eight years after EAD has taken archives, shook them up, and sparked countless debates about the actual usefulness of this not-so-humble metadata scheme.
Janice Ruth hedges on the boringly technical as she discusses the finer points of the development of EAD and why the working group made the decisions it did. She begins by arguing the merits of SGML as an open-source "technique for defining and expressing the logical structure of documents." Ruth then goes onto discuss the merits of some of the more significant decisions of the EAD working group and attempts to prove why the EAD DTD is ideally suited to archival use.
In the beginning of her article, Ruth presents an overview of SGML. She explains that the working group created EAD to reflect the content and not the structure of the traditional finding-aids since local practices vary so much in how finding-aids actually look since SGML is better suited to establishing a logical context rather than a strict physical structure (the look and feel) of a document.. That the SGML DTD specifies where and when a particular element may appear in the EAD record is not physical structure but logical context for certain bits of information that belong in one element and not the other so that the machine can process it. Finally, she explains that a powerful value of SGML is the ability to create attributes that create further machine-readable granularity. As a broad summary of the technical merits of SGML, this 1.5 pages is invaluable justification for the EAD metadata scheme over, say, a legacy Access database or, worse still, traditional paper finding-aids.
With this knowledge in hand, Ruth goes onto to justify the decisions that the EAD working group made. First, she justifies the decision to minimize the number of elements created, arguing that very early on it was decided not to create an element for every strucutural decision that might have been made in various local practices and instead allow for local practice with the <odd> and <add> (Other Descriptive Data and Additional Descriptive Data respectively). Each has its own use: the <odd> element provides a space for local practice data while the <add> element allows an archivist to present further information as he or she sees fit, enclosing a series of <p> tags within this poncho style element. The other major decision made, according to Ruth’s article, was to use generic terms rather than more specific vocabulary that may be familiar in some institutions but not in others.
When she gets into a discussion of hierarchy in EAD, Ruth gets to the heart and soul of the value of EAD for the archival community. This is probably why she spends the bulk of her essay discussing the finer points of this topic. The EAD DTD is divided into four major sections: the <eadheader>, <frontmatter>, <titlepage>, and the <archdesc> elements. Each element encloses information relevant to the overall collection of records, but for hierarchy the <archdesc> element is the most important. Using the <dsc> and <did> elements, according to Ruth, the EAD working group allowed for near infinite subordinate components--records series, record groups, sub groups, sub sub groups, and so on, to be presented in a single EAD file and therefore reflect the hierarchical and non-item-specific nature of an archival repository. In this way, the intellectual arrangement of the finding-aids are reflected in the EAD DTD but not the physical structure of the documents themselves, since stylesheets can be used to transform the EAD records into HTML documents that are human-readable and therefore the EAD serves as an intermediary like most metadata records in the digital era and not the final product presented to the user as metadata was used pre-digital.
Ruth spends the rest of the article presenting a broad overview (with samples) of the EAD record itself. In this A few highlights is an example of the hierarchical work of the <dsc> element, which allows for varying levels of description from record series to group to sub group to item all of which is inherited from higher levels by the lower levels. She concludes then with a summary of the meritorious history of EAD development, which has been done since the beginning according to the needs of the archival community in order to benefit their users.
The article is an interesting glimpse, for me, into the development of a major metadata scheme as well as a revealing look into SGML, of which I know very little. It is helpful to see why certain decisions were made (from an individual who was present during those decisions) and why they were made. It is also helpful to see that many times decisions are made not for short-term convenience, but rather for long-term quality and ease of use of the finished products. The EAD DTD was developed in order to facilitate intellectual access to all the archival collections and while this is an ambitious, long-term project it is a worthy goal to pursue and for that reason the DTD was developed with a mind towards making the highest quality finding-aids possible for use in the digital age. It has its problems, as many have pointed out, but EAD is an admirable if complicated metadata scheme.