case history: library of congress audio-visual prototyping project mets opening day october 27, 2003...
TRANSCRIPT
Case History:Library of Congress
Audio-Visual Prototyping Project
METS Opening Day
October 27, 2003
Carl Fleischhauer
Office of Strategic Initiatives
Library of Congress
The AV Project
• Preservation, sense one: reformatting into digital-file form
• Preservation, sense two: sustaining digital objects
• Participation by Motion Picture, Broadcasting, and Recorded Sound Division (M/B/RS) and the American Folklife Center
Reformatting Documentation
• About the source – original disc or tape being reformatted– <amdSec><sourceMD><AMD audio ext schema>
• About the process– how the copy file was made, what devices/tools– <amdSec><digiProvMD><PMD process ext schema>
• About the outcome– characteristics and features of the copy file– <amdSec><techMD><AMD audio ext schema>
PRODUCERS
ADMINISTRATION
DATAMANAGEMENT
ARCHIVALSTORAGE
INGEST ACCESS
CONSUMERS
PRESERVATION PLANNING
Reference Model for an Open Archival Information System (OAIS)
SIPs (Submission Information Packages) will be
produced by the AV preservation activity, ready to
submit to LC’s future digital repository.
Recorded Sound Processing Section
Content selected for reformatting
1. Initial creation or copying-in of metadata
Workflow Sidebar
Recorded Sound Processing Section
Content selected for reformatting
1. Initial creation or copying-in of metadata
LC Recording Lab or offsite contractor
Scanning activity
2. Creation of second layer of metadata
Workflow Sidebar
Recorded Sound Processing Section
Content selected for reformatting
1. Initial creation or copying-in of metadata
LC Recording Lab or offsite contractor
Scanning activity
2. Creation of second layer of metadata
3. Return loop to processing, edit and possible addition of third layer of metadata
Workflow Sidebar
OUTCOME ONE: A VIRTUAL DIGITAL OBJECT (SIP)
Logical storage structure based in a UNIX filesystem
master -- family of logical directories where the master files are stored (there is a parallel set of “service” directories)
afc -- “owner” is the American Folklife Center
afc1941001 -- group or aggregate of items, often from an actual collection
sr05 -- item directory (at the level of the digital object, counterpart to a bib record or “line” in a finding aid)
sr05am.wav -- the master file for side A of this disc
sr05am.wav -- the master file for side B of this disc
Index of master/afc/afc1941001/sr05
OUTCOME ONE: VIRTUAL DIGITAL OBJECT
The fileGrp segment of a METS instance “binds” the object
Includes logical pathnames for files, future switch to persistent names possible.
Selection from the database diagram: tables for extension schema data for image source, video source, and audio source
Builder: tool to shape a structMap using indent, outdent, up, and down. May be used in both template and individual object modes.
Two samples from the MODS entry and editing tool.
+ repeats the section
x and – delete sections or subsections
Some Shortcomings
• Cumbersome data entry – many screens, many actions
• Bugs – hard to get them all fixed now that the contractor is gone
• Best if users understand METS and the structMap – barrier to entry for new team members
• Does not include tools for bulk compilation from pre-existing data
Distributed Data Entry
• Hoped-for future• Each teams enters its own data in less cumbersome
“local” tools• Tool for descriptive data, especially copying in and out
of the ILS
• Tool for data about the source item and certain technical aspects, copied in and out of MAVIS
• Tool for digiProv data, “the engineers’ form”
• Tool or a MAVIS extension to encode the structMap
Supporting Tools
• To support the hoped-for future
• Centralized tool to gather and compile the various XML data units into a METS instance
• Facility to manage the METS XML documents