Download - Create your own digital repository
Create Your Own Create Your Own
Digital Digital RepositoryRepository
Leah PrescottLeah PrescottGetty Research InstituteGetty Research Institute
October 18, 2007October 18, 2007Master of St. Bartholomew,Master of St. Bartholomew, The Meeting of the Three The Meeting of the Three Kings (detail), Kings (detail), ca. 1480 J. Paul Getty Museumca. 1480 J. Paul Getty Museum
DIGITIZING IN A MATERIAL DIGITIZING IN A MATERIAL WORLDWORLD
What is a Digital Repository?
Digital Library, Digital Repository, Institutional Repository, Trusted Digital Repository, Digital Asset Management, Content Management System, Digital Commons...
Are they all the same thing?
What is a Digital Repository?
“A managed collection of digital objects... with consistent data and metadata structures for similar objects, enabling resource discovery by the ‘Communities of Practice’ for whom the objects are of interest”
Geoff Payne
What is a Digital Repository?
OAIS Reference ModelOpen Archival Information System
What is a Digital Repository?
Access Preservation
Institu
tion
al
Rep
osito
ry
Asset M
an
ag
em
en
t S
yste
m
Colle
ctio
ns M
an
ag
em
en
t S
yste
m
Tru
ste
d D
igita
l R
ep
osito
ry
Emblem (“Emblem (“Premitur, non opprimiturPremitur, non opprimitur”) from Claude Paradin, ”) from Claude Paradin, The heroicall devises of M. Claudius Paradin (London: W. The heroicall devises of M. Claudius Paradin (London: W. Kearney, 1591) Kearney, 1591) http://emblem.libraries.psu.edu/parad176.htmhttp://emblem.libraries.psu.edu/parad176.htm
Planning for a Repository
Planning for a Repository
What is the mission?What kinds of content will you
provide?Who are the key users?Who are the key stakeholders?What can you afford?What are your top service priorities?
Planning for a Repository
What Users Want from Digital Image Collections
Planning for a Repository
What Users Want from Digital Image Collections
◊ Users tend to be impatient with long waits
Planning for a Repository
What Users Want from Digital Image Collections
◊ Users tend to be impatient with long waits◊ Users need both image and text versions
Planning for a Repository
What Users Want from Digital Image Collections
◊ Users tend to be impatient with long waits◊ Users need both image and text versions◊ Users are generally content with current
image quality
Planning for a Repository
What Users Want from Digital Image Collections
◊ Users tend to be impatient with long waits◊ Users need both image and text versions◊ Users are generally content with current
image quality◊ Completeness and legibility of pages with
minimal scrolling is the primary user requirement for text- based documents.
Planning for a Repository
What Users Want from Digital Image Collections
◊ Users tend to be impatient with long waits◊ Users need both image and text versions◊ Users are generally content with current
image quality◊ Completeness and legibility of pages with
minimal scrolling is the primary user requirement for text- based documents.
◊ Users want multiple views to support their different research needs and some are interested in tonal and color fidelity
Planning for a Repository
What Users Want from Digital Image Collections
◊ Large format graphical images such as maps are difficult to display and fully comprehend online – need tools that provide zoom, pan, and peripheral- view capabilities
Planning for a Repository
What Users Want from Digital Image Collections
◊ Large format graphical images such as maps are difficult to display and fully comprehend online – need tools that provide zoom, pan, and peripheral- view capabilities
◊ Users prefer a simple interface
Planning for a Repository
What Users Want from Digital Image Collections
◊ Large format graphical images such as maps are difficult to display and fully comprehend online – need tools that provide zoom, pan, and peripheral- view capabilities
◊ Users prefer a simple interface◊ Users want to navigate the structural and
intellectual content of image collections – backward and forward, jump to specific pages, table of contents, index
Planning for a Repository
What Users Want from Digital Image Collections
◊ Users want to manipulate graphic images
Planning for a Repository
What Users Want from Digital Image Collections
◊ Users want to manipulate graphic images Zoom to view detail
Planning for a Repository
What Users Want from Digital Image Collections
◊ Users want to manipulate graphic images Zoom to view detail Examine Two or more images side by
side
Planning for a Repository
What Users Want from Digital Image Collections
◊ Users want to manipulate graphic images Zoom to view detail Examine Two or more images side by
side Save search result sets
Planning for a Repository
What Users Want from Digital Image Collections
◊ Users want to manipulate graphic images Zoom to view detail Examine Two or more images side by
side Save search result sets Sort search result sets
Planning for a Repository
What Users Want from Digital Image Collections
◊ Users want to manipulate graphic images Zoom to view detail Examine Two or more images side by
side Save search result sets Sort search result sets Export images into other software
Planning for a Repository
What Users Want from Digital Image Collections
◊ Users want to manipulate graphic images Zoom to view detail Examine Two or more images side by
side Save search result sets Sort search result sets Export images into other software
Produce high-quality printouts
Planning for a Repository
What Users Want from Digital Image Collections
◊ Users want to manipulate graphic images Zoom to view detail Examine Two or more images side by side Save search result sets Sort search result sets Export images into other software
Produce high-quality printouts Annotate images with comments and
save them to a notebook
Planning for a Repository
What Users Want from Digital Image Collections
◊ Users want to manipulate graphic images Zoom to view detail Examine Two or more images side by side Save search result sets Sort search result sets Export images into other software
Produce high-quality printouts Annotate images with comments and save
them to a notebook Use image-editing tools
Planning for a Repository
What Users Want from Digital Image Collections
◊ Users need a variety of search functions
Planning for a Repository
What Users Want from Digital Image Collections
◊ Users need a variety of search functions◊ Preferences for access points vary
according to the collection and the type of image Some perform searches for known items Other pose queries that contain a
preponderance of subject terms
Planning for a Repository
What Users Want from Digital Image Collections
◊ Users need a variety of search functions◊ Preferences for access points vary
according to the collection and the type of image Some perform searches for known items Other pose queries that contain a
preponderance of subject terms
◊ Search engines, query formulation and database indexing affect search results
Planning for a Repository
System Requirements – Technical
File formats supportedMetadata standardsInteroperability: OAI compliance, Z39.50 etc.Persistent URLSearch/Browse of metadataFull-text searchWorkflow, submission for content approvalUser authenticationCustomization features
Planning for a Repository
System Requirements – Technical
Free vs. commercial software (licence, subscription fees
Open Source vs. Proprietary
Technical Support availablefor fee vs. freeby phoneby emailvia online forms
Planning for a RepositorySystem Requirements - Metadata
““data about data” data about data” — — data data categoriescategories
data describing a discrete data data describing a discrete data object or objectsobject or objects
cataloging or indexing information cataloging or indexing information created to arrange, describe, and created to arrange, describe, and otherwise enhance access to an otherwise enhance access to an information objectinformation object
Planning for a RepositorySystem Requirements – Metadata
What is it?
Why is metadata Why is metadata important?important?
for enhanced for enhanced accessibilityaccessibility
for retention of contextfor retention of context for expanding usefor expanding use for multi-versioningfor multi-versioning for legal issuesfor legal issues for preservation of datafor preservation of data
Types of MetadataTypes of Metadata
AdministrativeAdministrative DescriptiveDescriptive PreservationPreservation TechnicalTechnical StructuralStructural UseUse
Dublin Core-Dublin Core- http://dublincore.org/
METS:METS:A Metadata “Wrapper” A Metadata “Wrapper” for Digital Information for Digital Information
ObjectsObjects
sample METS Objectsample METS Object
MODSMODS(Metadata Object Description (Metadata Object Description
Schema)Schema)An initiative of Network An initiative of Network Development and MARC Standards Development and MARC Standards Office at the Library of Congress Office at the Library of Congress http://www.loc.gov/standards/mods/ Uses XML schema Uses XML schema Originally designed for library Originally designed for library applications, but may be used for applications, but may be used for othersothers A derivative of MARCA derivative of MARC
Title InfoTitle InfoNameNameType of resourceType of resourceGenreGenreOrigin InfoOrigin InfoLanguageLanguagePhysical Physical descriptiondescriptionAbstractAbstractTable of contentsTable of contentsTarget audienceTarget audience
NoteNote
SubjectSubject
ClassificationClassification
Related itemRelated item
IdentifierIdentifier
LocationLocation
Access Access conditionsconditions
ExtensionExtension
Record InfoRecord Info
MODS high-level MODS high-level elementselements
Why do you need Why do you need standards?standards?
DATA STANDARDS: DATA STANDARDS: A BRIEF TYPOLOGYA BRIEF TYPOLOGY
Data structureData structure standards (metadata element standards (metadata element sets):sets): MARC, EAD, Dublin Core, METS, CDWA, VRA MARC, EAD, Dublin Core, METS, CDWA, VRA
CoreCore Data contentData content standards (cataloging rules): standards (cataloging rules):
AACR (AACR (RDA), ISBD, CCO, DA:CSRDA), ISBD, CCO, DA:CS Data valueData value standards (vocabularies): standards (vocabularies):
LCSH, LCNAF, TGM, AAT, TGN, ULANLCSH, LCNAF, TGM, AAT, TGN, ULAN Data formatData format standards (standards expressed in standards (standards expressed in
machine-readable form):machine-readable form): MARC, MARCXML, EAD, CDWA Lite XML, MARC, MARCXML, EAD, CDWA Lite XML,
Dublin Core Simple XML schema, DC Dublin Core Simple XML schema, DC Qualified XML schema, VRA Core XML Qualified XML schema, VRA Core XML schemaschema
Implementing a digital repository
Content SelectionDoes the item or collection have sufficient value to and demand from a current audience to justify digitization?
Does the proposed item or collection have active current users?Is there greater demand than can be served by the original or a traditional type of copy?Does it support high priority activities such as teaching of core courses that have large enrollments?Is it marketable to a group of specialists widely dispersed who all need access?
Implementing a digital repository
Content SelectionDo limitations on handling of fragile or valuable originals create a source of demand for high quality surrogates?
How does it fit with other materials on the same subject?Does it help build a distributed online collection?Do we have the legal right to create a digital version?Do we have the legal right to disseminate it?
Implementing a digital repository
Content SelectionCan the materials be digitized successfully?
Does or can digitization add something beyond simply creating a copy?
Can and should images be manipulated to make them more legible than the original items?OCRing for searchable text
Is the cost appropriate?
Issues with the Issues with the web...web...
Are your collections Are your collections “reachable” by commercial “reachable” by commercial search engines?search engines?
If yes, how will you If yes, how will you “contextualize” individual “contextualize” individual collection objects?collection objects?
If not, what is your strategy If not, what is your strategy to lead web users to your to lead web users to your search page?search page?
The “Visible web” The “Visible web” versus the “Deep versus the “Deep
web”web” The Visible web is what you see in the The Visible web is what you see in the results pages from general web search results pages from general web search engines & subject directories (static engines & subject directories (static web pages)web pages)
The Invisible or Deep web consists of The Invisible or Deep web consists of data from dynamically searchable data from dynamically searchable databases that cannot be indexed by databases that cannot be indexed by search engines, because they aren’t search engines, because they aren’t “stored” anywhere.“stored” anywhere.
PreservationPreservationEnduring CareBitstream CopyingDurable, Persistent MediaMigrationEmulationEncapsulationTechnology PreservationReinterpretation
EvaluationEvaluation Use metadata and usability Use metadata and usability
analysis should be a routine analysis should be a routine part of digital library work.part of digital library work.
Study end-user behavior Study end-user behavior (including your own)(including your own)
Joseph Ducreux, Joseph Ducreux, Yawning (Self-Yawning (Self-Portrait)Portrait), before 1783, before 1783
J. Paul Getty MuseumJ. Paul Getty Museum
Thank you Thank you for your for your
attention!attention!
[email protected]@getty.edudu