Download - Smithsonian institution libraries
Smithsonian Institution Libraries“Metadata Mixing & Matching For
Discovery”
CLSC 877
Metadata and Project Management
Suzanne C. Pilsk ~ Smithsonian Institution Libraries ~ 2010
Facts and FiguresSmithsonian Institution Libraries
– Washington, D.C.• Anacostia Museum & Center for African American History and Culture
Library
• Anthropology Library
• Botany and Horticulture Library
• The Dibner Library of the History of Science and Technology
• Freer Gallery of Art and Arthur M. Sackler Gallery Library
• Hirshhorn Museum and Sculpture Garden Library
• Joseph F. Cullman 3rd Library of Natural History
Facts and FiguresSmithsonian Institution Libraries
Washington, D.C. (continued)• Museum Studies & Reference Library
• National Air and Space Museum Library
• National Museum of American History Library
• National Museum of Natural History Library
• National Postal Museum Library
• National Zoological Park Library
• Smithsonian American Art Museum/National Portrait Gallery Library
• Warren M. Robbins Library, National Museum of African Art
Facts and FiguresSmithsonian Institution Libraries
• Elsewhere– Suitland, Md.
• Museum Support Center Library
• National Museum of the American Indian Library
– Edgewater, Md.
• Smithsonian Environmental Research Center Library
– New York City
• Cooper-Hewitt, National Design Museum Library
– Republic of Panama
• Smithsonian Tropical Research Institute Library
Facts and FiguresSmithsonian Institution Libraries
• African Art
• African American History and Culture
• Anthropology
• American Art
• American History
• Asian and Middle Eastern Art
• Aviation history and Space Flight
• Design and Decorative Arts
• Environmental Management and Ecology
• History of Science and Technology
• Latino History and Culture
• Materials Research
• Modern and Contemporary Art
• Museology
• Native American History and Culture
• Natural History
• Postal History
• Tropical Biology
• Trade Literature
• World’s Fair Ephemera
What’s So Special?
Public Museum
Smithsonian Institution is the largest museum complex in the world …
“The Nation’s Attic”
“Increase and Diffusion of Knowledge”
Unlock the Mysteries of the Universe
Understanding and Sustaining a Biodiverse Planet
Valuing World Cultures
Understanding the American Experience
SIL Mission(Smithsonian Directive 500)
As the largest and most diverse museum library in the world, SIL leads the Smithsonian in taking advantage of the opportunities of the digital society. SIL provides authoritative information and creates innovative services and programs for Smithsonian Institution researchers, scholars and curators, as well as the general public, to further their quest for knowledge. Through paper preservation and digital technologies, SIL ensures broad and enduring access to the Libraries’ collections for all users.
SIL’s Strategic Plan “Focus on Service”
• GOAL 1: COLLABORATING ACROSS BOUNDARIES– SIL creates a compelling environment for connecting, collaborating and
exploring across disciplines and information boundaries
• GOAL 2: DISCOVERING INFORMATION– SIL enhances and eases the discovery of information in our collections
for SI scholars, researchers, scientists, and the larger world of learners
• GOAL 3: CONNECTING WITH USERS– SIL understands and meets user needs, serving users where they live
and work
• GOAL 4: BUILDING EXPERTISE– SIL builds expertise on information discovery, navigation and
management
• GOAL 5: ENABLING OUR MISSION– SIL ensures its success through increased financial strength, effective
administrative support, and organizational excellence
Facts and FiguresSmithsonian Institution Libraries
Total volumes
> 1.7 million
50,000 are rare books
10,000 manuscripts
Trade Catalogs
> 500, 000 items
> 30,000 companies
dating from the 1800s
Facts and Figures
• 102 Smithsonian Libraries Staff
• 17 Souls in Cataloging Services (with contractors)
• Traditional Library
• Traditional Services
Integrated Library System
Smithsonian Institution Research Information System (SIRIS)
– MARC
– AACR2r
– ISBD
– LC Classification
– LC Subject Headings
Traditional Cataloging
• Monographs
• Serials
• Videos
• Microfilm/fiche
• Sound Recordings
• CD/DVDs
• Electronic Resources
Traditional Cataloging
• OCLC
• Program for Cooperative Cataloging
– NACO
– SACO
– BIBCO
SI Libraries Serves
• Curators
• Researchers
• Post-Docs
• Museum Administrators
• Public
How to make THIS into 0’s and 1’s
Virtual Library defined in theOnline Dictionary for
Library and Information Science
A "library without walls" in which the collections do not exist … [in] tangible form at a physical location but are electronically accessible in digital format via computer networks. The term digital library is more appropriate because virtual (borrowed from "virtual reality") suggests that the experience of using such a library is not the same as the "real" thing when in fact the experience of reading or viewing a document on a computer screen may be qualitatively different from reading the same publication in print, but the information content is the same regardless of format.~ http://lu.com/odlis/odlis_v.cfm
Digital Library defined in theOnline Dictionary for
Library and Information Science
A library in which a significant proportion of the resources are available in machine-readable format … . The digital content may be locally held or accessed remotely via computer networks. … In libraries, the process of digitization began with the catalog, moved to periodical indexes and abstracting services, then to periodicals and large reference works, and finally to book publishing.~ http://lu.com/odlis/odlis_v.cfm
Traditional Digital Library
• Electronic Journals & Databases
• Digital Editions
• Online Exhibitions
• Online Catalog
• Digital Reference
Will they find it?
If you digitize it …
Search Gone BAD!
IFLA’s Functional Requirements for Bibliographic Data
To Find
To Identify
To Select
To Obtain
To USE
Metadata
Metadata – failure to serve
Metadata: MARC
MARC
110 Oscar Mayer & Co.
650 Frankfurters
Metadata
Dublin Core
Creator:
Oscar Mayer & Co.
Subject:
Frankfurters
02761nam 2200469 4500001000700000005001700007008004100024010002300065019001300088035001400101035002300115040006100138049002700199050001500226100004200241245019300283260008300476300001700559504033500576505015400911590010901065590009601174650002601270945002101296945007301317945003101390945004801421945004801469945004701517945007901564945004401643945004601687945004801733945007601781945004401857945005101901945005101952945007102003945009002074945009602164945003102260-459797-20050131154400.0-731129m19021933enk b 000 0 lat c- aagr03000069 //r582- a14018362- aABY6485LB- a(OCoLC)ocm00751549- aU.S. Dept. of Agr. Libr.cRIUdOCLdCHSdSERdSMIdWaOLN- aSMI$aSMIMaSMIEaSMIB-00aQL354b.S5-1 aOscar Mayer & Co.-10aPronto pup:bhot dogs hamburgers/ca Oscar Mayer and Company.- aNew Orleans, La. :bBourbon Street Foods,c2000.
Metadata: Real MARC – Still failure to serve
Metadata: MARCXML
<?xml version="1.0" encoding="UTF-8" ?><collection xmlns="http://www.loc.gov/MARC21/slim"
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.loc.gov/MARC21/slim http://www.loc.gov/standards/marcxml/schema/MARC21slim.xsd">
<record><leader>02761nam a2200469 4500</leader><controlfield tag="001">459797</controlfield><controlfield tag="005">20050131154400.0</controlfield><controlfield tag="008">731129m19021933enk b 000 0 lat
c</controlfield><datafield tag="010" ind1=" " ind2=" "><subfield code="a">agr03000069 //r582</subfield></datafield>
- Specimen- Plate or other visual image- Taxonomic description
Beyond the Traditional
Taxonomic Literature Needs/Requests
• Beyond the Scan
• Beyond the Re-Keyed
• Marking up the data in metadata schemas
MARC
LCSH/LCCS
ISBDFeed the cat
Pick up dry cleaning
Milk, eggs, lactaid
Make dentist appt.
AACR
MARC
AACR
LCSH/LCCS
ISBD
Feed the cat
Pick up dry cleaning
MODSXML
Dublin Core
ONIX
METs
TEI
FRBR
Access
Hierarchical
Faceted
relatedItem
Milk, eggs, lactaid
Add hotdogs to grocery list
Dewey
XMP
RDA
Discoverable
Interoperability
Open AccessFeed the cat
Pick up dry cleaning
Milk, eggs, lactaid
Make dentist appt.
Collaboration
Biodiversity Heritage Library (BHL)
EOL species need
CuratorRequest
“gap-fill”for other BHL library
Pull from stacks
Circ in ILS Preliminary metadata checkAnd physical check
Goin’ down the rows
serial?
“Bid”on title, select in picklist
The Stacks
Meta-datacheck
Preser-vationreview
Other library“bid” ?
Circ to cataloging for MARC editing
Update picklist if item record has been changedDuring cataloging touch-upCirc to scanner
Put on shipping cart, generate‘packinglist’ invoice
Evaluate titleNeed is…
pass
yes
pass
fail
fail
no
Carts delivered to scanner
Picklist DatabaseStores Select / reject / shipstate & suppliesitem metadatato IA
Select title in picklist,upload to monograph de-duper
Duplicate?yes
BibliographicData from SIRIS
noReject in picklist,Circ in HorizonReturn to stacks
noReject in picklist,return to stacks
IA scanning processUnique IA id is assignedMetadata is gathered fromSIRIS and the picklist dbAnd associated with the scanJP2000s generated& transformedServed on archive.orgQA is done by IA on 10%
Books are returned, cart contents areverified against invoice
SIL does 20% QAChecking for metadata matchingWith item, scan quality etc
Updated in picklist as scannedCirc in HorizonPlace BHL sticker near barcodeReturn to Stacks
Pass QA?
BHL PortalPeriodically harvestsMarc.xml (bib) and itemRecords, along with JP2000 fromArchive.orgTo index and displayIn the portal
yesno
Update picklist to indicate rescan
Put on shipping cart, generate ‘packinglist’ Invoice, alert scanning center
Carts delivered to scanner
Download .csv from portal with SIL barcodes, Portal URLs
Send URLs to SIRISOffice for batch updates
Ernest IngersollHand-book to the National Museum … Smithsonian Institution, 1886
Mass Scanning Workflow
BHL
•Bid Lists•Serials Management•Pick Lists•Packing Lists•Monographic Management•Local data flow•WonderFetchtm
•Return of data•Return of material•Billing
1. Select Book ~Pull from Shelf
2. Review Physically and Metadata
3. Establish viability and create Wonderfetchtm
4. Send to IA scanning center
5. Book is scanned & QA
6. Page images loaded
7. Derivatives created
8. Book returned to library
9. Files harvested from IA portal to BHL
10. Taxonomic Intelligence Added
11. Available through BHL
BHL
Monographic DeDuper
The BHL Portal is not a library catalog
Circulation
Sort and Filter for Appropriate Material
Packing/ Receiving Lists
Identify at time of scan –
quality review
Ingested in to
Discovery Tools
Linked
Discovery
to Delivery
Receive NEW Data
Incorporate,
Re-Purpose, Re-Use
Hooks for Known and Unknown Re-Use
Descriptive Data
Chronology Issues
Holdings
(Badly done graphic)
Collections.SI.edu ~ SI Libraries
842,000 Records in ILS27,805 Trade literature
74,613 Art and Artists files4,000 SI Digital Repository
(SI Research Online)
Not in Collections.Si.Edu
Collections.SI.edu ~ Freer + Sackler
53% of the ENTIRE
collection at www.asia.si.edu
& collections.si.edu
12,269 objects online
NOT: F/S G’s Study Collection – 10,872 objects only for study not for exhibit – will never go online
Collections.SI.edu ~ NPM
12,000 Records
Collections.si.edu
16,000 Records in the ARAGO
214,000 Records in the database
6 Million objects
= 0.2% in Collections.si.edu
Collections.SI.edu ~ NMNH
NMNH estimates 126 Million Specimens
Collections.SI.edu ~ NMNH
NMNH estimates 126 Million Specimens
5,400,000 Catalog Records in collection management system –
5,218,793 available on collections.nmnh.si.edu (181,207 records not available)
Collections.SI.edu ~ NMNH
Coming soon:
IZ 992,000 (68,000 with media)
Bot 788,000 (1,300 with media)
Collections.SI.edu ~ NMNH
NMNH estimates 126 Million Specimens
5,400,000 Catalog Records in collection management system – 5,218,793 available on
collections.nmnh.si.edu (181,207 records not available)
6 out of 10 units supplying data to collections.si.edu = 2,527,557 records
(153,418 have images)
Collections.SI.edu
50% of the records are from 1 source
(NMNH and still growing 2,527,557 records
with 153,418 images)
4,600,000 Records
445,000 Images
40 Data sources
SI Wide Estimations
• 136.9 MILLION objects
• 13 MILLION digital records
• 821,000 digital images
“The worth and importance of the Institution is not to be estimated by what it accumulates within the walls of its building, but by what it sends forth to the world.”
—Joseph HenryThe Smithsonian Institution’s First Secretary
1852
Credits
Thanks to staff at
NMAI SIL
NMNH MBL/WHOI Library
NPM MoBot
Freer/Sackler NYBG
BHL
Smithsonian Institution Libraries“Metadata Mixing & Matching For
Discovery”
CLSC 877
Metadata and Project Management
Suzanne C. Pilsk Smithsonian Institution Libraries