linked data in archives - society of american...
TRANSCRIPT
Linked Data in Archives
Presented 2012-08-23
SAA 2012, Linking Data Across Libraries, Archives, and Museums
Corey A Harper
Publish, Enrich, Refine, Reconcile, Relate
2012-08-10 SAA12 -- Linking Data Across Libraries, Archives, and Museums 2
Semantic Web
• TBL’s original vision
“Weaving the Web” – 1999
• Then: Focus on Machine Reasoning
Scientific American Article
• Now: Focus on things & links
Reasoning & Inferencing less central
2012-08-10 SAA12 -- Linking Data Across Libraries, Archives, and Museums 3
Semantic Web
• Originally:
Metadata standard built on XML
Metadata about “Web” things (documents)
• Eventually:
Metadata about all sorts of things
And about relationships between things
2012-08-10 SAA12 -- Linking Data Across Libraries, Archives, and Museums 4
Linked Open Data
• Use URIs as names for things
• Use HTTP URIs so that people can look
up those names.
• When someone looks up a URI, provide
useful information.
• Include links to other URIs. so that they
can discover more things. http://www.w3.org/DesignIssues/LinkedData.html
2012-08-10 SAA12 -- Linking Data Across Libraries, Archives, and Museums 5
Linked Data
• Metadata as a Graph
• Typed “things”, named by URIs
• The relationships between those
things, also built on URIs
• Ease of integration *across* data
sources – “merging graphs”
2012-08-10 SAA12 -- Linking Data Across Libraries, Archives, and Museums 6
Growth of the Linked Data cloud
2012-08-10 SAA12 -- Linking Data Across Libraries, Archives, and Museums 7
2012-08-10 SAA12 -- Linking Data Across Libraries, Archives, and Museums 8
2012-08-10 SAA12 -- Linking Data Across Libraries, Archives, and Museums 9
DBpedia
• Structured Wikipedia Data
• Genres, Influences, External Links
• Multi-lingual / Multi-script labels
• Rich Semantics
• Many linkages to other datasets
2012-08-10 SAA12 -- Linking Data Across Libraries, Archives, and Museums 10
DBpedia Model
• Partial basis in data entry conventions
• InfoBoxes, and InfoBox Templates
• Metadata Entry Format
• Partial source of Ontology
Class Structure
Vocabulary Design
2012-08-10 SAA12 -- Linking Data Across Libraries, Archives, and Museums 11
DBpedia
• 3.4 Million “things” described
• Ontology based on “infoboxes”
1.5 million things classified
http://wiki.dbpedia.org/Ontology
• Approx. 50,000 “Properties”
Approx. 1,200 defined in ontology
http://thinkbase.cs.auckland.ac.nz/start.jsp
2012-08-10 SAA12 -- Linking Data Across Libraries, Archives, and Museums 16
Google Knowledge Graph
2012-08-10 SAA12 -- Linking Data Across Libraries, Archives, and Museums 17
Google Refine
2012-08-10 SAA12 -- Linking Data Across Libraries, Archives, and Museums 18
Automated Authorities?
2012-08-10 SAA12 -- Linking Data Across Libraries, Archives, and Museums 19
Belgians! http
://free
yo
urm
eta
da
ta.o
rg/
2012-08-10 SAA12 -- Linking Data Across Libraries, Archives, and Museums 20
BBC Chimps
2012-08-10 SAA12 -- Linking Data Across Libraries, Archives, and Museums 21
BBC Wildlife
2012-08-10 SAA12 -- Linking Data Across Libraries, Archives, and Museums 22
BBC Programmes
http://datagov.clarkparsia.com/
http://weblog.clarkparsia.com/2010/05/26/another-reason-semantic-web-kicks-ass/
2012-08-10 SAA12 -- Linking Data Across Libraries, Archives, and Museums 24
RelFinder
2012-08-10 SAA12 -- Linking Data Across Libraries, Archives, and Museums 25
LinkSailor: http://linksailor.com/nav
2012-08-10 SAA12 -- Linking Data Across Libraries, Archives, and Museums 26
2012-08-10 SAA12 -- Linking Data Across Libraries, Archives, and Museums 27
2012-08-10 SAA12 -- Linking Data Across Libraries, Archives, and Museums 28
2012-08-10 SAA12 -- Linking Data Across Libraries, Archives, and Museums 29
LAWDI http
://openconte
xt.o
rg/
2012-08-10 SAA12 -- Linking Data Across Libraries, Archives, and Museums 30
LAWDI
• Linked Ancient World Metadata Institute
Archeologists, Numismatists, Classists
Quasi- Digital Humanities
• Doing their own Linked Data
• Excited about Libraries helping
VIAF, id.loc, FAST, OCLC #’s etc…
• Actively modeling ancient place names
2012-08-10 SAA12 -- Linking Data Across Libraries, Archives, and Museums 31
W3C Linked Library Data Incubator
• Collected, Curated and Clustered over
50 Use Cases
• Mined use cases for functional
requirements and design patterns
• Recommendations to W3C
• http://www.w3.org/2005/Incubator/lld/
2012-08-10 SAA12 -- Linking Data Across Libraries, Archives, and Museums 32
Use Case Categories
• Bibliographic Data
• Authority Data
• Archives & Heterogeneous Metadata
• Citations
• Digital Objects
• Collections
• Social & New Uses
2012-08-10 SAA12 -- Linking Data Across Libraries, Archives, and Museums 33
So What Can You Do?
• Iterative changes to metadata
• Adding identifiers where you can
Unit Titles, Component Levels
• Access points at series, subseries, folder
• Relationships rather than (or in addition
to) prose
• RDFa embedded in HTML Finding Aids
• Start playing with tools and techniques
2012-08-10 SAA12 -- Linking Data Across Libraries, Archives, and Museums 34
Daily Worker
2012-08-10 SAA12 -- Linking Data Across Libraries, Archives, and Museums 35
2012-08-10 SAA12 -- Linking Data Across Libraries, Archives, and Museums 36
Refine
2012-08-10 SAA12 -- Linking Data Across Libraries, Archives, and Museums 37
ViewShare
2012-08-10 SAA12 -- Linking Data Across Libraries, Archives, and Museums 38
2012-08-10 SAA12 -- Linking Data Across Libraries, Archives, and Museums 39
LC Bibliographic Framework Transition http
://ww
w.lo
c.g
ov/m
arc
/transitio
n/
2012-08-10 SAA12 -- Linking Data Across Libraries, Archives, and Museums 40
• Distributed information ecosystem
Linking Data
Focus on identification over description
• Create navigable, browsable information
landscapes
• Relationships between resources weave
context & enrich user experiences
2012-08-10 SAA12 -- Linking Data Across Libraries, Archives, and Museums 41
Next Steps & Works in Progress
• Provenance
• Licensing
• Best Practices, Modeling & Infrastructure
• DCMI & W3C Work! (Add links on new slide)
DC Abstract Model / Application Profiles /
Description Sets
Vocabulary Management
Schema.org mappings
Provenance Ontology
• Interface Design
2012-08-10 SAA12 -- Linking Data Across Libraries, Archives, and Museums 42
New Interfaces
2012-08-10 SAA12 -- Linking Data Across Libraries, Archives, and Museums 43
Not Just for Libraries, Archives, Museums!
• Providing models & resources for
scholars and researchers
• Digital Humanities (LAWDI)
• Adding authoritative, stable URIs to the
grid that others can link to
• Pouring our history of info mgt into tools
like Freebase
2012-08-10 SAA12 -- Linking Data Across Libraries, Archives, and Museums 44
Thanks!
212.998.2479
@chrpr