linked data in archives - society of american...

44
Linked Data in Archives Presented 2012-08-23 SAA 2012, Linking Data Across Libraries, Archives, and Museums Corey A Harper Publish, Enrich, Refine, Reconcile, Relate

Upload: others

Post on 02-Jun-2020

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Linked Data in Archives - Society of American …files.archivists.org/conference/sandiego2012/401-Harper.pdfLinked Data in Archives Presented 2012-08-23 SAA 2012, Linking Data Across

Linked Data in Archives

Presented 2012-08-23

SAA 2012, Linking Data Across Libraries, Archives, and Museums

Corey A Harper

Publish, Enrich, Refine, Reconcile, Relate

Page 2: Linked Data in Archives - Society of American …files.archivists.org/conference/sandiego2012/401-Harper.pdfLinked Data in Archives Presented 2012-08-23 SAA 2012, Linking Data Across

2012-08-10 SAA12 -- Linking Data Across Libraries, Archives, and Museums 2

Semantic Web

• TBL’s original vision

“Weaving the Web” – 1999

• Then: Focus on Machine Reasoning

Scientific American Article

• Now: Focus on things & links

Reasoning & Inferencing less central

Page 3: Linked Data in Archives - Society of American …files.archivists.org/conference/sandiego2012/401-Harper.pdfLinked Data in Archives Presented 2012-08-23 SAA 2012, Linking Data Across

2012-08-10 SAA12 -- Linking Data Across Libraries, Archives, and Museums 3

Semantic Web

• Originally:

Metadata standard built on XML

Metadata about “Web” things (documents)

• Eventually:

Metadata about all sorts of things

And about relationships between things

Page 4: Linked Data in Archives - Society of American …files.archivists.org/conference/sandiego2012/401-Harper.pdfLinked Data in Archives Presented 2012-08-23 SAA 2012, Linking Data Across

2012-08-10 SAA12 -- Linking Data Across Libraries, Archives, and Museums 4

Linked Open Data

• Use URIs as names for things

• Use HTTP URIs so that people can look

up those names.

• When someone looks up a URI, provide

useful information.

• Include links to other URIs. so that they

can discover more things. http://www.w3.org/DesignIssues/LinkedData.html

Page 5: Linked Data in Archives - Society of American …files.archivists.org/conference/sandiego2012/401-Harper.pdfLinked Data in Archives Presented 2012-08-23 SAA 2012, Linking Data Across

2012-08-10 SAA12 -- Linking Data Across Libraries, Archives, and Museums 5

Linked Data

• Metadata as a Graph

• Typed “things”, named by URIs

• The relationships between those

things, also built on URIs

• Ease of integration *across* data

sources – “merging graphs”

Page 6: Linked Data in Archives - Society of American …files.archivists.org/conference/sandiego2012/401-Harper.pdfLinked Data in Archives Presented 2012-08-23 SAA 2012, Linking Data Across

2012-08-10 SAA12 -- Linking Data Across Libraries, Archives, and Museums 6

Growth of the Linked Data cloud

Page 7: Linked Data in Archives - Society of American …files.archivists.org/conference/sandiego2012/401-Harper.pdfLinked Data in Archives Presented 2012-08-23 SAA 2012, Linking Data Across

2012-08-10 SAA12 -- Linking Data Across Libraries, Archives, and Museums 7

Page 8: Linked Data in Archives - Society of American …files.archivists.org/conference/sandiego2012/401-Harper.pdfLinked Data in Archives Presented 2012-08-23 SAA 2012, Linking Data Across

2012-08-10 SAA12 -- Linking Data Across Libraries, Archives, and Museums 8

Page 9: Linked Data in Archives - Society of American …files.archivists.org/conference/sandiego2012/401-Harper.pdfLinked Data in Archives Presented 2012-08-23 SAA 2012, Linking Data Across

2012-08-10 SAA12 -- Linking Data Across Libraries, Archives, and Museums 9

DBpedia

• Structured Wikipedia Data

• Genres, Influences, External Links

• Multi-lingual / Multi-script labels

• Rich Semantics

• Many linkages to other datasets

Page 10: Linked Data in Archives - Society of American …files.archivists.org/conference/sandiego2012/401-Harper.pdfLinked Data in Archives Presented 2012-08-23 SAA 2012, Linking Data Across

2012-08-10 SAA12 -- Linking Data Across Libraries, Archives, and Museums 10

DBpedia Model

• Partial basis in data entry conventions

• InfoBoxes, and InfoBox Templates

• Metadata Entry Format

• Partial source of Ontology

Class Structure

Vocabulary Design

Page 11: Linked Data in Archives - Society of American …files.archivists.org/conference/sandiego2012/401-Harper.pdfLinked Data in Archives Presented 2012-08-23 SAA 2012, Linking Data Across

2012-08-10 SAA12 -- Linking Data Across Libraries, Archives, and Museums 11

DBpedia

• 3.4 Million “things” described

• Ontology based on “infoboxes”

1.5 million things classified

http://wiki.dbpedia.org/Ontology

• Approx. 50,000 “Properties”

Approx. 1,200 defined in ontology

Page 12: Linked Data in Archives - Society of American …files.archivists.org/conference/sandiego2012/401-Harper.pdfLinked Data in Archives Presented 2012-08-23 SAA 2012, Linking Data Across
Page 13: Linked Data in Archives - Society of American …files.archivists.org/conference/sandiego2012/401-Harper.pdfLinked Data in Archives Presented 2012-08-23 SAA 2012, Linking Data Across
Page 14: Linked Data in Archives - Society of American …files.archivists.org/conference/sandiego2012/401-Harper.pdfLinked Data in Archives Presented 2012-08-23 SAA 2012, Linking Data Across
Page 15: Linked Data in Archives - Society of American …files.archivists.org/conference/sandiego2012/401-Harper.pdfLinked Data in Archives Presented 2012-08-23 SAA 2012, Linking Data Across

http://thinkbase.cs.auckland.ac.nz/start.jsp

Page 16: Linked Data in Archives - Society of American …files.archivists.org/conference/sandiego2012/401-Harper.pdfLinked Data in Archives Presented 2012-08-23 SAA 2012, Linking Data Across

2012-08-10 SAA12 -- Linking Data Across Libraries, Archives, and Museums 16

Google Knowledge Graph

Page 17: Linked Data in Archives - Society of American …files.archivists.org/conference/sandiego2012/401-Harper.pdfLinked Data in Archives Presented 2012-08-23 SAA 2012, Linking Data Across

2012-08-10 SAA12 -- Linking Data Across Libraries, Archives, and Museums 17

Google Refine

Page 18: Linked Data in Archives - Society of American …files.archivists.org/conference/sandiego2012/401-Harper.pdfLinked Data in Archives Presented 2012-08-23 SAA 2012, Linking Data Across

2012-08-10 SAA12 -- Linking Data Across Libraries, Archives, and Museums 18

Automated Authorities?

Page 19: Linked Data in Archives - Society of American …files.archivists.org/conference/sandiego2012/401-Harper.pdfLinked Data in Archives Presented 2012-08-23 SAA 2012, Linking Data Across

2012-08-10 SAA12 -- Linking Data Across Libraries, Archives, and Museums 19

Belgians! http

://free

yo

urm

eta

da

ta.o

rg/

Page 20: Linked Data in Archives - Society of American …files.archivists.org/conference/sandiego2012/401-Harper.pdfLinked Data in Archives Presented 2012-08-23 SAA 2012, Linking Data Across

2012-08-10 SAA12 -- Linking Data Across Libraries, Archives, and Museums 20

BBC Chimps

Page 21: Linked Data in Archives - Society of American …files.archivists.org/conference/sandiego2012/401-Harper.pdfLinked Data in Archives Presented 2012-08-23 SAA 2012, Linking Data Across

2012-08-10 SAA12 -- Linking Data Across Libraries, Archives, and Museums 21

BBC Wildlife

Page 22: Linked Data in Archives - Society of American …files.archivists.org/conference/sandiego2012/401-Harper.pdfLinked Data in Archives Presented 2012-08-23 SAA 2012, Linking Data Across

2012-08-10 SAA12 -- Linking Data Across Libraries, Archives, and Museums 22

BBC Programmes

Page 23: Linked Data in Archives - Society of American …files.archivists.org/conference/sandiego2012/401-Harper.pdfLinked Data in Archives Presented 2012-08-23 SAA 2012, Linking Data Across

http://datagov.clarkparsia.com/

http://weblog.clarkparsia.com/2010/05/26/another-reason-semantic-web-kicks-ass/

Page 24: Linked Data in Archives - Society of American …files.archivists.org/conference/sandiego2012/401-Harper.pdfLinked Data in Archives Presented 2012-08-23 SAA 2012, Linking Data Across

2012-08-10 SAA12 -- Linking Data Across Libraries, Archives, and Museums 24

RelFinder

Page 25: Linked Data in Archives - Society of American …files.archivists.org/conference/sandiego2012/401-Harper.pdfLinked Data in Archives Presented 2012-08-23 SAA 2012, Linking Data Across

2012-08-10 SAA12 -- Linking Data Across Libraries, Archives, and Museums 25

LinkSailor: http://linksailor.com/nav

Page 26: Linked Data in Archives - Society of American …files.archivists.org/conference/sandiego2012/401-Harper.pdfLinked Data in Archives Presented 2012-08-23 SAA 2012, Linking Data Across

2012-08-10 SAA12 -- Linking Data Across Libraries, Archives, and Museums 26

Page 27: Linked Data in Archives - Society of American …files.archivists.org/conference/sandiego2012/401-Harper.pdfLinked Data in Archives Presented 2012-08-23 SAA 2012, Linking Data Across

2012-08-10 SAA12 -- Linking Data Across Libraries, Archives, and Museums 27

Page 28: Linked Data in Archives - Society of American …files.archivists.org/conference/sandiego2012/401-Harper.pdfLinked Data in Archives Presented 2012-08-23 SAA 2012, Linking Data Across

2012-08-10 SAA12 -- Linking Data Across Libraries, Archives, and Museums 28

Page 29: Linked Data in Archives - Society of American …files.archivists.org/conference/sandiego2012/401-Harper.pdfLinked Data in Archives Presented 2012-08-23 SAA 2012, Linking Data Across

2012-08-10 SAA12 -- Linking Data Across Libraries, Archives, and Museums 29

LAWDI http

://openconte

xt.o

rg/

Page 30: Linked Data in Archives - Society of American …files.archivists.org/conference/sandiego2012/401-Harper.pdfLinked Data in Archives Presented 2012-08-23 SAA 2012, Linking Data Across

2012-08-10 SAA12 -- Linking Data Across Libraries, Archives, and Museums 30

LAWDI

• Linked Ancient World Metadata Institute

Archeologists, Numismatists, Classists

Quasi- Digital Humanities

• Doing their own Linked Data

• Excited about Libraries helping

VIAF, id.loc, FAST, OCLC #’s etc…

• Actively modeling ancient place names

Page 31: Linked Data in Archives - Society of American …files.archivists.org/conference/sandiego2012/401-Harper.pdfLinked Data in Archives Presented 2012-08-23 SAA 2012, Linking Data Across

2012-08-10 SAA12 -- Linking Data Across Libraries, Archives, and Museums 31

W3C Linked Library Data Incubator

• Collected, Curated and Clustered over

50 Use Cases

• Mined use cases for functional

requirements and design patterns

• Recommendations to W3C

• http://www.w3.org/2005/Incubator/lld/

Page 32: Linked Data in Archives - Society of American …files.archivists.org/conference/sandiego2012/401-Harper.pdfLinked Data in Archives Presented 2012-08-23 SAA 2012, Linking Data Across

2012-08-10 SAA12 -- Linking Data Across Libraries, Archives, and Museums 32

Use Case Categories

• Bibliographic Data

• Authority Data

• Archives & Heterogeneous Metadata

• Citations

• Digital Objects

• Collections

• Social & New Uses

Page 33: Linked Data in Archives - Society of American …files.archivists.org/conference/sandiego2012/401-Harper.pdfLinked Data in Archives Presented 2012-08-23 SAA 2012, Linking Data Across

2012-08-10 SAA12 -- Linking Data Across Libraries, Archives, and Museums 33

So What Can You Do?

• Iterative changes to metadata

• Adding identifiers where you can

Unit Titles, Component Levels

• Access points at series, subseries, folder

• Relationships rather than (or in addition

to) prose

• RDFa embedded in HTML Finding Aids

• Start playing with tools and techniques

Page 34: Linked Data in Archives - Society of American …files.archivists.org/conference/sandiego2012/401-Harper.pdfLinked Data in Archives Presented 2012-08-23 SAA 2012, Linking Data Across

2012-08-10 SAA12 -- Linking Data Across Libraries, Archives, and Museums 34

Daily Worker

Page 35: Linked Data in Archives - Society of American …files.archivists.org/conference/sandiego2012/401-Harper.pdfLinked Data in Archives Presented 2012-08-23 SAA 2012, Linking Data Across

2012-08-10 SAA12 -- Linking Data Across Libraries, Archives, and Museums 35

Page 36: Linked Data in Archives - Society of American …files.archivists.org/conference/sandiego2012/401-Harper.pdfLinked Data in Archives Presented 2012-08-23 SAA 2012, Linking Data Across

2012-08-10 SAA12 -- Linking Data Across Libraries, Archives, and Museums 36

Refine

Page 37: Linked Data in Archives - Society of American …files.archivists.org/conference/sandiego2012/401-Harper.pdfLinked Data in Archives Presented 2012-08-23 SAA 2012, Linking Data Across

2012-08-10 SAA12 -- Linking Data Across Libraries, Archives, and Museums 37

ViewShare

Page 38: Linked Data in Archives - Society of American …files.archivists.org/conference/sandiego2012/401-Harper.pdfLinked Data in Archives Presented 2012-08-23 SAA 2012, Linking Data Across

2012-08-10 SAA12 -- Linking Data Across Libraries, Archives, and Museums 38

Page 39: Linked Data in Archives - Society of American …files.archivists.org/conference/sandiego2012/401-Harper.pdfLinked Data in Archives Presented 2012-08-23 SAA 2012, Linking Data Across

2012-08-10 SAA12 -- Linking Data Across Libraries, Archives, and Museums 39

LC Bibliographic Framework Transition http

://ww

w.lo

c.g

ov/m

arc

/transitio

n/

Page 40: Linked Data in Archives - Society of American …files.archivists.org/conference/sandiego2012/401-Harper.pdfLinked Data in Archives Presented 2012-08-23 SAA 2012, Linking Data Across

2012-08-10 SAA12 -- Linking Data Across Libraries, Archives, and Museums 40

• Distributed information ecosystem

Linking Data

Focus on identification over description

• Create navigable, browsable information

landscapes

• Relationships between resources weave

context & enrich user experiences

Page 41: Linked Data in Archives - Society of American …files.archivists.org/conference/sandiego2012/401-Harper.pdfLinked Data in Archives Presented 2012-08-23 SAA 2012, Linking Data Across

2012-08-10 SAA12 -- Linking Data Across Libraries, Archives, and Museums 41

Next Steps & Works in Progress

• Provenance

• Licensing

• Best Practices, Modeling & Infrastructure

• DCMI & W3C Work! (Add links on new slide)

DC Abstract Model / Application Profiles /

Description Sets

Vocabulary Management

Schema.org mappings

Provenance Ontology

• Interface Design

Page 42: Linked Data in Archives - Society of American …files.archivists.org/conference/sandiego2012/401-Harper.pdfLinked Data in Archives Presented 2012-08-23 SAA 2012, Linking Data Across

2012-08-10 SAA12 -- Linking Data Across Libraries, Archives, and Museums 42

New Interfaces

Page 43: Linked Data in Archives - Society of American …files.archivists.org/conference/sandiego2012/401-Harper.pdfLinked Data in Archives Presented 2012-08-23 SAA 2012, Linking Data Across

2012-08-10 SAA12 -- Linking Data Across Libraries, Archives, and Museums 43

Not Just for Libraries, Archives, Museums!

• Providing models & resources for

scholars and researchers

• Digital Humanities (LAWDI)

• Adding authoritative, stable URIs to the

grid that others can link to

• Pouring our history of info mgt into tools

like Freebase

Page 44: Linked Data in Archives - Society of American …files.archivists.org/conference/sandiego2012/401-Harper.pdfLinked Data in Archives Presented 2012-08-23 SAA 2012, Linking Data Across

2012-08-10 SAA12 -- Linking Data Across Libraries, Archives, and Museums 44

Thanks!

[email protected]

212.998.2479

@chrpr