europeana network association agm 2016 - 9 november - speaker: joan cobb
TRANSCRIPT
The Getty Datasets and the Semantic WebJoan Cobb
Technical Lead, Getty Vocabularies, J. Paul Getty Trust
Europeana Network Association AGM Riga, Latvia, 8-9 November 2016jcobb
@getty.edu
The
Getty
Dat
aset
s a
nd th
e Se
man
tic W
eb
The J. Paul Getty TrustA cultural and philanthropic institution dedicated to the presentation,
conservation, and interpretation of the world’s artistic legacy.www.getty.edu
The
Getty
Dat
aset
s a
nd th
e Se
man
tic W
eb
The Getty Foundation
The Getty Conservation Institute
The Getty Research Institute
The J. Paul GettyMuseum
Constituent Programs of the J. Paul Getty Trust
The
Getty
Dat
aset
s a
nd th
e Se
man
tic W
eb
The Getty’s Open Content Program
Began in August 2013 with the release of roughly 6,400 high-resolution images from the collections of the Getty Museum and the Getty Research Institute
The Getty Search Gateway now contains more than 1 million items http://search.getty.edu/gateway/landing
The
Getty
Dat
aset
s a
nd th
e Se
man
tic W
eb
The Challenge
To find a way to publish the Getty’s datasets as Linked Open Data
that would preserve the richness of the content
The
Getty
Dat
aset
s a
nd th
e Se
man
tic W
eb
Some 2013 LOD InfluencesJan Feb Mar Apr May Jun Jul Aug Sep
Tim Berners-Lee TED Talkhttp://www.ted.com/talks/tim_berners_lee_the_year_open_data_went_worldwide.html
Dr. Zeng delivered a 62 page report on why we should care about the semantic web
Wanted very much to align with the Getty Vocabs
Eero Hyvönen’s bookPublishing and Using Cultural Heritage Linked Data on the Semantic Web
Europeana Video – Sometimes a picture is worth a thousand words – in this case, it’s a video http://vimeo.com/36752317
The
Getty
Dat
aset
s a
nd th
e Se
man
tic W
eb
The Getty Vocabularies were the first datasets to be published as LOD
Reasons:• AAT, TGN, and ULAN were already linked by design • The data was clean• We had complete control of the custom, in-house developed, databases and applications that
support and publish the Getty vocabularies• We had been receiving request for years to publish these datasets as LOD• The Getty vocabularies would help to connect the rest of the Getty resources to each other
The
Getty
Dat
aset
s a
nd th
e Se
man
tic W
eb
AAT AAT
AAT
Vocabularies Provide Access Points to Works of Art
http://www.getty.edu/art/collection/objects/548
TGN
ULAN
The
Getty
Dat
aset
s a
nd th
e Se
man
tic W
eb
Another Example of Enhanced Access
http://www.getty.edu/art/collection/objects/1155/
ULAN
The
Getty
Dat
aset
s a
nd th
e Se
man
tic W
eb
Where do the terms and concepts come from?
Example of some of the top contributors
The
Getty
Dat
aset
s a
nd th
e Se
man
tic W
eb
International Terminology Working Group
August 2010 November 2011
August 2016
January 2013
September 2014
The
Getty
Dat
aset
s a
nd th
e Se
man
tic W
eb
Art & Architecture Thesaurus®
Current totals 57,824 concepts 370,310 terms
Scope includes generic terms for work types, roles, materials, styles, cultures, techniques, attributes, abstract concepts
AATReleased as LOD20 February 2014
The
Getty
Dat
aset
s a
nd th
e Se
man
tic W
eb
Getty Thesaurus of Geographic Names®
Current totals2,518,360 places4,064,885 terms
Scope includes cities, nations, empires, archaeological sites, physical features
TGNReleased as LOD 21 August 2014
The
Getty
Dat
aset
s a
nd th
e Se
man
tic W
eb
Released as LOD 30 April 2015
Union List of Artist Names®
Current totals262,443 agents682,328 names
Scope includes artists, architects, firms, studios, patrons, sitters; named and anonymous
ULAN
The
Getty
Dat
aset
s a
nd th
e Se
man
tic W
eb
Bi-weekly exports to Linked Open Data
Publication Cycle
Editorial System (VCS)
Bi-weekly exports to the public web sites
Web Service APIs
BatchOnline Forms
Contributions to AAT, TGN, ULAN, & CONA
Relational Table Releases
The
Getty
Dat
aset
s a
nd th
e Se
man
tic W
eb
Human-readable Datahttp://vocab.getty.edu/page/aat/300198841
http://vocab.getty.edu/aat/300198841
Machine-readable Data
The
Getty
Dat
aset
s a
nd th
e Se
man
tic W
eb
© 2016 J. Paul Getty Trust, author: Joan Cobb. For educational purposes only. Do not distribute.
Machine-readable Data Formats AvailableJSON JSONLD RDF
N-Triples N3/Turtle
The
Getty
Dat
aset
s a
nd th
e Se
man
tic W
eb
General Information Site
Target audience: Anyone interested in general information about the project
http://www.getty.edu/research/tools/vocabularies/lod/index.html
The
Getty
Dat
aset
s a
nd th
e Se
man
tic W
eb
SPARQL Endpointhttp://vocab.getty.edu/
Target audience: Technical developers who are interested in making use of the machine-readable data
94 pages of in-depth documentation
40 diagramsfull datasets
The
Getty
Dat
aset
s a
nd th
e Se
man
tic W
eb
To join send email to
Public Discussion Forum• Ask questions• Discuss Issues• Find solutions• Share usage stories• Learn from each other
© 2016 J. Paul Getty Trust, author: Joan Cobb. For educational purposes only. Do not distribute.
The
Getty
Dat
aset
s a
nd th
e Se
man
tic W
eb
Tracking Usage Can Be Difficult
• Once data is truly open it is difficult to track usage.• Usage is often not visible because the LOD links are part of
the machine code that create what is visible.• Ways we know the data is being used:
• Comments on Twitter• Comments on other cultural heritage sites like Europeana or
LODLAM (Linked Open Data for Libraries, Archives, and Museums)• Discussions on our public forum• Publications• Email from users
The
Getty
Dat
aset
s a
nd th
e Se
man
tic W
eb
Usage Stories• To date, we have received 42 usage stories from people
kind enough to let us know how they are making use of the LOD publications of AAT, TGN and ULAN.
• In the future we want to provide links to these resources from the Getty’s LOD sites.
• The following slides show some examples of usage.
The
Getty
Dat
aset
s a
nd th
e Se
man
tic W
eb
EADitor
Ethan GruberUS Numismatic Society
Nomisma.orgKerameikos.org
© 2014 J. Paul Getty Trust, author: Joan Cobb .For educational purposes only. Do not distribute.
The
Getty
Dat
aset
s a
nd th
e Se
man
tic W
eb
http://pro.europeana.eu/pro-blog/-/blogs/2293682
© 2014 J. Paul Getty Trust, author: Joan Cobb .For educational purposes only. Do not distribute.
The
Getty
Dat
aset
s a
nd th
e Se
man
tic W
eb
http://www.partage-plus.eu/
Partage Plus Digitising and Enabling
Art Nouveau for Europeana
© 2014 J. Paul Getty Trust, author: Joan Cobb .For educational purposes only. Do not distribute.
The
Getty
Dat
aset
s a
nd th
e Se
man
tic W
eb
Indexing Plugin for
Adobe Bridge
Greg Reser UC San Diego
Library
© 2014 J. Paul Getty Trust, author: Joan Cobb .For educational purposes only. Do not distribute.
The
Getty
Dat
aset
s a
nd th
e Se
man
tic W
eb
© 2014 J. Paul Getty Trust, author: Joan Cobb .For educational purposes only. Do not distribute.
Athanasios Velios
Ligatus
University of theArts London
Now includes all three vocabularies
https://www.drupal.org/sandbox/avelios/2328571
The
Getty
Dat
aset
s a
nd th
e Se
man
tic W
eb
Thomas Gilcrease MuseumTulsa, Oklahoma
About the CreatorInformation comes from ULAN LOD
The
Getty
Dat
aset
s a
nd th
e Se
man
tic W
eb
Academia Sinica
Digitization of the works of
Chengpo Chen (1895-1947)
Using AAT LOD to bridge between the descriptions of the artworks in the Chen collection and those in other collections in the world.
© 2014 J. Paul Getty Trust, author: Joan Cobb .For educational purposes only. Do not distribute.
The
Getty
Dat
aset
s a
nd th
e Se
man
tic W
eb
What’s Next?
Connect the silos in the process• Museum Objects• Library resources• Provenance• Conservation• Research• Images• Getty Vocabularies
Build on Vocabularies to Transform Cultural Heritage Resources as “Open Content” through
Linked Open Data (LOD) and International Image Interoperability Framework
The
Getty
Dat
aset
s a
nd th
e Se
man
tic W
eb
With thanks to my Getty colleagues:
Patricia HarpringManaging Editor, Getty [email protected]
Gregg GarciaLead Developer, Getty [email protected]
Murtha Baca, HeadDigital Art History [email protected]
Rob SandersonSemantic [email protected]
Joan CobbTechnical Lead, Getty Vocabularies
The J. Paul Getty Trust1200 Getty Center Drive
Los Angeles, CA 90049