linked data for digital history presentation for vu symposium "connecting data for...

32
Linked Data for Digital History Connecting Data for Research Victor de Boer With input from Christophe Guéret, Serge ter Braake, Niels Ockeloen, Antske Fokkens, Dirk Roorda, Lora Aroyo, Johan Oomen, Oana Inel, Jan Wielemaker, Jeroen Entjes

Upload: victor-de-boer

Post on 19-Feb-2017

744 views

Category:

Education


0 download

TRANSCRIPT

Page 1: Linked Data for Digital History presentation for VU symposium "Connecting Data for Research"

Linked Data for Digital History

Connecting Data for Research

Victor de Boer

With input from Christophe Guéret, Serge ter Braake, Niels Ockeloen, Antske Fokkens, Dirk Roorda, Lora

Aroyo, Johan Oomen, Oana Inel, Jan Wielemaker, Jeroen Entjes

Page 2: Linked Data for Digital History presentation for VU symposium "Connecting Data for Research"

Victor de Boer

Web & Media Group, CS, Vrije Universiteit AmsterdamNetherlands Institute for Sound and Vision

Cultural HeritageDigital History

Linked Data for Development

Page 3: Linked Data for Digital History presentation for VU symposium "Connecting Data for Research"

Digital HistorySub-discipline of digital humanities

Part of the effort of historian is moved from the physical archives to digital ones

Cross-domain collaborationImg:www.doaks.org, www.dkrz.de

Page 5: Linked Data for Digital History presentation for VU symposium "Connecting Data for Research"

“That is great. I would love that…

…but my research questions are slightly different.”

Img:Monty Python

Page 6: Linked Data for Digital History presentation for VU symposium "Connecting Data for Research"

Aging

Data Tool

C. Guéret based on http://redmonk.com/jgovernor/2007/04/05/why-applciations-are-like-fish-and-data-is-like0wine/

Page 7: Linked Data for Digital History presentation for VU symposium "Connecting Data for Research"

Even betterDo not bake the data into the tool and treat data as an end product.Build tools on top of the data.Make sure others can do so as well.

Fig: C. Guéret

Page 8: Linked Data for Digital History presentation for VU symposium "Connecting Data for Research"

Linked Data for Digital History

• Represent heterogeneous datasets with their own data models in common format: Resource Description Format (RDF)– Link what can be linked

• re-use and re-usability

• Linked Data is the (technically) best way to publish and share your (research) data

OBJECT EVENT

PLACE

TIME

PERSON

CONCEPT

PROVENANCE

Page 9: Linked Data for Digital History presentation for VU symposium "Connecting Data for Research"

Some examples

Page 10: Linked Data for Digital History presentation for VU symposium "Connecting Data for Research"

Dutch Ships and Sailors

Page 11: Linked Data for Digital History presentation for VU symposium "Connecting Data for Research"

The Problem:((Maritime) historical) data is not integrated

Page 12: Linked Data for Digital History presentation for VU symposium "Connecting Data for Research"

KB NEWSPAPERS

Dutch-Asiatic Shipping “VOC Opvarenden”

Jur Leinenga Matthias van Rossum

Elbing voyagesArchangel voyages

Page 13: Linked Data for Digital History presentation for VU symposium "Connecting Data for Research"

DIFFERENT but LINKED DATAMODELS BASED ON COMPETENCY QUESTIONS

dss:Recordgzmvoc:Telling

gzmvoc:telling-1046-De_Berkel

__bnode_1

gzmvoc:aziatischeBemanning

dss:Shipgzmvoc:Schip

gzmvoc: schip-1046-De_Berkel

dss:has_shipgzmvoc:schip

"1046"

“Schip”

“De Berkel”

rdfs:labeldss:scheepsnaam

gzmvoc:scheepsnaam

dss:ShipTypegzmvoc:Scheepstype

gzmvoc: type-Shipdss:has_shiptype

gzmvoc:has_shiptype

gzmvoc:scheepstype

“21”

“Moorse mattroosen”

dss:azRegistratieKop

gzmvoc:azAantalMatrozen

gzmvoc:telling

gzmvoc:heeft DAS heenreis

dss:Recorddas:Voyage

das:voyage-1918_61

Page 14: Linked Data for Digital History presentation for VU symposium "Connecting Data for Research"

ACCESS IT ATHTTP://DUTCHSHIPSANDSAILORS.NL/DATA

OR

HTTP://SEMANTICWEB.CS.VU.NL/DSS

SELECT * WHERE { ?record dss:hasOriginalScan ?scan. ?record dss:has_kb_link ?kblink. ?record mdb:schip ?schip. ?schip mdb:scheepstype ?shiptype. ?shiptype skos:exactMatch ?em. ?em skos:broader* aat:kustvaarders. }

Page 15: Linked Data for Digital History presentation for VU symposium "Connecting Data for Research"

Data analysis and visualisation

Page 17: Linked Data for Digital History presentation for VU symposium "Connecting Data for Research"

MEDIA HISTORIANS AND RESEARCHERS Media researcher Lars Arve Røssland of the University of Bergen. (Photo: Andreas R. Graven)

EXPLORATIVE SEARCH

Digital Hermeneutics: The combination of digital (Web) technology and theory of interpretation

Page 19: Linked Data for Digital History presentation for VU symposium "Connecting Data for Research"

ENTITY EXTRACTION

CROWDTRUTH.ORG

ENTITY EXTRACTION

EVENTS CROWDSOURCING AND LINKING TO CONCEPTS THROUGH CROWDTRUTH.ORG

SEGMENTATION & KEYFRAMES

LINKING EVENTS AND CONCEPTS TO KEYFRAMES

Page 20: Linked Data for Digital History presentation for VU symposium "Connecting Data for Research"

DATA CONNECTED IN KNOWLEDGE GRAPH

DIVE:MEDIA OBJECT

SEM:EVENT

SEM:PLACE

SEM:TIME

SEM:ACTOR

SKOS:CONCEPT

OA:ANNOTATION

LINKS TO EUROPEANA LINKS TO DBPEDIA

Page 21: Linked Data for Digital History presentation for VU symposium "Connecting Data for Research"

“DIGITAL SUBMARINE” INTERFACE

DIVE.BEELDENGELUID.NL

Page 22: Linked Data for Digital History presentation for VU symposium "Connecting Data for Research"

BiographyNetStarting Point: Biography Portal of the Netherlands; www.biografischportaal.nl

125,000 short biographical descriptions with limited metadata from 23 Dutch biographical dictionaries (~76,000 individuals)

What kind of historical questions can be answered with these data with the help of computational methods

Biographynet.nl

Page 23: Linked Data for Digital History presentation for VU symposium "Connecting Data for Research"

Johan Rudolph Thorbecke werdin 1798 geboren op 14 januari in Zwolle en komt uit een half-Duitse…

Johan Rudolph Thorbecke werdin 1798 geboren op 14 januari in Zwolle en komt uit een half-Duitse…

Linked Data for BiograpyNet

Thorbecke

Biographical Description

ProvenanceMeta Data

NNBW

PersonMeta Data

“Thorbecke”

BiographyParts

Birth1798Event

Biographical Description

Enrichment NLP Tool

PersonMeta Data

EventBirth

Johan Rudolph Thorbecke werdin 1798 geboren op 14 januari in Zwolle en komt uit een half-Duitse…

Zwolle1798-01-14

Biographynet.nl

Page 24: Linked Data for Digital History presentation for VU symposium "Connecting Data for Research"

a

Provenance in BiographynetEnsure credibility of the demonstrator, to evaluate its performance and to improve the academic status of the tool

Information involved Sources, but also: NER input data, etc. Processes involved All steps in enrichment, aggregation…People involved Who was responsible for pipeline, tool,

Biographynet.nl*Daniel Garijo, Yolanda Gil; http://www.opmw.org/model/p-plan

Page 25: Linked Data for Digital History presentation for VU symposium "Connecting Data for Research"

Interface for historians

Biographynet.nl

Page 26: Linked Data for Digital History presentation for VU symposium "Connecting Data for Research"

Framework generic solutions with historians1. Preprocess, Clean, Model, Link, Enrich data in a collaboration

with domain experts

2. Access heterogeneous datasets in a convenient way to get an intuition of the character and anomalies of the (linked) data;

3. Perform arbitrary queries to retrieve results relevant to their research questions;

4. Verify the veracity of query results, by following provenance links to original material

5. Retrieve and analyze the data with tool of preference.

6. Republish and share results

Page 27: Linked Data for Digital History presentation for VU symposium "Connecting Data for Research"

Historical tool criticism… willingness from historians to invest the time to learn about computer processes (at least the basic principles)

Possibilities for education at universities to bridge the gap between computer science and humanities studies and make tool criticism an integral part of student’s curricula

“Why do we still teach history student to decipher 17th Century handwriting, but not SQL”

Page 28: Linked Data for Digital History presentation for VU symposium "Connecting Data for Research"

Thank you!

Victor de Boer

http://[email protected]

@victordeboer

Page 29: Linked Data for Digital History presentation for VU symposium "Connecting Data for Research"

Verrijkt Koninkrijk

Page 30: Linked Data for Digital History presentation for VU symposium "Connecting Data for Research"

30

National-Socialist29%

Social-Democrat21%Protestant

13%

Liberal12%

R-Catholic12%

Com

munist8%

Jewish5%

http://semanticweb.cs.vu.nl/verrijktkoninkrijk/http://search.loedejongdigitaal.nl/

Page 31: Linked Data for Digital History presentation for VU symposium "Connecting Data for Research"

Results are links to paragraphs

Page 32: Linked Data for Digital History presentation for VU symposium "Connecting Data for Research"

re-usability

http://qhp.science.uva.nl/