openhpi 6.4 - linked data engineering

26
This file is licensed under the Creative Commons Attribution-NonCommercial 3.0 (CC BY-NC 3.0 ) Dr. Harald Sack Hasso Plattner Institute for IT Systems Engineering University of Potsdam Spring 2013 Semantic Web Technologies Lecture 6: Applications in the Web of Data 04: Linked Data Engineering

Upload: harald-sack

Post on 16-Jan-2015

349 views

Category:

Documents


3 download

DESCRIPTION

 

TRANSCRIPT

Page 1: OpenHPI 6.4 - Linked Data Engineering

This file is licensed under the Creative Commons Attribution-NonCommercial 3.0 (CC BY-NC 3.0)

Dr. Harald Sack

Hasso Plattner Institute for IT Systems Engineering

University of Potsdam

Spring 2013

Semantic Web Technologies

Lecture 6: Applications in the Web of Data04: Linked Data Engineering

Page 2: OpenHPI 6.4 - Linked Data Engineering

Semantic Web Technologies , Dr. Harald Sack, Hasso-Plattner-Institut, Universität Potsdam

2

Lecture 6: Applications in the Web of DataOpen HPI - Course: Semantic Web Technologies

Page 3: OpenHPI 6.4 - Linked Data Engineering

Semantic Web Technologies , Dr. Harald Sack, Hasso-Plattner-Institut, Universität Potsdam

3

04 - Linked Data EngineeringOpen HPI - Course: Semantic Web Technologies - Lecture 6: Applications in the Web of Data

Page 4: OpenHPI 6.4 - Linked Data Engineering

Vorlesung Semantic Web, Dr. Harald Sack, Hasso-Plattner-Institut, Universität Potsdam

4

•Data can only be found on the Web, if it is available at some website

Database

Web-Server

JDBC

HTTPHTML

Browser

How to Get Data From the Web?

Page 5: OpenHPI 6.4 - Linked Data Engineering

Vorlesung Semantic Web, Dr. Harald Sack, Hasso-Plattner-Institut, Universität Potsdam

5

How to Get Data From the Web?

•There is a number of different (proprietary) Web APIs, data exchange formats, and Mashups on top of that

Database 1

WebAPI 1

WebAPI 2

WebAPI 3

WebAPI 4

Database 2 Database 3 Database 4

Mashup

Page 6: OpenHPI 6.4 - Linked Data Engineering

Vorlesung Semantic Web, Dr. Harald Sack, Hasso-Plattner-Institut, Universität Potsdam

6

In the Web today...

•Data is locked up in small data islands

•Other applications usually cannot access this data...

Database

Database

Database

Database

Database

Database

Database

Database

Database

Page 7: OpenHPI 6.4 - Linked Data Engineering

Vorlesung Semantic Web, Dr. Harald Sack, Hasso-Plattner-Institut, Universität Potsdam

7

•Apply semantic technologies:

•to publish structured data on the web

•to draw connections from one data source to data from other data sources

Database 1 Database 2 Database 3 Database 4

RDF Data RDF Data RDF Data RDF Data

RDF Links

RDF Links

RDF Links

How to Get Rid of Closed Data Islands on the Web?

Page 8: OpenHPI 6.4 - Linked Data Engineering

Vorlesung Semantic Web, Dr. Harald Sack, Hasso-Plattner-Institut, Universität Potsdam

8

Linked Data Principles

(1) Use URIs as names for things.

(2) Use HTTP URIs, so that people can look up those names.

(3) When someone looks up a URI, provide useful information, using the standards (RDF, SPARQL)

(4) Include links to other URIs, so that they can discover more things.

Page 9: OpenHPI 6.4 - Linked Data Engineering

Vorlesung Semantic Web, Dr. Harald Sack, Hasso-Plattner-Institut, Universität Potsdam

9

Linked Data Principles

(1) Use URIs as names for things.

•URIs do not only identify documents but also arbitrary objects of the real world as well as abstract concepts

http://dbpedia.org/resource/Albert_Einstein

http://musicbrainz.org/artist/b10bbbfc-cf9e-42e0-be17-e2c3e1d2600d

http://yovisto.blogspot.com/

Page 10: OpenHPI 6.4 - Linked Data Engineering

Vorlesung Semantic Web, Dr. Harald Sack, Hasso-Plattner-Institut, Universität Potsdam

10

Linked Data Principles

(2) Use HTTP URIs, so that people can look up those names.

•HTTP URIs (URLs) as globally unique names enable dereferencing of assiciated information in the Web

•via http Content Negotiation

•303 URIshttp Response Code 303 ,See Other‘ (redirect)

•Hash URIshttp://example.com/Harald#me

Page 11: OpenHPI 6.4 - Linked Data Engineering

Vorlesung Semantic Web, Dr. Harald Sack, Hasso-Plattner-Institut, Universität Potsdam

11

Linked Data for Humans and Computers

■URI should deliver information as well as for humans as for computers, i.e.

URI

Accept: application/rdf+xml Accept: text/html

(Thing)

(RDF data) (HTML page)

Page 12: OpenHPI 6.4 - Linked Data Engineering

Vorlesung Semantic Web, Dr. Harald Sack, Hasso-Plattner-Institut, Universität Potsdam

12

■Server delivers different HTTP responses dependent ofHTTP-Accept-Header (Content Negotiation)

http://www4.wiwiss.fu-berlin.de/bizer/pub/LinkedDataTutorial/

Linked Data for Humans and Computers

Page 14: OpenHPI 6.4 - Linked Data Engineering

Vorlesung Semantic Web, Dr. Harald Sack, Hasso-Plattner-Institut, Universität Potsdam

14

Linked Data Principles

(3) When someone looks up a URI, provide useful information, using the standards (RDF, SPARQL)

•RDF as universal data model for publishing structured data on the Web

•Make all URIs in the RDF graph dereferencable

•Avoid RDF constructs that cause problems in Linked Data context

•RDF Reification

•RDF Collections und Containers

•unnamed Blank Nodes

Page 15: OpenHPI 6.4 - Linked Data Engineering

Vorlesung Semantic Web, Dr. Harald Sack, Hasso-Plattner-Institut, Universität Potsdam

15

Linked Data Principles

(4) Include links to other URIs, so that they can discover more things.

•Link RDF references among data between different data sources, to find information related by content

•Relationship LinksLinks to external LOD Entitites related with the original entity

• Identity LinksLinks to external LOD Entities referring to the same object or concept

•Vocabulary LinksLinks to definitions of the original entity

Page 16: OpenHPI 6.4 - Linked Data Engineering

Vorlesung Semantic Web, Dr. Harald Sack, Hasso-Plattner-Institut, Universität Potsdam

16

The application of the Linked Data Principles leads to a ,Web of Data‘

Page 17: OpenHPI 6.4 - Linked Data Engineering

Vorlesung Semantic Web, Dr. Harald Sack, Hasso-Plattner-Institut, Universität Potsdam

17

Development of the ,Web of Data‘

May 2007

Page 18: OpenHPI 6.4 - Linked Data Engineering

Vorlesung Semantic Web, Dr. Harald Sack, Hasso-Plattner-Institut, Universität Potsdam

18

Nov 2007

Development of the ,Web of Data‘

Page 19: OpenHPI 6.4 - Linked Data Engineering

Vorlesung Semantic Web, Dr. Harald Sack, Hasso-Plattner-Institut, Universität Potsdam

19

Development of the ,Web of Data‘

Page 20: OpenHPI 6.4 - Linked Data Engineering

Vorlesung Semantic Web, Dr. Harald Sack, Hasso-Plattner-Institut, Universität Potsdam

20

July 2009

Development of the ,Web of Data‘

Page 21: OpenHPI 6.4 - Linked Data Engineering

Vorlesung Semantic Web, Dr. Harald Sack, Hasso-Plattner-Institut, Universität Potsdam

21

September 2010

Development of the ,Web of Data‘

Page 22: OpenHPI 6.4 - Linked Data Engineering

Vorlesung Semantic Web, Dr. Harald Sack, Hasso-Plattner-Institut, Universität Potsdam

22

September 2011

300 Datasets 31B RDF Triples504M Links

Development of the ,Web of Data‘

Page 23: OpenHPI 6.4 - Linked Data Engineering

Vorlesung Semantic Web, Dr. Harald Sack, Hasso-Plattner-Institut, Universität Potsdam

23

Linked Open Data

■Public Linked Data resources in the Web, licensed as„Creative Common CC-BY“

■ Tim Berners-Lee‘s 5-Star Criteria for Linked Open Data

Available on the web (whatever format) but with an open licence, to be Open Data

Available as machine-readable structured data (e.g. excel instead of image scan of a table)

as (2) plus non-proprietary format (e.g. CSV instead of excel)

All the above plus: use open standards from W3C (RDF and SPARQL) to identify things, so that people can point at your stuff

All the above, plus: link your data to other people’s data to provide context

★ ★

★ ★ ★★ ★ ★ ★

★ ★ ★ ★ ★

Page 24: OpenHPI 6.4 - Linked Data Engineering

Vorlesung Semantic Web, Dr. Harald Sack, Hasso-Plattner-Institut, Universität Potsdam

24

Linked Open Data

Page 25: OpenHPI 6.4 - Linked Data Engineering

Vorlesung Semantic Web, Dr. Harald Sack, Hasso-Plattner-Institut, Universität Potsdam

25

□Semantic Mashups are applications that use linked RDF data from various data sources.

□ in difference to interfaces and exchange formats or ordinary Web APIs, Linked Data offers the following benefits:

□ a flexible and standardized data format (RDF)□ standardized access mechanism (http)□ possibility to put links (RDF-Links) among different data

sources» enables navigation» is supported by search engines (crawler)» enables expressive search facilities over the crawled

data and beyond

Semantic Mashups

Page 26: OpenHPI 6.4 - Linked Data Engineering

Semantic Web Technologies , Dr. Harald Sack, Hasso-Plattner-Institut, Universität Potsdam

26

05 - Linked Data Engineering (Part 2)Open HPI - Course: Semantic Web Technologies - Lecture 6: Applications in the Web of Data