planetdata in a nutshell

27
PlanetData in a nutshell Elena Simperl, KIT 1st year review Luxembourg, December 2011

Upload: cayla

Post on 16-Jan-2016

37 views

Category:

Documents


0 download

DESCRIPTION

PlanetData in a nutshell. Elena Simperl, KIT 1st year review Luxembourg, December 2011. The idea. PlanetData‘s aim and objectives. Aim: establish an interdisciplinary, sustainable European community on large-scale data management Purposeful data exposure Novel and improved applications - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: PlanetData in a nutshell

PlanetData in a nutshell

Elena Simperl, KIT

1st year reviewLuxembourg, December 2011

Page 2: PlanetData in a nutshell

The idea

Page 3: PlanetData in a nutshell

PlanetData‘s aim and objectives

• Aim: establish an interdisciplinary, sustainable European community on large-scale data management• Purposeful data exposure• Novel and improved applications

• Objectives• Addressing challenges through integrated research• Data and technology provisioning through PlanetData Lab• Impact through training, dissemination, standardization

and networking• Openness and flexibility through PlanetData Programs

Databases

Data and Web

MiningSemantics

Page 4: PlanetData in a nutshell

Work plan and expected results

Page 5: PlanetData in a nutshell

Work packages and activities

Activity 1 Research

WP 1 Data Streams and Dynamicity

WP 2 Context Representation and Quality Assessment

WP 3 Provenance and Access

Policies

Activity 2 Data Provisioning and Management

WP 4 Data Provisioning

WP 5 Data Management

Act

ivit

y 3

Im

pact

Activity 4 Management

WP 6

Tra

inin

g

WP 7

D

isse

min

ati

on a

nd

Com

mun

ity

Build

ing

WP 8 Project Management

Page 6: PlanetData in a nutshell

Expected outcomes (i)

• Research on publishing and managing new species of interlinked data sets• Methods and techniques to publish, access and

manage stream data

• Research on improving the usefulness of existing linked data sources• Quality assessment for interlinked data sets (for

LOD and stream data), including best practices for the representation and usage of contextual information

• Provenance, trust, access control (for LOD and stream data)

Page 7: PlanetData in a nutshell

Expected outcomes (ii)

• Catalogues of data sets and vocabularies, including best practices for publishing and managing self-descriptive data

• Catalogues of data provisioning and management tools, including best practices on how to exploit clouds and clusters for distributed and large-scale data management

• Linked services and processes as an instrument to develop applications

Page 8: PlanetData in a nutshell

Expected results (iii)

• Yearly summer school co-located with the ESWC

• Open training infrastructure

• Semantic Web video journal

• PlanetData Programs

Page 9: PlanetData in a nutshell

Example scenario

relational DB

stream DB stream DB RDF (stream) DB

CSV twitter

C-SPARQL/SPARQL-STR/HTTP

registry

Qualit

y c

on

trol

Provenance and access control

ontologiesSSN

Page 10: PlanetData in a nutshell

Simplified scenario

GADM NUTS

Provenance and access control

Qualit

y c

on

trol

SPARQL SSN

NeoGeo

Page 11: PlanetData in a nutshell

Highlights of the first year

Page 12: PlanetData in a nutshell

Publishing and managing new species of interlinked data sets

• W3C SSN ontology documenting RDF data streams

• URI definition • Supporting technology

• Extensions to SPARQL (SPARQL-STR)• Data stream management systems (e.g.,

MonetDB)• Transformation/characterisation tools (e.g.,

Pachube2RDF)

Page 13: PlanetData in a nutshell

Improving the usefulness of linked data sourcesQuality assessment combining database techniques

with requirements from the Web of Data (LOD), and model-based techniques to clean sensor data

GeoVocab to represent geospatial informationGADM and NUTS region data and services published

with mapping to relevant data sets in the LODC

• Provenance of SPARQL queries based on relational models

• Annotation model for access control access control mechanism taking into account RDFS entailnment

Geovocab.org

Page 14: PlanetData in a nutshell

Cataloguing

Surveys◦ Metadata for Semantic Sensor Networks

◦ Vocabularies for datasets and streams

◦ Geospatial data, geospatial Ontology

http://vocab.cc

Page 15: PlanetData in a nutshell

Training

Page 16: PlanetData in a nutshell

Dissemination

Page 17: PlanetData in a nutshell

PlanetData Programs

• 1st Call: 37d proposals submitted with a total requested contribution of almost 3.000.000 €

• “Consuming and Quality Assessment of Linked Data in Urban Environments through Games with a Purpose” led by CEFRIEL

• “Consuming and Improving Norwegian Linked Open Data for Regional Development and Environmental Friendly Behavior” led by Computas AS

• “ParkMe: Linked Open Parking Data” led by Open University

Page 18: PlanetData in a nutshell

The management

Page 19: PlanetData in a nutshell

Some facts and figures

• European Network of Excellence in Call 5 of FP7

• 4 years, started in October 2010• Budget: 3.7 million €; EC contribution: 3

million €, 0.6 million € allocated to open calls

• 9 partners from 7 European countries

Page 20: PlanetData in a nutshell

The team

Page 21: PlanetData in a nutshell

Management structure

Page 22: PlanetData in a nutshell

Core & associate partners

Page 23: PlanetData in a nutshell

Project wiki

: wiki.planet-data.eu

Page 24: PlanetData in a nutshell

The agenda

Page 25: PlanetData in a nutshell

Agenda: Day 1

• 07-Dec-2011 • 14:45 - 17:00 Improving the usefullness of existing Linked

Data sets • 14:45 - 15:30 Data quality and repair (Pablo Mendes, FUB;

and Giorgos Flouris, FORTH)• 15:30 – 16:00 Representing contextual aspects of data

(Andreas Harth, KIT)• 16:00 – 16:15 Coffee break• 16:15 - 17:00 Data provenance and access control (Irini

Fundulaki, FORTH)• 17:00 - 18:00 Towards Linked Stream Data (Oscar Corcho,

UPM)

Page 26: PlanetData in a nutshell

Agenda: Day 2

• 08-12-2011 • 09:00 - 09:45 Data sets, vocabularies and tools

(Pablo Mendes, FUB)• 09:45 - 10:30 Dissemination and community building

(Lyndon Nixon, STI International)• 10:30 - 11:15 Training (Mitja Jermol, JSI)• 11:15 - 11:30 Coffee break • 11:30 - 12:00 PlanetData Programs (Elena Simperl,

KIT)• 12:00 - 12:30 PlanetData outlook (Elena Simperl, KIT)• 12:30 – 13:30 Lunch break • 13:30 – 14:15 Closed session (PO+reviewers) • 14:15 – 15:00 Feedback session

Page 27: PlanetData in a nutshell