planetdata in a nutshell

Post on 16-Jan-2016

37 Views

Category:

Documents

0 Downloads

Preview:

Click to see full reader

DESCRIPTION

PlanetData in a nutshell. Elena Simperl, KIT 1st year review Luxembourg, December 2011. The idea. PlanetData‘s aim and objectives. Aim: establish an interdisciplinary, sustainable European community on large-scale data management Purposeful data exposure Novel and improved applications - PowerPoint PPT Presentation

TRANSCRIPT

PlanetData in a nutshell

Elena Simperl, KIT

1st year reviewLuxembourg, December 2011

The idea

PlanetData‘s aim and objectives

• Aim: establish an interdisciplinary, sustainable European community on large-scale data management• Purposeful data exposure• Novel and improved applications

• Objectives• Addressing challenges through integrated research• Data and technology provisioning through PlanetData Lab• Impact through training, dissemination, standardization

and networking• Openness and flexibility through PlanetData Programs

Databases

Data and Web

MiningSemantics

Work plan and expected results

Work packages and activities

Activity 1 Research

WP 1 Data Streams and Dynamicity

WP 2 Context Representation and Quality Assessment

WP 3 Provenance and Access

Policies

Activity 2 Data Provisioning and Management

WP 4 Data Provisioning

WP 5 Data Management

Act

ivit

y 3

Im

pact

Activity 4 Management

WP 6

Tra

inin

g

WP 7

D

isse

min

ati

on a

nd

Com

mun

ity

Build

ing

WP 8 Project Management

Expected outcomes (i)

• Research on publishing and managing new species of interlinked data sets• Methods and techniques to publish, access and

manage stream data

• Research on improving the usefulness of existing linked data sources• Quality assessment for interlinked data sets (for

LOD and stream data), including best practices for the representation and usage of contextual information

• Provenance, trust, access control (for LOD and stream data)

Expected outcomes (ii)

• Catalogues of data sets and vocabularies, including best practices for publishing and managing self-descriptive data

• Catalogues of data provisioning and management tools, including best practices on how to exploit clouds and clusters for distributed and large-scale data management

• Linked services and processes as an instrument to develop applications

Expected results (iii)

• Yearly summer school co-located with the ESWC

• Open training infrastructure

• Semantic Web video journal

• PlanetData Programs

Example scenario

relational DB

stream DB stream DB RDF (stream) DB

CSV twitter

C-SPARQL/SPARQL-STR/HTTP

registry

Qualit

y c

on

trol

Provenance and access control

ontologiesSSN

Simplified scenario

GADM NUTS

Provenance and access control

Qualit

y c

on

trol

SPARQL SSN

NeoGeo

Highlights of the first year

Publishing and managing new species of interlinked data sets

• W3C SSN ontology documenting RDF data streams

• URI definition • Supporting technology

• Extensions to SPARQL (SPARQL-STR)• Data stream management systems (e.g.,

MonetDB)• Transformation/characterisation tools (e.g.,

Pachube2RDF)

Improving the usefulness of linked data sourcesQuality assessment combining database techniques

with requirements from the Web of Data (LOD), and model-based techniques to clean sensor data

GeoVocab to represent geospatial informationGADM and NUTS region data and services published

with mapping to relevant data sets in the LODC

• Provenance of SPARQL queries based on relational models

• Annotation model for access control access control mechanism taking into account RDFS entailnment

Geovocab.org

Cataloguing

Surveys◦ Metadata for Semantic Sensor Networks

◦ Vocabularies for datasets and streams

◦ Geospatial data, geospatial Ontology

http://vocab.cc

Training

Dissemination

PlanetData Programs

• 1st Call: 37d proposals submitted with a total requested contribution of almost 3.000.000 €

• “Consuming and Quality Assessment of Linked Data in Urban Environments through Games with a Purpose” led by CEFRIEL

• “Consuming and Improving Norwegian Linked Open Data for Regional Development and Environmental Friendly Behavior” led by Computas AS

• “ParkMe: Linked Open Parking Data” led by Open University

The management

Some facts and figures

• European Network of Excellence in Call 5 of FP7

• 4 years, started in October 2010• Budget: 3.7 million €; EC contribution: 3

million €, 0.6 million € allocated to open calls

• 9 partners from 7 European countries

The team

Management structure

Core & associate partners

Project wiki

: wiki.planet-data.eu

The agenda

Agenda: Day 1

• 07-Dec-2011 • 14:45 - 17:00 Improving the usefullness of existing Linked

Data sets • 14:45 - 15:30 Data quality and repair (Pablo Mendes, FUB;

and Giorgos Flouris, FORTH)• 15:30 – 16:00 Representing contextual aspects of data

(Andreas Harth, KIT)• 16:00 – 16:15 Coffee break• 16:15 - 17:00 Data provenance and access control (Irini

Fundulaki, FORTH)• 17:00 - 18:00 Towards Linked Stream Data (Oscar Corcho,

UPM)

Agenda: Day 2

• 08-12-2011 • 09:00 - 09:45 Data sets, vocabularies and tools

(Pablo Mendes, FUB)• 09:45 - 10:30 Dissemination and community building

(Lyndon Nixon, STI International)• 10:30 - 11:15 Training (Mitja Jermol, JSI)• 11:15 - 11:30 Coffee break • 11:30 - 12:00 PlanetData Programs (Elena Simperl,

KIT)• 12:00 - 12:30 PlanetData outlook (Elena Simperl, KIT)• 12:30 – 13:30 Lunch break • 13:30 – 14:15 Closed session (PO+reviewers) • 14:15 – 15:00 Feedback session

top related