planetdata in a nutshell
DESCRIPTION
PlanetData in a nutshell. Elena Simperl, KIT 1st year review Luxembourg, December 2011. The idea. PlanetData‘s aim and objectives. Aim: establish an interdisciplinary, sustainable European community on large-scale data management Purposeful data exposure Novel and improved applications - PowerPoint PPT PresentationTRANSCRIPT
PlanetData in a nutshell
Elena Simperl, KIT
1st year reviewLuxembourg, December 2011
The idea
PlanetData‘s aim and objectives
• Aim: establish an interdisciplinary, sustainable European community on large-scale data management• Purposeful data exposure• Novel and improved applications
• Objectives• Addressing challenges through integrated research• Data and technology provisioning through PlanetData Lab• Impact through training, dissemination, standardization
and networking• Openness and flexibility through PlanetData Programs
Databases
Data and Web
MiningSemantics
Work plan and expected results
Work packages and activities
Activity 1 Research
WP 1 Data Streams and Dynamicity
WP 2 Context Representation and Quality Assessment
WP 3 Provenance and Access
Policies
Activity 2 Data Provisioning and Management
WP 4 Data Provisioning
WP 5 Data Management
Act
ivit
y 3
Im
pact
Activity 4 Management
WP 6
Tra
inin
g
WP 7
D
isse
min
ati
on a
nd
Com
mun
ity
Build
ing
WP 8 Project Management
Expected outcomes (i)
• Research on publishing and managing new species of interlinked data sets• Methods and techniques to publish, access and
manage stream data
• Research on improving the usefulness of existing linked data sources• Quality assessment for interlinked data sets (for
LOD and stream data), including best practices for the representation and usage of contextual information
• Provenance, trust, access control (for LOD and stream data)
Expected outcomes (ii)
• Catalogues of data sets and vocabularies, including best practices for publishing and managing self-descriptive data
• Catalogues of data provisioning and management tools, including best practices on how to exploit clouds and clusters for distributed and large-scale data management
• Linked services and processes as an instrument to develop applications
Expected results (iii)
• Yearly summer school co-located with the ESWC
• Open training infrastructure
• Semantic Web video journal
• PlanetData Programs
Example scenario
relational DB
stream DB stream DB RDF (stream) DB
CSV twitter
C-SPARQL/SPARQL-STR/HTTP
registry
Qualit
y c
on
trol
Provenance and access control
ontologiesSSN
Simplified scenario
GADM NUTS
Provenance and access control
Qualit
y c
on
trol
SPARQL SSN
NeoGeo
Highlights of the first year
Publishing and managing new species of interlinked data sets
• W3C SSN ontology documenting RDF data streams
• URI definition • Supporting technology
• Extensions to SPARQL (SPARQL-STR)• Data stream management systems (e.g.,
MonetDB)• Transformation/characterisation tools (e.g.,
Pachube2RDF)
Improving the usefulness of linked data sourcesQuality assessment combining database techniques
with requirements from the Web of Data (LOD), and model-based techniques to clean sensor data
GeoVocab to represent geospatial informationGADM and NUTS region data and services published
with mapping to relevant data sets in the LODC
• Provenance of SPARQL queries based on relational models
• Annotation model for access control access control mechanism taking into account RDFS entailnment
Geovocab.org
Cataloguing
Surveys◦ Metadata for Semantic Sensor Networks
◦ Vocabularies for datasets and streams
◦ Geospatial data, geospatial Ontology
http://vocab.cc
Training
Dissemination
PlanetData Programs
• 1st Call: 37d proposals submitted with a total requested contribution of almost 3.000.000 €
• “Consuming and Quality Assessment of Linked Data in Urban Environments through Games with a Purpose” led by CEFRIEL
• “Consuming and Improving Norwegian Linked Open Data for Regional Development and Environmental Friendly Behavior” led by Computas AS
• “ParkMe: Linked Open Parking Data” led by Open University
The management
Some facts and figures
• European Network of Excellence in Call 5 of FP7
• 4 years, started in October 2010• Budget: 3.7 million €; EC contribution: 3
million €, 0.6 million € allocated to open calls
• 9 partners from 7 European countries
The team
Management structure
Core & associate partners
Project wiki
: wiki.planet-data.eu
The agenda
Agenda: Day 1
• 07-Dec-2011 • 14:45 - 17:00 Improving the usefullness of existing Linked
Data sets • 14:45 - 15:30 Data quality and repair (Pablo Mendes, FUB;
and Giorgos Flouris, FORTH)• 15:30 – 16:00 Representing contextual aspects of data
(Andreas Harth, KIT)• 16:00 – 16:15 Coffee break• 16:15 - 17:00 Data provenance and access control (Irini
Fundulaki, FORTH)• 17:00 - 18:00 Towards Linked Stream Data (Oscar Corcho,
UPM)
Agenda: Day 2
• 08-12-2011 • 09:00 - 09:45 Data sets, vocabularies and tools
(Pablo Mendes, FUB)• 09:45 - 10:30 Dissemination and community building
(Lyndon Nixon, STI International)• 10:30 - 11:15 Training (Mitja Jermol, JSI)• 11:15 - 11:30 Coffee break • 11:30 - 12:00 PlanetData Programs (Elena Simperl,
KIT)• 12:00 - 12:30 PlanetData outlook (Elena Simperl, KIT)• 12:30 – 13:30 Lunch break • 13:30 – 14:15 Closed session (PO+reviewers) • 14:15 – 15:00 Feedback session