dissemination and visualisation of linked statistical data – a … · by research projects and...

38
Dissemination and visualisation of linked statistical data – a practical approach – [email protected]

Upload: others

Post on 10-Oct-2020

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Dissemination and visualisation of linked statistical data – a … · by research projects and web enthusiasts conversion of existing data to RDF increasingly adopted by the primary

Dissemination and visualisation of linked statistical data

– a practical approach –

[email protected]

Page 2: Dissemination and visualisation of linked statistical data – a … · by research projects and web enthusiasts conversion of existing data to RDF increasingly adopted by the primary

Agenda Background Open data and linked data RDF vocabularies for statistical data Data visualization tool More resources and way forward Q&A

Page 3: Dissemination and visualisation of linked statistical data – a … · by research projects and web enthusiasts conversion of existing data to RDF increasingly adopted by the primary

European Environment Agency transparency https://taskman.eionet.europa.eu/projects

open source software https://github.com/eea

open data http://www.eea.europa.eu/data-and-maps#tab-datasets exposed as RDF via Semantic Data Service:

http://semantic.eea.europa.eu/ 16.000 datasets SPARQL endpoint: http://semantic.eea.europa.eu/sparql

Page 4: Dissemination and visualisation of linked statistical data – a … · by research projects and web enthusiasts conversion of existing data to RDF increasingly adopted by the primary

Global Open Data Index http://index.okfn.org/place

Page 5: Dissemination and visualisation of linked statistical data – a … · by research projects and web enthusiasts conversion of existing data to RDF increasingly adopted by the primary

5 stars open data * on the web, open license ** structured *** non-proprietary open format **** uses URIs to denote things ***** provides context through links to other data http://5stardata.info/en/#costs-benefits

Page 6: Dissemination and visualisation of linked statistical data – a … · by research projects and web enthusiasts conversion of existing data to RDF increasingly adopted by the primary

Linked data www.w3.org/TR/ld-glossary/#linked-data linkeddatafragments.org resolvable HTTP URIs

http://dbpedia.org/resource/Bucharest http://dbpedia.org/data/Bucharest.jsonld (application/ld+json) http://dbpedia.org/page/Bucharest (text/html)

Page 7: Dissemination and visualisation of linked statistical data – a … · by research projects and web enthusiasts conversion of existing data to RDF increasingly adopted by the primary

Open Data Portals Linked data producers by research projects and web enthusiasts conversion of existing data to RDF increasingly adopted by the primary data producers

Open Data Portals most of them based on CKAN (by OKFN) open-data.europa.eu catalog.data.gov data.gov.ccTLD, e.g. data.gov.ro datahub.io

Page 8: Dissemination and visualisation of linked statistical data – a … · by research projects and web enthusiasts conversion of existing data to RDF increasingly adopted by the primary

http://lod-cloud.net

Page 9: Dissemination and visualisation of linked statistical data – a … · by research projects and web enthusiasts conversion of existing data to RDF increasingly adopted by the primary

Statistical data structurally different from other [linked] data typically distributed as datasets concerned with measures, indicators, series,

time periods, statistical/geographical regions multiple dimensions and measures generic - place, time or domain-specific

slices, agregations, totals, denominators various RDF vocabularies for modeling datasets

Page 10: Dissemination and visualisation of linked statistical data – a … · by research projects and web enthusiasts conversion of existing data to RDF increasingly adopted by the primary

XML-based standards SDMX Statistical Data and Metadata eXchange probably the most widely used standard for

statistical data exchange adopted by major producers of statistical data (ECB,

Eurostat, IMF, OECD, UNSD, UNESCO, World Bank) http://ec.europa.eu/eurostat/web/sdmx-web-services

DSPL Dataset Publishing Language can be processed by Google Public Data Explorer

Page 11: Dissemination and visualisation of linked statistical data – a … · by research projects and web enthusiasts conversion of existing data to RDF increasingly adopted by the primary

RDF vocabularies Dublin Core Terms DCAT http://www.w3.org/TR/vocab-dcat catalogues, dataset metadata, distribution

VoID http://www.w3.org/TR/void linked datasets

RDF Data Cube Vocabulary http://www.w3.org/TR/vocab-data-cube built on top of existing vocabularies focused on statistical data integrates dataset metadata, structure, codelists and

observations uses the core SDMX Information Model

Page 12: Dissemination and visualisation of linked statistical data – a … · by research projects and web enthusiasts conversion of existing data to RDF increasingly adopted by the primary

Semantic interoperability SPARQL - native query language of RDF knowledge bases http://worldbank.270a.info/sparql select * where { ?s ?p <http://dbpedia.org/resource/Temperature> }

Page 13: Dissemination and visualisation of linked statistical data – a … · by research projects and web enthusiasts conversion of existing data to RDF increasingly adopted by the primary

http://worldbank.270a.info/classification/variable/tas

Page 14: Dissemination and visualisation of linked statistical data – a … · by research projects and web enthusiasts conversion of existing data to RDF increasingly adopted by the primary

http://worldbank.270a.info/classification/variable

Page 15: Dissemination and visualisation of linked statistical data – a … · by research projects and web enthusiasts conversion of existing data to RDF increasingly adopted by the primary

world-bank-climates/month-average-historical.html

Page 16: Dissemination and visualisation of linked statistical data – a … · by research projects and web enthusiasts conversion of existing data to RDF increasingly adopted by the primary

SPARQL PREFIX qb: <http://purl.org/linked-data/cube#> PREFIX sdmx-dimension: <http://purl.org/linked-data/sdmx/2009/dimension#> PREFIX owl: <http://www.w3.org/2002/07/owl#> PREFIX skos: <http://www.w3.org/2004/02/skos/core#> PREFIX property: <http://worldbank.270a.info/property/> PREFIX d-climates: <http://worldbank.270a.info/dataset/world-bank-climates/>

select ?o ?value where { ?o a qb:Observation; qb:dataSet d-climates:month-average-historical; sdmx-dimension:refArea [owl:sameAs <http://dbpedia.org/resource/Romania>]; property:variable [skos:exactMatch <http://dbpedia.org/resource/Temperature>]; property:recurring-interval <http://reference.data.gov.uk/def/intervals/March>; property:month-average ?value } http://worldbank.270a.info/dataset/world-bank-climates/month-average-historical/1900-2009/03/RO/tas "3.781612" ^^<http://www.w3.org/2001/XMLSchema#decimal>

Page 17: Dissemination and visualisation of linked statistical data – a … · by research projects and web enthusiasts conversion of existing data to RDF increasingly adopted by the primary

Observation page

Page 18: Dissemination and visualisation of linked statistical data – a … · by research projects and web enthusiasts conversion of existing data to RDF increasingly adopted by the primary

Romania page

Page 19: Dissemination and visualisation of linked statistical data – a … · by research projects and web enthusiasts conversion of existing data to RDF increasingly adopted by the primary

http://digital-agenda-data.eu European Commission, DG for Communications

Networks, Content & Technology Digital Agenda, Europe 2020 strategy

https://ec.europa.eu/digital-agenda/

The Digital Economy & Society Index (DESI)

Page 20: Dissemination and visualisation of linked statistical data – a … · by research projects and web enthusiasts conversion of existing data to RDF increasingly adopted by the primary

User requirements target statistical data / Data Cube Vocabulary non-technical audience simple and intuitive navigation lots of explanatory notes, labels and metadata 100+ indicators inside a single large dataset

no SPARQL embedding, export, share, bookmark, etc. moderated comments good looking charts

Page 21: Dissemination and visualisation of linked statistical data – a … · by research projects and web enthusiasts conversion of existing data to RDF increasingly adopted by the primary

Dataset metadata

Page 22: Dissemination and visualisation of linked statistical data – a … · by research projects and web enthusiasts conversion of existing data to RDF increasingly adopted by the primary

Dataset metadata

Page 23: Dissemination and visualisation of linked statistical data – a … · by research projects and web enthusiasts conversion of existing data to RDF increasingly adopted by the primary

Dataset contents

Page 24: Dissemination and visualisation of linked statistical data – a … · by research projects and web enthusiasts conversion of existing data to RDF increasingly adopted by the primary

Sample column chart

Page 25: Dissemination and visualisation of linked statistical data – a … · by research projects and web enthusiasts conversion of existing data to RDF increasingly adopted by the primary

Visualisations

Page 26: Dissemination and visualisation of linked statistical data – a … · by research projects and web enthusiasts conversion of existing data to RDF increasingly adopted by the primary

Chart configurator

Page 27: Dissemination and visualisation of linked statistical data – a … · by research projects and web enthusiasts conversion of existing data to RDF increasingly adopted by the primary

Chart configurator type of chart single/multiple selection for dimensions series layout of filters order and grouping in each filter sorting of values chart titles, tooltips, explanatory texts legend and metadata from code lists and more...

Page 28: Dissemination and visualisation of linked statistical data – a … · by research projects and web enthusiasts conversion of existing data to RDF increasingly adopted by the primary

Chart configurator

Page 29: Dissemination and visualisation of linked statistical data – a … · by research projects and web enthusiasts conversion of existing data to RDF increasingly adopted by the primary

Hierarchical code list

Page 30: Dissemination and visualisation of linked statistical data – a … · by research projects and web enthusiasts conversion of existing data to RDF increasingly adopted by the primary

Sample column chart

Page 31: Dissemination and visualisation of linked statistical data – a … · by research projects and web enthusiasts conversion of existing data to RDF increasingly adopted by the primary

Sample line charts

Page 32: Dissemination and visualisation of linked statistical data – a … · by research projects and web enthusiasts conversion of existing data to RDF increasingly adopted by the primary

Additional explanations and user interaction

Page 33: Dissemination and visualisation of linked statistical data – a … · by research projects and web enthusiasts conversion of existing data to RDF increasingly adopted by the primary

Navigation widget

Page 34: Dissemination and visualisation of linked statistical data – a … · by research projects and web enthusiasts conversion of existing data to RDF increasingly adopted by the primary

Digital Economy and Society Index

Page 35: Dissemination and visualisation of linked statistical data – a … · by research projects and web enthusiasts conversion of existing data to RDF increasingly adopted by the primary

http://digital-agenda-data.eu Digital Agenda Key Indicators 170 indicators, 700k observations

Digital Economy and Society Index 50 indicators, 8k observations

Lead Indicators for DG Connect policy priorities 31 indicators, 35k observations

Page 36: Dissemination and visualisation of linked statistical data – a … · by research projects and web enthusiasts conversion of existing data to RDF increasingly adopted by the primary

Other resources Use Cases and Lessons for the Data Cube

Vocabulary http://www.w3.org/TR/vocab-data-cube-use-cases

Technical information http://digital-agenda-data.eu/documentation

Try it test.digital-agenda-data.eu github.com/digital-agenda-data Vagrant build

Page 37: Dissemination and visualisation of linked statistical data – a … · by research projects and web enthusiasts conversion of existing data to RDF increasingly adopted by the primary

Linked statistical data – our experience good: URI’s – break data silos of statistical office cross-dataset queries metadata and methodology

not so good: missing from RDF data cube: aggregate values size of RDF serialized form performance

challenging: generating valid DSD data maintenance

Page 38: Dissemination and visualisation of linked statistical data – a … · by research projects and web enthusiasts conversion of existing data to RDF increasingly adopted by the primary

Find out more joinup.ec.europa.eu interoperability solutions for public administrations share and reuse communities, guidelines, source code

SEMIC 2016 Semantic Interoperability Conference Rome, Italy – 12 May 2016

StatDCAT application profile for data portals joinup.ec.europa.eu/asset/stat_dcat_application_pro

file/home