lod2 open government data stakeholder survey, michael martin and martin kaltenböck
DESCRIPTION
Slides of the presentation by Michael Martin (ULEI, INFAI) and Martin Kaltenböck (Semantic Web Company) at the OKCon2011 in Berlin on 30th of June 2011: The LOD2 Open Government Data Stakeholder SurveyTRANSCRIPT
Creating Knowledge out of Interlinked Data
The Open Government Data Stakeholder Survey Michael Martin, University of Leipzig (Germany)
Martin Kaltenböck, Semantic Web Company (Austria) Sören Auer (University of Leipzig), Helmut Nagy (Semantic Web Company)
OKCon2011 - Berlin, 30.06. 2011
These slides are published under : http://creativecommons.org/licenses/by/3.0
LOD2 OGD Stakeholder Survey 30.06. 2011 http://lod2.eu
Agenda
• LOD2 Project & the OGD Stakeholder Survey
• Results of the OGD Stakeholder Survey
• Publishing the Stakeholder Survey as LOD
• Conclusion & Outreach - What‘s next
LOD2 OGD Stakeholder Survey 30.06. 2011 http://lod2.eu
OGD Stakeholder Survey: Results
LOD2 Creating Knowledge out of
Interlinked Data
LOD2 OGD Stakeholder Survey 30.06. 2011 http://lod2.eu
LOD2 in a Nutshell
Creating Knowledge out of Interlinked Data
Research focus • Very large RDF data management • Enrichment & Interlinking • Fusion & Information Quality • Adaptive User Interfaces 3 Use Cases • Media & Publishing • Linked Enterprise Data • Open Government Data
10 Partners of 7 countries • University of Leipzig, Germany • DERI Galway, Ireland • FU Berlin, Germany • Semantic Web Company, Austria • OpenLink Software, UK • TenForce, Belgium • Exalead, France • Wolters Kluwer, Germany • Open Knowledge Foundation, UK • CWI, Netherlands
LOD2 OGD Stakeholder Survey 30.06. 2011 http://lod2.eu
LOD2 OGD Stakeholder Survey 30.06. 2011 http://lod2.eu
3 Use Cases in LOD2
Objective Applying Linked Data technologies in an enterprise stack to support Human Resources (HR) related issues. ENTERPRISE
APPLICATIONS
Exalead, France
MEDIA &
PUBLISHING
Wolters Kluwer Germany
OPEN GOVERNMENT
DATA
Open Knowledge Foundation, UK
Objective Improving accessibility, findability & reusability of Open Government Data in Europe: publicdata.eu
Objective Supporting content-related production workflows in the media & publishing industry.
LOD2 OGD Stakeholder Survey 30.06. 2011 http://lod2.eu
Open Governmental Data – and ideal testbed for Linked Data?
Close cooperation with W3C eGov IG, OKFN’s OpenEUdata, PSI & grassroots efforts
CKAN.org | semic.eu | EIF European Interoperability Framework | ICT2010 Networking Session
UIs and Personalization
Individual mashups of data with other sources
Notification/subscription service based on personal preferences
Transparency wishlists, upload revisions, derivates
Create and publish queries, reports and visualisations
Single Point of Access: European registry & collaboration for open government data Outreach & involve data providers - local, regional, national and European
LOD2 OGD Stakeholder Survey 30.06. 2011 http://lod2.eu
LOD2 Open Government Data Use Case: publicdata.eu
LOD2 OGD Stakeholder Survey 30.06. 2011 http://lod2.eu
OGD Stakeholder Survey: Results
The LOD2 Open Government Data (OGD)
Stakeholder Survey
LOD2 OGD Stakeholder Survey 30.06. 2011 http://lod2.eu
WHY
• Involve OGD community in Europe (& worldwide) in publicdata.eu process
• Ask for their needs & requirements in the area of OGD & publicdata.eu
• Use results for requirements elicitation for the LOD2 use case: publicdata.eu
HOW
• Set up by OKFN & SWC with support from DERI, Wolters Kluwer and ULEI
• Easy to use online survey tool (surveygizmo.com)
• Promoted via blogs, mailings, mailing lists and additional viral marketing
channels as well as at related events in Europe & by the EC
• Duration: 5 weeks
• 329 participants
• Results available since May 2011: http://survey.lod2.eu/
• Published in HTML, PDF & raw survey data in CSV & RDF
for re-use under CC-BY license
The LOD2 OGD Stakeholder Survey
LOD2 OGD Stakeholder Survey 30.06. 2011 http://lod2.eu
OGD Stakeholder Survey: Results
Interest in ‚domains of data‘
For the complete results you can see that "Geospatial information", "Scientific data" and "Environment" data are the top ranked domains.
LOD2 OGD Stakeholder Survey 30.06. 2011 http://lod2.eu
Currently used formats
For the preferences on the format of data you can see that current "traditional" formats like HTML, PDF, CSV/XLS and DOC/RTF are preferred.
OGD Stakeholder Survey: Results
LOD2 OGD Stakeholder Survey 30.06. 2011 http://lod2.eu
Requested (future) formats
In general you can also see that ideally ("in future") formats like "XML", "RDF" and "APIs" will become more importance. It seems that all user types find DOC, RTF and PDF not a suitable solution for a future Open Government Data infrastructure. As we haven't listed JSON, RSS and YAML in this list, respondents have urged that in the free comment field.
OGD Stakeholder Survey: Results
LOD2 OGD Stakeholder Survey 30.06. 2011 http://lod2.eu
From where should the data come?
The results show that "national" data is most important followed by "regional", "EU-wide" and "worldwide" data.
OGD Stakeholder Survey: Results
LOD2 OGD Stakeholder Survey 30.06. 2011 http://lod2.eu
What is important for quality assurance?
The complete results show that "provenance/source of data" followed by "format of data", "completeness of meta data", "ranking / comments by users" and "official certificates"
OGD Stakeholder Survey: Results
LOD2 OGD Stakeholder Survey 30.06. 2011 http://lod2.eu
What features does a EU data catalogue need?
The complete results show that "providing raw datasets", "information about versioning of data sets" and "searching, exploring, grouping and clustering of data sets" are the features which are "expected to have" while "crowd sourcing mechanism (e.g. data repair)", "alerts on regional information" and "analysis and visualisation tools" have the highest rating in the "like to have" category.
OGD Stakeholder Survey: Results
LOD2 OGD Stakeholder Survey 30.06. 2011 http://lod2.eu
What is also expected on publicdata.eu?
For this question "white papers & best practice", "news on Open Government Data" and "use cases & sucess stories" are "expected to have" while "ideas for apps", "events" and again "use cases & success stories" are ranked highest in the "like to have" category.
OGD Stakeholder Survey: Results
LOD2 OGD Stakeholder Survey 30.06. 2011 http://lod2.eu
OGD Stakeholder Survey: Results
Publishing the Survey as RDF
LOD2 OGD Stakeholder Survey 30.06. 2011 http://lod2.eu
- Questionnaire and results of the survey published as - HTML,
- CSV and - PDF
-These formats are only for humans which represent use case specific views
Publishing the Survey as RDF
OGD Stakeholder Survey: Results
LOD2 OGD Stakeholder Survey 30.06. 2011 http://lod2.eu
- Publishing in RDF give further benefits: -Aggregating data to enable other user(-generated) aspects (SPARQL)
- Interlink the data with other datasets … - … which enable (advanced) users to create more complex queries (aggregating resources from non-local information spaces like DBpedia )
- Schema and data can be queried together
Publishing the Survey as RDF
OGD Stakeholder Survey: Results
LOD2 OGD Stakeholder Survey 30.06. 2011 http://lod2.eu
RDF-Schema creation to represent Surveys
Publishing the Survey as RDF
http://ns.aksw.org/survey/
OGD Stakeholder Survey: Results
LOD2 OGD Stakeholder Survey 30.06. 2011 http://lod2.eu
Transformation of the Data Questionnaire delivered as PDF containing 5 survey sections, 60 questions, 221 Options for single -/multiple - choice questions -> modeled with the survey vocabulary Resultset of the Survey tool delivered as CSV which is being transformed with PHP
Creating RDF-Resources for every : Participant: Response (329) Answers to Questions (12891)
Overall ~ 70,000 triples
Publishing the Survey as RDF
OGD Stakeholder Survey: Results
LOD2 OGD Stakeholder Survey 30.06. 2011 http://lod2.eu
Deployment of the Data Upload the knowledge base into Virtuoso
Questionnaire and survey results as one model
Universal Server with multiple RDF / LOD functionalities
Adding metadata to the OGD-Survey-model (contributor, publisher, license)
Publishing the Survey as RDF
OGD Stakeholder Survey: Results
LOD2 OGD Stakeholder Survey 30.06. 2011 http://lod2.eu
Publishing the model via OntoWiki human and machine friendly HTML/JS – Web interface to explore and maintain the data SPARQL Editor / Endpoint LOD client and server
Publishing the Survey as RDF
OGD Stakeholder Survey: Results
LOD2 OGD Stakeholder Survey 30.06. 2011 http://lod2.eu
Publishing the Survey as RDF
http://data.lod2.eu/2010/ogd-survey/
OGD Stakeholder Survey: Results
LOD2 OGD Stakeholder Survey 30.06. 2011 http://lod2.eu
OGD Stakeholder Survey: Results
Conclusion & Outreach
LOD2 OGD Stakeholder Survey 30.06. 2011 http://lod2.eu
• Analyzing the 329 responses showed the importance of facilitating
Open Government Data
• Geospatial information, Scientific & Environmental Data are the top
ranked requested domains.
• National & regional data seems to be most important for users
• It shows a shift in currently used formats to new formats
• It shows that the source of a dataset is the most important indicator
for quality assurance
• White Papers, Best Practise and Success Stories are requested as
additional information on publicdata.eu
Conclusion
OGD Stakeholder Survey: Results
LOD2 OGD Stakeholder Survey 30.06. 2011 http://lod2.eu
• LOD2 PUBLINK Consultancy 2010/2011 • Greater London Authority, UK
• City of Vienna, Austria
• Umweltbundesamt, Austria
• Historische Kommission, Germany
• Parliament of Finland, Finland
• Instituto Canario de Estadística – ISTAC
• Next PUBLINK call Sept 2011 (LOD consumption)
• LOD2 Webinar Series
• Results of LOD2 OGD Stakeholder Survey 2010 A new survey is planned for late 2011
• LOD2 Technology Stack coming soon (autumn2011)
• Open Data Camp 2011 powered by LOD2 in
Warsaw, Poland - around 21st of October 2011
http://lod2.eu http://blog.lod2.eu http://survey.lod2.eu
LOD2 Outreach- What’s next in LOD2
LOD2 OGD Stakeholder Survey 30.06. 2011 http://lod2.eu
Thank you for your attention!
Web: http://www.semantic-web.at Blog: http://blog.semantic-web.at Mail: [email protected] Phone: +43 - 1 - 402 12 35 – 25
LOD2 Project: http://lod2.eu
LOD2 Blog: http://blog.lod2.eu
LOD2 OGD Stakeholder Survey: http://survey.lod2.eu/
LOD2 OGD Stakeholder Survey data: http://data.lod2.eu/2010/ogd-survey/
PUBLINK LOD Consultancy: http://lod2.eu/Article/Publink.html
Martin Kaltenböck, Semantic Web Company Web: http://aksw.org Blog: http://blog.aksw.org Mail: [email protected] Phone: +49 341 97-32322
Michael Martin, University of Leipzig
These slides are published under : http://creativecommons.org/licenses/by/3.0