the cso open data experience

24
The CSO Open Data Experience Databank and Dissemination, Central Statistics Office, Cork, Ireland Eoin MacCuirc [email protected] (00 353 21) 453 5504

Upload: dub-linked

Post on 04-Aug-2015

90 views

Category:

Data & Analytics


3 download

TRANSCRIPT

Page 1: The CSO Open Data Experience

The CSO Open Data ExperienceDatabank and Dissemination,

Central Statistics Office, Cork, IrelandEoin MacCuirc [email protected] (00 353 21) 453 5504

Page 2: The CSO Open Data Experience

The Tower of Babel

“If as one people speaking the same language they have begun to do this, then nothing they plan to do will be impossible for them. Come, let us go down and confuse their language so they will not understand each other.”

Page 3: The CSO Open Data Experience

Ireland — Central Statistics Office

“The collection, compilation, extraction and dissemination for statistical purposes of information relating to economic, social and general activities and conditions in the State”

Page 4: The CSO Open Data Experience

Dublin – 255,000,000 hitslooking for a needle in a haystack

Page 5: The CSO Open Data Experience

The data delugehttps://www.google.ie/search?q=brett+ryder&espv=2&biw=1920&bih=979&source=lnms&tbm=isch&sa=X&ei=0YtIVdmnPKHT7Qb__ICgDA&ved=0CAYQ_AUoAQ#imgrc=iBXJ3IZb2fMfYM

%253A%3BxQDi6MJ7528D_M%3Bhttp%253A%252F%252Fimage.sciencenet.cn%252Falbum%252F%252Fphoto%252Fupload%252Fbigimg%252F201083114955198.jpg%3Bhttp%253A%252F%252Fblog.sciencenet.cn%252Fblog-420554-351226.html%3B499%3B281

Page 6: The CSO Open Data Experience

The Web of Things – The Internet of Thingshttp://semanticweb.com/34702/

The Internet of Things is coming, but it needs a semantic backbone to flourish. With some 25 billion devices expected to be connected to the Internet by 2015 and 50 billion by 2020, providing interoperability among the things on the IoT “is one of the most fundamental requirements to support object addressing, tracking, and discovery as well as information representation, storage, and exchange.” So write the authors of Semantics for the Internet of Things: Early Progress and Back to the Future, Payam Barnaghi and Wei Wang, Centre for Communication Systems Research, University of Surrey, Guildford, UK and Cory Henson, Kno.e.sis – Ohio Center of Excellence in Knowledge-enabled Computing.

“The suite of technologies developed in the Semantic Web … such as ontologies, semantic annotation, Linked Data and semantic Web services … can be used as principal solutions for the purpose of realizing the IoT,” they state. “Defining an ontology and using semantic descriptions for data will make it interoperable for users and stakeholders that share and use the same ontology.”

Page 7: The CSO Open Data Experience

Tim Berners Lee – Founder of the Web“In an extreme view, the world can be seen as only connections, nothing else. We think of a dictionary as the repository of meaning, but it defines words only in terms of other words. I liked the idea that a piece of information is really defined only by what it's related to, and how it's related. There really is little else to meaning. The structure is everything. There are billions of neurons in our brains, but what are neurons? Just cells. The brain has no knowledge until connections are made between neurons. All that we know, all that we are, comes from the way our neurons are connected.”

Page 8: The CSO Open Data Experience

Linked Open Data cloud

http://lod-cloud.net/

Media

Government

Geo

Publications

User-generated

Life sciences

Cross-domain

Page 9: The CSO Open Data Experience

How open is the data? - Linked Open Data star scheme

Tim Berners-Lee suggested a 5-star deployment scheme for Linked Open Data and Ed Summers provided a nice rendering of it. In the following, examples are given for each level. The example data used throughout is 'the temperature forecast for Galway, Ireland for the next 3 days':

★ make your stuff available on the Web (whatever format) under an open license 1 example ...

★★ make it available as structured data (e.g., Excel instead of image scan of a table) 2 example ...

★★★ use non-proprietary formats (e.g., CSV instead of Excel) 3 example ... ★★★★ use URIs to identify things, so that people can point at your stuff4

example ... ★★★★★ link your data to other data to provide context 5 example

http://lab.linkeddata.deri.ie/2010/star-scheme-by-example/

Page 10: The CSO Open Data Experience

URI – Uniform Resource Identifier give the thing a name and an address

The following picture shows the desired relationships between a resource and its representing documents:

Page 11: The CSO Open Data Experience

Tim’s cool URIs

Cool URIs don't changeWhat makes a cool URI?A cool URI is one which does not change.What sorts of URI change?URIs don't change: people change them.

It is the the duty of a Webmaster to allocate URIs which you will be able to stand by in 2 years, in 20 years, in 200 years. This needs thought, and organization, and commitment.

Page 12: The CSO Open Data Experience

Where is the CSO with all this?• Publishes data on www.cso.ie

• Maintains a statistics portal www.statcentral.ie

• Publishes data in JSON-Stat API (Beta) format http://www.cso.ie/webserviceclient/

• One of the first NSIs in the world to upload census data as linked open data – data.cso.ie – Census 2011 http://data.cso.ie/

• Involved in the EU Open Cube project http://opencube-project.eu/

• Hosts data for other government departments http://www.cso.ie/en/databases/

• Actively engaged with the Irish Open Government Partnership http://www.ogpireland.ie/

• Organises the apps4gaps competition www.apps4gaps.ie

Page 13: The CSO Open Data Experience

www.cso.ie

Page 14: The CSO Open Data Experience

www.statcentral.ie

Page 16: The CSO Open Data Experience

http://data.cso.ie/

Page 17: The CSO Open Data Experience

Census – Linked Open Data

• 12 million RDF triples from Census

• Geographical entities (counties, cities, etc.)

• Codelists

Page 18: The CSO Open Data Experience

To be delivered December 2015

Open Cube Project Pilots

Page 19: The CSO Open Data Experience

• Own the data.cso.ie process and technology– Enable in-house maintenance, changes, etc.

• Publish StatBank* data as Linked Open Data– Ongoing publication process– Adhering to release schedule is critical– Publish data that are regularly updated (monthly, quarterly,

annual) as linked open data ( Census 2011 static data)*StatBank is the CSO published time series database (PC Axis)

• Deploy tools that enable analytics and exploitation of linked data– Both internally and externally

CSO goals (independent from OpenCube)

Page 20: The CSO Open Data Experience

Public Sector Statistics Network

Page 21: The CSO Open Data Experience

www.apps4gaps.ie

Page 22: The CSO Open Data Experience

Building capacity – Using the data

• Liaison Groups • Seminar Series (Administrative Data and Business

Statistics) • Oireachtas Briefings • Social Media (Twitter, YouTube, FaceBook)• Education Outreach (CensusAtSchool, John Hooper

Medal for Statistics, IPA Diploma in Official Statistics)• Visualisations and Engagement (Key Economic

Indicators, Infographics, Exploristica)

Page 23: The CSO Open Data Experience

The Tower of Babel

“If as one people speaking the same language they have begun to do this, then nothing they plan to do will be impossible for them. Come, let us go down and confuse their language so they will not understand each other.”

Page 24: The CSO Open Data Experience

Thank You

Questions?