soton2013 opendata

68
Get Started With Open Data Tony Hirst Dept of Communication and Systems, The Open University

Upload: tony-hirst

Post on 27-Jan-2015

119 views

Category:

Documents


2 download

DESCRIPTION

 

TRANSCRIPT

Page 1: Soton2013 opendata

Get Started With Open Data

Tony HirstDept of Communication and Systems,

The Open University

Page 2: Soton2013 opendata

So what do we mean by

“OPEN DATA”

Page 3: Soton2013 opendata

Open Public Data

Page 4: Soton2013 opendata

- copy, publish, distribute and transmit the Information;- adapt the Information;- exploit the Information commercially for example, by combining it with other Information, or by including it in your own product or application

You are free to:

Page 5: Soton2013 opendata

You must:- acknowledge the source of the Information by including any attribution statement specified by the Information Provider(s) and, where possible, provide a link to this licence;- ensure that you do not use the Information in a way that suggests any official status;- ensure that you do not mislead others or misrepresent the Information or its source;- ensure that your use of the Information does not breach the Data Protection Act 1998 or the Privacy and Electronic Communications (EC Directive) Regs 2003.

Page 6: Soton2013 opendata

Exemptions:- personal data;- Information that has neither been published nor disclosed under information access legislation (FOI) by or with the consent of the Information Provider;- departmental or public sector organisation logos, crests etc;- third party rights the Information Provider is not authorised to license;- Information subject to other IPR

Page 7: Soton2013 opendata

Availability and Access

Reuse and Redistribution

Universal Participation

The Open Knowledge Foundation

Page 8: Soton2013 opendata

Availability and Access: the data must be available as a whole and at no more than a reasonable reproduction cost, preferably by downloading over the internet. The data must also be available in a convenient and modifiable form.

The Open Knowledge Foundation

Page 9: Soton2013 opendata

Reuse and Redistribution: the data must be provided under terms that permit reuse and redistribution including the intermixing with other datasets.

The Open Knowledge Foundation

Page 10: Soton2013 opendata

Universal Participation: everyone must be able to use, reuse and redistribute – there should be no discrimination against fields of endeavour or against persons or groups. For example, ‘non-commercial’ restrictions that would prevent ‘commercial’ use, or restrictions of use for certain purposes (e.g. only in education), are not allowed.

The Open Knowledge Foundation

Page 11: Soton2013 opendata

/via http://antictrl.com/chapter-3-2-regulability-of-the-internet/

Lessig’s “

dot”

Page 12: Soton2013 opendata

Licensing

DATAAuthentication

Closed standards

Messy Data

Crappy spreadsheets

Paywalls

“Privacy”

FOI exemptions

Data protection Act

PDFs

Page 13: Soton2013 opendata
Page 14: Soton2013 opendata
Page 15: Soton2013 opendata

Right to access data

Page 16: Soton2013 opendata
Page 17: Soton2013 opendata
Page 18: Soton2013 opendata
Page 19: Soton2013 opendata
Page 20: Soton2013 opendata
Page 21: Soton2013 opendata
Page 22: Soton2013 opendata

So where’s the data?

Page 23: Soton2013 opendata
Page 24: Soton2013 opendata
Page 25: Soton2013 opendata
Page 26: Soton2013 opendata
Page 27: Soton2013 opendata
Page 28: Soton2013 opendata
Page 29: Soton2013 opendata

“First” generation:data catalogues

Page 30: Soton2013 opendata

Breathing life into data…

Page 31: Soton2013 opendata

=importData(“CSV_URL”)

Google Sheets

Page 32: Soton2013 opendata

the spreadsheet becomes

A DATABASE

Page 33: Soton2013 opendata

Google Charts

Visualisation API

Page 34: Soton2013 opendata

Google Charts

Visualisation API

Page 35: Soton2013 opendata

Google Charts

Visualisation API

Page 36: Soton2013 opendata

“Second” generation:data management

systems

Page 37: Soton2013 opendata

DMS – Data Management System

Page 38: Soton2013 opendata

Digging for data…

Page 39: Soton2013 opendata

BUT

Page 40: Soton2013 opendata

There’s lots more data that’s locked up in web pages…

Page 41: Soton2013 opendata

Scraping…

Page 42: Soton2013 opendata

“grabbing web content in a machine readable

format and then processing it for your

own purposes”

Page 43: Soton2013 opendata

Original HTML web

page

Accessible web page

Extract Information

-> data

Page 44: Soton2013 opendata
Page 45: Soton2013 opendata
Page 46: Soton2013 opendata
Page 47: Soton2013 opendata

Recreating the database that was used

to populate a (templated) page

Page 48: Soton2013 opendata
Page 49: Soton2013 opendata
Page 50: Soton2013 opendata
Page 51: Soton2013 opendata
Page 52: Soton2013 opendata

“Creating” Data

Page 53: Soton2013 opendata
Page 54: Soton2013 opendata

[Disruptive Innovation?]

Page 55: Soton2013 opendata
Page 56: Soton2013 opendata

Company

DirectorDirector

DirectorDirector

CompanyCompany

CompanyCompany

Page 57: Soton2013 opendata
Page 58: Soton2013 opendata

Barriers to Use

Page 59: Soton2013 opendata

OpenRefine

Page 60: Soton2013 opendata
Page 61: Soton2013 opendata
Page 62: Soton2013 opendata
Page 63: Soton2013 opendata

Also:- month overflows at week end- year overflows

- Character string dates- Erratic whitespace- Arbitrary separators- Excel Dates

Page 64: Soton2013 opendata
Page 65: Soton2013 opendata
Page 66: Soton2013 opendata
Page 67: Soton2013 opendata

Open is as open

does… DATA

Page 68: Soton2013 opendata

@psychemedia

blog.ouseful.info