unlocking the geospatial potential of survey data

18
UNLOCKING THE GEOSPATIAL POTENTIAL OF SURVEY DATA Tom Ensom & Veerle Van den Eynden wwww.data-archive.ac.uk

Upload: tomensom

Post on 30-Jun-2015

397 views

Category:

Technology


1 download

DESCRIPTION

Paper on a JISC-funded project based at the UK Data Archive, as presented at the GISRUK 2012 conference, Lancaster University. The project set out to better enable the use of Archive datasets in GIS, primarily by addressing metadata and quality issues of geospatial identifiers.

TRANSCRIPT

Page 1: Unlocking the geospatial potential of survey data

UNLOCKING THE GEOSPATIAL POTENTIAL OF SURVEY DATA

Tom Ensom & Veerle Van den Eynden

wwww.data-archive.ac.uk

Page 2: Unlocking the geospatial potential of survey data

Archived survey data presents a vast wealth of material with potential for

secondary use in GIS

UNLOCKING THE GEOSPATIAL POTENTIAL OF SURVEY DATA

Page 3: Unlocking the geospatial potential of survey data

UK DATA ARCHIVE

• Over 5,000 datasets

• Popular survey data series include:

Quarterly Labour Force Survey

British Household Panel Survey / Understanding Society

British Crime Survey

UNLOCKING THE GEOSPATIAL POTENTIAL OF SURVEY DATA

Page 4: Unlocking the geospatial potential of survey data

We set out to explore the availability and usability of geo-identifiers in the UK Data

Archive collection

These identifiers come in the form of ‘spatial units’ e.g. Ward and Constituency

UNLOCKING THE GEOSPATIAL POTENTIAL OF SURVEY DATA

Page 5: Unlocking the geospatial potential of survey data

• The availability of geo-referenced data is ever increasing

• The usability of geo-referenced data ‘out-of-the-box’ is still generally poor

Reflective of and contributing too a divide between:

• GIS experts – idiosyncratic methodologies• Untrained with interest – steep learning

curve

UNLOCKING THE GEOSPATIAL POTENTIAL OF SURVEY DATA

Page 6: Unlocking the geospatial potential of survey data

1. SELECTION

2. QUALITY

3. METADATA

Three key features of ‘ready-to-link’ survey data for GIS

Page 7: Unlocking the geospatial potential of survey data

1. SELECTION

Include geographical identifiers which:

• Can be readily transformed

• Are of sufficient resolution to allow for fine-grained analysis

• Are appropriate to the data subject

UNLOCKING THE GEOSPATIAL POTENTIAL OF SURVEY DATA

Page 8: Unlocking the geospatial potential of survey data

2. QUALITY

Include geographical identifiers which:

• Use standard names

• Are coded with a standard coding schemee.g. ONS’ GSS Coding and Naming

UNLOCKING THE GEOSPATIAL POTENTIAL OF SURVEY DATA

Page 9: Unlocking the geospatial potential of survey data

3. METADATA

Include geographical identifiers which are:

• Time-referencede.g. Government Office Region as defined in 2001 as opposed to 1998

• Well documented in their derivation

UNLOCKING THE GEOSPATIAL POTENTIAL OF SURVEY DATA

Page 10: Unlocking the geospatial potential of survey data

Those collecting data need to adjust their workflows to enable this

Those curating data need to adjust their workflows to enable this

Page 11: Unlocking the geospatial potential of survey data

What should data collectors be doing?

• Considering geographic identifiers BEFORE data collection!

• Considering standards• INSPIRE/GEMINI• GSS Coding and Naming

• Documenting the provenance of geographic identifiers

UNLOCKING THE GEOSPATIAL POTENTIAL OF SURVEY DATA

Page 12: Unlocking the geospatial potential of survey data

What will we be doing at the UK Data Archive?

• INSPIRE compliance(we have published a metadata mapping for DDI-INSPIRE-GEMINI)

• Improving spatial unit definitions through extensive data cleansing

Standardised Time referenced

UNLOCKING THE GEOSPATIAL POTENTIAL OF SURVEY DATA

Page 13: Unlocking the geospatial potential of survey data

What will we be doing at the UK Data Archive?

• Improving resource discovery tools / interface

User friendly Lessen time spent searching through text Consider semantics

• Feeding back to data depositors

Guidance on best practise

UNLOCKING THE GEOSPATIAL POTENTIAL OF SURVEY DATA

Page 14: Unlocking the geospatial potential of survey data

U·Geo Browser

A new web tool for resource discovery

• Revised and augmented variable metadata

• Information clarifying the quality of the geo-identifier

• Integrated spatial unit definitions

• Links to boundary files

Live beta at: geo.data-archive.ac.uk

UNLOCKING THE GEOSPATIAL POTENTIAL OF SURVEY DATA

Page 15: Unlocking the geospatial potential of survey data
Page 16: Unlocking the geospatial potential of survey data
Page 17: Unlocking the geospatial potential of survey data

U·Geo Browser

• A demo tool using a simple, pragmatic approach

• This tech will be integrated into a central Archive resource discovery tool, and catalogued data will be updated to reflect these refinements

-

• A step in the right direction but we need formal semantics built on persistent vocabularies

• A drive needed to establish this

UNLOCKING THE GEOSPATIAL POTENTIAL OF SURVEY DATA

Page 18: Unlocking the geospatial potential of survey data

UNLOCKING THE GEOSPATIAL POTENTIAL OF SURVEY DATA

Tom Ensom

[email protected]

wwww.data-archive.ac.uk

@UKDataArchive

Thanks to:

• all those at the UK Data Archive

• to EDINA for their contributions as consultants