e-infrastructure for social science data: obesity e-lab & methodbox

25
e-Infrastructure for Social Science data: Obesity e-Lab & MethodBox Ian Dunlop 15/03/11 [email protected]

Upload: zan

Post on 12-Jan-2016

28 views

Category:

Documents


0 download

DESCRIPTION

e-Infrastructure for Social Science data: Obesity e-Lab & MethodBox. Ian Dunlop 15/03/11 [email protected]. Terminology. Obesity e-Lab is the ESRC project www.obesityelab.org.uk MethodBox is the product. www.methodbox.org. Obesity e-Lab Aims. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: e-Infrastructure for Social Science data: Obesity e-Lab & MethodBox

e-Infrastructure for Social Science data:

Obesity e-Lab & MethodBoxIan Dunlop

15/03/11 [email protected]

Page 2: e-Infrastructure for Social Science data: Obesity e-Lab & MethodBox

Terminology

• Obesity e-Lab is the ESRC project

www.obesityelab.org.uk

• MethodBox is the product

www.methodbox.org

Page 3: e-Infrastructure for Social Science data: Obesity e-Lab & MethodBox
Page 4: e-Infrastructure for Social Science data: Obesity e-Lab & MethodBox

Obesity e-Lab Aims

• Enable socially networked research between the social sciences, health sciences and public health

• Add value to archived datasets by developing technologies to help on-line users

• Seed an “open source” approach to social research publication

Page 5: e-Infrastructure for Social Science data: Obesity e-Lab & MethodBox

Project Objectives

• Engagement (‘More with less’)– Research communities (Obesity/Cancer, Education) – Public health researchers (Academic, NHS, LA)– Key data providers (ESDS/UKDA)

• Reduce barriers– For survey datasets– Formation of research communities (cross-disciplinary)

• Develop tools– On line digital laboratory an ‘e-Lab’ known as MethodBox

• Data * Methods * People

Page 6: e-Infrastructure for Social Science data: Obesity e-Lab & MethodBox

e-Lab

Socially-stimulating science, in-silico

Research Object

FindShareReuse

Data-sources

Data-preparation scripts

Research protocol Statistical analysis scripts

Slides

Working datasets

Figures/Graphics

Manuscripts

References

Analysis-logs & notes

Page 7: e-Infrastructure for Social Science data: Obesity e-Lab & MethodBox

Where we are upto

• MethodBox launched at ESDS government event April 2010(scored 5.7/7 from 15 responses)

• 80 registered users, 45 scripts and 58 data extracts.

• 21 public health researchers trained using a combination of social science and health science approaches

• Methodological approach adopted by North West e-Health (www.nweh.org.uk) project (which is 20x bigger than us)

Page 8: e-Infrastructure for Social Science data: Obesity e-Lab & MethodBox

Context, Features, Architecture

• Context– Investigation Cycle– Survey (Meta) Data overload– How MethodBox fits it

• MethodBox– Architecture– Screenshots– E-Infrastructure

• Future Directions

Page 9: e-Infrastructure for Social Science data: Obesity e-Lab & MethodBox

Investigation Cycle

Data

•Our Tooling focus is (survey) Data and Analysis•Out main Community focus is Expertise via Methods/Analysis/Scripts

AnalysisModels

Results

QuestionsQuestions Publications, Reports or Decisions

ToolingCommunity

Page 10: e-Infrastructure for Social Science data: Obesity e-Lab & MethodBox

Examples: HSE 2006

13 pages208 pages

Variable DefinitionsVariable CategoriesVariable SPSS code

Questionnaire Instructions

224 pages

Questions usedTo set variables

148 pages

Survey Description

9 pagesVariable Value

Domains 351 pages46 MB data files

Data and Variable Codebook

X 17 All HSE

@1800 Variables

Page 11: e-Infrastructure for Social Science data: Obesity e-Lab & MethodBox

How MethodBox fits in

UK Data Archive(UKDA)

MethodBox

Economic and Social Data Service(ESDS)

Survey Curation

Survey Mapping

diagram not to scale

Survey Navigation

Survey Commissioning & Collection

etc…

Impr

ovin

g Ac

cess

& U

se

Page 12: e-Infrastructure for Social Science data: Obesity e-Lab & MethodBox

Ruby delayed jobRuby delayed job

Ruby on RailsRuby on Rails

Data providers

Data providers

User Dataset import

User Dataset import

File system

File system

mySQLmySQLMetadata

importMetadata

import

User data and

metadata import

Request ‘catalog’

information

Provide metadata

Page 13: e-Infrastructure for Social Science data: Obesity e-Lab & MethodBox

Search

Page 14: e-Infrastructure for Social Science data: Obesity e-Lab & MethodBox

Results

Page 15: e-Infrastructure for Social Science data: Obesity e-Lab & MethodBox

Variable info with Stats

Page 16: e-Infrastructure for Social Science data: Obesity e-Lab & MethodBox

Profiles

Page 17: e-Infrastructure for Social Science data: Obesity e-Lab & MethodBox

People & Expertise

Page 18: e-Infrastructure for Social Science data: Obesity e-Lab & MethodBox

Methods

Page 19: e-Infrastructure for Social Science data: Obesity e-Lab & MethodBox

Method Information

Page 20: e-Infrastructure for Social Science data: Obesity e-Lab & MethodBox

Data Extracts

Page 21: e-Infrastructure for Social Science data: Obesity e-Lab & MethodBox

Making the data extractvisible…

Linking a data extractwith a script forderiving variables…

Sharing and visibility

Page 22: e-Infrastructure for Social Science data: Obesity e-Lab & MethodBox
Page 23: e-Infrastructure for Social Science data: Obesity e-Lab & MethodBox

MethodBox as e-Infrastructure• Data Providers

– Existing infrastructure (NESSTAR/NESSTAR Server)– Cautious

• adopt only ‘proven’ technologies• Willing ‘try’ things if risk/work is low

• MethodBox offers– Social Layer, sharing, data tooling– Integration

• Existing data provider infrastructure – NESSTAR Server • Security infrastructure (Shibboleth)• Automated running of scripts for new datasets (using institutional/national

compute)• Deployment

– ESDS/CCSR first instance (exit strategy)• Obesity e-Lab project ends 31/03/12

Page 24: e-Infrastructure for Social Science data: Obesity e-Lab & MethodBox

Future work• MethodBox as e-Infrastructure

– Target deployment as part of ESDS/CCSR– Integration with NESSTAR system

• Focus on communities– Greater Manchester Public Health Inequalities

Research Network– University of Manchester School of Education– North West e-Health and Arthritis Research UK

• Ability to ‘run’ methods– Part funded by Obesity e-lab work in JISC ‘National e-

Infrastructure for Social Simulation’ project

video at http://bit.ly/methodbox11

Page 25: e-Infrastructure for Social Science data: Obesity e-Lab & MethodBox