martin donnelly sarah jones dmp online

31
Future Perfect 2012: Digital Preservation by Design Te Papa Tongarewa, Wellington, New Zealand 26 – 27 March 2012 Research data management: from policy to practice with DMP Online Martin Donnelly Digital Curation Centre University of Edinburgh Sarah Jones Digital Curation Centre University of Glasgow

Upload: future-perfect-2012

Post on 06-May-2015

606 views

Category:

Technology


0 download

DESCRIPTION

Research data management: from policy to practice with DMP OnlineMartin Donnelly Sarah Jones

TRANSCRIPT

Page 1: Martin Donnelly Sarah Jones DMP Online

Future Perfect 2012: Digital Preservation by DesignTe Papa Tongarewa, Wellington, New Zealand

26 – 27 March 2012

Research data management: from policy to practice with DMP Online

Martin DonnellyDigital Curation CentreUniversity of Edinburgh

Sarah JonesDigital Curation CentreUniversity of Glasgow

Page 2: Martin Donnelly Sarah Jones DMP Online

Running order (c. 25 mins)1. Introduction to the DCC & research data management 2. Data-related policies in the UK 3. The DCC & data management planning4. DMP Online v3.05. Connections and collaborations6. Putting it into practice (UMF work and other things)7. Summary / conclusion

Sarah

Martin

Page 3: Martin Donnelly Sarah Jones DMP Online

1. The Digital Curation Centre

- Founded in 2004- Three partners: Edinburgh, Glasgow and Bath- Primary funder is JISC

Helping to build capacity, capability and skills in data management and curation across the UK’s higher education research community

- DCC Phase 3 Business Plan

Page 4: Martin Donnelly Sarah Jones DMP Online

What does the DCC do?

• Develop tools – CARDIO, DAF, DRAMBORA, DMP Online

• Offer guidance – helpdesk, briefing papers, how-to guides

• Run training & events– DC101, roadshow, RDMF, IDCC

• Support the JISC – esp. the Managing Research Data programmes

Page 5: Martin Donnelly Sarah Jones DMP Online

“the active management and appraisal of data over the lifecycle of scholarly and

scientific interest”

Data management is part of good research practice

What is Research Data Management?

Manage

Share

Page 6: Martin Donnelly Sarah Jones DMP Online

How does RDM affect preservation?

The costs of ingest – receiving data, preparing it for long-term storage, and incorporating it into the digital archive – receives the largest allocation of resources.

- Keeping Research Data Safe 2

Page 7: Martin Donnelly Sarah Jones DMP Online

2. Data-related policies in the UK

http://www.dcc.ac.uk/resources/policy-and-legal/overview-funders-data-policies

Page 8: Martin Donnelly Sarah Jones DMP Online

RCUK Common Principles• Publicly funded research data are a public good, produced in the public interest,

which should be made openly available with as few restrictions as possible in a timely and responsible manner that does not harm intellectual property.

• Institutional and project specific data management policies and plans should be in accordance with relevant standards and community best practice. Data with acknowledged long-term value should be preserved and remain accessible and usable for future research.

• To enable research data to be discoverable and effectively re-used by others, sufficient metadata should be recorded and made openly available ....

7 principles agreed by all the UK research councils in May 2011

http://www.rcuk.ac.uk/research/Pages/DataPolicy.aspx

Page 9: Martin Donnelly Sarah Jones DMP Online

UK research funder expectations

• timely release of data– once patents are filed or on (acceptance for) publication

• open data sharing – minimal or no restrictions– deposit in data centres, structured databases, data enclave

• preservation of data – most funders state expect 5-10+ years

• submission of data management and sharing plans…

Page 10: Martin Donnelly Sarah Jones DMP Online

3. The DCC and DMP

Links to all DMP resources via http://www.dcc.ac.uk/resources/data-management-plans

We’ve responded to requirements by offering support

Analysed requirements

Developed a Checklist

Provided tools & guidance

Page 11: Martin Donnelly Sarah Jones DMP Online

What is a DMP?

UK research funders typically ask for:

• A short statement/plan submitted in grant applications

• An outline of what you will create/collect, methods, standards, data management and long-term plans

• How and why – justify your decisions and any limits

Page 12: Martin Donnelly Sarah Jones DMP Online

Common DMP questions

• What data will be created (format, types) and how?

• How will the data be documented and described?

• How will you manage ethics and Intellectual Property?

• What are the plans for data sharing and access?

• What is the strategy for long-term preservation?

Page 13: Martin Donnelly Sarah Jones DMP Online

§1: Introduction and Context§2: Data Types, Formats, Standards and Capture

Methods§3: Ethics and Intellectual Property§4: Access, Data Sharing and Re-use§5: Short-Term Storage and Data Management§6: Deposit and Long-Term Preservation§7: Resourcing§8: Adherence and Review§9: Agreement/Ratification by Stakeholders§10: Annexes

DCC Checklist Coverage

Checklist for a Data Management Plan v3.0 (Donnelly and Jones,

March 2011)

http://www.dcc.ac.uk/resources/data-management-plans

Page 14: Martin Donnelly Sarah Jones DMP Online

DMP-related resources

– “Dealing with Data” (Lyon, 2008)– Analysis of Funder Policies (Jones, 2009)– Checklist for a Data Management Plan

(Donnelly and Jones, 2009)– “How to Develop a Data Management and

Sharing Plan” (Jones, 2011) Edinburgh: Digital Curation Centre

– “Data Management Plans and Planning” (Donnelly, 2012) in Pryor (ed.) Managing Research Data, London: Facet

Links to all DCC resources via http://www.dcc.ac.uk/resources/data-management-plans

Page 15: Martin Donnelly Sarah Jones DMP Online

Key things to remember

All research projects are different

The DMP will depend upon the nature of the research AND the context (funder, domain, institution(s) etc)

DMPs are useful communication tools

Page 16: Martin Donnelly Sarah Jones DMP Online

Not a UK phenomenon

“Data Management Plans and Planning” (Donnelly, 2012) in Pryor (ed.) Managing Research Data, London: Facet

“Research data policies: principles, requirements and trends” (Jones, 2012) in Pryor (ed.) Managing Research Data, London: Facet

Read about the international policy and DMP landscape in:

Page 17: Martin Donnelly Sarah Jones DMP Online

4. www.dcc.ac.uk/dmponline

Page 18: Martin Donnelly Sarah Jones DMP Online

What does do?

A web-based tool that enables users to...

i. Create, store and update multiple versions of Data Management Plans across the research lifecycle

ii. Meet a variety of specific data-related requirements (from funders, institutions, publishers, etc.)

iii. Get tailored guidance on best practice and helpful contacts, at the point of need

iv. Customise export are share DMPs in a variety of formats in order to facilitate communications within and beyond research projects

* N.B. The templates have varying degrees of endorsement from funders, stakeholder communities, etc. More on this shortly…

Page 19: Martin Donnelly Sarah Jones DMP Online

Technologies involved (v3.0)

– Ruby on Rails (v3.1.3)– JavaScript (jQuery v1.7.1)– MySQL database (v5+)– Hosting: University of Edinburgh Information Services

Virtual Hosting (13 managed servers across 2 sites)– Authentication: registered users with passwords encrypted

in DB (we are also testing Shibboleth for integration with UK Access Management Federation for Education and Research)

– Various export formats (DOCX, PDF, XML, CSV, etc)

Page 20: Martin Donnelly Sarah Jones DMP Online

DMP Online v3.0: Spring 2012- Improved user interface, inc. customisable

institutional versions- New features

- Overlaying multiple templates for ‘hybrid’ DMPs- Template phases (e.g. pre- / during / post-project)- Granular read / write / share permissions- API for systems interoperability (e.g. this project)- Shibboleth authentication- Multilingual support / boilerplate text

- Endorsement from funders

Page 21: Martin Donnelly Sarah Jones DMP Online

- Generic data management guidance ( in conjunction with )

- Funder-specific guidance developed in collaboration with the funders themselves

- Institution-specific guidance developed with key institutional contacts

- Discipline-specific guidance developed and deployed with JISC MRD projects (e.g. DMT Psych at York)

- Joint training programmes organised and delivered by DCC and UKDA

- Provided advice to US consortium

Collaborations

Page 22: Martin Donnelly Sarah Jones DMP Online

Templates: Stakeholder Liaison (i)RCUK funders Status

Arts and Humanities Research Council (AHRC) Discussions beginning

Biotechnology and Biological Sciences Research Council (BBSRC)

Discussions ongoing

Engineering and Physical Sciences Research Council (EPSRC)

No explicit data management plan requirements: DCC referenced in roadmap requirements

Economic and Social Research Council (ESRC) Template and guidance developed in collaboration with ESRC and ESDS. Funder’s online guidance points applicants towards tool.

Medical Research Council (MRC) Template in preparation through collaboration with funder

NERC (Natural Environment Research Council) Discussions ongoing

Science and Technology Facilities Council (STFC) DCC resources referenced in data requirements

Other funders Status

The Wellcome Trust Template and guidance endorsed by funder

National Science Foundation (US) Template developed by Sherry Lake, University of Virginia

Page 23: Martin Donnelly Sarah Jones DMP Online

Templates: Stakeholder Liaison (ii)Disciplinary templates Status

History Developed in conjunction with University of Hull and University of Hertfordshire

Psychology Developed by DMT Psych project, led by University of York

Mechanical Engineering Developed as part of REDm-MED project, led by University of Bath

Health sciences Developed by DATUM for Health project, led by University of Northumbria

Spatial information (INSPIRE) Developed in conjunction with EDINA (UK national data centre) and trialled with Freshwater Biological Association

Institutional templates Status

University of Northampton Developed in collaboration with Information Services department

More institutional and subject-based templates are being developed through the JISC RDM projects and UMF institutional engagements…

Page 24: Martin Donnelly Sarah Jones DMP Online

Institutional Engagements: Putting it into practice

- Working with eighteen institutions over approximately 18 months to improve data management capabilities

- A broad variety of institutional types and sizes, from research intensive ancient universities, to new universities and small specialist institutions (e.g. art colleges)

- Institutions select from a ‘menu’ of tools and services, e.g. (next slide)

Page 25: Martin Donnelly Sarah Jones DMP Online

Components of a Data Management Strategy (Research and Admin)

DCC Tools DCC Services

Policy Data Asset Framework (DAF)

Policy development

Planning DMP Online Strategy development

Advocacy CARDIO Training

Tools DRAMBORA Workflow assessment

Training Costing

Institutional data catalogues (discovery)

The Menu

Page 26: Martin Donnelly Sarah Jones DMP Online

Workflow connectionsDMP Online can also be used in conjunction with other tools that support the data management/curation lifecycle, e.g.…

- DAF (Data Asset Framework)- DRAMBORA (Digital Repository Audit Method

Based On Risk Assessment)- CARDIO (Collaborative Assessment of

Research Data Infrastructure and Objectives)

Also non-DCC tools:

- LIFE- Planets tools- and more

Page 27: Martin Donnelly Sarah Jones DMP Online

For machine readership…

- Facilitates quick public sharing

- Compatible with API for linking with other systems

- Minimal formatting

For human readership…

- Pleasant formatting

- Editable. Can be used in conjunction with (e.g. MS Sharepoint)

- Removes all formatting

How to connect: six export formats

Page 28: Martin Donnelly Sarah Jones DMP Online

Systems– CRIS / admin systems– RCUK Je-S system– Institutional Repositories– DDI repository– DMP Tool (US)– Other instances of DMP

Online via federated model (? -TBC)

External connectionsStandards / protocols– CERIF*

– SWORD2– DDI* – RDF (? - TBC)

* via RESTful API

Page 29: Martin Donnelly Sarah Jones DMP Online

Researcher(s)

Research Support Office

Computing Support

Faculty Ethics Committee Etc...

DATAMANAGEMENTPLAN

UNRULYDATA

Data Library / Repository / Archive

Page 30: Martin Donnelly Sarah Jones DMP Online

To sum...

All of our DMP-related resources available online via:

www.dcc.ac.uk/dmponline/

Page 31: Martin Donnelly Sarah Jones DMP Online

Thank you

Image credits: Slide 1 - http://upload.wikimedia.org/wikipedia/commons/8/88/LernaeanHydraRephael.jpg Slide 5 - http://www.dcc.ac.uk/resources/curation-lifecycle-model Slide 6 (The Scream) - http://www.flickr.com/photos/terryfreedman/6548040049 Slide 6 (OAIS) - http://public.ccsds.org/publications/archive/650x0b1.pdf Slide 29 - http://en.wikipedia.org/wiki/File:Hercules_slaying_the_Hydra.jpg Slide 30 - http://www.treehugger.com/picture-is-worth-sum-car-parts.jpg

This work is licensed under the Creative Commons Attribution 2.5 UK: Scotland License.

Martin DonnellyDigital Curation CentreUniversity of Edinburgh

[email protected]: @mkdDCC

Sarah JonesDigital Curation CentreUniversity of Glasgow

[email protected]: @sjDCC

Check out DCC at: www.dcc.ac.uk or follow us on twitter @digitalcuration and #ukdcc