dealing with research data and...

43
Dealing with Research Data and Dissertations Workshop Joachim Schöpfel & Hélène Prost This presentation is licensed under the Creative Commons: CC-BY-SA-4.0, via http://www.nusl.cz/ntk/nusl-369340

Upload: others

Post on 31-Aug-2020

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Dealing with Research Data and Dissertationsrepozitar.techlib.cz/record/1182/files/Workshop_presentation_Schopfel_Prost.pdfWhat you should know about data literacy, attitudes and needs

Dealing with Research Data and Dissertations Workshop

Joachim Schöpfel & Hélène Prost

This presentation is licensed under the Creative Commons: CC-BY-SA-4.0, via http://www.nusl.cz/ntk/nusl-369340

Page 2: Dealing with Research Data and Dissertationsrepozitar.techlib.cz/record/1182/files/Workshop_presentation_Schopfel_Prost.pdfWhat you should know about data literacy, attitudes and needs

Time schedule

• 13:00 First part– What you should know about data– What you should know about data literacy, attitudes and needs– What you should know about data related to PhD dissertation– What you should know about service development

• 15:00 Pause• 15:20 Second part

– Presentation of Lille project– Discussion

• 16:50 Short feedback• 17:00 End

Workshop Data and Dissertations - October 18, 2017 - NTK 2

Page 3: Dealing with Research Data and Dissertationsrepozitar.techlib.cz/record/1182/files/Workshop_presentation_Schopfel_Prost.pdfWhat you should know about data literacy, attitudes and needs

Our objectives

Our intention is that each participant leaves the workshop with a better understanding of

• A realistic model of RDM with PhD students on the campus

• Critical issues (anticipation of problems, risk analysis)

• Key success factors (governance, education, cooperation)

Workshop Data and Dissertations - October 18, 2017 - NTK 3

Page 4: Dealing with Research Data and Dissertationsrepozitar.techlib.cz/record/1182/files/Workshop_presentation_Schopfel_Prost.pdfWhat you should know about data literacy, attitudes and needs

Your expectations?

Workshop Data and Dissertations - October 18, 2017 - NTK 4

Page 5: Dealing with Research Data and Dissertationsrepozitar.techlib.cz/record/1182/files/Workshop_presentation_Schopfel_Prost.pdfWhat you should know about data literacy, attitudes and needs

Preliminary questions

Do you know• the international directory of research data repositories re3data ?

• DMPonline for the creation of data management plans ?

• the two data repositories Zenodo or figshare ?

• the Educopia ETD+ Toolkit for the management of the students’ research output ?

Workshop Data and Dissertations - October 18, 2017 - NTK 5

Page 6: Dealing with Research Data and Dissertationsrepozitar.techlib.cz/record/1182/files/Workshop_presentation_Schopfel_Prost.pdfWhat you should know about data literacy, attitudes and needs

FIRST PART

What you should know…

Workshop Data and Dissertations - October 18, 2017 - NTK 6

Page 7: Dealing with Research Data and Dissertationsrepozitar.techlib.cz/record/1182/files/Workshop_presentation_Schopfel_Prost.pdfWhat you should know about data literacy, attitudes and needs

What you should know about data

Workshop Data and Dissertations - October 18, 2017 - NTK 7

A popular but uncertain concept:

• « Big Data is the Information asset characterised by such a High Volume, Velocity and Variety to require specific Technology and Analytical Methodsfor its transformation into Value » (De Mauro et al. 2016)

• « What constitutes data is determined by a given community of interestthat produces the data. However, an investigator may be part of multiple, overlapping communities of interest, each of which may have different notions of what are data » (Koltay 2016)

• « The recorded factual material commonly accepted in the scientific community as necessary to validate research findings » (OMB Circular 110)

Page 8: Dealing with Research Data and Dissertationsrepozitar.techlib.cz/record/1182/files/Workshop_presentation_Schopfel_Prost.pdfWhat you should know about data literacy, attitudes and needs

GL19 – October 23-24, 2017 – National Research Council of Italy - Roma 8

A conceptual approach

Page 9: Dealing with Research Data and Dissertationsrepozitar.techlib.cz/record/1182/files/Workshop_presentation_Schopfel_Prost.pdfWhat you should know about data literacy, attitudes and needs

Typology of data

Workshop Data and Dissertations - October 18, 2017 - NTK 9

« Data are most often defined by example, such as facts, numbers, letters and symbols » (Borgman et al. 2015)

• Observation

• Experimentation

• Simulation

• etc.

Often more conditioned by instruments and methods than by disciplines and communities.

Page 10: Dealing with Research Data and Dissertationsrepozitar.techlib.cz/record/1182/files/Workshop_presentation_Schopfel_Prost.pdfWhat you should know about data literacy, attitudes and needs

Primary and secondary data

• Data as material(resource) for research

• Data as research results

• Long tail of data

• Unequal categories

• Domain-specific profiles

Workshop Data and Dissertations - October 18, 2017 - NTK 10

Page 11: Dealing with Research Data and Dissertationsrepozitar.techlib.cz/record/1182/files/Workshop_presentation_Schopfel_Prost.pdfWhat you should know about data literacy, attitudes and needs

A functional approach

Workshop Data and Dissertations - October 18, 2017 - NTK 11

• Politics• Increase transparency• Increase efficiency of public action• Provide fuel for economy

• Economics• Optimize (valorize) public research• Accelerate innovation (health, environment)

• Science• Explore (reuse)• Visualize results (also: data journalism)• Compare and/or control results• Validate hypotheses• Also: citizen science

Page 12: Dealing with Research Data and Dissertationsrepozitar.techlib.cz/record/1182/files/Workshop_presentation_Schopfel_Prost.pdfWhat you should know about data literacy, attitudes and needs

Data and research process

Workshop Data and Dissertations - October 18, 2017 - NTK 12http://library.nuigalway.ie/media/digitalscholarship/images/research-data-lifecycle.jpg

Data are dynamicThey have their own life cycle

Page 13: Dealing with Research Data and Dissertationsrepozitar.techlib.cz/record/1182/files/Workshop_presentation_Schopfel_Prost.pdfWhat you should know about data literacy, attitudes and needs

What you should know about data literacy, attitudes and needs

• An increasing number of surveys

• Institutional and disciplinary differences

• However, some common characteristics

• Schöpfel, J., Prost, H., 2016. Research data management in social sciences and humanities: A survey at the university of Lille 3 (France). LIBREAS. Library Ideas 29, 98-112. http://hal.univ-lille3.fr/hal-01395816

Workshop Data and Dissertations - October 18, 2017 - NTK 13

Page 14: Dealing with Research Data and Dissertationsrepozitar.techlib.cz/record/1182/files/Workshop_presentation_Schopfel_Prost.pdfWhat you should know about data literacy, attitudes and needs

Data literacy

Workshop Data and Dissertations - October 18, 2017 - NTK 14

Diversity of data

Research data results (n=211)Research data sources (n=214)

Page 15: Dealing with Research Data and Dissertationsrepozitar.techlib.cz/record/1182/files/Workshop_presentation_Schopfel_Prost.pdfWhat you should know about data literacy, attitudes and needs

Data literacy

83% on private computer49% on professional computer

97% declare themselves responsible for data backup

Storage and sharing

15Workshop Data and Dissertations - October 18, 2017 - NTK

Data sharing limited 64%

do not share their research data with colleagues or other peopleNobody else has access to their data

Page 16: Dealing with Research Data and Dissertationsrepozitar.techlib.cz/record/1182/files/Workshop_presentation_Schopfel_Prost.pdfWhat you should know about data literacy, attitudes and needs

Workshop Data and Dissertations - October 18, 2017 - NTK 16Deposit of research data in a data repository (n=162)

Attitudes toward sharingExperience, motivation

Page 17: Dealing with Research Data and Dissertationsrepozitar.techlib.cz/record/1182/files/Workshop_presentation_Schopfel_Prost.pdfWhat you should know about data literacy, attitudes and needs

Workshop Data and Dissertations - October 18, 217 - NTK 17Data deposit (n=187)

Attitudes toward sharingWhich kind of data?

Page 18: Dealing with Research Data and Dissertationsrepozitar.techlib.cz/record/1182/files/Workshop_presentation_Schopfel_Prost.pdfWhat you should know about data literacy, attitudes and needs

Workshop Data and Dissertations - October 18, 217 - NTK 18Data publishing (n=147)

Attitudes toward sharingData publishing

Page 19: Dealing with Research Data and Dissertationsrepozitar.techlib.cz/record/1182/files/Workshop_presentation_Schopfel_Prost.pdfWhat you should know about data literacy, attitudes and needs

Workshop Data and Dissertations - October 18, 217 - NTK 19Preferred data repository (n=173)

Attitudes toward sharingPreferred data repository

Page 20: Dealing with Research Data and Dissertationsrepozitar.techlib.cz/record/1182/files/Workshop_presentation_Schopfel_Prost.pdfWhat you should know about data literacy, attitudes and needs

Workshop Data and Dissertations - October 18, 2017 - NTK 20Support and services needed (n=188)

RDM related needsAbove all, storage

Page 21: Dealing with Research Data and Dissertationsrepozitar.techlib.cz/record/1182/files/Workshop_presentation_Schopfel_Prost.pdfWhat you should know about data literacy, attitudes and needs

What you should know about data related to PhD dissertation

ETD

Research data

Institutionalrepositories

21

Originality

Linked to a scientific program

No commercial and public character

Workshop Data and Dissertations - October 18, 2017 - NTK

Page 22: Dealing with Research Data and Dissertationsrepozitar.techlib.cz/record/1182/files/Workshop_presentation_Schopfel_Prost.pdfWhat you should know about data literacy, attitudes and needs

The potential of ETDs

• Contain the results of at least three years of scientific work

• Variety and richness of appendices

• Availability in open access

• Contribute to eScience

Workshop Data and Dissertations - October 18, 2017 - NTK 22

Page 23: Dealing with Research Data and Dissertationsrepozitar.techlib.cz/record/1182/files/Workshop_presentation_Schopfel_Prost.pdfWhat you should know about data literacy, attitudes and needs

ETDs and data

GL17 - Dec 1-2, 2015 23

Page 24: Dealing with Research Data and Dissertationsrepozitar.techlib.cz/record/1182/files/Workshop_presentation_Schopfel_Prost.pdfWhat you should know about data literacy, attitudes and needs

The size of data appendices

24

Page 25: Dealing with Research Data and Dissertationsrepozitar.techlib.cz/record/1182/files/Workshop_presentation_Schopfel_Prost.pdfWhat you should know about data literacy, attitudes and needs

Data type and discipline

25GL17 - Dec 1-2, 2015Total

Total

Page 26: Dealing with Research Data and Dissertationsrepozitar.techlib.cz/record/1182/files/Workshop_presentation_Schopfel_Prost.pdfWhat you should know about data literacy, attitudes and needs

What you should know about service development

• Five basic questions of strategic service marketing

– What is our business?

– Who are our customers?

– What is our value for them?

– Where is the business going?

– Where should we go?

• Collective choice

• Compliance with institutional policy

Workshop Data and Dissertations - October 18, 2017 - NTK 26

Page 27: Dealing with Research Data and Dissertationsrepozitar.techlib.cz/record/1182/files/Workshop_presentation_Schopfel_Prost.pdfWhat you should know about data literacy, attitudes and needs

Strategical analysis with SWOT

Workshop Data and Dissertations - October 18, 2017 - NTK 27

Internal vs external factors

Helpful vs harmful factors

https://canvanizer.com/images/canvas-thumb/swot-canvas.png

Page 28: Dealing with Research Data and Dissertationsrepozitar.techlib.cz/record/1182/files/Workshop_presentation_Schopfel_Prost.pdfWhat you should know about data literacy, attitudes and needs

Preparing change with the Strategyzer Canvas

Workshop Data and Dissertations - October 18, 2017 - NTK 28https://strategyzer.com/canvas/business-model-canvas

Page 29: Dealing with Research Data and Dissertationsrepozitar.techlib.cz/record/1182/files/Workshop_presentation_Schopfel_Prost.pdfWhat you should know about data literacy, attitudes and needs

Taking Murphy’s law seriously: "Anything that can go wrong will go wrong“

• Risk analysis: what can go wrong, and what should be done?

– Nature of risk

– Probability

– Impact (severity)

– Overall assessment (probability * impact)

– Prevention (what can be done to avoid risk)

– Action (what can be done if it goes wrong)

Workshop Data and Dissertations - October 18, 2017 - NTK 29

Page 30: Dealing with Research Data and Dissertationsrepozitar.techlib.cz/record/1182/files/Workshop_presentation_Schopfel_Prost.pdfWhat you should know about data literacy, attitudes and needs

Questions?

Workshop Data and Dissertations - October 18, 2017 - NTK 30

Page 31: Dealing with Research Data and Dissertationsrepozitar.techlib.cz/record/1182/files/Workshop_presentation_Schopfel_Prost.pdfWhat you should know about data literacy, attitudes and needs

PAUSE

20 minutes

Workshop Data and Dissertations - October 18, 2017 - NTK 31

Page 32: Dealing with Research Data and Dissertationsrepozitar.techlib.cz/record/1182/files/Workshop_presentation_Schopfel_Prost.pdfWhat you should know about data literacy, attitudes and needs

SECOND PART

Our project, your initiatives…

Workshop Data and Dissertations - October 18, 2017 - NTK 32

Page 33: Dealing with Research Data and Dissertationsrepozitar.techlib.cz/record/1182/files/Workshop_presentation_Schopfel_Prost.pdfWhat you should know about data literacy, attitudes and needs

The Lille project

• White paper on data in dissertations

• 2015-2018

Workshop Data and Dissertations - October 18, 2017 - NTK 33

Page 34: Dealing with Research Data and Dissertationsrepozitar.techlib.cz/record/1182/files/Workshop_presentation_Schopfel_Prost.pdfWhat you should know about data literacy, attitudes and needs

34Workshop Data and Dissertations - October 18, 2017 - NTK

Local ETD data workflow

Page 35: Dealing with Research Data and Dissertationsrepozitar.techlib.cz/record/1182/files/Workshop_presentation_Schopfel_Prost.pdfWhat you should know about data literacy, attitudes and needs

Main issues of discussion

• Content and coverage

– Granularity

• Reuse

– Data format

• Checklist

– Data base

Workshop Data and Dissertations - October 18, 2017 - NTK 35

• Metadata– Indexing

• ETD metadata

– Data structure• METS

– Referentials• 5 DC elements

– Identifier• Handle

– Source code

• Other issues

– Legal aspects

– Deposit• Who has access?

– Data size

Page 36: Dealing with Research Data and Dissertationsrepozitar.techlib.cz/record/1182/files/Workshop_presentation_Schopfel_Prost.pdfWhat you should know about data literacy, attitudes and needs

Other issues

• Long termpreservation

– Nationalinfrastructure

Workshop Data and Dissertations - October 18, 2017 - NTK 36

• Quality

– Validation?

– Filter?

• Promotion– But no « data sharing

ideology »

• Technicaldocumentation– For students

– For staff

Page 37: Dealing with Research Data and Dissertationsrepozitar.techlib.cz/record/1182/files/Workshop_presentation_Schopfel_Prost.pdfWhat you should know about data literacy, attitudes and needs

The PhD training program

Workshop Data and Dissertations - October 18, 2017 - NTK 37

Page 38: Dealing with Research Data and Dissertationsrepozitar.techlib.cz/record/1182/files/Workshop_presentation_Schopfel_Prost.pdfWhat you should know about data literacy, attitudes and needs

Key elements of training program

• Mixed team (scientists, librarian)• Multidisciplinarity (but limited to SSH)• 20 hours, six months• Different levels of PhD projects

– May be discontinued

• Mix of (some) theory and (much) practice• PhD DMP as the red line of the training program

– With DMP platform

• Individual follow-up and time for discussion• Evaluation

Workshop Data and Dissertations - October 18, 2017 - NTK 38

Page 39: Dealing with Research Data and Dissertationsrepozitar.techlib.cz/record/1182/files/Workshop_presentation_Schopfel_Prost.pdfWhat you should know about data literacy, attitudes and needs

Examples of students’ DMP

• On French CNRS platform DMP OPIDoR

• Dynamic character of DMP (initial, mid-term, final)

• Different data literacy level of students, depending on

– Progress of dissertation project

– Discipline (e.g. ethical and privacy issues in psychology, sociology…)

Workshop Data and Dissertations - October 18, 2017 - NTK 39

Page 40: Dealing with Research Data and Dissertationsrepozitar.techlib.cz/record/1182/files/Workshop_presentation_Schopfel_Prost.pdfWhat you should know about data literacy, attitudes and needs

Your initiatives?

Workshop Data and Dissertations - October 18, 2017 - NTK 40

Page 41: Dealing with Research Data and Dissertationsrepozitar.techlib.cz/record/1182/files/Workshop_presentation_Schopfel_Prost.pdfWhat you should know about data literacy, attitudes and needs

Questions?

• Main characteristics of a RDM program with (for) PhD students

• Key factors of success

– Governance, education, viral marketing, partnerships…

• Major risks

– Leadership, ideology (evangelism), focus on tools not people

Workshop Data and Dissertations - October 18, 2017 - NTK 41

Page 42: Dealing with Research Data and Dissertationsrepozitar.techlib.cz/record/1182/files/Workshop_presentation_Schopfel_Prost.pdfWhat you should know about data literacy, attitudes and needs

Feedback?

Workshop Data and Dissertations - October 18, 2017 - NTK 42

Page 43: Dealing with Research Data and Dissertationsrepozitar.techlib.cz/record/1182/files/Workshop_presentation_Schopfel_Prost.pdfWhat you should know about data literacy, attitudes and needs

THANK YOU !

Workshop Data and Dissertations - October 18, 2017 - NTK

Referenceshttp://www.citeulike.org/user/Schopfel/tag/data_management

[email protected]

43