update on the canadian rdm landscape
TRANSCRIPT
Update on theCanadian RDM Landscape
Jeff Moon, Director, PortageAtlantic Canada RDM Day | 21 October 2020
Funding in support of the Portage Network’s stewardship of research data within Canada is administered through the New Digital Research Infrastructure Organization (NDRIO).Le financement accordé pour l’intendance des données de recherche au Canada du réseau Portage est administré au travers de la Nouvelle organisation de l’information de recherche numérique.
DMP Exemplars
DMP Templates
Image courtesy: Daimon Tayler-McLeod
Agenda
1. What is Research Data Management?
2. Introduction to Portage
3. Making data FAIR1. Tri-agency RDM Policy
2. Supports for institutions & researchers
3. Other initiatives
4. Looking forward → DM under NDRIO
Photo by Thomas Renaud on Unsplash
What is Research
Data Management?
Photo by fabio on Unsplash
"Research data management concerns the
organisation of data, from its entry to the research
cycle through to the dissemination and archiving
of valuable results. It aims to ensure reliable
verification of results, and permits new and
innovative research built on existing information."
Whyte, A., Tedds, J. (2011). ‘Making the Case for Research Data Management’. DCC Briefing Papers. Edinburgh: Digital Curation Centre. Available online
Photo by Shahadat Rahman on Unsplash
Why manage research data?
Because it is
good practice.Photo by Alex on Unsplash
Why manage research data?
Because it is
practical.Photo by Sergey Zolkin on Unsplash
Why manage research data?
Because it will
be required.Photo by Christina @ wocintechchat.com on Unsplash
Research
Life Cycle
RDM Drivers
Making effective use
of public funds
Improving discoverability &
accessibility
Extending research
Facilitating interoperability
Supporting replicability
Growing demand for data
Avoiding duplication
Verification of research results
Meeting funder & journal
requirements
Growing public awareness of
data
Enabling good public policy
making
Aligning with international best
practices & standards
steer by zidney from the Noun Project RDM Drivers
What keeps me up at night . . .
Data
Commercial interests
Preventing data loss
. . . it can happen to you!
infrastructure by Nithinan Tatah from the Noun Projectcomputing by Adrien Coquet from the Noun Projecttools by tanu doank from the Noun Projecttraining by Adrien Coquet from the Noun Project
Network of Experts
Training
Services
Tools
Infrastructure Platforms
International & Domain Focus
Facilitation & Convening
Planning
Policies
Design Research Data
Management Plan
Reuse
Finding data
Data citation
Collect & Analyze
Capture and organize data
Active Storage and backup
Documentation & metadata
File naming & formats
Collaborate
Deposit &
Preserve
Reformatting
Standards
Archival Storage
Publish
Data sharing
Copyright & Ownership
Ethics
Research
Data
Management
Life Cycle
DMP
Expert Group
Discovery &
Metadata
Expert Group
Preservation
Expert Group
Sensitive Data
Expert Group
Curation
Expert Group
National
Training
Expert Group
Research
Intelligence
Expert Group
Data Repositories
Expert Group
Ne
two
rk o
f E
xp
ert
s
130+ Experts 60+ Organizations
DMP
Coordinator
Discovery &
Metadata
Coordinator
Preservation
Coordinator
Policy, Privacy, &
Sensitive Data
Coordinator
Curation
Coordinator
& Officers
National
Training
Coordinator
Research
Intelligence
& Assessment
Coordinator
Communications
& Project Officers
Na
tio
na
l S
up
po
rt
How can I make
my data FAIR?
Photo by Alexander Sinn on Unsplash
FAIR Principles
Findable Accessible Interoperable Reusable
A set of principles to ensure that data are shared in a way that
enables and enhances reuse by humans and machines
Funder
Policies
Data
Management
Plans
Institutional
StrategyDeposit
DRAFT Tri-Agency Research Data Management Policy For Consultation
Broad uptake
Revised in 2020
Data Management PlansInstitutional
Strategy Deposit
RDMStrategyTemplate
National, Multi-disciplinary RepositoryOptions
DMP Assistant
National, online, bilingual, Data Management Planning Tool
New version imminent
New discipline-specific
Exemplars & Templates
Dataverse
&
Federated Research
Data Repository
https://publons.com/benefits/institutions
How do I make
my data FAIR?
Institutional
Buy-in
STRATEGY COMPONENTS
Raise awareness
Assess institutional readiness
Formalize RDM practices
Define a Roadmap
Institutional
RDM Strategy
How do I make my data FAIR?
Planning!
Photo by Joanna Kosinska on Unsplash
What is a Data Management Plan (DMP)?
➔ Describes what data you expect to
acquire or generate during the course of
a research project and why
➔ Explains how you will manage, describe,
analyze, and store your data and who will
be responsible
➔ Details when and where you will deposit
your data and how it will be shared
https://communitylivingstmarys.ca/services/community-development-and-planning-services/
➔ Helps you think ahead and map out
how you will manage, describe,
analyze, store, and share your data
➔ Helps identify areas for improvement &
questions that need to be answered
➔ Provides you & others with a record of
what you intend(ed) to do
➔ They are (or will be) required
Why DMPs?
https://media.defense.gov/2017/Nov/13/2001842185/-1/-1/0/171026-F-RN211-001.JPG
Infrastructure and Support
DMP Assistant
➔ National, online and bilingual
➔ Step-by-step, easy-to-use
➔ Framed around key sections, questions, & guidance
➔ Update anytime, share with collaborators
➔ Output in funder-ready formats
Visit: https://assistant.portagenetwork.ca/
How do I make my data FAIR?
Deposit!
Photo by Joanna Kosinska on Unsplash
Research Data Storage ContinuumA
cti
ve S
tora
ge
Controlled Access
Working Copy
Short-term
Duration of
project
Used to
complete
research
AC
TIV
E S
TO
RA
GE
From the Noun Project: storage by Nithinan Tatah | Share by Prasad | Time by Alice Design | working by Ranah Pixel Studio ect | Access by Adrien Coquet ject |Unlock by Zulfa Mahendra | Research by sandra |click by Delwar Hossain
Research Data Storage ContinuumA
cti
ve S
tora
ge
Re
po
sit
ory
Sto
rage
Controlled Access
Working Copy
Short-term
Duration of
project
Used to
complete
research
Open
(as appropriate)
Medium-term
Beyond duration
of project
Discovery &
Access
From the Noun Project: storage by Nithinan Tatah | Share by Prasad | Time by Alice Design | working by Ranah Pixel Studio ect | Access by Adrien Coquet ject |Unlock by Zulfa Mahendra | Research by sandra |click by Delwar Hossain
Dissemination
Copy
RE
PO
SIT
OR
Y S
TO
RA
GE
AC
TIV
E S
TO
RA
GE
Research Data Storage ContinuumA
cti
ve S
tora
ge
Re
po
sit
ory
Sto
rage
Pre
se
rva
tio
n P
roce
ssin
g
Arc
hiv
al S
tora
ge
Controlled Access
Working Copy
Short-term
Duration of
project
Used to
complete
research
Open
(as appropriate)
Dissemination
Copy
Medium-term
Beyond duration
of project
Discovery &
Access
Open
(as appropriate)
Preservation Copy
Long-term
Disaster recovery/
Copy of last resort
AR
CH
IVA
L S
TO
RA
GE
AC
TIV
E S
TO
RA
GE
RE
PO
SIT
OR
Y S
TO
RA
GE
From the Noun Project: storage by Nithinan Tatah | Share by Prasad | Time by Alice Design | working by Ranah Pixel Studio ect | Access by Adrien Coquet ject |Unlock by Zulfa Mahendra | Research by sandra |click by Delwar Hossain
ARCHIVAL STORAGE ACTIVE STORAGE
REPOSITORY STORAGE
ResearchLife Cycle
Benefits of Data Repositories
● Ensure data are discoverable & accessible beyond the original study
-- The Availability of Research Data Declines Rapidly with Article Age (Vines et al., 2014)
● Support publishing datasets for discovery and re-use
● Assign a Digital Object Identifier (DOI) for unambiguous citation
● Set licensing terms specifying how datasets may be used
● Monitor research impact by tracking use of published datasets
Repository Options in Canada: A Portage Guide
Storage, Discovery & Access
Infrastructure and Support: Repositories
A scalable, federated platform for digital research data
management and the discovery of Canadian research data
Big data by Arafat Uddin from the Noun Project
Server by Graphic Tigers from the Noun Project
Subfolder by shashank singh from the Noun Project
Big data capable
& scalable
Retains file
hierarchies
Geographically
distributed
Federated Research Data RepositoryTotal Datasets in FRDR Repository: 135
Total number of FRDR accounts: 275
Total Published: 14.9 TB Oct 2020
Institutions
51
Dataverses
450+
Datasets
2,307
Files
29,862
Downloads
178,642
Nov 12, 2019
University of British Columbia licensed
Simon Fraser University
University of Northern British Columbia
UAL
Dataverse Dataverse
Scholars Portal
Dataverse
https://www.technologynetworks.com/informatics/news/deep-learning-algorithm-could-remove-materials-discovery-bottleneck-339063
How do I make my data FAIR?
Ensure they are findable
& accessible
FRDR.ca
Metadata harvested to FRDR
Domain-specific
Repositories
General Repositories
National Discovery Layer
Improve discovery of Canadian research
(meta)data
Break down repository siloes
Drive traffic to existing repository
sites
Create interoperability
between Canadian and international
platforms
Government Repositories
Harvested Canadian repositories: 79
https://techbeacon.com/app-dev-testing/seven-key-enablers-continuous-testing
How do I make my FAIR
Other initiatives…
How do I make my data FAIR?Persistent Identifiers
Photo by Nina PhotoLab on Unsplash
Fair EnablersPersistent Identifiers
➔ Persistent identifier (PID): a
long-lasting reference to a
digital resource
➔ Provides the information
required to reliably identify,
verify and locate
➔ Example: Digital Object
Identifier (DOI)
Fair EnablersPersistent Identifiers
DataCite Canada
Consortium
➔ Support Canadian institutions in managing and providing DOIs
➔ Allows Canadian researchers to obtain DOIs for their research
outputs easily and without direct costs
Fair EnablersPersistent Identifiers
ORCID-CA: The ORCID
Consortium in Canada
➔ Obtain an ORCID iD for free from https://orcid.org/
➔ Publish information about your research interests and collate
all your research outputs in one location
➔ Solve name ambiguity and researcher identification problems
➔ Major publishers, funders and research institutions have been
adopting
Other Potential PIDs
Source: https://www.slideshare.net/OpenAIRE_eu/new-pid-developments
Fair EnablersRepository Certification
Internationally endorsed set of core characteristics of trustworthy data repositories
https://www.coretrustseal.org/why-certification/certified-repositories/
Fair EnablersRepository Certification
Align with emerging TRUST Principles:• Transparency• Responsibility• User Focus• Sustainability• Technology
https://www.nature.com/articles/s41597-020-0486-7
Fair Enablers Metadata, Controlled Vocabularies
& Discovery
FASTSubject Headings
(Faceted
Application of
Subject
Terminology)
Improving &
expanding
metadata
harvesting
from Canadian
Repositories
Improving
Geospatial
discovery
through
GEODISY
Project
GeospatialDiscovery(beta)
https://geo.frdr-
dfdr.ca/
Fair Enablers Sensitive Data
Chandra Kavanagh, Memorial University
✓Glossary of sensitive data terms
✓Defining risks related to roles of individuals
✓Deposit-friendly text for ethics, informed consent
✓Data access agreements
✓Research Data Risk Matrix
Risks to Research
Participants
Risks to Groups,
Communities and Third
Parties
Risks to Researchers
Risks to Institutions
Risks to Data
Risk Management
Fair Enablers Sensitive Data
Chandra Kavanagh, Memorial University
✓Glossary of sensitive data terms
✓Defining risks related to roles of individuals
✓Deposit-friendly text for ethics, informed consent
✓Data access agreements
✓Research Data Risk Matrix
Risks to Research
Participants
Risks to Groups,
Communities and Third
Parties
Risks to Researchers
Risks to Institutions
Risks to Data
Risk Management
Fair Enablers Sensitive Data
✓ FRDR-aligned project to support sensitive
data
✓ Exploring viability of zero-knowledge
encryption in an RDM context
✓ Developing tools to encrypt packages at time
of deposit
✓ Supporting metadata-only review and
discovery of sensitive datasets.
Fair Enablers Training!
Strengthening Research Data Management in Canada:
A National Training Strategy
[Draft] October 2020
Looking Forward
DRI before…
DRI after…
New CEO: Nizar Ladak
NDRIO Board
https://engagedri.ca/wp-content/uploads/2020/09/Fact-Sheet.pdf
https://engagedri.ca/wp-content/uploads/2020/09/Fact-Sheet.pdf
https://engagedri.ca/wp-content/uploads/2020/09/Fact-Sheet.pdf
https://thebulletin.org/doomsday-clock/current-time/
Key priorities for DM
RDM Platform Support &
Development
Develop and operationalize
national, collaborative RDM
platforms & services
Transition to NDRIOMerger of Portage & RDC
Committees----------------------------------------------------------------------------
Integration with ARC & RS----------------------------------------------------------------------------
Needs Assessment &
Input from Researcher Council
National Data
Stewardship
Support
Providing coordinated support
at the national level, across
the data life cycle
Oct 1
2019Sept 30
2020
April 1
2020
Mar 31
2021
N D R I O D M F u n d i n g
U n d e r S t r a t e g i c P l a n
Mar 31
20242022 – 2023
C o o r d i n a t i o n & A l i g n m e n t w i t h A R C , R S , & N e t w o r k
S t r a t e g i c P l a n
B r i e f i n g N o t e s
Dec
2020
Jan 31
2020
C o r p
P l a n
D e v
C A N A R I E / I S E D –
D i r e c t e d F u n d i n g
N D R I O D i r e c t e d F u n d i n g
U n d e r C o r p o r a t e P l a n
DM Timeline for transition to NDRIO
C A N A R I E
• Improving discovery
• Developing & deploying a national RDM training strategy
• Growing a national network of data curation support
• Supporting the evolution of national preservation services
• Addressing issues related to sensitive data
• Working with domains to address discipline-specific issues
• Promoting data management planning
• Advancing repository certification efforts in Canada
• Successfully merging DM into NDRIO
Looking Forward
http://gojmff.org/program-areas/other-initiatives.cfm
Thanks again to our partnersCARL Portage is supported by directed funding from Innovation, Science &
Economic Development Canada (ISED), flowing through NDRIO
Network of Experts