a story of preprints and curation networks: efficiently scaling community outreach ... ›...

71
A story of preprints and curation networks: efficiently scaling community outreach using public goods infrastructure Jeffrey Spies, Co-founder and CTO, Center for Open Science Philip Cohen, Professor of Sociology, University of Maryland Claire Stewart, Associate University Librarian for Research and Learning, University of Minnesota Cynthia Hudson-Vitale, Data Services Coordinator, Washington University in St. Louis

Upload: others

Post on 09-Jun-2020

8 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: A story of preprints and curation networks: efficiently scaling community outreach ... › wp-content › uploads › 2017 › 01 › CNI_Story... · 2017-05-30 · A story of preprints

A story of preprints and curation networks: efficiently scaling community outreach using public goods infrastructure

Jeffrey Spies, Co-founder and CTO, Center for Open SciencePhilip Cohen, Professor of Sociology, University of MarylandClaire Stewart, Associate University Librarian for Research and Learning, University of MinnesotaCynthia Hudson-Vitale, Data Services Coordinator, Washington University in St. Louis

Page 3: A story of preprints and curation networks: efficiently scaling community outreach ... › wp-content › uploads › 2017 › 01 › CNI_Story... · 2017-05-30 · A story of preprints

SHARE is a free, open dataset of research activity across the research workflow supported by free, open source tools.

The OSF is a free, open source workflow management, integration, and sharing platform.

Page 4: A story of preprints and curation networks: efficiently scaling community outreach ... › wp-content › uploads › 2017 › 01 › CNI_Story... · 2017-05-30 · A story of preprints

Public good

● A commodity or service that is provided without profit to all members of a society, either by the government or a private individual or organization. (Oxford)

● A good that is both non-excludable and non-rivalrous in that individuals cannot be effectively excluded from use and where use by one individual does not reduce availability to others. (Wikipedia)

Page 5: A story of preprints and curation networks: efficiently scaling community outreach ... › wp-content › uploads › 2017 › 01 › CNI_Story... · 2017-05-30 · A story of preprints

Openness fosters inclusivity, collaboration, and innovation.

Page 6: A story of preprints and curation networks: efficiently scaling community outreach ... › wp-content › uploads › 2017 › 01 › CNI_Story... · 2017-05-30 · A story of preprints

Scaling

3^0 = 1

3^1 = 3

3^3 = 9

Page 7: A story of preprints and curation networks: efficiently scaling community outreach ... › wp-content › uploads › 2017 › 01 › CNI_Story... · 2017-05-30 · A story of preprints

3^7 = 21873^6 = 7293^5 = 2433^4 = 813^3 = 93^1 = 33^0 = 1

Page 8: A story of preprints and curation networks: efficiently scaling community outreach ... › wp-content › uploads › 2017 › 01 › CNI_Story... · 2017-05-30 · A story of preprints

How to achieve efficient scaling

● Engage broadly (and allowing others to engage broadly)● Facilitate experts being experts

○ Especially as force multipliers--make others more efficient

● Facilitate (unknown) innovations

By

● Respecting current incentives and current workflows● Allowing people to be selfish● Creating virtuous cycles● Reusing/repurposing modular, open infrastructure● Crediting

Page 9: A story of preprints and curation networks: efficiently scaling community outreach ... › wp-content › uploads › 2017 › 01 › CNI_Story... · 2017-05-30 · A story of preprints

OSF

Page 10: A story of preprints and curation networks: efficiently scaling community outreach ... › wp-content › uploads › 2017 › 01 › CNI_Story... · 2017-05-30 · A story of preprints
Page 11: A story of preprints and curation networks: efficiently scaling community outreach ... › wp-content › uploads › 2017 › 01 › CNI_Story... · 2017-05-30 · A story of preprints

Publish Report

Search / Discovery

Develop Idea

Design Study

Collect Data

Store Data

Analyze Data

Write Report

Page 12: A story of preprints and curation networks: efficiently scaling community outreach ... › wp-content › uploads › 2017 › 01 › CNI_Story... · 2017-05-30 · A story of preprints

http://osf.io

Page 13: A story of preprints and curation networks: efficiently scaling community outreach ... › wp-content › uploads › 2017 › 01 › CNI_Story... · 2017-05-30 · A story of preprints
Page 14: A story of preprints and curation networks: efficiently scaling community outreach ... › wp-content › uploads › 2017 › 01 › CNI_Story... · 2017-05-30 · A story of preprints
Page 15: A story of preprints and curation networks: efficiently scaling community outreach ... › wp-content › uploads › 2017 › 01 › CNI_Story... · 2017-05-30 · A story of preprints

Let experts be experts.

Page 16: A story of preprints and curation networks: efficiently scaling community outreach ... › wp-content › uploads › 2017 › 01 › CNI_Story... · 2017-05-30 · A story of preprints

Publish Report

Search / Discovery

Develop Idea

Design Study

Collect Data

Store Data

Analyze Data

Write Report

OSF can integrate rather than append expertise to the research workflow.

Page 17: A story of preprints and curation networks: efficiently scaling community outreach ... › wp-content › uploads › 2017 › 01 › CNI_Story... · 2017-05-30 · A story of preprints

Publish Report

Search / Discovery

Develop Idea

Design Study

Collect Data

Store Data

Analyze Data

Write Report

Preservation

Page 18: A story of preprints and curation networks: efficiently scaling community outreach ... › wp-content › uploads › 2017 › 01 › CNI_Story... · 2017-05-30 · A story of preprints
Page 19: A story of preprints and curation networks: efficiently scaling community outreach ... › wp-content › uploads › 2017 › 01 › CNI_Story... · 2017-05-30 · A story of preprints

Publish Report

Search / Discovery

Develop Idea

Design Study

Collect Data

Store Data

Analyze Data

Write Report

OSF can integrate rather than append expertise to the research workflow.

SHARE can engage local experts to curate descriptions of the research workflow.

Page 20: A story of preprints and curation networks: efficiently scaling community outreach ... › wp-content › uploads › 2017 › 01 › CNI_Story... · 2017-05-30 · A story of preprints

APIProviders ConsumersGather

Page 21: A story of preprints and curation networks: efficiently scaling community outreach ... › wp-content › uploads › 2017 › 01 › CNI_Story... · 2017-05-30 · A story of preprints

SHARE

● Give this increased audience curation tools and APIs● Give them incentives via the virtuous cycle of open

○ Make it in the consumer’s best interest to contribute

Page 22: A story of preprints and curation networks: efficiently scaling community outreach ... › wp-content › uploads › 2017 › 01 › CNI_Story... · 2017-05-30 · A story of preprints

OSF Application Framework

• Workflow• Authentication• Permissions• File Storage• File Rendering• Meta-database• Persistence• Integrations• Search• SHARE

osf.io

osf.io/preprints

osf.io/registries

journals

grants management

university systems

Page 23: A story of preprints and curation networks: efficiently scaling community outreach ... › wp-content › uploads › 2017 › 01 › CNI_Story... · 2017-05-30 · A story of preprints

Modularity and abstraction support scaling.

Page 24: A story of preprints and curation networks: efficiently scaling community outreach ... › wp-content › uploads › 2017 › 01 › CNI_Story... · 2017-05-30 · A story of preprints

OSF Application Framework

• Workflow• Authentication• Permissions• File Storage• File Rendering• Meta-database• Persistence• Integrations• Search• SHARE

osf.io/preprints

Page 25: A story of preprints and curation networks: efficiently scaling community outreach ... › wp-content › uploads › 2017 › 01 › CNI_Story... · 2017-05-30 · A story of preprints
Page 26: A story of preprints and curation networks: efficiently scaling community outreach ... › wp-content › uploads › 2017 › 01 › CNI_Story... · 2017-05-30 · A story of preprints
Page 27: A story of preprints and curation networks: efficiently scaling community outreach ... › wp-content › uploads › 2017 › 01 › CNI_Story... · 2017-05-30 · A story of preprints

http://osf.io/preprints

Page 28: A story of preprints and curation networks: efficiently scaling community outreach ... › wp-content › uploads › 2017 › 01 › CNI_Story... · 2017-05-30 · A story of preprints
Page 29: A story of preprints and curation networks: efficiently scaling community outreach ... › wp-content › uploads › 2017 › 01 › CNI_Story... · 2017-05-30 · A story of preprints

OSF Application Framework

• Workflow• Authentication• Permissions• File Storage• File Rendering• Meta-database• Persistence• Integrations• Search• SHARE

osf.io/preprints

Page 30: A story of preprints and curation networks: efficiently scaling community outreach ... › wp-content › uploads › 2017 › 01 › CNI_Story... · 2017-05-30 · A story of preprints

OSF Application Framework

• Workflow• Authentication• Permissions• File Storage• File Rendering• Meta-database• Persistence• Integrations• Search• SHARE

osf.io

osf.io/preprints

osf.io/registries

journals

grants management

university systems

Page 31: A story of preprints and curation networks: efficiently scaling community outreach ... › wp-content › uploads › 2017 › 01 › CNI_Story... · 2017-05-30 · A story of preprints
Page 32: A story of preprints and curation networks: efficiently scaling community outreach ... › wp-content › uploads › 2017 › 01 › CNI_Story... · 2017-05-30 · A story of preprints
Page 33: A story of preprints and curation networks: efficiently scaling community outreach ... › wp-content › uploads › 2017 › 01 › CNI_Story... · 2017-05-30 · A story of preprints
Page 34: A story of preprints and curation networks: efficiently scaling community outreach ... › wp-content › uploads › 2017 › 01 › CNI_Story... · 2017-05-30 · A story of preprints
Page 35: A story of preprints and curation networks: efficiently scaling community outreach ... › wp-content › uploads › 2017 › 01 › CNI_Story... · 2017-05-30 · A story of preprints

How to achieve efficient scaling

● Engage broadly (and allowing others to engage broadly)● Facilitate experts being experts

○ Especially as force multipliers--make others more efficient

● Facilitate (unknown) innovations

By

● Respecting current incentives and current workflows● Allowing people to be selfish● Creating virtuous cycles● Reusing/repurposing modular, open infrastructure● Crediting

Page 37: A story of preprints and curation networks: efficiently scaling community outreach ... › wp-content › uploads › 2017 › 01 › CNI_Story... · 2017-05-30 · A story of preprints

▪ Open archive of the social sciences

Free, open-source, open-accessSoft launch JulyNew interface went up last week

▪ Created by sociologists and librarians

Administered at U. of Maryland

▪ Partners: Center for Open Science

Powered by SHAREOn the Open Science Framework

SocArxiv.org ● @socarxiv ● [email protected] ● Facebook.com/SocArXiv

Philip N. CohenU. of [email protected]@familyunequal

Page 38: A story of preprints and curation networks: efficiently scaling community outreach ... › wp-content › uploads › 2017 › 01 › CNI_Story... · 2017-05-30 · A story of preprints

SocArxiv.org ● @socarxiv ● [email protected] ● Facebook.com/SocArXiv

Page 39: A story of preprints and curation networks: efficiently scaling community outreach ... › wp-content › uploads › 2017 › 01 › CNI_Story... · 2017-05-30 · A story of preprints

SocArxiv.org ● @socarxiv ● [email protected] ● Facebook.com/SocArXiv

Pitch to hesitant social scientists: Reach

Page 40: A story of preprints and curation networks: efficiently scaling community outreach ... › wp-content › uploads › 2017 › 01 › CNI_Story... · 2017-05-30 · A story of preprints

Pitch to hesitant social scientists: Time

SocArxiv.org ● @socarxiv ● [email protected] ● Facebook.com/SocArXiv

Page 41: A story of preprints and curation networks: efficiently scaling community outreach ... › wp-content › uploads › 2017 › 01 › CNI_Story... · 2017-05-30 · A story of preprints

> Working paper – when it’s ready to shareYes, most journals will still let you submit it later

> Preprint – when it’s ready to publishYes, most journals permit pre-publication posting

> Post-print – when it’s behind a paywallYes, most journals permit post-publication posting

SocArxiv.org ● @socarxiv ● [email protected] ● Facebook.com/SocArXiv

Page 42: A story of preprints and curation networks: efficiently scaling community outreach ... › wp-content › uploads › 2017 › 01 › CNI_Story... · 2017-05-30 · A story of preprints

On the OSF | Preprints server

> VersionsUpdate your paper as it evolvesPersistent URL, citation, and optional DOI

> Analytics, social media sharing, linked IDs

> Add optional data and codePublic settings, collaboration

> Down the roadOverlay journalsPost-publication review

SocArxiv.org ● @socarxiv ● [email protected] ● Facebook.com/SocArXiv

Page 43: A story of preprints and curation networks: efficiently scaling community outreach ... › wp-content › uploads › 2017 › 01 › CNI_Story... · 2017-05-30 · A story of preprints

Get involved!

Post papers

Spread the word

Volunteer

Raise money / contribute

SocArxiv.org ● @socarxiv ● [email protected] ● Facebook.com/SocArXiv

Page 44: A story of preprints and curation networks: efficiently scaling community outreach ... › wp-content › uploads › 2017 › 01 › CNI_Story... · 2017-05-30 · A story of preprints

SHARE Curation Associates

Cynthia Hudson-VitaleWashington University in St. Louis

@cynhudson

Develop digital curation and computational thinking skills to enhance local institutional repositories in a service-learning setting

Page 45: A story of preprints and curation networks: efficiently scaling community outreach ... › wp-content › uploads › 2017 › 01 › CNI_Story... · 2017-05-30 · A story of preprints

Jennifer PhillipsNational Center for Atmospheric Research (NCAR)Jonathan CainUniversity of OregonTheresa PolkUniversity of Texas at AustinDana ChandlerTuskegee UniversityFred ReissUniversity of OklahomaZach CobleNew York UniversityWendy RobertsonUniversity of IowaShane ColemanVirginia TechMark ShelstadColorado State UniversityDeborah CornellCollege of William and MaryIyanna SimsNorth Carolina A&T State UniversityAmanda GoochThe George Washington University

Ashley AdairUniversity of Texas at AustinBrianna MarshallUniversity of WisconsinMary AlexanderUniversity of AlabamaKim MearsAugusta UniversityTalea AndersonWashington State UniversityJeremy MynttiUniversity of UtahElizabeth BedfordUniversity of WashingtonLisa PalmerUniversity of Massachusetts Medical SchoolLisa StienbargerUniversity of Notre DameJoanne PatersonWestern UniversityCunera BuysNorthwestern UniversityJulie HardestyIndiana University

Vicky SteevesNew York UniversityMatthew HarpArizona State UniversityEmily StenbergWashington University in St. LouisSteven HollowayJames Madison UniversityNicole Sump-CretharOklahoma State UniversitySalwa IsmailGeorgetown UniversityKelly ThompsonUniversity of MinnesotaSherry LakeUniversity of VirginiaKathleen LuschekUniversity of Hawai’i at MānoaDainan SkeemBrigham Young University

Page 46: A story of preprints and curation networks: efficiently scaling community outreach ... › wp-content › uploads › 2017 › 01 › CNI_Story... · 2017-05-30 · A story of preprints
Page 47: A story of preprints and curation networks: efficiently scaling community outreach ... › wp-content › uploads › 2017 › 01 › CNI_Story... · 2017-05-30 · A story of preprints

Local curation enhancements and projects that provide benefits locally

Page 48: A story of preprints and curation networks: efficiently scaling community outreach ... › wp-content › uploads › 2017 › 01 › CNI_Story... · 2017-05-30 · A story of preprints

Curation Track

Page 49: A story of preprints and curation networks: efficiently scaling community outreach ... › wp-content › uploads › 2017 › 01 › CNI_Story... · 2017-05-30 · A story of preprints

First 6 months:

● Metadata review● Gap analysis● Digital preservation review● Draft 3-3-3 plan

Upcoming:

● Implement 3-3-3 plan

Local enhancement activities

Page 50: A story of preprints and curation networks: efficiently scaling community outreach ... › wp-content › uploads › 2017 › 01 › CNI_Story... · 2017-05-30 · A story of preprints

Project Track

Page 51: A story of preprints and curation networks: efficiently scaling community outreach ... › wp-content › uploads › 2017 › 01 › CNI_Story... · 2017-05-30 · A story of preprints

Populating an OA IR using the SHARE data set

Members: Zach Coble, NYU; Sherry Lake, UVA; Joanne Paterson, Western University

https://osf.io/c3veb/

Page 52: A story of preprints and curation networks: efficiently scaling community outreach ... › wp-content › uploads › 2017 › 01 › CNI_Story... · 2017-05-30 · A story of preprints

Graduate Student Profiles

Members: Cunera Buys, Northwestern University; Brianna Marshall, University of Wisconsin-Madison

https://osf.io/w9dm4/

Page 53: A story of preprints and curation networks: efficiently scaling community outreach ... › wp-content › uploads › 2017 › 01 › CNI_Story... · 2017-05-30 · A story of preprints

Research Data Searching

Members: Talea Anderson, Washington State University; Elizabeth Bedford, University of Washington; Sherry Lake, UVA; Kelly Thompson, University of Minnesota

https://osf.io/hy3cq/

Page 54: A story of preprints and curation networks: efficiently scaling community outreach ... › wp-content › uploads › 2017 › 01 › CNI_Story... · 2017-05-30 · A story of preprints

ORCID

Members: Jonathan Cain, University of Oregon; Steven Halloway, JMU; Salwa Ismail, Georgetown University; Victoria Steeves, NYU

Page 55: A story of preprints and curation networks: efficiently scaling community outreach ... › wp-content › uploads › 2017 › 01 › CNI_Story... · 2017-05-30 · A story of preprints

Data Curation Network

Planning a network of expertise model for curating research data in academic libraries

The Data Curation Network project is supported by a grant from the ALFRED P. SLOAN FOUNDATION.

2016-2017

Page 56: A story of preprints and curation networks: efficiently scaling community outreach ... › wp-content › uploads › 2017 › 01 › CNI_Story... · 2017-05-30 · A story of preprints

Rise of a data sharing cultureResearchers are increasingly required/incentivised to share data● Funder data sharing mandates● Journal data sharing policies● Disciplinary practices → emphasis on transparency and reproducibility

Data repositories: it’s not enough to just keep the files!

Goal of data curation ⇒ Prepare and maintain research data in ways that make it findable, accessible, interoperable and reusable (FAIR),

Data curation = metadata, documentation, access, preservation, and more...

Data Curation Network

Page 57: A story of preprints and curation networks: efficiently scaling community outreach ... › wp-content › uploads › 2017 › 01 › CNI_Story... · 2017-05-30 · A story of preprints

Data curation activities

Data Curation Network

● Code review● Contextualize● Documentation● Embargo● File Format Transformations● Persistent Identifier● Quality Assurance● Use Analytics● Versioning● Data Citation● Deidentification

● File Audit● File Inventory or Manifest● File validation● Metadata● Metadata Brokerage● Rights Management● Risk Management● Terms of Use● Peer-review● Technology Monitoring and Refresh

Page 58: A story of preprints and curation networks: efficiently scaling community outreach ... › wp-content › uploads › 2017 › 01 › CNI_Story... · 2017-05-30 · A story of preprints

Challenge for institutional data curation services

How to scale data curation services across all disciplines?

Multiple data curation experts are needed to effectively curate the diverse data types an institution typically generates.

Data curation expertise needed: - File format-- GIS, spreadsheet/tabular, statistical/survey, software code,

video/audio, images/3D, simulations...- Discipline-specific-- genomic sequence, chemical spectra, biological image... - Frequency-- Centers of excellence, departmental concentration

Data Curation Network

Page 59: A story of preprints and curation networks: efficiently scaling community outreach ... › wp-content › uploads › 2017 › 01 › CNI_Story... · 2017-05-30 · A story of preprints

Data Curation Network

Data Curation Network

The Data Curation Network will enable academic institutions to better support researchers that are faced with a growing number of requirements to ethically

share their research data.

http://z.umn.edu/datacuration

Page 60: A story of preprints and curation networks: efficiently scaling community outreach ... › wp-content › uploads › 2017 › 01 › CNI_Story... · 2017-05-30 · A story of preprints

Kirchner, Joy, Jose Diaz, Geneva Henry, Susan Fliss, John Culshaw, Heather Gendron, and Jon E. Cawthorne. “The Center of Excellence Model for Information Services.” Council on Library and Information Resources (CLIR), February 2015.

Page 61: A story of preprints and curation networks: efficiently scaling community outreach ... › wp-content › uploads › 2017 › 01 › CNI_Story... · 2017-05-30 · A story of preprints

Our Vision for the Next 3-5 Years

Data Curation Network

1. Develop standards-driven data curation techniques for all types of repository workflows and infrastructure.

2. Expand into a sustainable entity that grows beyond our initial six partner institutions.

3. Datasets curated by the Data Curation Network will be used to advance research and education in ways that are measurably of greater reuse value than non-curated data.

4. Build an innovative community that enriches capacities for data curation writ large.

Page 62: A story of preprints and curation networks: efficiently scaling community outreach ... › wp-content › uploads › 2017 › 01 › CNI_Story... · 2017-05-30 · A story of preprints

Data Curation Network Partners

Data Curation Network

Page 63: A story of preprints and curation networks: efficiently scaling community outreach ... › wp-content › uploads › 2017 › 01 › CNI_Story... · 2017-05-30 · A story of preprints

Planning the Data Curation Network

Data Curation Network

(Current) Planning phase, supported by the Alfred P. Sloan Foundation to:● Develop a Data Curation Network ‘model of expertise’ for data curation staff

that includes the projected staffing, costs, skills sets, and demand necessary for implementation.

(Future) Pilot phase will ● Test the model across our six institutions● Plan for how to grow and sustain the Network

Page 64: A story of preprints and curation networks: efficiently scaling community outreach ... › wp-content › uploads › 2017 › 01 › CNI_Story... · 2017-05-30 · A story of preprints

Data Curation Network

Draft Model for the Data Curation Network

Page 65: A story of preprints and curation networks: efficiently scaling community outreach ... › wp-content › uploads › 2017 › 01 › CNI_Story... · 2017-05-30 · A story of preprints

Our Planning Phase activity to date✓ Summer → Assessed infrastructure/policy/workflow differences and monitor

the demand across institutions. Baseline report.

● Just completed Oct/Nov 2016 → Seek input from researchers to better understand how data curation services fit into their research workflow (focus groups).

● Jan 2017 → ARL Spec Kit survey on library data curation activities.

● Spring 2017 → Develop financial/governance models. Share our draft Data Curation Network model with stakeholders for feedback.

Data Curation Network

Page 66: A story of preprints and curation networks: efficiently scaling community outreach ... › wp-content › uploads › 2017 › 01 › CNI_Story... · 2017-05-30 · A story of preprints

Researcher Engagements

Data Curation Network

Page 67: A story of preprints and curation networks: efficiently scaling community outreach ... › wp-content › uploads › 2017 › 01 › CNI_Story... · 2017-05-30 · A story of preprints

Results: Researcher Engagements

Data Curation Network

Goal: Identify value/importance placed on 40+ data curation activities in order to Identify gaps in important curation activities that are either not happening/well. Completed engagements, analysis underway (~90 participants at 6 institutions)

Page 68: A story of preprints and curation networks: efficiently scaling community outreach ... › wp-content › uploads › 2017 › 01 › CNI_Story... · 2017-05-30 · A story of preprints

Results: Repository Curation Workflows

Data Curation Network

Page 69: A story of preprints and curation networks: efficiently scaling community outreach ... › wp-content › uploads › 2017 › 01 › CNI_Story... · 2017-05-30 · A story of preprints

Results: Repository Technologies

Data Curation Network

Page 70: A story of preprints and curation networks: efficiently scaling community outreach ... › wp-content › uploads › 2017 › 01 › CNI_Story... · 2017-05-30 · A story of preprints

Results: Repository Policies

Data Curation Network

Page 71: A story of preprints and curation networks: efficiently scaling community outreach ... › wp-content › uploads › 2017 › 01 › CNI_Story... · 2017-05-30 · A story of preprints

Thanks!

Web: https://sites.google.com/site/DataCurationNetwork

Twitter #DataCurationNetwork

Claire StewartUniversity of Minnesota Libraries

[email protected]

Data Curation Network