niso datacite ezid presentation
DESCRIPTION
Presentation at August 11 "Show Me the Data" NISO Webinar onTRANSCRIPT
Persistent Citation & Identification for Datasets:
DataCite and EZID
John Kunze, Associate Director, UC3Joan Starr, Manager, Strategic & Project Planning, CDL
Problem statement: the rocky landscape
1. “My grant requires a data sustainability plan”2. “I know I should be doing something more to protect my stuff,
but I don’t know what”3. “I don’t want to preserve my stuff, just store it forever”
University of California Curation Center, California Digital Library
Digital curation provides the answer.
University of California Curation Center, California Digital Library
University of California Curation Center, California Digital Library
UC3 at CDL
Web Archiving Service
ChronopolisMedia Vault Program
Curation microservices
University of California Curation Center, California Digital Library
Problem: the research trajectory
collectedanalysedsynthesisedinterpreted
are
Publication
Data
University of California Curation Center, California Digital Library
Problem: the research trajectory
collectedanalysedsynthesisedinterpreted
are
becomes Information
is
published
Publication
Data
University of California Curation Center, California Digital Library
Problem: the research trajectory
collectedanalysedsynthesisedinterpreted
are
becomes Information
is
published
becomes Knowledge
Publication
… is accessible
Data
University of California Curation Center, California Digital Library
Problem: the research trajectory
collectedanalysedsynthesisedinterpreted
are
becomes Information
is
published
becomes Knowledge
Publication
… is accessible
… is traceable
Data
University of California Curation Center, California Digital Library
Problem: the research trajectory
collectedanalysedsynthesisedinterpreted
are
becomes Information
is
published
becomes Knowledge
Publication
… is accessible
… is traceable
… is lost!Data
University of California Curation Center, California Digital Library
In other words: a gap...between published research and underlying data
University of California Curation Center, California Digital Library
A gap...between published research and underlying data
As a result, datasets are– Difficult to discover– Difficult to access– Difficult to archive – Second-class citizens in the scholarly record
University of California Curation Center, California Digital Library
Second-class citizens in the scholarly record.
Research dataData is difficult to manage after project funding ceases
Who has it?
How do I get it?
What is it’s impact?
Where is it?
Journal articleLibraries keep it safe.
Many libraries and archives have it .
Many libraries and archives have it and will share it.
I can monitor its impact.
I know how to find it.
University of California Curation Center, California Digital Library
A choice
If the scientific record is at risk– Results can’t be reproduced– Science fails, global
catastrophe ensues
Better data publishing, sharing, and archiving
OR
Planetary destruction?Roberto Rizzato
DataCite members
• Technische Informationsbibliothek
(TIB), Germany
• Australian National Data Service
(ANDS)
• The British Library
• California Digital Library, USA
•
• Canada Institute for Scientific and
Technical Information (CISTI)
• L’Institut de l’Information Scientifique
et Technique (INIST), France
• Library or the ETH Zürich
• Library of TU Delft,
The Netherlands
• Purdue University, USA
• Technical Information Center of
Denmark
University of California Curation Center, California Digital Library
Before DataCite...
Publishers Data centres
University of California Curation Center, California Digital Library
Before DataCite…
Publishers Data centres
University of California Curation Center, California Digital Library
With DataCite…
Publishers Data centres
University of California Curation Center, California Digital Library
DataCite structure
Carries
International DOI Foundation
DataCite
MemberInstitution
Data CentreData CentreData Center, Library, Publisher
MemberInstitution
Data CentreData CentreData Center, Library, Publisher
. . .
Works with
Managing Agent(TIB)
AssociateStakeholder, e.g., Library
Data Researcher or Producer
Data Researcher or Producer
Member
University of California Curation Center, California Digital Library
DataCite example
CDL
DataONE Member Node data archive
(eg, Dryad)
Research scientist
6. full citation
7. full citation
1. data + metadata
3. citation + URL + id
DOI resolver and TIB registration
5. URL plus id EZID resolver and registration service
4. save full citation
(opt) CDL-hosted EZID id minting service
DataONE Coordinating Node metadata catalog
(eg, UNM or UCSB)
get unique id string
get unique id string
2. metadata + URL + id
University of California Curation Center, California Digital Library
EZID
One stop shop for DataCite DOIs & more• California Digital Library is a trusted service provider• EZID creates ids, stores metadata and resolver target URLs.• EZID supports DataCite DOIs and lower-cost ids (ARKs, URLs)
University of California Curation Center, California Digital Library
How it could look: eScholarship and Datacite
Supplementary DataReichl, R., Waldinger, R., et al. (2006)Table A: Survey of Attitudes and…Table B: Latinos in LA Basin…
22
Linking data to article Dataset
G.Yancheva, N. R. Nowaczyk et al (2007)Rock magnetism and X-ray flourescence spectrometry analyses on sediment cores of the Lake Huguang Maar, Southeast China, PANGAEA doi:10.1594/PANGAEA.587840
ArticleG. Yancheva, N. R. Nowaczyk et al (2007) Influence of the intertropical convergence zone on the East Asian monsoonNature 445, 74-77doi:10.1038/nature05431
Cites
Cites b
ack t
o article
University of California Curation Center, California Digital Library
How it could look
ark:/a50600/rb2468097doi:10.5060/rb2468097http://n2t.net/a5060/rb2468097
Links
to dat
a
University of California Curation Center, California Digital Library
University of California Curation Center, California Digital Library
University of California Curation Center, California Digital Library
Bridging the data gap
• DataCite/EZID empowers researchers
• DataCite/EZID supports data centers
• DataCite/EZID extends libraries
• DataCite/EZID enables publishers
http://datacite.orgDATACITE
University of California Curation Center, California Digital Library
EZID
Now in production, V. 1.0, API • Limited release.• First customer is DataONE member, Dryad
University of California Curation Center, California Digital Library
Upcoming milestones
EZIDSeptember 2010: V2.0 UIExpanded release base to include UC partners +
DataCite Metadata StandardCommunity review to begin August 2010
University of California Curation Center, California Digital Library
Thank you!