open research data
TRANSCRIPT
OPEN RESEARCH DATAJessica Gallinger
Systems & Data Services Librarian
Okanagan College
INTRODUCTION TO OPEN ACCESS
Thomas Jefferson, the third President of the United States, described knowledge in the
following way: "He who receives an idea from me, receives instruction himself without lessening
mine; as he who lights his taper at mine, receives light without darkening me." In doing so,
Jefferson anticipated the modern concept of a public good.
… A public good [depends on] non-rivalrous consumption – the consumption of one individual
does not detract from that of [another]. Knowledge of a mathematical theorem clearly satisfies
[this] attributes: if I teach you the theorem, I continue to enjoy the knowledge of the theorem at
the same time that you do.
Stiglitz (1999). Knowledge as a Global Public Good.
OUTLINE
Introduction to Open Access
What is open research data?
What are some examples of open data?
Merits of publishing research data
Finding repositories
Finding datasets
Research data management
References
Appendixes
WHAT IS OPEN RESEARCH DATA?
Format: machine-readable digital files
License: for re-use
“Rese
arc
h D
ata
”
materialsample
biological specimen
chemical
artifacts
datatabular
microdata [.sav, .txt]
aggregate [.xls]spatial [.shp, .kml]
imagelab notes [.pdf]
pictures [.jpg]
AVaudio [.wav]
video [.mp4]
text transcript [.txt]
WHAT ARE SOME EXAMPLES OF OPEN DATA?
Example: Inter-university Consortium for Political and Social Research (ICPSR)
Example: Kelowna Open Data Catalogue
Example: Abacus Dataverse Network
MERITS OF PUBLISHING DATA
May citation frequency
permits mashups
May integrity
Metadata record discoverability
searchability
contact-ability
* Data repositories accept null results
* Data repositories help you comply with Tri-Council funding requirements
Sharing Detailed Research Data Is
Associated with Increased Citation
Rate: Principal Findings
[The study] examined the citation history
of 85 cancer microarray clinical trial
publications with respect to the
availability of their data. The 48% of
trials with publicly available microarray
data received 85% of the aggregate
citations. Publicly available data was
significantly (p = 0.006) associated with
a 69% increase in citations,
independently of journal impact factor,
date of publication, and author country
of origin using linear regression.
FINDING REPOSITORIES
Types of repositories:
Un/curated
Restricted use vs on demand
Disclosure risk
Digital preservation (or not)
Registry of Research Data Repositories > re3data.org
FINDING DATASETS
DATA MANAGEMENT PLANS
DMP Assistant is a bilingual tool for preparing data management plans (DMPs). The tool follows best practices in data stewardship and walks researchers step-by-step through key questions about data management.
REFERENCES
King, Gary. (2007). An Introduction to the Dataverse Network as an Infrastructure for Data Sharing. Sociological Methods and Research 36: 173-199. copy at http://j.mp/iHJcAa
Piwowar HA, Day RS, Fridsma DB. (2007). Sharing Detailed Research Data Is Associated with Increased Citation Rate. PLoS ONE 2(3): e308. doi:10.1371/journal.pone.0000308
Stiglitz, J. (1999). Knowledge as a global public good, in Kaul, I., Grunberg, I. and Stern, M. (eds.), Global Public Goods: International cooperation in the 21st century. New York: Oxford University Press, pp. 308–325.
Wicherts JM, Bakker M, Molenaar D. (2011). Willingness to Share Research Data Is Related to the Strength of the Evidence and the Quality of Reporting of Statistical Results. PLoS ONE 6(11): e26828. doi:10.1371/journal.pone.0026828
Willinsky, J. (2010). Open access and academic reputation. Annals of Library and Information Studies 57: pp. 296-302 http://goo.gl/UzTAW
APPENDIX OF RESOURCES
Highlighted repositories:
Inter-university Consortium for Political and Social Research (ICPSR)
Kelowna Open Data Catalogue
Abacus Dataverse Network
Registry of Research Data Repositories: re3data.org
DataCite Metadata Search
Portage DMP Assistant
APPENDIX: TRI-COUNCIL POLICIES
Funding Agency Open Data Policies
SSHRC : Research Data Archiving Policy
NSERC : Responsibilities and Accountability (see: Intellectual Property)
CIHR : Tri-Agency Open Access Policy: Publication-related Research Data