emerging standards: data and data exchange in scholarly publishing

19
Emerging Standards: Data and Data Exchange in Scholarly Publishing Jay Henry Chief Marketing Officer

Upload: ringgold-inc

Post on 04-Aug-2015

92 views

Category:

Data & Analytics


0 download

TRANSCRIPT

Page 1: Emerging Standards: Data and Data Exchange in Scholarly Publishing

Emerging Standards: Data and Data Exchange in Scholarly Publishing

Jay HenryChief Marketing Officer

Page 2: Emerging Standards: Data and Data Exchange in Scholarly Publishing

We spend most of our time here.On the ground

Page 3: Emerging Standards: Data and Data Exchange in Scholarly Publishing

DOI

ISSN

Author ORCID

Author Affiliations(ISNIs or RING IDs)

Title

Year Published

Subjects

CirculationData

Abstract

Page 4: Emerging Standards: Data and Data Exchange in Scholarly Publishing

Okay, I have a better idea of what’s going on in my own neck of the woods

Page 5: Emerging Standards: Data and Data Exchange in Scholarly Publishing

DataInformationKnowledge

Page 6: Emerging Standards: Data and Data Exchange in Scholarly Publishing

Standard Identifiers contribute to interoperability*

*This means data that can be linked together through unambiguous identification and exchanged with others

GovernedTrustedTransparentAnd contain appropriate metadata

In order to be effective, identifiers must be:

Page 7: Emerging Standards: Data and Data Exchange in Scholarly Publishing

What are standard identifiers?Persistent numeric or alpha-numeric designations associated with a single entity

Entities can be an institution, person, or piece of content (People, Places, & Things)

What do they do?1. Disambiguate, aka enforce uniqueness

2. Enable linking, aka data integration and interoperability

In other words, they provide a simple basis for data governance

Page 8: Emerging Standards: Data and Data Exchange in Scholarly Publishing

Standard identifiers are the cornerstone of linking data among internal and external systems◦ Break down silos ◦ Keep data current and

synchronised◦ Enable staff to interact

with data more effectively

◦ Simplify data exchange◦ Improve overall data

quality

Institutional

Identifiers

Page 9: Emerging Standards: Data and Data Exchange in Scholarly Publishing

Impact of Identifiers on workflows

• Resources & personnel required to join existing records to IDs or an authority file

• Build customized solutions mapping systems together

• Improve data capture to require an ID upon record creation

• Manual vs programmatic cost-benefit questions• Design new reporting and analysis tools to

leverage newly linked datasets

Page 10: Emerging Standards: Data and Data Exchange in Scholarly Publishing

Impact on StakeholdersResearchers – create Current Research Information Systems (CRIS) – one portal to figure out how to best conduct research, who to work with, who will fund it, what else has been contributed to the subject thus far, where is the best equipment to help further the research.

Funders – Want to track areas of interest, identify worthwhile pursuits, and see where their money goes.

Institutions – Demonstrate research output more accurately and precisely describe the institution’s contribution and who is affiliated with that work.

Publishers – Facilitate transactions of all types from content discovery to delivery of author royalties. Improved market analysis and targeted advertising.

Page 11: Emerging Standards: Data and Data Exchange in Scholarly Publishing

ISO Standard 27729 ISNI is designed to be a

“bridge identifier” Covers any type of entity

ISNI Number ISNI Number

Party ID 2Party ID 1

Proprietary Information and/or

Metadata

Proprietary Information and/or

Metadata

International StandardNameIdentifier

Page 12: Emerging Standards: Data and Data Exchange in Scholarly Publishing

ISNIIn cooperation with ProQuest, OCLC, and other public and commercial entities, Ringgold has been working to map ISNIs to deeper datasets for the past two years.

It’s taken time due to the problems with the raw source data, and the policies for assignment of the unique ISNI identifier.

Page 13: Emerging Standards: Data and Data Exchange in Scholarly Publishing

Ringgold: an ISNI Registration Agency

At the same time ISNI records are loaded to the Ringgold Identify Database we will being issuing ISNIs for institutions. ProQuest (Bowker) is a Registration Agency as well, focusing on individuals.

Page 14: Emerging Standards: Data and Data Exchange in Scholarly Publishing

RINGGO

LD

BOWKERISNI

(OCLC tech)

Third Parties

MEMEBERS

MEMEBERS

MEMEBERS

MEMEBERS

MEMEBERS

MEMEBERS

Public Data Proprietary Databases

Members submit data to RAGs:a. auto-match b. audit matchc. RAG assigns new ISNIs d. RAGs synch w/ ISNIe. ISNI used as bridge via

Public Data

Members can access “full” ISNI information but cannot provide or assign numbers to 3rd parties-- ISNI data can be used w/in internal systems (e.g. library may assign ISNIs to all individuals and departments within their institution

ISNI – RAGs & Members

Page 15: Emerging Standards: Data and Data Exchange in Scholarly Publishing

ISNI Record (Individual)

Page 16: Emerging Standards: Data and Data Exchange in Scholarly Publishing

ISNI – Institutional Record

Page 17: Emerging Standards: Data and Data Exchange in Scholarly Publishing

An issue with identification

It was a desire to “help” authors differentiate and disambiguate themselves that got ISNI started.

Along the way, a lot has been learned. A specific example, that often doesn’t get a lot of attention, is the need for privacy protection whenever there is an Identification process underway… this holds true for individuals and institutions.

Our industry spends a great deal of time discussing “open data”, but there are many times when that data should not (or cannot) be made public (physicist romance author, animal tester, military applications, etc….)

Page 18: Emerging Standards: Data and Data Exchange in Scholarly Publishing

The future:

The Semantic Web cannot exist without well structured data

Things take on a life of their own

VastnessVaguenessUncertaintyInconsistencyDeceit

The challenges to creating a world of content tagged with meaning:

Standard Identifiers can help with the middle three – Artificial Intelligence

will handle Vastness and Deceit

Page 19: Emerging Standards: Data and Data Exchange in Scholarly Publishing

Thank youJay Henry

Chief Marketing Officer

[email protected]

www.ringgold.com