emerging standards: data and data exchange in scholarly publishing

Post on 04-Aug-2015

92 Views

Category:

Data & Analytics

0 Downloads

Preview:

Click to see full reader

TRANSCRIPT

Emerging Standards: Data and Data Exchange in Scholarly Publishing

Jay HenryChief Marketing Officer

We spend most of our time here.On the ground

DOI

ISSN

Author ORCID

Author Affiliations(ISNIs or RING IDs)

Title

Year Published

Subjects

CirculationData

Abstract

Okay, I have a better idea of what’s going on in my own neck of the woods

DataInformationKnowledge

Standard Identifiers contribute to interoperability*

*This means data that can be linked together through unambiguous identification and exchanged with others

GovernedTrustedTransparentAnd contain appropriate metadata

In order to be effective, identifiers must be:

What are standard identifiers?Persistent numeric or alpha-numeric designations associated with a single entity

Entities can be an institution, person, or piece of content (People, Places, & Things)

What do they do?1. Disambiguate, aka enforce uniqueness

2. Enable linking, aka data integration and interoperability

In other words, they provide a simple basis for data governance

Standard identifiers are the cornerstone of linking data among internal and external systems◦ Break down silos ◦ Keep data current and

synchronised◦ Enable staff to interact

with data more effectively

◦ Simplify data exchange◦ Improve overall data

quality

Institutional

Identifiers

Impact of Identifiers on workflows

• Resources & personnel required to join existing records to IDs or an authority file

• Build customized solutions mapping systems together

• Improve data capture to require an ID upon record creation

• Manual vs programmatic cost-benefit questions• Design new reporting and analysis tools to

leverage newly linked datasets

Impact on StakeholdersResearchers – create Current Research Information Systems (CRIS) – one portal to figure out how to best conduct research, who to work with, who will fund it, what else has been contributed to the subject thus far, where is the best equipment to help further the research.

Funders – Want to track areas of interest, identify worthwhile pursuits, and see where their money goes.

Institutions – Demonstrate research output more accurately and precisely describe the institution’s contribution and who is affiliated with that work.

Publishers – Facilitate transactions of all types from content discovery to delivery of author royalties. Improved market analysis and targeted advertising.

ISO Standard 27729 ISNI is designed to be a

“bridge identifier” Covers any type of entity

ISNI Number ISNI Number

Party ID 2Party ID 1

Proprietary Information and/or

Metadata

Proprietary Information and/or

Metadata

International StandardNameIdentifier

ISNIIn cooperation with ProQuest, OCLC, and other public and commercial entities, Ringgold has been working to map ISNIs to deeper datasets for the past two years.

It’s taken time due to the problems with the raw source data, and the policies for assignment of the unique ISNI identifier.

Ringgold: an ISNI Registration Agency

At the same time ISNI records are loaded to the Ringgold Identify Database we will being issuing ISNIs for institutions. ProQuest (Bowker) is a Registration Agency as well, focusing on individuals.

RINGGO

LD

BOWKERISNI

(OCLC tech)

Third Parties

MEMEBERS

MEMEBERS

MEMEBERS

MEMEBERS

MEMEBERS

MEMEBERS

Public Data Proprietary Databases

Members submit data to RAGs:a. auto-match b. audit matchc. RAG assigns new ISNIs d. RAGs synch w/ ISNIe. ISNI used as bridge via

Public Data

Members can access “full” ISNI information but cannot provide or assign numbers to 3rd parties-- ISNI data can be used w/in internal systems (e.g. library may assign ISNIs to all individuals and departments within their institution

ISNI – RAGs & Members

ISNI Record (Individual)

ISNI – Institutional Record

An issue with identification

It was a desire to “help” authors differentiate and disambiguate themselves that got ISNI started.

Along the way, a lot has been learned. A specific example, that often doesn’t get a lot of attention, is the need for privacy protection whenever there is an Identification process underway… this holds true for individuals and institutions.

Our industry spends a great deal of time discussing “open data”, but there are many times when that data should not (or cannot) be made public (physicist romance author, animal tester, military applications, etc….)

The future:

The Semantic Web cannot exist without well structured data

Things take on a life of their own

VastnessVaguenessUncertaintyInconsistencyDeceit

The challenges to creating a world of content tagged with meaning:

Standard Identifiers can help with the middle three – Artificial Intelligence

will handle Vastness and Deceit

Thank youJay Henry

Chief Marketing Officer

jay.henry@ringgold.com

www.ringgold.com

top related