creating a data interchange standard for researchers, research, and research resources: vivo-isf...

69
Creating a Data Interchange Standard for Researchers, Research, and Research Resources: VIVO-ISF Dean B. Krafft Brian Lowe Coalition for Networked Information 10 December 2013

Upload: andrew-potter

Post on 20-Jan-2016

216 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Creating a Data Interchange Standard for Researchers, Research, and Research Resources: VIVO-ISF Dean B. Krafft Brian Lowe Coalition for Networked Information

Creating a Data Interchange Standard for Researchers, Research, and Research Resources:

VIVO-ISF

Dean B. KrafftBrian Lowe

Coalition for Networked Information10 December 2013

Page 2: Creating a Data Interchange Standard for Researchers, Research, and Research Resources: VIVO-ISF Dean B. Krafft Brian Lowe Coalition for Networked Information

What is VIVO?• Software: An open-source semantic-web-based

researcher and research discovery tool• Data: Institution-wide, publicly-visible

information about research and researchers• Standards: A standard ontology (VIVO data) that

interconnects researchers, communities, and campuses using Linked Open Data

• Community: An open community with strong national and international participation

Page 3: Creating a Data Interchange Standard for Researchers, Research, and Research Resources: VIVO-ISF Dean B. Krafft Brian Lowe Coalition for Networked Information

VIVO Normalizes Complex Inputs

People

Grants

Data

Google Scholar

Center/ Dept/

Program websites

Research Facilities &

Services

Courses

Tech transfer

Publications

VP ResearchUniv.

Communications

HPC

HR data

Faculty Reporting

GradSchool

Pubmed

CrossRef

Researcher.gov

arXiv

other databases

NIH RePorter

Self-editing

Other campuses

Page 4: Creating a Data Interchange Standard for Researchers, Research, and Research Resources: VIVO-ISF Dean B. Krafft Brian Lowe Coalition for Networked Information

VIVO connects scientists and scholars with and through their research and scholarship

Page 5: Creating a Data Interchange Standard for Researchers, Research, and Research Resources: VIVO-ISF Dean B. Krafft Brian Lowe Coalition for Networked Information
Page 6: Creating a Data Interchange Standard for Researchers, Research, and Research Resources: VIVO-ISF Dean B. Krafft Brian Lowe Coalition for Networked Information

SKE Knowledge Environment

http://ske.las.ac.cn/

Page 7: Creating a Data Interchange Standard for Researchers, Research, and Research Resources: VIVO-ISF Dean B. Krafft Brian Lowe Coalition for Networked Information
Page 8: Creating a Data Interchange Standard for Researchers, Research, and Research Resources: VIVO-ISF Dean B. Krafft Brian Lowe Coalition for Networked Information
Page 9: Creating a Data Interchange Standard for Researchers, Research, and Research Resources: VIVO-ISF Dean B. Krafft Brian Lowe Coalition for Networked Information

Customization

Page 10: Creating a Data Interchange Standard for Researchers, Research, and Research Resources: VIVO-ISF Dean B. Krafft Brian Lowe Coalition for Networked Information
Page 11: Creating a Data Interchange Standard for Researchers, Research, and Research Resources: VIVO-ISF Dean B. Krafft Brian Lowe Coalition for Networked Information
Page 12: Creating a Data Interchange Standard for Researchers, Research, and Research Resources: VIVO-ISF Dean B. Krafft Brian Lowe Coalition for Networked Information

The VIVO Community is now over 100 institutions worldwide

Page 13: Creating a Data Interchange Standard for Researchers, Research, and Research Resources: VIVO-ISF Dean B. Krafft Brian Lowe Coalition for Networked Information

Why is VIVO important?• It is the only standard way to exchange

information about research and researchers across diverse institutions

• It provides authoritative data from institutional databases of record as Linked Open Data

• Structured VIVO data supports search, analysis and visualization across institutions and consortia

• It is highly flexible and extensible to cover research resources, facilities, datasets, and more

Page 14: Creating a Data Interchange Standard for Researchers, Research, and Research Resources: VIVO-ISF Dean B. Krafft Brian Lowe Coalition for Networked Information

An HTTP request can return HTML or data

Page 15: Creating a Data Interchange Standard for Researchers, Research, and Research Resources: VIVO-ISF Dean B. Krafft Brian Lowe Coalition for Networked Information

Value for institutions and consortia• Common data substrate

– Public, granular and direct– Discovery via external and internal search engines– Available for reuse at many levels

• Distributed curation– E.g., affiliations beyond what HR system tracks– Data coordination across functional silos– Feeding changes back to systems of record– Direct linking across campuses

• Data that is visible gets fixed

Page 16: Creating a Data Interchange Standard for Researchers, Research, and Research Resources: VIVO-ISF Dean B. Krafft Brian Lowe Coalition for Networked Information

Example: U.S. Dept. of Agriculture

• Multiple agencies including Agricultural Research Service and U.S. Forest Service

• VIVO portal for 45,000 intramural researchers• Goal to link to Land Grant universities and

international agricultural research centers• Using VIVO as an integration tool to send data

for federal STAR METRICS/SciENCV projects• RDF exposed via a SPARQL endpoint constitutes

compliance

Page 17: Creating a Data Interchange Standard for Researchers, Research, and Research Resources: VIVO-ISF Dean B. Krafft Brian Lowe Coalition for Networked Information

VIVO Exploration and Analytics• Since VIVO is structured data, it can be

navigated, analyzed, and visualized uniformly within or across institutions

• VIVO can visualize the strengths of networks within and across institutions

• You can create dashboards to help understand academic outputs and collaborations

• VIVO can map research engagements and impact

Page 18: Creating a Data Interchange Standard for Researchers, Research, and Research Resources: VIVO-ISF Dean B. Krafft Brian Lowe Coalition for Networked Information
Page 19: Creating a Data Interchange Standard for Researchers, Research, and Research Resources: VIVO-ISF Dean B. Krafft Brian Lowe Coalition for Networked Information
Page 20: Creating a Data Interchange Standard for Researchers, Research, and Research Resources: VIVO-ISF Dean B. Krafft Brian Lowe Coalition for Networked Information
Page 21: Creating a Data Interchange Standard for Researchers, Research, and Research Resources: VIVO-ISF Dean B. Krafft Brian Lowe Coalition for Networked Information
Page 22: Creating a Data Interchange Standard for Researchers, Research, and Research Resources: VIVO-ISF Dean B. Krafft Brian Lowe Coalition for Networked Information
Page 23: Creating a Data Interchange Standard for Researchers, Research, and Research Resources: VIVO-ISF Dean B. Krafft Brian Lowe Coalition for Networked Information
Page 24: Creating a Data Interchange Standard for Researchers, Research, and Research Resources: VIVO-ISF Dean B. Krafft Brian Lowe Coalition for Networked Information

Providing the Context for Research Data

• Context is critical to finding, understanding, and reusing research data

• Contexts include:– Narrative publications– The researcher, research resources, grants, etc.– Dataset registries– Structured Knowledge Environments– The web of Linked Open Data

Page 25: Creating a Data Interchange Standard for Researchers, Research, and Research Resources: VIVO-ISF Dean B. Krafft Brian Lowe Coalition for Networked Information
Page 26: Creating a Data Interchange Standard for Researchers, Research, and Research Resources: VIVO-ISF Dean B. Krafft Brian Lowe Coalition for Networked Information
Page 27: Creating a Data Interchange Standard for Researchers, Research, and Research Resources: VIVO-ISF Dean B. Krafft Brian Lowe Coalition for Networked Information
Page 28: Creating a Data Interchange Standard for Researchers, Research, and Research Resources: VIVO-ISF Dean B. Krafft Brian Lowe Coalition for Networked Information

VIVO Dataset Registries

• VIVO/ANDS consortium in Australia– Link research data with researcher profiles and

publications– Harvest to national registry

• Datastar data registry tool– Add-on to VIVO or independent companion– Complement to other library data-related services– Institute for Museum and Library Services (IMLS)

grant

Page 29: Creating a Data Interchange Standard for Researchers, Research, and Research Resources: VIVO-ISF Dean B. Krafft Brian Lowe Coalition for Networked Information

Melbourne Central Research Data Registry

Page 30: Creating a Data Interchange Standard for Researchers, Research, and Research Resources: VIVO-ISF Dean B. Krafft Brian Lowe Coalition for Networked Information
Page 31: Creating a Data Interchange Standard for Researchers, Research, and Research Resources: VIVO-ISF Dean B. Krafft Brian Lowe Coalition for Networked Information
Page 32: Creating a Data Interchange Standard for Researchers, Research, and Research Resources: VIVO-ISF Dean B. Krafft Brian Lowe Coalition for Networked Information

What is VIVO Today?

• An open community hosted by the DuraSpace 501(c)3 with strong national and international participation, for which we are currently hiring a full-time VIVO Project Director

• An open suite of software tools• A growing body of interoperable data• An ontology (VIVO-ISF) with a community-

driven process for extension

Page 33: Creating a Data Interchange Standard for Researchers, Research, and Research Resources: VIVO-ISF Dean B. Krafft Brian Lowe Coalition for Networked Information

VIVO and the Integrated Semantic Framework

Page 34: Creating a Data Interchange Standard for Researchers, Research, and Research Resources: VIVO-ISF Dean B. Krafft Brian Lowe Coalition for Networked Information

What is the Integrated Semantic Framework?

• A semantic infrastructure to represent people based on all the products of their research and activities– To support both networking and reporting

• A partnership between VIVO, eagle-i, and ShareCenter

• A Clinical and Translational Information Exchange Project (CTSAConnect)– 18 Months (February 2012 – August 2013)– Funded by NIH NCATS via Booz Allen Hamilton

Page 35: Creating a Data Interchange Standard for Researchers, Research, and Research Resources: VIVO-ISF Dean B. Krafft Brian Lowe Coalition for Networked Information

CTSAconnect TeamOHSU:Melissa Haendel, Carlo Torniai, Nicole Vasilevsky, Shahim Essaid, Eric Orwoll

Cornell University:Jon Corson-Rikert, Dean Krafft, Brian Lowe

University of Florida: Mike Conlon, Chris Barnes, Nicholas Rejack

Stony Brook University: Moises Eisenberg, Erich Bremer, Janos Hajagos

Harvard University:Daniela Bourges-WaldeggSophia Cheng

Share Center:Chris Kelleher, Will Corbett, Ranjit Das, Ben Sharma

University at Buffalo:Barry Smith, Dagobert Soergel

Page 36: Creating a Data Interchange Standard for Researchers, Research, and Research Resources: VIVO-ISF Dean B. Krafft Brian Lowe Coalition for Networked Information

People and Resources

techniquestraining

protocols

affiliation

roles

grants

credentials

genes

anatomy

manufacturer

publications

Page 37: Creating a Data Interchange Standard for Researchers, Research, and Research Resources: VIVO-ISF Dean B. Krafft Brian Lowe Coalition for Networked Information

Connecting researchers, resources, and clinical activities

Page 38: Creating a Data Interchange Standard for Researchers, Research, and Research Resources: VIVO-ISF Dean B. Krafft Brian Lowe Coalition for Networked Information

Beyond Static CVs

• Distributed data

• Research and scholarship in context

• Context aids in disambiguation

• Contributor roles

• Outputs and outcomes beyond publications

Page 39: Creating a Data Interchange Standard for Researchers, Research, and Research Resources: VIVO-ISF Dean B. Krafft Brian Lowe Coalition for Networked Information

Ontologies for Linked Data

• First level text– Second level

• Third level– Fourth level

» Fifth Level

Page 40: Creating a Data Interchange Standard for Researchers, Research, and Research Resources: VIVO-ISF Dean B. Krafft Brian Lowe Coalition for Networked Information

Linked Data Vocabularies

FOAF (people, organizations,

groups)VCard

(contact information)

BIBO (publications)

SKOS (terminologies)

Page 41: Creating a Data Interchange Standard for Researchers, Research, and Research Resources: VIVO-ISF Dean B. Krafft Brian Lowe Coalition for Networked Information

Open Biomedical Ontologies

OBI(Ontology of Biomedical

Investigations) ERO(eagle-i Research

Resource Ontology)RO

(Relationship Ontology)

IAO(Information Artifact

Ontology)

Page 42: Creating a Data Interchange Standard for Researchers, Research, and Research Resources: VIVO-ISF Dean B. Krafft Brian Lowe Coalition for Networked Information

Basic Formal Ontology

Process

Spatial Region

Szabolcs Toth http://www.flickr.com/photos/necccc/5726970855/

Role

Site

Occurrent

Continuant

Page 43: Creating a Data Interchange Standard for Researchers, Research, and Research Resources: VIVO-ISF Dean B. Krafft Brian Lowe Coalition for Networked Information

Relationships

Person Org.Position

Person ArticleAuthor-

ship

Page 44: Creating a Data Interchange Standard for Researchers, Research, and Research Resources: VIVO-ISF Dean B. Krafft Brian Lowe Coalition for Networked Information

Aggregate Data over Time

Person Org.Position

timeinterval

Page 45: Creating a Data Interchange Standard for Researchers, Research, and Research Resources: VIVO-ISF Dean B. Krafft Brian Lowe Coalition for Networked Information

Aggregate Data over Time

Person Org. 1Position

1

timeInterval

1

Org. 2Position

2

timeInterval

2

Page 46: Creating a Data Interchange Standard for Researchers, Research, and Research Resources: VIVO-ISF Dean B. Krafft Brian Lowe Coalition for Networked Information

Aggregate Data over Time

Person NameVCard

timeinterval

Page 47: Creating a Data Interchange Standard for Researchers, Research, and Research Resources: VIVO-ISF Dean B. Krafft Brian Lowe Coalition for Networked Information

Aggregate Data over Time

PersonOld

NameVCard 1

timeInterval

1

New Name

VCard 2

timeInterval

2

Page 48: Creating a Data Interchange Standard for Researchers, Research, and Research Resources: VIVO-ISF Dean B. Krafft Brian Lowe Coalition for Networked Information

Aggregate Data over Time

Person Author-ship

VCard

timeinterval

Page 49: Creating a Data Interchange Standard for Researchers, Research, and Research Resources: VIVO-ISF Dean B. Krafft Brian Lowe Coalition for Networked Information

Beyond Publication Bylines

Person ProjectRole

• What are people doing?• Roles in projects, activities

• Other kinds of scholarly contribution• Datasets, resources

Page 50: Creating a Data Interchange Standard for Researchers, Research, and Research Resources: VIVO-ISF Dean B. Krafft Brian Lowe Coalition for Networked Information

Roles and Outputs

PersonProject

Role

document /resource /

etc.

Page 51: Creating a Data Interchange Standard for Researchers, Research, and Research Resources: VIVO-ISF Dean B. Krafft Brian Lowe Coalition for Networked Information

Application Examples: Search

Page 52: Creating a Data Interchange Standard for Researchers, Research, and Research Resources: VIVO-ISF Dean B. Krafft Brian Lowe Coalition for Networked Information

Application Examples: Search

Ponce VIVO

WashU VIVO

IU VIVO

CornellIthaca VIVO

Weill

Cornell VIVO

eagle-IResearchresources Harvard

ProfilesRDF

OtherVIVOs

DigitalVitaRDF

IowaLokiRDF

Linked Open Data

vivosearch.

org

UF VIVO

Scripps VIVO

Solrsearchindex

Alter-nateSolr

index

Page 53: Creating a Data Interchange Standard for Researchers, Research, and Research Resources: VIVO-ISF Dean B. Krafft Brian Lowe Coalition for Networked Information

Application Examples: Search

Page 54: Creating a Data Interchange Standard for Researchers, Research, and Research Resources: VIVO-ISF Dean B. Krafft Brian Lowe Coalition for Networked Information
Page 55: Creating a Data Interchange Standard for Researchers, Research, and Research Resources: VIVO-ISF Dean B. Krafft Brian Lowe Coalition for Networked Information

Use Cases

• Find publications supported by grants• Discover and re-use expensive equipment and

resources• Demonstrate importance of facilities services

to research results• Discover people with access to resources or

with expertise in techniques

Page 56: Creating a Data Interchange Standard for Researchers, Research, and Research Resources: VIVO-ISF Dean B. Krafft Brian Lowe Coalition for Networked Information

Linking People through Terminologies

ISF+ UMLS

Clinicians

ICD9 codes

Researchers

MeSH keywords

linked data

http://cstaconnect.org/

Page 57: Creating a Data Interchange Standard for Researchers, Research, and Research Resources: VIVO-ISF Dean B. Krafft Brian Lowe Coalition for Networked Information

Humanities and Artistic Works

• Performances of a work• Translations• Collections and exhibits

Steven McCauley and Theodore Lawless, Brown University

http://www.vivoweb.org/files/vivo2013/friday_pm/VIVO-Humanities_McCauley.pdf

Page 58: Creating a Data Interchange Standard for Researchers, Research, and Research Resources: VIVO-ISF Dean B. Krafft Brian Lowe Coalition for Networked Information

Collaborative Development

• DuraSpace VIVO-ISF Working Group• Biweekly calls (Wed 2 pm ET)https://wiki.duraspace.org/display/VIVO/ - look for “Ontology Working Group”

Page 59: Creating a Data Interchange Standard for Researchers, Research, and Research Resources: VIVO-ISF Dean B. Krafft Brian Lowe Coalition for Networked Information

Interest Groups

Page 60: Creating a Data Interchange Standard for Researchers, Research, and Research Resources: VIVO-ISF Dean B. Krafft Brian Lowe Coalition for Networked Information

Linked Data for Libraries: Creating a Scholarly Resource Semantic

Information Store (SRSIS)

Page 61: Creating a Data Interchange Standard for Researchers, Research, and Research Resources: VIVO-ISF Dean B. Krafft Brian Lowe Coalition for Networked Information

Linked Data for Libraries

• On December 5, 2013, the Andrew W. Mellon Foundation made a two-year $999K grant to Cornell, Harvard, and Stanford starting Jan ‘14

• Partners will work together to develop an ontology and linked data sources that provide relationships, metadata, and broad context for Scholarly Information Resources

• Leverages existing work by both the VIVO project and the Hydra Partnership

Page 62: Creating a Data Interchange Standard for Researchers, Research, and Research Resources: VIVO-ISF Dean B. Krafft Brian Lowe Coalition for Networked Information

The Project Team

• Cornell: Dean Krafft, Jon Corson-Rikert, Brian Lowe, Simeon Warner, and 1.5 new FTE

• Harvard: David Weinberger, Paul Deschner, and an outside consultant

• Stanford: Tom Cramer and 1 new FTE

Page 63: Creating a Data Interchange Standard for Researchers, Research, and Research Resources: VIVO-ISF Dean B. Krafft Brian Lowe Coalition for Networked Information

“The goal is to create a Scholarly Resource Semantic Information Store model that works both within

individual institutions and through a coordinated, extensible network of Linked Open Data to capture the

intellectual value that librarians and other domain experts add to information resources when they

describe, annotate, organize, select, and use those resources, together with the social value evident from

patterns of usage.”

Page 64: Creating a Data Interchange Standard for Researchers, Research, and Research Resources: VIVO-ISF Dean B. Krafft Brian Lowe Coalition for Networked Information

Project timeline 2014

• Jan-June 2014: Initial ontology design; identify data sources; identify external vocabularies; begin SRSIS and Hydra ActiveTriples development

• July-Dec 2014: Complete initial ontology; complete initial ActiveTriples development; pilot initial data ingests into Vitro-based SRSIS instance at Cornell

Page 65: Creating a Data Interchange Standard for Researchers, Research, and Research Resources: VIVO-ISF Dean B. Krafft Brian Lowe Coalition for Networked Information

Workshop – December 2014

• Hold a two-day workshop for 25 attendees from 10-12 interested library, archive, and cultural memory institutions

• Demonstrate initial prototypes of SRSIS and ontology• Obtain feedback on initial ontology design• Obtain feedback on overall design and approach• Make connections to support participants in piloting this

approach at their institutions• Understand how institutions see this approach fitting in

with their own multi-institutional collaborations and existing cross-institutional efforts such as the Digital Public Library of America, VIVO, and SHARE

Page 66: Creating a Data Interchange Standard for Researchers, Research, and Research Resources: VIVO-ISF Dean B. Krafft Brian Lowe Coalition for Networked Information

Project timeline Jan-June 2015

• Pilot SRSIS instances at Harvard and Stanford• Populate Cornell SRSIS instance from multiple

data sources including MARC catalog records, EAD finding aids, VIVO data, CuLLR, and local digital collections

• Develop a test instance of the SRSIS Search application harvesting RDF across the three partner institutions

• Integrate SRSIS with ActiveTriples

Page 67: Creating a Data Interchange Standard for Researchers, Research, and Research Resources: VIVO-ISF Dean B. Krafft Brian Lowe Coalition for Networked Information

Project timeline July-Dec 2015

• Implement fully functional SRSIS instances at Cornell, Harvard, and Stanford

• Public release of open source SRSIS code and ontology

• Public release of open source ActiveTriples Hydra Component

• Create public demonstration of SRSIS Search-based discovery and access system across the three SRSIS instances

Page 68: Creating a Data Interchange Standard for Researchers, Research, and Research Resources: VIVO-ISF Dean B. Krafft Brian Lowe Coalition for Networked Information

Project Outcomes

• Open source extensible SRSIS ontology compatible with VIVO ontology, BIBFRAME, and other existing library LOD efforts

• Open source SRSIS semantic editing, display, and discovery system

• Project Hydra compatible interface to SRSIS, using ActiveTriples to support Blacklight search across multiple SRSIS instances

Page 69: Creating a Data Interchange Standard for Researchers, Research, and Research Resources: VIVO-ISF Dean B. Krafft Brian Lowe Coalition for Networked Information

For More Information:http://vivoweb.org

@VIVOCollab

Questions?