linked open data_mlanet13

43
An Introduction to the Semantic Web and Linked Open Data Kristi L. Holmes, PhD Twitter: @kristiholmes Layne Mark Johnson, PhD @LayneJohnson The day after May the Fourth, 2013

Upload: kristi-holmes

Post on 11-May-2015

165 views

Category:

Technology


0 download

DESCRIPTION

Presentation at 2013 Medical Library Association Annual Meeting Layne Johnson and Kristi Holmes

TRANSCRIPT

Page 1: Linked Open Data_mlanet13

An Introduction to the Semantic Web and Linked Open Data

Kristi L. Holmes, PhDTwitter: @kristiholmes

Layne Mark Johnson, PhD@LayneJohnson

The day after May the Fourth, 2013

Page 2: Linked Open Data_mlanet13

Information Overload

Page 3: Linked Open Data_mlanet13

We humans have always applied tools to our work to make things

work easier…

Page 4: Linked Open Data_mlanet13

Simple Machines

Page 5: Linked Open Data_mlanet13

“a web of data that can be processed directly and indirectly by machines”

-Tim Berners-Lee

Page 6: Linked Open Data_mlanet13

At its heart, the Semantic Web is really about

extending standard Web technologies to better deal

with data on the Web.

If the WWW is for people, the Semantic Web is for machines

George Thomas and Jim Hendler, http://www.data.gov/communities/node/116/blogs/142

Data modeled as bidirectional relationships

All data has standard form

at

Eve

ryth

ing

has

its o

wn

UR

I

Semantic Web Value Proposition…

Web-based infrastructure of standards and technologies which allows for a distributable, machine readable description of data that allows for stronger data and smart web application linkages

Page 7: Linked Open Data_mlanet13

How the Semantic Web works

Anakin Skywalker is Luke Skywalker's father.

Page 8: Linked Open Data_mlanet13

How the Semantic Web works

XML and RDF are at the heart of the Semantic Web. They give computers a structure in which to look for

information and define relationships between resources.

http://computer.howstuffworks.com/semantic-web

Page 9: Linked Open Data_mlanet13

An ontology is simply a vocabulary that describes objects and how they relate to one another. A schema

is a method for organizing information

http://computer.howstuffworks.com/semantic-web

Page 10: Linked Open Data_mlanet13

Using languages designed for data

RDF | OWL | XML

Page 11: Linked Open Data_mlanet13

Semantic web: describes methods and technologies to allow machines to understand the meaning or "semantics” of information on the web. -- W3C director Sir Tim Berners-Lee

Ontology: a formal representation of the knowledge by a set of concepts within a domain and the relationships between those concepts.-- Wikipedia

Page 12: Linked Open Data_mlanet13

Let’s talk about the data…

The Semantic Web isn't just about putting data on the web. It is about making links, so that a person or

machine can explore the web of data.  With linked data, when you have some

of it, you can find other related data.

http://computer.howstuffworks.com/semantic-web

Page 13: Linked Open Data_mlanet13

The 5 Stars of Linked Open Data

★★★★★★★★★★★★★★★

http://lab.linkeddata.deri.ie/2010/star-scheme-by-example/http://www.w3.org/DesignIssues/LinkedData.html

AVAILABILITY & VALUE

Page 14: Linked Open Data_mlanet13

Is your data 5-star??

The 5 Stars of Linked Open Data

http://5stardata.info

Page 15: Linked Open Data_mlanet13

The growth of Linked Data

20082007

2011

Page 16: Linked Open Data_mlanet13

http://lod-cloud.net

What kind of things are available as

linked data?

The LOD Cloud

Page 17: Linked Open Data_mlanet13

Models and standards that allow for greater data exchange (and flexibility!)

It takes layers and layers of metadata, logic and security to

make the Web machine-readable.

http://computer.howstuffworks.com/semantic-web

Page 18: Linked Open Data_mlanet13

Building a web of data

http://geology.com/articles/night-satellite/satellite-photo-of-europe-at-night-lg.jpg

Data Creators, Data Aggregators, & Data Consumers

Repositories. Tools. Applications.

Workflows

Page 19: Linked Open Data_mlanet13
Page 20: Linked Open Data_mlanet13

Ok! Now let’s dig into a few good examples of how we can put

these things to work

Page 21: Linked Open Data_mlanet13

Linked Open Data and Biomedical Research: A Survey of Current International Efforts

Kristi L. Holmes, PhDTwitter: @kristiholmes

Layne Mark Johnson, PhD@LayneJohnson

May 5, 2013

Page 22: Linked Open Data_mlanet13

The

evo

lvin

g ec

osys

tem

of

info

rmat

ion

Courtesy Mike Conlon, U Florida

Page 23: Linked Open Data_mlanet13

Projects.

Research NetworkingOntology

Page 24: Linked Open Data_mlanet13

Research NetworkingInformation about scholars is optimized using a Web-based infrastructure of

standards and technologies which allows for a distributable, machine readable description of data that allows for stronger data and smart web application linkages across many universities, agencies, societies both

within the US and abroad.

Why is this important?

Linked data infrastructure allows for • Visualizations, research and clinical data integration, and deep

semantic searching across multiple types and sources of data• By breaking data out of traditional database silos, research

networking platforms promote a network effect within a single site and across multiple sites– The value of the network increases with the amount of linked data

and applications that are available to consume the linked data.

Page 25: Linked Open Data_mlanet13

The Semantic Web & Researcher Networking

• Increasing recognition of the value of semantic web standards • Increasing momentum in support of semantic web

technologies to facilitate research discovery• Recommendations for researcher networking recently

endorsed by the CTSA Consortium Steering Committee represent a new standard in researcher networking.

• Examples of applications that consume these rich data include: visualizations, enhanced multi-site search. Other utilities are in development across a wide range of topic areas.

Page 26: Linked Open Data_mlanet13

Recommendations and Best Practices for Research Networking

The Research Networking Recommendations were approved by the CTSA Consortium Executive and Steering Committee on October 25, 2011.

Recommendations for Research Networking:• Recommendation: All CTSAs should encourage their institution(s) to implement

research networking tool(s) institution-wide that utilize RDF triples and an ontology compatible with the VIVO ontology. 

• Recommendation: Information in people profiles at institutions should be publicly available as data as a general principle, specifically as Linked Open Data.  To ensure quality of information, authoritative electronic data sources versus manual entry should be emphasized.  Institutions will vary in the amount of information that they will include and make publicly available but the value is enhanced by the quality and quantity of information.

• Recommendation: Monitoring of the research networking landscape, technology, and tools should continue to be overseen by experts from the CTSA consortium (e.g., the Research Networking group of the Informatics KFC).

https://www.ctsacentral.org/recommendations-and-best-practices-research-networking

Page 27: Linked Open Data_mlanet13

Research Networking Systems

• VIVO, Profiles, SciVal Experts, Stanford’s CAP, Iowa’s Loki

• Encourage your RN provider to meet the recommendations for Researcher networking– Better visibility – Enhanced utility

Page 28: Linked Open Data_mlanet13

Profiles

• text

http://catalyst.harvard.edu/spotlights/profiles.html

Page 29: Linked Open Data_mlanet13

VIVO

This work is funded by the National Institutes of Health, U24 RR029822. 

VIVO enjoys a robust open source, open community space to support implementation, adoption, and development efforts around the

world. See http://vivo.sourceforge.net

Page 30: Linked Open Data_mlanet13

www.ctsaconnect.org CTSAconnectReveal Connections. Realize

Potential.

  

CTSAConnect ProjectGoals:

– Identify potential collaborators, relevant resources, and expertise across scientific disciplines

– Assemble translational teams of scientists to address specific research questions

Approach:Create a semantic representation of clinician and basic science researcher expertise to enable– Broad and computable representation of translational

expertise– Publication of expertise as Linked Data (LD) for use in

other applications

Page 31: Linked Open Data_mlanet13

2/21/14 31www.ctsaconnect.org CTSAconnectReveal Connections. Realize

Potential.

  

Merging VIVO and eagle-i

eagle-i is an ontology-driven application for collecting and searching research resources.

VIVO is an ontology-driven application for collecting anddisplaying information about people.

Both publish Linked Data. Neither addresses clinical expertise.

CTSAconnect will produce a single Integrated Semantic Framework, a modular collection of ontologies — that also includes clinical expertise

eagle-i

Resources

VIVO

People

Coordinationeagle-i

VIVO

Inte

grat

ed

Framew

ork

Semantic

Clinical activities

Page 32: Linked Open Data_mlanet13

OpenPHACTS

Open PHACTS Project• To reduce the barriers to drug discovery in industry,

academia and for small businesses, the Open PHACTS consortium is building the Open PHACTS Discovery Platform. This will be freely available, integrating pharmacological data from a variety of information resources and providing tools and services to question this integrated data to support pharmacological research. 

Guiding principle is open access, open usage, open source- Key to standards adoption -

http://www.openphacts.org/

Page 33: Linked Open Data_mlanet13

OpenPHACTS

Open PHACTS Project• Develop a set of robust standards…• Implement the standards in a semantic integration hub• Deliver services to support drug discovery programs in pharma

and public domain• 22 partners, 8 pharmaceutical companies, 3 biotechs• 36 months project, through March 2014

Guiding principle is open access, open usage, open source- Key to standards adoption -

http://www.openphacts.org/

Page 34: Linked Open Data_mlanet13

http://skr3.nlm.nih.gov/SemMed/index.html

Page 35: Linked Open Data_mlanet13

Outreach and adoption activities

Education and

training

Ontology and

controlled vocabulary expertise

Relationships with

vendors/data providers

Programming & technical support

Understand data structure

Libraries

Libraries are supporting (& contributing!) to work areas in a variety of ways related to core

mission and service areas

Page 36: Linked Open Data_mlanet13

Tools & Apps.

SearchVisualizations

Work efficienciesAnalysis and evaluation

Page 37: Linked Open Data_mlanet13

Search

• VIVOsearch and CTSAsearch• VIVOsearchlight

• AgriVIVO – FAO of the UN

• Search across – Land Grant institutions– CTSA Consortium Schools– State university systems; Big 10, Big 12, etc.

Page 38: Linked Open Data_mlanet13

http://vivosearchlight.org/@mileswortho

Page 39: Linked Open Data_mlanet13

Visualizations!

http://xcite.hackerceo.org/VIVOviz/@hackerceo

Inte

r-In

stitu

tiona

l Col

labo

ratio

n E

xplo

rer

Page 40: Linked Open Data_mlanet13

Make work easier

Page 41: Linked Open Data_mlanet13

SPARQL Query Builder

Page 42: Linked Open Data_mlanet13

Are you using Linked Open Data?

What are your hopes for this collection of technologies?

How can you get involved?

Page 43: Linked Open Data_mlanet13

Open data, open tools, open process

Thank you!

Acknowledgements:• Carlo Torniai & Melissa Haendel – OHSU• Tony Williams – OpenPHACTS, RSC• CTSA Research Networking Affinity workgroup• VIVO Project