linked data approach for integration of human health & environmental data

46
Linked Data Approach for Integration of Human Health and Environmental Data Building for Better Decisions: Multi-scale Integration of Human Health and Environmental Data 8-11 May 2012 By: Bernadette Hyland, Chair, W3C Government Linked Data WG CEO, 3 Round Stones, Inc Email. [email protected] Twitter: @BernHyland This presentation: http://slideshare.net/3roundstones 1 Tuesday, May 8, 12

Upload: 3-round-stones

Post on 07-May-2015

614 views

Category:

Technology


0 download

DESCRIPTION

Best practices and platforms for access and reuse of scientific data and models. We explore a Linked Data approach for data integration, modeling and interoperability. Delivered by Bernadette Hyland at EPA & Society of Toxicology Scientific Workshop titled: "Building for Better Decisions: Multi-scale Integration of Human Health and Environmental Data.. Delivered 8-May-2012 at EPA Research Triangle Park, NC USA.

TRANSCRIPT

Page 1: Linked Data Approach for Integration of Human Health & Environmental Data

Linked Data Approach for Integration of Human Health and

Environmental DataBuilding for Better Decisions: Multi-scale Integration of Human Health

and Environmental Data 8-11 May 2012

By: Bernadette Hyland, Chair, W3C Government Linked Data WG

CEO, 3 Round Stones, Inc

Email. [email protected]: @BernHyland

This presentation: http://slideshare.net/3roundstones

1Tuesday, May 8, 12

Page 2: Linked Data Approach for Integration of Human Health & Environmental Data

• Linked Data is about publishing and consuming data using international data standards

• Based on 20 year old idea

• A system of linked information systems

2Tuesday, May 8, 12

Page 3: Linked Data Approach for Integration of Human Health & Environmental Data

3Tuesday, May 8, 12

Page 4: Linked Data Approach for Integration of Human Health & Environmental Data

Photo credit: http://www.flickr.com/photos/sjungling/5974860/

4Tuesday, May 8, 12

Page 5: Linked Data Approach for Integration of Human Health & Environmental Data

1970s 1980s 1990s

$ cat foo.txt | grep blah | sort

A neat little package Client-Server The Early Web

A HISTORY OF SILOS

5Tuesday, May 8, 12

Page 6: Linked Data Approach for Integration of Human Health & Environmental Data

There is a better way to connect data ...

• No one vendor owns it• It scales ... to Web-scale• Doesn’t require a super model• Based on International Data Exchange Standards (RDF, SPARQL)

6Tuesday, May 8, 12

Page 7: Linked Data Approach for Integration of Human Health & Environmental Data

• What is next for Open Data on the Web

• Structured data on the Web is quickly becoming mainstream

• Authorities beginning to appreciate a new way to publish and consume content

What is next for Data in the Web?

7Tuesday, May 8, 12

Page 8: Linked Data Approach for Integration of Human Health & Environmental Data

8Tuesday, May 8, 12

Page 9: Linked Data Approach for Integration of Human Health & Environmental Data

9Tuesday, May 8, 12

Page 10: Linked Data Approach for Integration of Human Health & Environmental Data

“Linked Data means

Cooperation without coordination”

-- David Wood, PhD

10Tuesday, May 8, 12

Page 11: Linked Data Approach for Integration of Human Health & Environmental Data

GovernmentsGoals: Governmental transparency and/or improved

internal efficiencies (data warehouses)

11Tuesday, May 8, 12

Page 12: Linked Data Approach for Integration of Human Health & Environmental Data

Hardware/Software Vendors

Goal: Improve interoperability between products and product lines

12Tuesday, May 8, 12

Page 13: Linked Data Approach for Integration of Human Health & Environmental Data

RetailersGoal: Improve click-throughs on search results

13Tuesday, May 8, 12

Page 14: Linked Data Approach for Integration of Human Health & Environmental Data

Book PublishersGoals: Improve internal manuscript pipelines, expose

additional ways of finding and using content

14Tuesday, May 8, 12

Page 15: Linked Data Approach for Integration of Human Health & Environmental Data

New Media

15Tuesday, May 8, 12

Page 16: Linked Data Approach for Integration of Human Health & Environmental Data

Web

Universal Client

Universal Connection

Universal Database

Logic and interlinking

Ubiquitous,reusable applications

URL Curation

of Data

Linked Data in Context

16Tuesday, May 8, 12

Page 17: Linked Data Approach for Integration of Human Health & Environmental Data

17Tuesday, May 8, 12

Page 18: Linked Data Approach for Integration of Human Health & Environmental Data

18Tuesday, May 8, 12

Page 19: Linked Data Approach for Integration of Human Health & Environmental Data

19Tuesday, May 8, 12

Page 20: Linked Data Approach for Integration of Human Health & Environmental Data

Why is RDF important?• It is an international standard for publishing data on

the Web (public and private)

• Data exchange model

• Serializations include RDF/XML, N-triples, N3, Turtle ...

• It is the future of using the Web

20Tuesday, May 8, 12

Page 21: Linked Data Approach for Integration of Human Health & Environmental Data

What you can do ...

• Good = Use Data Standards (RDF) to publish metadata about data and models, at a minimum

• Better = Use RDF to publish all your data

• Best = Link your data + models

• Web architecture, Web-scale

21Tuesday, May 8, 12

Page 22: Linked Data Approach for Integration of Human Health & Environmental Data

22Tuesday, May 8, 12

Page 23: Linked Data Approach for Integration of Human Health & Environmental Data

23Tuesday, May 8, 12

Page 24: Linked Data Approach for Integration of Human Health & Environmental Data

WE’VE SEEN THIS BEFORE

24Tuesday, May 8, 12

Page 25: Linked Data Approach for Integration of Human Health & Environmental Data

25Tuesday, May 8, 12

Page 26: Linked Data Approach for Integration of Human Health & Environmental Data

26Tuesday, May 8, 12

Page 27: Linked Data Approach for Integration of Human Health & Environmental Data

27Tuesday, May 8, 12

Page 28: Linked Data Approach for Integration of Human Health & Environmental Data

28Tuesday, May 8, 12

Page 29: Linked Data Approach for Integration of Human Health & Environmental Data

29Tuesday, May 8, 12

Page 30: Linked Data Approach for Integration of Human Health & Environmental Data

30Tuesday, May 8, 12

Page 31: Linked Data Approach for Integration of Human Health & Environmental Data

31Tuesday, May 8, 12

Page 32: Linked Data Approach for Integration of Human Health & Environmental Data

32Tuesday, May 8, 12

Page 33: Linked Data Approach for Integration of Human Health & Environmental Data

33Tuesday, May 8, 12

Page 34: Linked Data Approach for Integration of Human Health & Environmental Data

EMRData

InternalPortal  Data

Linked  DataCloud

Open  Government  Data

Social  Media

Clinical  Condi*on  Specific

PhysiciansServicesLoca*ons

DBpediaPub  MedNLM

CDCEPA

US  Census

FacebookTwiCer

ClinicalOntology

BusinessOntology

34Tuesday, May 8, 12

Page 35: Linked Data Approach for Integration of Human Health & Environmental Data

•Decrease costly emergency department visits

•Reduce hospital re-admissions after treatment

• Improved self-care and medication compliance

•Education of triggers and disease management

Value Proposition

35Tuesday, May 8, 12

Page 36: Linked Data Approach for Integration of Human Health & Environmental Data

Func*onal  Model

1.  Define  target  popula*on  and  clinical  data  from  electronic  medical  record

2.  Iden*fy  sources  of  open  government  data  related  to  environmental,  weather,  and  other  variables  related  to  chronic  pulmonary  disease  exacerba*ons

3.  Combine  open  content  from  NLM,  PubMed,  Medline  to  support  educa*on

4.  Leverage  a  Linked  Data  approach,  using  Open  Source  and  interna*onal  data  exchange  standards  (RDF)

5.    Alert  pa*ent  of  possible  hazardous  condi*ons  and  recommend  appropriate  ac*ons

36Tuesday, May 8, 12

Page 37: Linked Data Approach for Integration of Human Health & Environmental Data

CA-­‐email-­‐message.jpg

Leverage  Linked  Data,  Open  Source  &  Standards

CDCEPA

US  Census

DBpediaPub  MedNLM

Web  of  Data

EMR

SMS

Email

Web

37Tuesday, May 8, 12

Page 38: Linked Data Approach for Integration of Human Health & Environmental Data

38Tuesday, May 8, 12

Page 39: Linked Data Approach for Integration of Human Health & Environmental Data

Shows:

1) Air Quality data from US EPA

2) Anonymized EMR data

3) Doctor’s details from CSV file

Uses Callimachus,a Linked Data Management Platform

39Tuesday, May 8, 12

Page 40: Linked Data Approach for Integration of Human Health & Environmental Data

• Large and small vendors are involved in Linked Data

• From Oracle, IBM to 3 Round Stones

• Listing of active projects, companies and research See http://dir.w3.org/

• Best practices, see http://www.w3.org/2011/gld/charter

Tools & best practices?

40Tuesday, May 8, 12

Page 41: Linked Data Approach for Integration of Human Health & Environmental Data

•Callimachus is a framework for data-driven applications based on Linked Data principles

•Callimachus allows Web developers to easily create data driven applications for the Web

• It is Open Source (FLOSS)

•http://callimachusproject.org

41Tuesday, May 8, 12

Page 42: Linked Data Approach for Integration of Human Health & Environmental Data

http://www.w3.org/2011/gld/charter42Tuesday, May 8, 12

Page 43: Linked Data Approach for Integration of Human Health & Environmental Data

DELIVERABLES

Community Directory

Best Practices for Publishing Linked Data

Procurement, vocabulary selection, URI construction, versioning, stability, legacy data issues

Cookbook for Linked Open Data

Standard Vocabularies

Metadata, Statistical “Cube” Data, People, Organizational structures

43Tuesday, May 8, 12

Page 44: Linked Data Approach for Integration of Human Health & Environmental Data

44Tuesday, May 8, 12

Page 45: Linked Data Approach for Integration of Human Health & Environmental Data

• Be prepared for the scientific community & public to demand that your data be

published in re-usable format (RDF)

• Demand your vendors use Open Source whenever possible

• Incentivize industry & STM publishers to do the right thing

• Open vs. proprietary technologies & data formats ... be OPEN• Beware of semantic “pixie dust” - be “an educated consumer” (and scientist!)

• Solutions must embrace International Standards and published Best Practices (W3C, OMG, IETF)

• Define a URI Policy and Strategy, document it and ensure scientists use it!

• Leverage the work of others and work cooperatively...

• Our future is all connected through your work...

Recommendations

45Tuesday, May 8, 12

Page 46: Linked Data Approach for Integration of Human Health & Environmental Data

This work is Copyright © 2011-2012 3 Round Stones Inc.It is licensed under the Creative Commons Attribution 3.0 Unported LicenseFull details at: http://creativecommons.org/licenses/by/3.0/

You are free:

to Share — to copy, distribute and transmit the work

to Remix — to adapt the work

Under the following conditions:Attribution. You must attribute the work in the manner specified by the author or licensor (but not in any way that suggests that they endorse you or your use of the work).

Share Alike. If you alter, transform, or build upon this work, you may distribute the resulting work only under the same or similar license to this one.

46Tuesday, May 8, 12