gt health2stat 7-22-2010

20
Data.gov: Web, Data Web, Social Data Web [email protected] 7/22/2010 #health2stat

Upload: george-thomas

Post on 02-Nov-2014

1.456 views

Category:

Documents


0 download

DESCRIPTION

 

TRANSCRIPT

Page 1: Gt health2stat 7-22-2010

Data.gov: Web, Data Web, Social Data Web

[email protected]/22/2010 #health2stat

Page 2: Gt health2stat 7-22-2010

The Web is evolving…

From a Web of Linked Documents,

To a Web of Linked Data!

Page 3: Gt health2stat 7-22-2010

OK…

What does that mean?

Bear with me for a quick overview -

Page 4: Gt health2stat 7-22-2010

There’s only one type of link in HTML

SourcePage.htm contains markup;

<a href='target.html'> hyperlink </a>

Page 5: Gt health2stat 7-22-2010

XML gave us custom tags

<myTag> typed data </myTag>

Good, but more importantly...

Page 6: Gt health2stat 7-22-2010

The Architecture of the Web

(one of the most important and successful inventions of the 20th Century!)

is RESTful

(Info) Resource Representations (URI’s vs URL’s), Uniform Interface (CRUD via HTTP methods), server Resource and client App state (via links)...

Page 7: Gt health2stat 7-22-2010

Web + Data = Data Web

The Data Web (aka Web of Data) is an innovation that extends the existing Web of Documents

Beginning with the Resource Description Framework (RDF)

Which is a Web based data modeling language

Page 8: Gt health2stat 7-22-2010

RDF gives us custom link types:

node <arc> nodething <relationship> thing

topic <property> topicentity <attribute> value

subject <predicate> objectthe grammar of ‘triples’

Page 9: Gt health2stat 7-22-2010

HTTP GET ’dereferences’ RDF triples in multiple serialization (.htm, .rdf, .json) formats

So, the URI

http://agency.gov/Subject

has a URL representation that returns

– <http://my.org/predicate> <http://your.com/Object>

Page 10: Gt health2stat 7-22-2010

Grammar needs vocabularies...

Vocabularies can be metadata:

Objective quantifies Goal

So http://standards.org/vocab/Objective returns;

<quantifies> <http://standards.org/vocab/Goal>

Page 11: Gt health2stat 7-22-2010

Instance data (re)uses vocabularies:

Meaningful Use 'is a' (type of) Objective

http://onc.hhs.gov/Meaningful_Use<type> <http://standards.org/Objective>

EHR Exchange 'is a' (type of) Goal

http://onc.hhs.gov/EHR_Exchange<type> <http://standards.org/Goal>

Page 12: Gt health2stat 7-22-2010

OK…

Now we’ve introduced to the what, but -

WHY should we do this Linked (Open Gov) Data stuff?

Page 13: Gt health2stat 7-22-2010

We can (automatically) infer things like; Meaningful Use quantifies EHR Exchange

We can traverse (via apps and browsers) the data graph with no apriori domain knowledge…

We/others just make links to correlate disparately owned/managed/published data across distinct

(some say federated) domains…

Page 14: Gt health2stat 7-22-2010

We can (automatically) integrate disparate data sites/sources via graph merging!

If http://hhs.gov/about returns;

<owl:sameAs> <http://dbpedia.org/DHHS>, <http://www.freebase.com/HHS> .

All these data from each site is seen as one dataset, substantially lowering coordination costs of

integration!

Page 15: Gt health2stat 7-22-2010

Who is doing this in Industry?

Google, Facebook, YahooNY Times, Newsweek

Best Buy, …

What about Government Agencies?

Library of Congress Subject Headingshttp://id.loc.gov – pace setting example!

Page 16: Gt health2stat 7-22-2010

So what about Data.gov?

(Homepage: ‘Linking Open Government Data’)

Data.gov PMO SemWeb and Linked Data Team

Collaboration with tw.rpi.edu – SemWeb inventors

Federal CIOC-AIC Data Architecture Subcommittee Open Government Vocabulary WG

Page 17: Gt health2stat 7-22-2010

How about at HHS?

NLM: UMLS ‘SPARQL endpoint’http://mor.nlm.nih.gov/sparql

CMS: Data.gov PMO SemWeb Team membersCMS Dashboard vocabulary WIP

CHDI: vocabularies and URI schemes

Page 18: Gt health2stat 7-22-2010

Rich linking scenarios to explore;

What are successful community intervention tactics to combat childhood obesity?

For these geographic health stats, what evidence based provider payment/performance trends

emerge?

What other Gov data is relevant? Is there something in the air or water?

Page 19: Gt health2stat 7-22-2010

Web + Data = Data Web; Data Web + Social = Social Data Web!

Consider the metadata creation (domain SME’s) and instance data curation (info workers) as

objects of social collaboration.

Activity-stream history feeds, viral expert networking (etc.) will all contribute to enhanced data quality.

Think structured data wikis, where tags are suggested and come from SME designed ontologies (vocabularies) instead of user-

generated folksonomies.

Page 20: Gt health2stat 7-22-2010

What can YOU do?

Get involved with the Data.gov PMO and/or the FCIOC-AIC-DAS-OGV-WG!

(Come to DC Semantic Web meetups too...)

Contact me

THANKS!!!