mit csail linked data ventures class: linked open data for entrepreneurs 2013

53
Dr. David Wood [email protected] @prototypo 12 March 2013 Linked Data: Opportunities for Entrepreneurs

Upload: 3-round-stones

Post on 07-May-2015

2.562 views

Category:

Education


1 download

DESCRIPTION

A presentation to MIT CSAIL's Linked Data Ventures class 20130312.

TRANSCRIPT

Page 1: MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013

Dr. David [email protected]

@prototypo12 March 2013

Linked Data: Opportunities for Entrepreneurs

Page 2: MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013

David WoodB.S. Mechanical Engineering

B.S. Electrical Engineering (equivalency)M.S. Astronautical EngineeringAeronautical & Astronautical Engineer

Ph.D. Software Engineering

Page 3: MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013

David Wood

ongoing

ongoing

company founded products disposition

2002

2005

@𝛑Plugged In Software

Page 4: MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013

David Wood

RDF Database

RDF Database Management

RDF Usage ongoing

Linked Data Management

ongoing

company founded products disposition

2002

2005

@𝛑Plugged In Software

Page 5: MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013
Page 6: MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013
Page 7: MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013
Page 8: MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013

“more anterior sectors of the prefrontal cortex are distinctively recruited when altruistic choices prevail over selfish material interests”

- Jorge Moll et al

Page 9: MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013

“For it is in giving that we receive.”

- Saint Francis of Assisi

Page 10: MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013
Page 11: MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013
Page 12: MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013

Consistently late to rapidly changing markets (music, electronics, cafés, e-books)

Page 13: MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013

Pop Quiz

Page 14: MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013

Pop Quiz

Page 15: MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013

Innovators Dilemma

Page 16: MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013

Innovators Dilemma

Page 17: MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013

May 2001

Page 18: MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013

08 Oct 2007 07 Nov 2007 10 Nov 2007 28 Feb 2008 31 Mar 2008

18 Sep 2008 05 Mar 2009 27 Mar 2009 14 Jul 2009 22 Sep 2010

Page 19: MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013

Sep 2011

Page 20: MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013
Page 21: MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013
Page 22: MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013

We’ve Seen This Before

Page 23: MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013
Page 24: MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013
Page 25: MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013

YouTube HDTV

watch videos watch Better videos

Publish videos

Share videos

Rate videos

Discuss videos

Page 26: MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013

Linked Data RDBMS

Use data Use data

Publish data

Share data

Rate data

Discuss data

Page 27: MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013
Page 28: MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013
Page 29: MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013
Page 31: MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013

CONTENTMANAGEMENT

SYSTEM

LINKED DATAMANAGEMENT

SYSTEM

Callimachus

UNSTRUCTURED

TEXT

TEXT

STRUCTURED

DATA

DATA

Page 32: MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013

32

Page 33: MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013
Page 34: MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013
Page 35: MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013
Page 36: MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013

Publishing

Page 37: MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013

Credit: Bradley P. Allen, Elsevier Labs

Page 38: MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013

Credit: Bradley P. Allen, Elsevier Labs

XHTML 5

DocBook 5

ePub 3

LaTex✔

Page 39: MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013

Open Government

Page 40: MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013

US EPA• Cloud-based Linked Data provision of 3 core programs:

• 2.9M Facilities• 100K substances• 25 years of toxic pollution reports• FISMA compliant• 16 Callimachus templates• Official launch Feb 2013

Page 41: MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013
Page 42: MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013
Page 43: MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013
Page 44: MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013

From WikipediaFrom EPA

Open Street Map

Page 45: MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013

Life Sciences

Page 46: MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013

HTTP-accessible endpoints capable of returning XML or textual content

Convert XML or textual results to RDF

Render RDF to HTML via templateUser resolves asingle URI to anActive PURL

Multiple targets queriedindependently

1

David Wood1 and Tom [email protected], [email protected]

Active PURLs for Clinical Study Aggregation

The problem: No coordinated view of clinical study information. Information is distributed across departments, subsidiaries and government data sources.

The solution: Gather, convert, aggregate and format for display

Challenges

Next steps

How semantic technologies help

3 Round Stones and AstraZeneca created a system to allow coordinated views of distributed clinical trial information. The system extended the CallimachusProject, an Open Source management system for Linked Data. Persistent URLs, or PURLs, were used to provide globally unique and resolvable identifiers for each clinical study. The PURL concept was extended to enablePURLs to have multiple targets and for the results of each target to undergo arbitrary transformation. PURLs which have such capabilities are called Active PURLs. Information sources relevant to clinical studies were identified, regardless of whether their location was internal or external to the pharmaceutical company'snetwork. Active PURLs were used to resolve data sources having HTTP endpoints capable of returning XML or textual results. Each information source isdynamically transformed into Resource Description Framework (RDF) formats and all sources' results then merged into a single, temporary graph of RDF data.Information is rendered to end users as coordinated HTML descriptions regarding each clinical trial using the Callimachus template engine. Machine-readableversions of the data are also available.

Linked Data techniques can help to address both the availability of clinical trial information and provide a means to build effective information systems using it.Linked Data techniques allow for "cooperation without coordination". Publishers of data provide context for use by third parties in other portions of a distributedenterprise. Users of Linked Data can combine information from multiple sources. Subsequent publication can create a virtuous circle of positive feedback, allowingresearchers, informaticists and support staff to collaboratively and distributively build a reusable knowledge base.

Distributed queries have many knownlimitations, such as the introduction ofmultiple single points of failure in anygiven PURL resolution. HTTP timeouts,auth/auth errors or other network failurescan slow or stop a pipeline from returningcorrectly. Similarly, distributed queries can resultin variant query-time performance due tocomplex network and endpoint perform-ance variances. Proactive caching and cache manage-meant strategies can improve runtimeperformance and protect end users fromthe limitations inherent in a distributedquery architecture. Caching ofintermediate results from endpoints hasnot yet been implemented.

We intend to continue to addressReferences

1. Callimachus Project,

User experience

Users resolve a URL thatprovides a unique identifier fora clinical study, drug, chemicalor other concept managed bythis system. The user maybe presented with the URL onHTML pages, search it via full-text techniques or discover itvia semantic search.

1

2 Users are presented with adynamically generated Webpage representing aggregatedclinical study information. Usersare isolated from the complexand distributed informationenvironment.

Page 47: MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013

• Linked Data warehouses 10B USD annually.

• Linked Data supply chains205M USD annually (Web analytics)6B USD annually (enterprise)

• Linked Data analytics16B USD annually

Your Opportunity?

Page 48: MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013
Page 49: MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013

http://www.manning.com/dwood/

Page 50: MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013

CreditsBatman Treaty Signing

(public domain)http://upload.wikimedia.org/wikipedia/commons/d/dc/Batman_signs_treaty_artist_impression.jpg)

Centro Universitario de Ciencias Exactas e Ingenierías, Universidad de Guadalajara

(public domain)

http://proton.ucting.udg.mx/galeria/3D/WEB.jpg

Spreadsheet PhotoCasey Serin

(CC-BY licensed)http://www.flickr.com/photos/sercasey/351617208/sizes/l/in/photostream/

LOD Cloud DiagramsRichard Cyganiak, Anja Jentzsch, (CC-BY-SA)

http://lod-cloud.net/

Earth weather analysis imageNASA Goddard SFC

CC-BYhttp://www.flickr.com/photos/gsfc/4662884851/

Publisher emerging content architecture

Copyright (c) 2011 Elsevier, used with permission.

Corporate logos, Darkon Movie Poster, BBC screenshots, CAMC credit card image and book covers © their respective owners and used under Fair Use for educational purposes

Corporate logos, Darkon Movie Poster, BBC screenshots, CAMC credit card image and book covers © their respective owners and used under Fair Use for educational purposes

Page 51: MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013

CreditsMundaneum images Copyright © Collection Mundaneum - Mons, Courtesy of the Mundaneum Archives Centre.

Chasm PhotoTravis S.

(CC-BY-NC licensed)http://www.flickr.com/photos/baggis/3860802929/

Supply Chain ImageKevin Krejci

(CC-BY licensed)http://www.flickr.com/photos/kevinkrejci/6141829763/

Sharing Squirrels Imageleezie5

CC-BY-NC-ND licensed)http://www.flickr.com/photos/leeziet/5912219625/

Envirofacts screenshot A US Government Work of the US EPA. Used with permission.

Linked Data book cover Copyright (c) 2012-13 Manning Publications Inc. Used with permission.

All other photos and drawings © 2010-13 3 Round Stones Inc or David Wood, released under a CC-BY-SA licenseAll other photos and drawings © 2010-13 3 Round Stones Inc or David Wood, released under a CC-BY-SA license

Page 52: MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013

This work is Copyright © 2011 3 Round Stones Inc.It is licensed under the Creative Commons Attribution 3.0 Unported LicenseFull details at: http://creativecommons.org/licenses/by/3.0/

You are free:

to Share — to copy, distribute and transmit the work

to Remix — to adapt the work

Under the following conditions:Attribution. You must attribute the work in the manner specified by the author or licensor (but not in any way that suggests that they endorse you or your use of the work).

Share Alike. If you alter, transform, or build upon this work, you may distribute the resulting work only under the same or similar license to this one.

Page 53: MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013

Dr. David [email protected]

@prototypo12 March 2013

Linked Data: Opportunities for Entrepreneurs

http://purl.org/net/prototypo/lod-entrepreneur