how sadi & share help restore the scientific method to in silico science

73
From BioMoby to SADI The Quest for the Holy Grail!

Upload: mark-wilkinson

Post on 08-May-2015

1.190 views

Category:

Technology


0 download

DESCRIPTION

This is my presentation to the Bio Open Source Convention (BOSC) in Boston, July 2010. I start with a brief status-update on the BioMoby project and then launch into a series of demonstrations of it's successor - SADI + SHARE. Rather than discussing how SADI/SHARE work, I focus the discussion on what role I think these technologies can play in bringing the traditional "scientific method" back into in silico biology.

TRANSCRIPT

Page 1: How SADI & SHARE help restore the Scientific Method to in silico science

From BioMoby to SADI

The Quest for the Holy Grail!

Page 2: How SADI & SHARE help restore the Scientific Method to in silico science

BioMoby Stats in a nutshell• >1800 services worldwide (~1300 “alive” at any given time)• 4 major installations of the Moby Service registry

– Genome Canada, SUN Center of Excellence, Calgary– Genome España, Barcelona Supercomputing Center– International Rice Research Institute, Philippines – Max Planck, Cologne

• Canadian service registry brokers ~400,000 requests/month• Canadian BioMoby services receive ~700,000 uses/month• Canadian server just had a significant memory upgrade to

improve performance

“The report of my death was an exaggeration”-- Mark Twain

Page 3: How SADI & SHARE help restore the Scientific Method to in silico science

Model Organism Bring Your-Own Database Interface Conference

“MOBY-DIC”

Emma Lake, SaskatchewanSept 21, 2001

Page 4: How SADI & SHARE help restore the Scientific Method to in silico science
Page 5: How SADI & SHARE help restore the Scientific Method to in silico science

Are we going after The Holy Grail

here?

Page 6: How SADI & SHARE help restore the Scientific Method to in silico science

The Holy Grail:(this slide created circa 2002)

Align the promoters of all serine threonine kinases involved exclusively in the regulation of cell sorting during wound healing in blood vessels.

Retrieve and align 2000nt 5' from every serine/threonine kinase in Mus musculus expressed exclusively in the tunica [I | M |A] whose expression increases 5X or more within 5 hours of wounding but is not activated during the normal development of blood vessels, and is <40% homologous in the active site to kinases known to be involved in cell-cycle regulation in any other species.

Page 7: How SADI & SHARE help restore the Scientific Method to in silico science

http://sadiframework.org

Founding partner

MicrosoftResearch

Page 8: How SADI & SHARE help restore the Scientific Method to in silico science

Holy Grail Demo #1

Page 9: How SADI & SHARE help restore the Scientific Method to in silico science

Imagine there is a “virtual database” containing all of the data from all of the databases,together with the output of

every conceivable analysis

Page 10: How SADI & SHARE help restore the Scientific Method to in silico science

How do we query that database?

Page 11: How SADI & SHARE help restore the Scientific Method to in silico science

“SHARE”Semantic Health And Research Environment

SADI client application

http://biordf.net/cardioSHARE (Pellet)

http://dev.biordf.net/cardioSHARE (Pellet 2)

Page 12: How SADI & SHARE help restore the Scientific Method to in silico science

What pathways does UniProt protein P47989 belong to?

PREFIX pred: <http://sadiframework.org/ontologies/predicates.owl#>PREFIX ont: <http://ontology.dumontierlab.com/>PREFIX uniprot: <http://lsrn.org/UniProt:>SELECT ?gene ?pathway WHERE {

uniprot:P47989 pred:isEncodedBy ?gene . ?gene ont:isParticipantIn ?pathway .

}

Page 13: How SADI & SHARE help restore the Scientific Method to in silico science
Page 14: How SADI & SHARE help restore the Scientific Method to in silico science
Page 15: How SADI & SHARE help restore the Scientific Method to in silico science
Page 16: How SADI & SHARE help restore the Scientific Method to in silico science

Recapwhat we just saw

A standard SPARQL query was entered into SHARE, a SADI-aware query engine

Page 17: How SADI & SHARE help restore the Scientific Method to in silico science

Recapwhat we just saw

The query was interpreted to extract the “triple” patterns

subject, predicate, object

being requested

Page 18: How SADI & SHARE help restore the Scientific Method to in silico science

Recapwhat we just saw

Triple-patterns are passed to SADI for Web Service discovery

Page 19: How SADI & SHARE help restore the Scientific Method to in silico science

Recapwhat we just saw

Services capable of generating those triple-patterns are automatically executed,

the triples are stored, and the query is resolved.

Page 20: How SADI & SHARE help restore the Scientific Method to in silico science

Recapwhat we just saw

We posed, and answered a ~complex database query

WITHOUT A DATABASE

(in fact, the data didn’t even have to exist...)

Page 21: How SADI & SHARE help restore the Scientific Method to in silico science

Recapwhat we just saw

Note that there is no centralized ontology

Unlike BioMoby, SADI supports all (OWL) ontologies and

does not invent any of its own

Page 22: How SADI & SHARE help restore the Scientific Method to in silico science

Holy Grail Demo #1

Align the promoters of all serine threonine kinases involved exclusively in the regulation of cell sorting during wound healing in blood vessels.

Retrieve and align 2000nt 5' from every serine/threonine kinase in Mus musculus expressed exclusively in the tunica [I | M |A] whose expression increases 5X or more within 5 hours of wounding but is not activated during the normal development of blood vessels, and is <40% homologous in the active site to kinases known to be involved in cell-cycle regulation in any other species.

Page 23: How SADI & SHARE help restore the Scientific Method to in silico science

Holy Grail Demo #2

Page 24: How SADI & SHARE help restore the Scientific Method to in silico science

Show me the latest Blood Urea Nitrogen and Creatinine levelsof patients who appear to be rejecting their transplants

PREFIX rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> PREFIX patient: <http://sadiframework.org/ontologies/patients.owl#> PREFIX l: <http://sadiframework.org/ontologies/predicates.owl#> SELECT ?patient ?bun ?creatFROM <http://sadiframework.org/ontologies/patients.rdf>WHERE {

?patient rdf:type patient:LikelyRejecter .?patient l:latestBUN ?bun . ?patient l:latestCreatinine ?creat .

}

Page 25: How SADI & SHARE help restore the Scientific Method to in silico science

Start burrowing through the LikelyRejector OWL class find that we need a regression model OWL class

Page 26: How SADI & SHARE help restore the Scientific Method to in silico science

Regression models have features like slopes and intercepts, and so on.The class is completely decomposed until a set of required Services are discoveredcapable of creating all these necessary properties

Page 27: How SADI & SHARE help restore the Scientific Method to in silico science

Decomposition of the OWL class uncovers the need for a Linear Regression analysis on the patient blood chemistry data

Page 28: How SADI & SHARE help restore the Scientific Method to in silico science

VOILA!

Page 29: How SADI & SHARE help restore the Scientific Method to in silico science

We just dynamically evaluated if individuals matching a particular high-level concept definition exist

…or can exist

Page 30: How SADI & SHARE help restore the Scientific Method to in silico science

Holy Grail Demo #2

Align the promoters of all serine threonine kinases involved exclusively in the regulation of cell sorting during wound healing in blood vessels.

Retrieve and align 2000nt 5' from every serine/threonine kinase in Mus musculus expressed exclusively in the tunica [I | M |A] whose expression increases 5X or more within 5 hours of wounding but is not activated during the normal development of blood vessels, and is <40% homologous in the active site to kinases known to be involved in cell-cycle regulation in any other species.

Page 31: How SADI & SHARE help restore the Scientific Method to in silico science

How does

SADI + SHARE

do that?

Page 32: How SADI & SHARE help restore the Scientific Method to in silico science

Please see other presentations uploaded to SlideShare for a full explanation

of SADI Functionality

See also the Taverna and Protégé plug-insfor discovering, running and creating services

Taverna

Page 33: How SADI & SHARE help restore the Scientific Method to in silico science

The Holy Grail may not yet be in-handbut I think we can at least see it from here!

So… now what?

Page 34: How SADI & SHARE help restore the Scientific Method to in silico science

Mark’s Manifesto

What is my next “Holy Grail”?

Page 35: How SADI & SHARE help restore the Scientific Method to in silico science

Science

Support for the in silico Scientific Method

Page 36: How SADI & SHARE help restore the Scientific Method to in silico science

Reproducibility

Clarity (hypothesis)

Discourse

Disagreement

Clarity (experiment)

Page 37: How SADI & SHARE help restore the Scientific Method to in silico science

The Scientific Method

Discourse: What do you believe? What do I believe?

Disagreement: You’re wrong! And I’m gonna prove it!

Clarity: This is the experiment I am going to do

Reproducibility: This is how I did it (“provenance”)

Clarity: This is my new hypothesis

Page 38: How SADI & SHARE help restore the Scientific Method to in silico science

The Scientific Method

Discourse: What do you believe? What do I believe?

Disagreement: You’re wrong! And I’m gonna prove it!

Clarity: This is the experiment I am going to do

Reproducibility: This is how I did it (“provenance”)

Clarity: This is my new hypothesis

Workflows (e.g. myExperiment)

Page 39: How SADI & SHARE help restore the Scientific Method to in silico science

Reproducibility

Clarity (hypothesis)

Discourse

Disagreement

Clarity (experiment)

Page 40: How SADI & SHARE help restore the Scientific Method to in silico science

In opposition to the lessons we learnt from Web 2.0

The Semantic Web in Healthcare and Life Sciences

is solving the problems of science…

…by forming institutions

Page 41: How SADI & SHARE help restore the Scientific Method to in silico science

Result:

Large, centrally-designed and centrally-curated ontologies

that enforce “community agreement” about “biological reality”

Page 42: How SADI & SHARE help restore the Scientific Method to in silico science

Science ≠ Consensus

Page 43: How SADI & SHARE help restore the Scientific Method to in silico science

Reproducibility

Clarity (hypothesis)

Discourse

Disagreement

Clarity (experiment)

Page 44: How SADI & SHARE help restore the Scientific Method to in silico science

Reproducibility

Clarity (hypothesis)

Institutions & Consortia

Disagreement

Clarity (experiment)

Page 45: How SADI & SHARE help restore the Scientific Method to in silico science

Reproducibility

Clarity (hypothesis)

Institutions & Consortia

Consensus

Clarity (experiment)

Page 46: How SADI & SHARE help restore the Scientific Method to in silico science

Reproducibility

????

Institutions & Consortia

Consensus

Clarity (experiment)

Page 47: How SADI & SHARE help restore the Scientific Method to in silico science

To bring the “traditions of Science”

to in silico Science

Web 3.0 must encourage and facilitate

personal opinion and debate

Page 48: How SADI & SHARE help restore the Scientific Method to in silico science

What has this got to do with SADI and SHARE?

Page 49: How SADI & SHARE help restore the Scientific Method to in silico science

PREFIX rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> PREFIX patient: <http://sadiframework.org/ontologies/patients.owl#> PREFIX l: <http://sadiframework.org/ontologies/predicates.owl#> SELECT ?patient ?bun ?creatFROM <http://sadiframework.org/ontologies/patients.rdf>WHERE {

?patient rdf:type patient:LikelyRejecter .?patient l:latestBUN ?bun . ?patient l:latestCreatinine ?creat .

}

Page 50: How SADI & SHARE help restore the Scientific Method to in silico science

Likely Rejecter

Page 51: How SADI & SHARE help restore the Scientific Method to in silico science

I created a small ontologydescribing my definition of

a Likely Rejecter

Page 52: How SADI & SHARE help restore the Scientific Method to in silico science

… it was MY ontology!

Page 53: How SADI & SHARE help restore the Scientific Method to in silico science

I can re-use it

Page 54: How SADI & SHARE help restore the Scientific Method to in silico science

I can modify it as I change my world-view

Page 55: How SADI & SHARE help restore the Scientific Method to in silico science

Reproducibility

Clarity (hypothesis)

Discourse

Disagreement

Clarity (experiment)

I can publish it for others to use

Page 56: How SADI & SHARE help restore the Scientific Method to in silico science

Reproducibility

Clarity (hypothesis)

Discourse

Disagreement

Clarity (experiment)Others can modify it and/or

compare it to THEIR world-view

Page 57: How SADI & SHARE help restore the Scientific Method to in silico science

Reproducibility

Clarity (hypothesis)

Discourse

Disagreement

Clarity (experiment)

Sharing my ontology also gives opportunities for micro-attribution;

“Citation” of me is transparent and automatic when someone extends my ontology

Page 58: How SADI & SHARE help restore the Scientific Method to in silico science

Using SADI and SHAREmy personal world-view is

explicitly expressed and can bedynamically evaluated against

global data and knowledge

Page 59: How SADI & SHARE help restore the Scientific Method to in silico science

Ontology development is distributed and personal rather than centralized

no institutions

Page 60: How SADI & SHARE help restore the Scientific Method to in silico science
Page 61: How SADI & SHARE help restore the Scientific Method to in silico science

…but there’s more…

Page 62: How SADI & SHARE help restore the Scientific Method to in silico science

“Likely Rejecter”

Page 63: How SADI & SHARE help restore the Scientific Method to in silico science

I made that up! It came out of my head!

Page 64: How SADI & SHARE help restore the Scientific Method to in silico science

What’s another word for a world-view that you make-up?

Hypothesis

Page 65: How SADI & SHARE help restore the Scientific Method to in silico science

Reproducibility

Hypotheses

Discourse

Disagreement

Clarity (experiment)The “Likely Rejecter” OWL Classis an explicitly-expressed hypothesis;

Members of that class may or may not exist!

Page 66: How SADI & SHARE help restore the Scientific Method to in silico science

Reproducibility

Hypotheses

Discourse

Disagreement

Experiment

Page 67: How SADI & SHARE help restore the Scientific Method to in silico science
Page 68: How SADI & SHARE help restore the Scientific Method to in silico science

Ontologically-expressed Hypotheses drive the discovery, assembly, and analysis of data capable of evaluating their validity

Blood Pressure

Hypertension

Ischemia

Hypothesis

Database 1 Database 2

SADI+

SHARE

Analytical Algorithm

Page 69: How SADI & SHARE help restore the Scientific Method to in silico science

Join us!

SADI and CardioSHARE are Open-Source projects

Come join us – we’re having a lot of fun!!

http://sadiframework.org

#SADIFrameworkSADI SemanticWeb Services Page

Page 70: How SADI & SHARE help restore the Scientific Method to in silico science

C-BRASS: Canadian Bioinformatics Resources As Semantic Services

together with Michel Dumontier, Chris Baker

~$1M funding to help us deploy SADI services and provide training for new service providers

We can help you get started!

“C-BRASS” is on Facebook! Like

Page 71: How SADI & SHARE help restore the Scientific Method to in silico science

Credits

Benjamin VanderValk (SADI & SHARE)

Luke McCarthy (SADI & SHARE)

Soroush Samadian (CardioSHARE)

Page 72: How SADI & SHARE help restore the Scientific Method to in silico science

Microsoft Research

Page 73: How SADI & SHARE help restore the Scientific Method to in silico science

Fin

This presentation available on SlideShare: keywords ‘wilkinson’ ‘bosc’