bioit 2009 biocatalogue slides by carole goble

19
http://www.biocatalogue.org http://beta.biocatalogue.org Professor Carole Goble University of Manchester, UK Director myGrid Consortium BioIT Alliance Lunch, 28 April 2009, Boston MA

Upload: biocatalogue

Post on 18-Jun-2015

544 views

Category:

Documents


4 download

DESCRIPTION

Slides presented by Carole Goble at the BioIT Alliance Lunch 2009 in Boston

TRANSCRIPT

Page 1: BioIT 2009 BioCatalogue slides by Carole Goble

http://www.biocatalogue.org

http://beta.biocatalogue.org

Professor Carole Goble

University of Manchester, UK

Director myGrid Consortium

BioIT Alliance Lunch, 28 April 2009, Boston MA

Page 2: BioIT 2009 BioCatalogue slides by Carole Goble

• Public, Curated Catalogue of Life Science Web Services• Register, Find, Curate Web Services• Community-sourced annotation, expert oversee• Open content

• Open platform with open REST interfaces• Web 2.0 site and development. • Open source code base.

• Started June 2008. In first beta phase. • Launch June 2009 at ISMB.• beta.biocatalogue.org• MoU with BioIT Alliance under discussion.

Bottom Line

Page 3: BioIT 2009 BioCatalogue slides by Carole Goble

Why?

Guessimate 3000+ Web Services in Life Science publicly available

Where…can I find them? advertise?

What…do they do? Can I use them?

How…do they work? operational profile?

up to date?Who…

provides them? recommends them?

Page 4: BioIT 2009 BioCatalogue slides by Carole Goble

Why?

Scientific Workflow Management System

Open SourceOpen ServicesOpen Disciplines

Data pipelines of web services in the wild

3500+ service operations

http://www.taverna.org.uk

Page 5: BioIT 2009 BioCatalogue slides by Carole Goble

Why?

Socially share, discover and reuse workflows

Poster #30

Crowd sourced content

Social curation of scientific assets

http://myexperiment.org

Page 6: BioIT 2009 BioCatalogue slides by Carole Goble
Page 7: BioIT 2009 BioCatalogue slides by Carole Goble

http://beta.biocatalogue.org

Page 8: BioIT 2009 BioCatalogue slides by Carole Goble

Content

• Community contributed– Service providers– Third Parties

• Automated crawling

• Sourced from partners and registries

• Chiefly public services

Page 9: BioIT 2009 BioCatalogue slides by Carole Goble

Content

• Community contributed– Service providers– Third Parties

• Automated crawling

• Sourced from partners and registries

• Chiefly public services

Page 10: BioIT 2009 BioCatalogue slides by Carole Goble

SOAPREST

SoapLabBioMOBY

DAS

Beta today:465 Services (4 REST)2691 Soap operations51 ProvidersPerpetual take-on

Content

• Community contributed– Service providers– Third Parties

• Automated crawling

• Sourced from partners and registries

• Chiefly public services

Page 11: BioIT 2009 BioCatalogue slides by Carole Goble

Curation Model

Versio

ning

Quantitative Content

Tags

Service Model

Semantic Content

Ontologies

FunctionalCapabilities

Provenance

OperationalCapabilities

OperationalMetrics

Usage Policy

Community Standing

Ratings

Usage Statistics

Attribution

Free textSearching Statistics

Usable and Useful

Understandable

Controlled vocabs

Inte

rface

s

Page 12: BioIT 2009 BioCatalogue slides by Carole Goble

Curation

• Just enough just in time• Universal annotation scheme• Mixed: Free text, Tags, controlled

vocabs, community ontologies

• Community sourced tags, comments, recommendations

• Expert curation ontology-based annotation. myGrid OWL Ontology

• Automated WSDL ripping and analytics

• Automated monitoring & testing• Partner feeds (e.g. myExperiment)

• Update feeds to users

twitter

blog

comments

ontologies

tags

recommendations

syndicated feeds

Page 13: BioIT 2009 BioCatalogue slides by Carole Goble

Curation

• Just enough just in time• Universal annotation scheme• Mixed: Free text, Tags, controlled

vocabs, community ontologies

• Community sourced tags, comments, recommendations

• Expert curation ontology-based annotation. myGrid OWL Ontology

• Automated WSDL ripping and analytics

• Automated monitoring & testing• Partner feeds (e.g. myExperiment)

• Update feeds to users

Today: 14902 annotations(provider, user, registries)KEGG: 1433 annotations

Page 14: BioIT 2009 BioCatalogue slides by Carole Goble

Open Platform

• Export & import standards – WSDL, SAWSDL, SA-

REST, WSMO ….– RDF and SPARQL

• Web 2.0– Open REST interface– Plugin & Mash up

• Open to Google• URLs for Bookmarking• Development model

– Perpetual beta– User driven– Biocatalogue Friends Google Gadgets

Page 15: BioIT 2009 BioCatalogue slides by Carole Goble

Governance

• Submission

• Content

• Service update

• Metadata update

• Withdrawal

• Take-down

• Preservation

Page 16: BioIT 2009 BioCatalogue slides by Carole Goble

When?

• Now: – Silent Beta beta.biocatalogue.org– Content take-on– Performance and reliability testing– Service testing framework commissioning– Friends and family review

• Launch:– At ISMB, June 2009

• Roadmap– www.biocatalogue.org wiki

Page 17: BioIT 2009 BioCatalogue slides by Carole Goble

Who?

• Three years guarantee funding (June08-May11)

• Sustainability guarantee by EMBL-EBI

Page 18: BioIT 2009 BioCatalogue slides by Carole Goble

Why BioIT Alliance?

• Mutual benefits

• Content

• Penetration

• Sustainability route

Page 19: BioIT 2009 BioCatalogue slides by Carole Goble

Credits

Thomas LaurentHamish McWilliams

Franck Tanoh Jiten BhagatCarole Goble

Rodrigo LopezEric Nzuobontane

Steve Pettifer

Katy Wolstencroft

Robert Stevens

David De Roure