exframe: a semantic web platform for genomic experiments

10
A Semantic Web Platform for Genomics Experiments Emily Merrill 1 , Stéphane Corlosquet 1 , Paolo Ciccarese 1,2 , Tim Clark 1,2,3 & Sudeshna Das 1,2 1 Massachusetts General Hospital 2 Harvard Medical School 3 University of Manchester †Authors contributed equally

Upload: twclark

Post on 29-Aug-2014

175 views

Category:

Technology


1 download

DESCRIPTION

 

TRANSCRIPT

Page 1: eXframe: A Semantic Web Platform for Genomic Experiments

A Semantic Web Platform for Genomics

ExperimentsEmily Merrill1, Stéphane Corlosquet1, Paolo Ciccarese1,2,

Tim Clark1,2,3 & Sudeshna Das1,2

1Massachusetts General Hospital 2Harvard Medical School

3University of Manchester

†Authors contributed equally

Page 2: eXframe: A Semantic Web Platform for Genomic Experiments

What is eXframe?• reusable framework for creating online data

repositories• upgraded version based on Drupal 7 (drupal.org)• structured annotation of experiments, bio-

materials and assays• publishes Semantic Web data automatically (RDF

& SPARQL endpoint)• first instance of upgraded version - Stem Cell

Commons (http://stemcellcommons.org)

Page 3: eXframe: A Semantic Web Platform for Genomic Experiments
Page 4: eXframe: A Semantic Web Platform for Genomic Experiments

Architecture

• re-factored second generation of eXframe

• updated experimental model mapped to ontologies

• Drupal RDF modules used to generate RDF

• RDF store (SPARQL endpoint) powered by ARC2 PHP library

Page 5: eXframe: A Semantic Web Platform for Genomic Experiments

Data Model• experiment

(obi:investigation); meta-data mapped to Dublin Core (dc)

• researchers & citations mapped to foaf & bibo respectively

• experiments are comprised of bioassays (obo:assay)

• bioassays have replicates (efo:replicate)

• replicates are associated with biomaterials (obo:specimen)

Page 6: eXframe: A Semantic Web Platform for Genomic Experiments

Biomaterials• biomaterials are deeply annotated, key examples:- organism (NCBI Taxonomy)- tissue (FMA: Foundation Model of Anatomy)- cell type (CL: Cell Type ontology)- disease state (DO: Disease ontology)- treatment compound (CheBI: Chemical Entities of

Biological Interest)• )

Page 7: eXframe: A Semantic Web Platform for Genomic Experiments

Sample RDF from Stem Cell Commons

Page 8: eXframe: A Semantic Web Platform for Genomic Experiments

SPARQL queries• flexible query system with

SPARQL

• allows graph queries

• integration with other endpoints

• sample query on right: “find experiments done on mouse, hematopoietic stem cells”.

Page 9: eXframe: A Semantic Web Platform for Genomic Experiments

Security• Stem Cell Commons: selected experiments

are accessible only to researchers from Harvard Stem Cell Institute

• created two stores: - public with limited data- private with all data

Page 10: eXframe: A Semantic Web Platform for Genomic Experiments

Conclusion• groups & institutions can create databases

simply by configuring eXframe• structured repository serves as institutional

memory and facilitates publication• automatic RDF generation & SPARQL

endpoint lowers barrier to Semantic Web adoption