an extensible platform for variome data integration

16
PEDRO LOPES [email protected] ITAB2010 - Corfu, Greece November 2 nd , 2010 VARIOME DATA INTEGRATION AN EXTENSIBLE PLATFORM FOR

Upload: pedro-lopes

Post on 12-Mar-2016

220 views

Category:

Documents


1 download

DESCRIPTION

WAVe talk at ITAB2010 Corfu, Greece

TRANSCRIPT

Page 1: An Extensible Platform for Variome Data Integration

PEDRO LOPES [email protected] - Corfu, Greece

November 2nd, 2010

VARIOME DATA INTEGRATIONAN EXTENSIBLE PLATFORM FOR

Page 2: An Extensible Platform for Variome Data Integration

PEDRO LOPES [email protected] - Corfu, Greece

November 2nd, 2010

Page 3: An Extensible Platform for Variome Data Integration

WHAT IS WAVe?

http://bioinformatics.ua.pt/

Page 4: An Extensible Platform for Variome Data Integration

‣ BACKGROUND

‣ CHALLENGES

‣ SOLUTIONS

‣ STRATEGY

‣ DEMO

‣ HIGHLIGHTS

• Applications & Resources, Features

‣ CONCLUSION

OUTLINE

Page 5: An Extensible Platform for Variome Data Integration

‣ PERSONALIZED MEDICINE

• Custom drug design

• Improved patient specific healthcare

‣ GENOTYPE TO PHENOTYPE

• Understanding changes in our genetic sequence

‣ Causes

‣ Consequences

‣ HUMAN VARIOME

• Genome Wide Association Studies, GWAS

‣ Huge databases, huge statistics

• Locus-specific Databases, LSDBs

‣ Publish genomic variation datasets

BACKGROUND

http://bioinformatics.ua.pt/

Page 6: An Extensible Platform for Variome Data Integration

‣ LSDB

• Independent & heterogeneous systems

‣ LOVD, UMD, MUTbase, legacy...

‣ VARIANT

• Distributed through multiple systems

• Described with distinct formats

‣ RESOURCES

• Link genomic variation datasets with original external resources

Enable agile access to integrated & enriched human variome research datasets?

CHALLENGES

http://bioinformatics.ua.pt/

Page 7: An Extensible Platform for Variome Data Integration

‣ LSDB

• Manually curated LSDB

‣ List from HGVS

‣ VARIANT

• Web crawling engine

• LOVD API

Genes * [LSDBs + Variants + Original Resources]!

SOLUTIONS

‣ RESOURCES

• Include

‣ Original applications/content

‣ Miscellaneous data types

• Sources

‣ GeNS warehouse

‣ UniProt

http://bioinformatics.ua.pt/

Page 8: An Extensible Platform for Variome Data Integration

‣ CORE + EXTENSIONS

Gene Variant

Disease LSDB

...

ProteinPharma Pathway

‣ HIGHLIGHTS

• Dynamic

‣ Easily extensible

‣ Update connections on-the-fly

• Original

‣ Pointers to original resources

• Centralized

‣ One-stop-shop for relevant information

STRATEGY

An extensible lightweight integration & enrichment platform for genomic variation datasets☺

http://bioinformatics.ua.pt/

Page 9: An Extensible Platform for Variome Data Integration

DEMO | http://bioinformatics.ua.pt/WAVe

Page 10: An Extensible Platform for Variome Data Integration

DEMO | http://bioinformatics.ua.pt/WAVe

Page 11: An Extensible Platform for Variome Data Integration

‣ LSDB

• LOVD + MUTbase + UMD + misc legacy

‣ GENE

• GeneCards + GeneNames + Entrez

‣ PUBLICATION

• QuExT

‣ DISEASE

• OMIM

‣ PHARMACOGENOMICS

• PharmGKB

‣ LOCUS

• MapViewer + Ensembl

‣ PATHWAY

• KEGG + Reactome

‣ PROTEIN

• UniProt + PDB + Expasy + InterPro

‣ GENE ONTOLOGY

• AmiGO

HIGHLIGHT | RESOURCES

http://bioinformatics.ua.pt/

Page 12: An Extensible Platform for Variome Data Integration

‣ LSDB

• LOVD + MUTbase + UMD + misc legacy

‣ GENE

• GeneCards + GeneNames + Entrez

‣ PUBLICATION

• QuExT

‣ DISEASE

• OMIM

‣ PHARMACOGENOMICS

• PharmGKB

‣ LOCUS

• MapViewer + Ensembl

‣ PATHWAY

• KEGG + Reactome

‣ PROTEIN

• UniProt + PDB + Expasy + InterPro

‣ GENE ONTOLOGY

• AmiGO

HIGHLIGHT | RESOURCES

~ 1350 Genes, 1550 LSDBs, 80k Variants, 100k Links!

http://bioinformatics.ua.pt/

Page 13: An Extensible Platform for Variome Data Integration

‣ GENE SEARCH

• Direct access to genes

‣ Auto-suggest engine

• Curated genes

‣ GENE ANALYSIS WORKSPACE

• Navigation tree

‣ Holistic perspective on all data

• “Live view” mode

‣ Shows original applications/content

HIGHLIGHT | FEATURES

http://bioinformatics.ua.pt/

Page 14: An Extensible Platform for Variome Data Integration

‣ GENE SEARCH

• Direct access to genes

‣ Auto-suggest engine

• Curated genes

‣ GENE ANALYSIS WORKSPACE

• Navigation tree

‣ Holistic perspective on all data

• “Live view” mode

‣ Shows original applications/content

HIGHLIGHT | FEATURES

‣ API

• RSS/XML access to data

‣ Usable in any framework

• Genes

‣ Access navigation tree data

‣ Google Chrome Extension

• Variants

‣ Only platform that publishes variants from multiple sources

http://bioinformatics.ua.pt/

Page 15: An Extensible Platform for Variome Data Integration

‣ INTEGRATE

• Integrate genomic variation datasets from multiple distributed and heterogeneous sources

‣ ENRICH

• Enrich available data with connections to miscellaneous (yet relevant) resources

• Display original applications/content to maintain authorship and ownership

‣ INNOVATE

• Use “card” metaphor to provide a holistic view over human variome research

‣ ADD VALUE

• Extract and combine true added value from LSDBs

• One step forward for personalized medicine research

CONCLUSION

http://bioinformatics.ua.pt/

Page 16: An Extensible Platform for Variome Data Integration

YOUR FEEDBACK IS HIGHLY APPRECIATED

THANK YOU!

QUESTIONS?