data integration - integration of functional associations using string

107
Data integration Integration of functional associations using STRING Lars Juhl Jensen

Upload: lars-juhl-jensen

Post on 10-May-2015

670 views

Category:

Documents


0 download

DESCRIPTION

EMBO World Practical Course on Computational Biology, Shanghai Jiao Tong University, Shanghai, China, August 22, 2009.

TRANSCRIPT

Page 1: Data integration - Integration of functional associations using STRING

Data integrationIntegration of functional associations using STRING

Lars Juhl Jensen

Page 2: Data integration - Integration of functional associations using STRING

Jensen, Kuhn et al., Nucleic Acids Research, 2009

Page 3: Data integration - Integration of functional associations using STRING

functional associations

Page 4: Data integration - Integration of functional associations using STRING

confidence scores

Page 5: Data integration - Integration of functional associations using STRING

cross-species integration

Page 6: Data integration - Integration of functional associations using STRING

630 genomes

Page 7: Data integration - Integration of functional associations using STRING

model organism databases

Page 8: Data integration - Integration of functional associations using STRING

Ensembl

Page 9: Data integration - Integration of functional associations using STRING

RefSeq

Page 10: Data integration - Integration of functional associations using STRING

defining orthology

Page 11: Data integration - Integration of functional associations using STRING

two modes

Page 12: Data integration - Integration of functional associations using STRING

protein mode

Page 13: Data integration - Integration of functional associations using STRING

von Mering et al., Nucleic Acids Research, 2005

Page 14: Data integration - Integration of functional associations using STRING

COG mode

Page 15: Data integration - Integration of functional associations using STRING

von Mering et al., Nucleic Acids Research, 2005

Page 16: Data integration - Integration of functional associations using STRING

genomic context

Page 17: Data integration - Integration of functional associations using STRING

gene fusion

Page 18: Data integration - Integration of functional associations using STRING

Korbel et al., Nature Biotechnology, 2004

Page 19: Data integration - Integration of functional associations using STRING

conserved neighborhood

Page 20: Data integration - Integration of functional associations using STRING

operons

Page 21: Data integration - Integration of functional associations using STRING

Korbel et al., Nature Biotechnology, 2004

Page 22: Data integration - Integration of functional associations using STRING

bidirectional promoters

Page 23: Data integration - Integration of functional associations using STRING

Korbel et al., Nature Biotechnology, 2004

Page 24: Data integration - Integration of functional associations using STRING

phylogenetic profiles

Page 25: Data integration - Integration of functional associations using STRING

Korbel et al., Nature Biotechnology, 2004

Page 26: Data integration - Integration of functional associations using STRING

examples

Page 27: Data integration - Integration of functional associations using STRING

bacterial Cox assembly

Page 28: Data integration - Integration of functional associations using STRING
Page 29: Data integration - Integration of functional associations using STRING

Banci et al., PNAS, 2005

Page 30: Data integration - Integration of functional associations using STRING

Banci et al., PNAS, 2005

Page 31: Data integration - Integration of functional associations using STRING

cellulose degradation

Page 32: Data integration - Integration of functional associations using STRING
Page 33: Data integration - Integration of functional associations using STRING
Page 34: Data integration - Integration of functional associations using STRING
Page 35: Data integration - Integration of functional associations using STRING

Cell

Cellulosomes

Cellulose

Page 36: Data integration - Integration of functional associations using STRING

experimental data

Page 37: Data integration - Integration of functional associations using STRING

protein interactions

Page 38: Data integration - Integration of functional associations using STRING

yeast two-hybrid

Page 39: Data integration - Integration of functional associations using STRING

affinity purification

Page 40: Data integration - Integration of functional associations using STRING

fragment complementation

Page 41: Data integration - Integration of functional associations using STRING

Jensen & Bork, Science, 2008

Page 42: Data integration - Integration of functional associations using STRING

genetic interactions

Page 43: Data integration - Integration of functional associations using STRING

Beyer et al., Nature Reviews Genetics, 2007

Page 44: Data integration - Integration of functional associations using STRING

BINDBiomolecular Interaction Network Database

Page 45: Data integration - Integration of functional associations using STRING

BioGRIDGeneral Repository for Interaction Datasets

Page 46: Data integration - Integration of functional associations using STRING

DIPDatabase of Interacting Proteins

Page 47: Data integration - Integration of functional associations using STRING

IntAct

Page 48: Data integration - Integration of functional associations using STRING

MINTMolecular Interactions Database

Page 49: Data integration - Integration of functional associations using STRING

HPRDHuman Protein Reference Database

Page 50: Data integration - Integration of functional associations using STRING

PDBProtein Data Bank

Page 51: Data integration - Integration of functional associations using STRING

inferred associations

Page 52: Data integration - Integration of functional associations using STRING

gene coexpression

Page 53: Data integration - Integration of functional associations using STRING
Page 54: Data integration - Integration of functional associations using STRING

GEOGene Expression Omnibus

Page 55: Data integration - Integration of functional associations using STRING

expression compendia

Page 56: Data integration - Integration of functional associations using STRING

curated knowledge

Page 57: Data integration - Integration of functional associations using STRING

complexes

Page 58: Data integration - Integration of functional associations using STRING

MIPSMunich Information center

for Protein Sequences

Page 59: Data integration - Integration of functional associations using STRING

Gene Ontology

Page 60: Data integration - Integration of functional associations using STRING

pathways

Page 61: Data integration - Integration of functional associations using STRING

Letunic & Bork, Trends in Biochemical Sciences, 2008

Page 62: Data integration - Integration of functional associations using STRING

KEGGKyoto Encyclopedia of Genes and Genomes

Page 63: Data integration - Integration of functional associations using STRING

MetaCyc

Page 64: Data integration - Integration of functional associations using STRING

Reactome

Page 65: Data integration - Integration of functional associations using STRING

PIDNCI-Nature Pathway Interaction Database

Page 66: Data integration - Integration of functional associations using STRING

literature mining

Page 67: Data integration - Integration of functional associations using STRING

>10 km

Page 68: Data integration - Integration of functional associations using STRING

MEDLINE

Page 69: Data integration - Integration of functional associations using STRING

SGDSaccharomyces Genome Database

Page 70: Data integration - Integration of functional associations using STRING

The Interactive Fly

Page 71: Data integration - Integration of functional associations using STRING

OMIMOnline Mendelian Inheritance in Man

Page 72: Data integration - Integration of functional associations using STRING

co-mentioning

Page 73: Data integration - Integration of functional associations using STRING

NLPNatural Language Processing

Page 74: Data integration - Integration of functional associations using STRING

Gene and protein namesCue words for entity recognitionVerbs for relation extraction

[nxgene The GAL4 gene]

[nxexpr The expression of [nxgene the cytochrome genes [nxpg CYC1 and CYC7]]]is controlled by[nxpg HAP1]

Page 75: Data integration - Integration of functional associations using STRING
Page 76: Data integration - Integration of functional associations using STRING

easy in theory …

Page 77: Data integration - Integration of functional associations using STRING

… but not in practice

Page 78: Data integration - Integration of functional associations using STRING

many data types

Page 79: Data integration - Integration of functional associations using STRING

not comparable

Page 80: Data integration - Integration of functional associations using STRING

variable quality

Page 81: Data integration - Integration of functional associations using STRING

many sources

Page 82: Data integration - Integration of functional associations using STRING

different file formats

Page 83: Data integration - Integration of functional associations using STRING

different gene identifiers

Page 84: Data integration - Integration of functional associations using STRING

partially redundant

Page 85: Data integration - Integration of functional associations using STRING

spread over 630 genomes

Page 86: Data integration - Integration of functional associations using STRING

quality scores

Page 87: Data integration - Integration of functional associations using STRING

reproducibility

Page 88: Data integration - Integration of functional associations using STRING

von Mering et al., Nucleic Acids Research, 2005

Page 89: Data integration - Integration of functional associations using STRING

intergenic distances

Page 90: Data integration - Integration of functional associations using STRING
Page 91: Data integration - Integration of functional associations using STRING

benchmarking

Page 92: Data integration - Integration of functional associations using STRING

calibrate vs. gold standard

Page 93: Data integration - Integration of functional associations using STRING

von Mering et al., Nucleic Acids Research, 2005

Page 94: Data integration - Integration of functional associations using STRING

raw quality scores

Page 95: Data integration - Integration of functional associations using STRING

probabilistic scores

Page 96: Data integration - Integration of functional associations using STRING

integrate over orthologs

Page 97: Data integration - Integration of functional associations using STRING

protein mode

Page 98: Data integration - Integration of functional associations using STRING

von Mering et al., Nucleic Acids Research, 2005

Page 99: Data integration - Integration of functional associations using STRING

COG mode

Page 100: Data integration - Integration of functional associations using STRING

von Mering et al., Nucleic Acids Research, 2005

Page 101: Data integration - Integration of functional associations using STRING

combine all evidence

Page 102: Data integration - Integration of functional associations using STRING

Frishman et al., Modern Genome Annotation, 2009

Page 103: Data integration - Integration of functional associations using STRING

small molecules

Page 104: Data integration - Integration of functional associations using STRING

Kuhn et al., Nucleic Acids Research, 2008

Page 105: Data integration - Integration of functional associations using STRING

metametabolomics

Page 106: Data integration - Integration of functional associations using STRING

Acknowledgments

Christian von Mering

Michael Kuhn

Manuel Stark

Samuel Chaffron

Philippe Julien

Monica Campillos

Tobias Doerks

Jan Korbel

Berend Snel

Martijn Huynen

Peer Bork

Page 107: Data integration - Integration of functional associations using STRING

larsjuhljensen