intogen, integrative oncogenomics for personal cancer genomes
DESCRIPTION
IntOGen was presented September, 11th at the CSHL Meeting on Personal Genomes. The talk was given by Christian Perez-Llamas and he presented the main features of the current version and the advances of IntOGen 2.0 to store, analyze and visualize next generation sequencing data from cancer samples. CSHL Meeting on Personal Cancer Genomes web: http://meetings.cshl.edu/meetings/person10.shtmlTRANSCRIPT
![Page 1: IntOGen, Integrative Oncogenomics for Personal Cancer Genomes](https://reader034.vdocuments.site/reader034/viewer/2022051817/5483f1d2b47959ec0c8b4ad7/html5/thumbnails/1.jpg)
IntOGen, Integrative OncoGenomics for personal cancer genomes
Christian Pérez-Llamas
Biomedical Genomics LabPompeu Fabra University
Biomedical Research Park at Barcelona
![Page 2: IntOGen, Integrative Oncogenomics for Personal Cancer Genomes](https://reader034.vdocuments.site/reader034/viewer/2022051817/5483f1d2b47959ec0c8b4ad7/html5/thumbnails/2.jpg)
IntOGen, Integrative OncoGenomics for personal cancer genomes
Christian Pérez-Llamas
Biomedical Genomics LabPompeu Fabra University
Biomedical Research Park at Barcelona
![Page 3: IntOGen, Integrative Oncogenomics for Personal Cancer Genomes](https://reader034.vdocuments.site/reader034/viewer/2022051817/5483f1d2b47959ec0c8b4ad7/html5/thumbnails/3.jpg)
![Page 4: IntOGen, Integrative Oncogenomics for Personal Cancer Genomes](https://reader034.vdocuments.site/reader034/viewer/2022051817/5483f1d2b47959ec0c8b4ad7/html5/thumbnails/4.jpg)
![Page 5: IntOGen, Integrative Oncogenomics for Personal Cancer Genomes](https://reader034.vdocuments.site/reader034/viewer/2022051817/5483f1d2b47959ec0c8b4ad7/html5/thumbnails/5.jpg)
Oncogenomics data Clinical annotations Biological modules
Transcriptomic alterationsCopy Number alterationsMutations...
InternationalClassificationof Diseasesfor Oncology
FunctionalRegulatoryCancer related...
Integrative methodologies
Cancer related genes identificationCancer related modules identificationCombinations of experiments by ICDOGeneration of cancer specific modules
Web discovery tool Gitools
www.gitools.org
Biomart services
biomart.intogen.orgwww.intogen.org
DA
TAS
TAT
IST
ICS
EX
PL
OR
AT
ION
Data management
Overview
![Page 6: IntOGen, Integrative Oncogenomics for Personal Cancer Genomes](https://reader034.vdocuments.site/reader034/viewer/2022051817/5483f1d2b47959ec0c8b4ad7/html5/thumbnails/6.jpg)
Copy Number Analysisfrom Sanger Institute
Copy number alterationsTranscriptomic alterations Mutations
Selection of experiments
Public dataExperiment design: cancer vs normalAt least 20 samples
Annotation of tumour type
International Classification of Diseases for Oncology (ICD-O)Manual curation from publication or descriptionProgenetix already annotated with ICD-O
More than 800 experimentsMore than 25000 samplesAlmost 150 ICD-O tumor types
Data
![Page 7: IntOGen, Integrative Oncogenomics for Personal Cancer Genomes](https://reader034.vdocuments.site/reader034/viewer/2022051817/5483f1d2b47959ec0c8b4ad7/html5/thumbnails/7.jpg)
identification of driver alterations
STEP 1
exp.
1
samples
genes
not alteredaltered
genes
experiment 1
corrected p-value
0.05 10
Cancer related genes identificationStatistics
![Page 8: IntOGen, Integrative Oncogenomics for Personal Cancer Genomes](https://reader034.vdocuments.site/reader034/viewer/2022051817/5483f1d2b47959ec0c8b4ad7/html5/thumbnails/8.jpg)
identification of driver alterations
STEP 1
exp.
1
+
combination of experiments
STEP 2
exp.
2
exp.
3
exp.
n
Cance
r ty
pe A
samples
genes
not alteredaltered
genes
experiment 1
...
corrected p-value
0.05 10
Cancer related genes identificationStatistics
![Page 9: IntOGen, Integrative Oncogenomics for Personal Cancer Genomes](https://reader034.vdocuments.site/reader034/viewer/2022051817/5483f1d2b47959ec0c8b4ad7/html5/thumbnails/9.jpg)
Statistics Cancer related modules identification
![Page 10: IntOGen, Integrative Oncogenomics for Personal Cancer Genomes](https://reader034.vdocuments.site/reader034/viewer/2022051817/5483f1d2b47959ec0c8b4ad7/html5/thumbnails/10.jpg)
Web discovery tool Gitools
www.gitools.org
Biomart services
biomart.intogen.orgwww.intogen.org
Exploration
![Page 11: IntOGen, Integrative Oncogenomics for Personal Cancer Genomes](https://reader034.vdocuments.site/reader034/viewer/2022051817/5483f1d2b47959ec0c8b4ad7/html5/thumbnails/11.jpg)
![Page 12: IntOGen, Integrative Oncogenomics for Personal Cancer Genomes](https://reader034.vdocuments.site/reader034/viewer/2022051817/5483f1d2b47959ec0c8b4ad7/html5/thumbnails/12.jpg)
![Page 13: IntOGen, Integrative Oncogenomics for Personal Cancer Genomes](https://reader034.vdocuments.site/reader034/viewer/2022051817/5483f1d2b47959ec0c8b4ad7/html5/thumbnails/13.jpg)
![Page 14: IntOGen, Integrative Oncogenomics for Personal Cancer Genomes](https://reader034.vdocuments.site/reader034/viewer/2022051817/5483f1d2b47959ec0c8b4ad7/html5/thumbnails/14.jpg)
![Page 15: IntOGen, Integrative Oncogenomics for Personal Cancer Genomes](https://reader034.vdocuments.site/reader034/viewer/2022051817/5483f1d2b47959ec0c8b4ad7/html5/thumbnails/15.jpg)
READS
TUMOURSAMPLE
LONG LISTOF ALTERED
GENES
Cancer gene prioritization with personal genomes
MutationsINDELSDif. Expr.
![Page 16: IntOGen, Integrative Oncogenomics for Personal Cancer Genomes](https://reader034.vdocuments.site/reader034/viewer/2022051817/5483f1d2b47959ec0c8b4ad7/html5/thumbnails/16.jpg)
biomart.intogen.org biomart.intogen.org/martservice
RESTfulWeb service
MartView
biomaRt perl python curl
Web discovery tool Gitools
www.gitools.org
Biomart services
biomart.intogen.orgwww.intogen.org
Exploration
![Page 17: IntOGen, Integrative Oncogenomics for Personal Cancer Genomes](https://reader034.vdocuments.site/reader034/viewer/2022051817/5483f1d2b47959ec0c8b4ad7/html5/thumbnails/17.jpg)
Web discovery tool Gitools
www.gitools.org
Biomart services
biomart.intogen.orgwww.intogen.org
Exploration
![Page 18: IntOGen, Integrative Oncogenomics for Personal Cancer Genomes](https://reader034.vdocuments.site/reader034/viewer/2022051817/5483f1d2b47959ec0c8b4ad7/html5/thumbnails/18.jpg)
Web discovery tool Gitools
www.gitools.org
Biomart services
biomart.intogen.orgwww.intogen.org
Exploration
![Page 19: IntOGen, Integrative Oncogenomics for Personal Cancer Genomes](https://reader034.vdocuments.site/reader034/viewer/2022051817/5483f1d2b47959ec0c8b4ad7/html5/thumbnails/19.jpg)
Web discovery tool Gitools
www.gitools.org
Biomart services
biomart.intogen.orgwww.intogen.org
Exploration
![Page 20: IntOGen, Integrative Oncogenomics for Personal Cancer Genomes](https://reader034.vdocuments.site/reader034/viewer/2022051817/5483f1d2b47959ec0c8b4ad7/html5/thumbnails/20.jpg)
IntOGen: Integration and data-mining of multidimensional oncogenomic data
Gundem G, Perez-Llamas C, Jene-Sanz A, Kedzierska A,Islam A,
Deu-Pons J, Furney S and Lopez-Bigas N.
Nature Methods, 7, 92-93 (2010)
More details...
www.gitools.org
biomart.intogen.org
www.intogen.org
![Page 21: IntOGen, Integrative Oncogenomics for Personal Cancer Genomes](https://reader034.vdocuments.site/reader034/viewer/2022051817/5483f1d2b47959ec0c8b4ad7/html5/thumbnails/21.jpg)
International Cancer Genome Consortium
50 cancer types
500 samples each cancer type
About 25000 genomes in total
![Page 22: IntOGen, Integrative Oncogenomics for Personal Cancer Genomes](https://reader034.vdocuments.site/reader034/viewer/2022051817/5483f1d2b47959ec0c8b4ad7/html5/thumbnails/22.jpg)
Data Storage, Analysis & Management
International Cancer Genome Consortium
50 cancer types
500 samples each cancer type
About 25000 genomes in total
![Page 23: IntOGen, Integrative Oncogenomics for Personal Cancer Genomes](https://reader034.vdocuments.site/reader034/viewer/2022051817/5483f1d2b47959ec0c8b4ad7/html5/thumbnails/23.jpg)
samples
not altered
altered
ICGC-CLL genome project
genes
Cancer genomes in the context of IntOGen
Samples
Technology
Alteration
RNA-seq
Dif. Expression:- Upregulated- Downregulated
7 CLL7 normal
(Roderic Guigo lab)
![Page 24: IntOGen, Integrative Oncogenomics for Personal Cancer Genomes](https://reader034.vdocuments.site/reader034/viewer/2022051817/5483f1d2b47959ec0c8b4ad7/html5/thumbnails/24.jpg)
samples
not altered
altered
genes
Cancer genomes in the context of IntOGen
tumours / experiments
genes
IntOGen
corrected p-value
0.05 10
Samples
Technology
Alteration
RNA-seq
Dif. Expression:- Upregulated- Downregulated
7 CLL7 normal
(Roderic Guigo lab)
ICGC-CLL genome project
![Page 25: IntOGen, Integrative Oncogenomics for Personal Cancer Genomes](https://reader034.vdocuments.site/reader034/viewer/2022051817/5483f1d2b47959ec0c8b4ad7/html5/thumbnails/25.jpg)
samples
not altered
altered
genes
Cancer genomes in the context of IntOGen
tumours
genes
IntOGen
corrected p-value
0.05 10
Samples
Technology
Alteration
RNA-seq
Dif. Expression:- Upregulated- Downregulated
7 CLL7 normal
(Roderic Guigo lab)
ICGC-CLL genome project
![Page 26: IntOGen, Integrative Oncogenomics for Personal Cancer Genomes](https://reader034.vdocuments.site/reader034/viewer/2022051817/5483f1d2b47959ec0c8b4ad7/html5/thumbnails/26.jpg)
samples
not altered
altered
genes
samples
path
way
s
Cancer genomes in the context of IntOGen
corrected p-value
0.05 10
tumours
genes
IntOGen
path
way
s
tumours
corrected p-value
0.05 10
corrected p-value
0.05 10
Enrichmentanalysis
Samples
Technology
Alteration
RNA-seq
Dif. Expression:- Upregulated- Downregulated
7 CLL7 normal
(Roderic Guigo lab)
ICGC-CLL genome project
![Page 27: IntOGen, Integrative Oncogenomics for Personal Cancer Genomes](https://reader034.vdocuments.site/reader034/viewer/2022051817/5483f1d2b47959ec0c8b4ad7/html5/thumbnails/27.jpg)
samples
not altered
altered
genes
samples
Cancer genomes in the context of IntOGen
corrected p-value
0.05 10
tumours
genes
IntOGen
tumours
corrected p-value
0.05 10
corrected p-value
0.05 10
Enrichmentanalysis
Samples
Technology
Alteration
RNA-seq
Dif. Expression:- Upregulated- Downregulated
7 CLL7 normal
(Roderic Guigo lab)
ICGC-CLL genome project
path
way
s
path
way
s
![Page 28: IntOGen, Integrative Oncogenomics for Personal Cancer Genomes](https://reader034.vdocuments.site/reader034/viewer/2022051817/5483f1d2b47959ec0c8b4ad7/html5/thumbnails/28.jpg)
Considerations for the next version
Ethical
Technological
![Page 29: IntOGen, Integrative Oncogenomics for Personal Cancer Genomes](https://reader034.vdocuments.site/reader034/viewer/2022051817/5483f1d2b47959ec0c8b4ad7/html5/thumbnails/29.jpg)
Ethical considerations
openaccess
controlledaccess
Data that cannot be usedto identify individuals:age, normalized gene expression, ...
Germline genomic data anddetailed clinical informationassociated to a unique individual
![Page 30: IntOGen, Integrative Oncogenomics for Personal Cancer Genomes](https://reader034.vdocuments.site/reader034/viewer/2022051817/5483f1d2b47959ec0c8b4ad7/html5/thumbnails/30.jpg)
openaccess
controlledaccess
Data that cannot be usedto identify individuals:age, normalized gene expression, ...
Germline genomic data anddetailed clinical informationassociated to a unique individual
Ethical considerations
![Page 31: IntOGen, Integrative Oncogenomics for Personal Cancer Genomes](https://reader034.vdocuments.site/reader034/viewer/2022051817/5483f1d2b47959ec0c8b4ad7/html5/thumbnails/31.jpg)
Technical considerations
User interfaces
Infrastructure
Web servicesBrowserGitools BiomartManagement
HadoopMap-Reduce
HadoopDFS Cascading PIG
Grid Engine Plain files MySQL MongoDBBioinformatics
software
IntOGen core
Dataimporters
Analysismanagement
Datamanagement
Experimentsmanagement
Analysisworkflows
Datamodels
Amazon / Eucalyptus
![Page 32: IntOGen, Integrative Oncogenomics for Personal Cancer Genomes](https://reader034.vdocuments.site/reader034/viewer/2022051817/5483f1d2b47959ec0c8b4ad7/html5/thumbnails/32.jpg)
Technical considerations
User interfaces
Infrastructure
Web servicesBrowserGitools BiomartManagement
HadoopMap-Reduce
HadoopDFS Cascading PIG
Grid Engine Plain files MySQL MongoDBBioinformatics
software
IntOGen core
Dataimporters
Analysismanagement
Datamanagement
Experimentsmanagement
Analysisworkflows
Datamodels
Amazon / Eucalyptus
Genome view
NGS workflows
Web management
![Page 33: IntOGen, Integrative Oncogenomics for Personal Cancer Genomes](https://reader034.vdocuments.site/reader034/viewer/2022051817/5483f1d2b47959ec0c8b4ad7/html5/thumbnails/33.jpg)
Technical considerations
User interfaces
Infrastructure
Web servicesBrowserGitools BiomartManagement
HadoopMap-Reduce
HadoopDFS Cascading PIG
Grid Engine Plain files MySQL MongoDBBioinformatics
software
IntOGen core
Dataimporters
Analysismanagement
Datamanagement
Experimentsmanagement
Analysisworkflows
Datamodels
Amazon / Eucalyptus
Genome view
NGS workflows
Web management
Flexibility●Different ways to access the data●Methods constantly evolving●Methods impl. different languages and infrastructure requirements
●Quantity of data increases●And also the number and complexity of calculations
Scalability
![Page 34: IntOGen, Integrative Oncogenomics for Personal Cancer Genomes](https://reader034.vdocuments.site/reader034/viewer/2022051817/5483f1d2b47959ec0c8b4ad7/html5/thumbnails/34.jpg)
Summary
IntOGen is a novel framework for oncogenomics data integration and analysis
It integrates many tumor types and different types of alterations in a common framework
It explores the data at different levels, from individual experiments to combinations of experiments, and from individual genes to biological modules
It incorporates an intuitive web system designed to be a discovery tool for cancer researchers
I have presented some examples on how to use IntOGen and Gitools to prioritize and compare personal genomes data.
We are adapting IntOGen to store, analyze and visualize next generation sequencing data, which will allow to incorporate data from the ICGC, starting by the Chronic Lymphocytic Leukemia data.
Ethical and technological considerations has to be addressed.
![Page 35: IntOGen, Integrative Oncogenomics for Personal Cancer Genomes](https://reader034.vdocuments.site/reader034/viewer/2022051817/5483f1d2b47959ec0c8b4ad7/html5/thumbnails/35.jpg)
Acknowledgements
Nuria López-Bigas
Gunes Gundem
Jordi Deu-Pons
Khademul Islam
Alba Jené-Sanz
Michael Schroeder
Xavier Rafael
Sophia Derdak
Abel Gonzalez-Pérez
Armand Gutierrez
Biomedical Genomics