atelier grand graphes et bioinformatiques
TRANSCRIPT
![Page 1: Atelier Grand Graphes et Bioinformatiques](https://reader037.vdocuments.site/reader037/viewer/2022102811/58eeb6281a28abaf628b464f/html5/thumbnails/1.jpg)
Urszula Czeriwnska, Laurence Calzone, Emmanuel Barillot and Andrei Zinovyev
Atelier Grands Graphes et Bioinformatique - EGC 2016 REIMS
DEDAL CYTOSCAPE 3 APP FOR PRODUCING AND MORPHING
DATA-DRIVEN AND STRUCTURE-DRIVEN NETWORK LAYOUTS
U900 Computational Systems Biology for Cancer
![Page 2: Atelier Grand Graphes et Bioinformatiques](https://reader037.vdocuments.site/reader037/viewer/2022102811/58eeb6281a28abaf628b464f/html5/thumbnails/2.jpg)
OUTLINEFROM EXTRATION OF KNOWLEDGE TO INTELLIGENT LAYOUT
COMBING MUTIDIMENTIONAL DATA AND NETWORK STRUCTURE
SUMMARY
DEDAL CYTOSCAPE 3.0 APP.
![Page 3: Atelier Grand Graphes et Bioinformatiques](https://reader037.vdocuments.site/reader037/viewer/2022102811/58eeb6281a28abaf628b464f/html5/thumbnails/3.jpg)
FROM EXTRACTION OF KNOWLEDGE TO INTELLIGENT LAYOUT
![Page 4: Atelier Grand Graphes et Bioinformatiques](https://reader037.vdocuments.site/reader037/viewer/2022102811/58eeb6281a28abaf628b464f/html5/thumbnails/4.jpg)
EXTRACTION OF KNOWLEDGEIN BIOLOGY
molecular biology interactions -> networks
: the creation of knowledge from structured and unstructured sources; the resulting knowledge needs to be in a machine-readable format;
![Page 5: Atelier Grand Graphes et Bioinformatiques](https://reader037.vdocuments.site/reader037/viewer/2022102811/58eeb6281a28abaf628b464f/html5/thumbnails/5.jpg)
protein 1 protein 2interaction
node nodeedge
NETWORKS
![Page 6: Atelier Grand Graphes et Bioinformatiques](https://reader037.vdocuments.site/reader037/viewer/2022102811/58eeb6281a28abaf628b464f/html5/thumbnails/6.jpg)
NETWORKS
Moldovan and D’Andrea 2009Peri et al., 2004
![Page 7: Atelier Grand Graphes et Bioinformatiques](https://reader037.vdocuments.site/reader037/viewer/2022102811/58eeb6281a28abaf628b464f/html5/thumbnails/7.jpg)
LAYOUTS
circular
hierarchical
organic
![Page 8: Atelier Grand Graphes et Bioinformatiques](https://reader037.vdocuments.site/reader037/viewer/2022102811/58eeb6281a28abaf628b464f/html5/thumbnails/8.jpg)
COMBINING MULTIDIMENSIONAL DATA AND NETWORK
![Page 9: Atelier Grand Graphes et Bioinformatiques](https://reader037.vdocuments.site/reader037/viewer/2022102811/58eeb6281a28abaf628b464f/html5/thumbnails/9.jpg)
THERE IS NOT A SINGLE LAYOUT
mapping data on the top of pre-defined biological network layout
identyfing subnetworks from a global network processing certain properities computed from the data
using biological network structure for pre-processing the high throughput data
1
2
3
![Page 10: Atelier Grand Graphes et Bioinformatiques](https://reader037.vdocuments.site/reader037/viewer/2022102811/58eeb6281a28abaf628b464f/html5/thumbnails/10.jpg)
T-test statistics
1. MAPPING DATA ON THE TOP OF THE NETWORK
Moldovan and D’Andrea 2009Peri et al., 2004TGCA, 2012
![Page 11: Atelier Grand Graphes et Bioinformatiques](https://reader037.vdocuments.site/reader037/viewer/2022102811/58eeb6281a28abaf628b464f/html5/thumbnails/11.jpg)
Ulitsky and Shamir, 2007
2. IDENTYFING SUBNETWORKS
![Page 12: Atelier Grand Graphes et Bioinformatiques](https://reader037.vdocuments.site/reader037/viewer/2022102811/58eeb6281a28abaf628b464f/html5/thumbnails/12.jpg)
3. USING BIOLOGICAL NETWORK STRUCTURE FOR PRE-PROCESSING THE HIGH THROUGHPUT DATA
Hofree et al, 2013
![Page 13: Atelier Grand Graphes et Bioinformatiques](https://reader037.vdocuments.site/reader037/viewer/2022102811/58eeb6281a28abaf628b464f/html5/thumbnails/13.jpg)
MULTIDIMENTIONAL SCALING
represent (dis)similarties as distances
dimension reduction
i.e. Principal Components Analysis
![Page 14: Atelier Grand Graphes et Bioinformatiques](https://reader037.vdocuments.site/reader037/viewer/2022102811/58eeb6281a28abaf628b464f/html5/thumbnails/14.jpg)
EXAMPLE: GOlorize Cytoscape App.
Garcia et al, 2007
![Page 15: Atelier Grand Graphes et Bioinformatiques](https://reader037.vdocuments.site/reader037/viewer/2022102811/58eeb6281a28abaf628b464f/html5/thumbnails/15.jpg)
DeDaL Cytoscape 3.0 App
DATA DRIVEN NETWORK LAYOUTS& MORPHING
![Page 16: Atelier Grand Graphes et Bioinformatiques](https://reader037.vdocuments.site/reader037/viewer/2022102811/58eeb6281a28abaf628b464f/html5/thumbnails/16.jpg)
Moldovan and D’Andrea 2009Peri et al., 2004TGCA, 2012
CYTOSCAPE: ORGANIC LAYOUT WITH MAPPED EXPRESSION DATA
T-test statistics
![Page 17: Atelier Grand Graphes et Bioinformatiques](https://reader037.vdocuments.site/reader037/viewer/2022102811/58eeb6281a28abaf628b464f/html5/thumbnails/17.jpg)
PCA
PCA layout
PC2
PC1
T-test statistics
DEDAL:
Moldovan and D’Andrea 2009Peri et al., 2004TGCA, 2012
PRINCIPAL COMPONENT ANALYSIS DRIVEN LAYOUT
![Page 18: Atelier Grand Graphes et Bioinformatiques](https://reader037.vdocuments.site/reader037/viewer/2022102811/58eeb6281a28abaf628b464f/html5/thumbnails/18.jpg)
PMS
Pure network structure based layout
Purely Data Driven Layout
PCA or Elmap
Combination of network structure and data layout
MORPHING DATA-DRIVEN AND STRUCTURE-BASED LAYOUT
![Page 19: Atelier Grand Graphes et Bioinformatiques](https://reader037.vdocuments.site/reader037/viewer/2022102811/58eeb6281a28abaf628b464f/html5/thumbnails/19.jpg)
PMS
Pure network structure based layout
Purely Data Driven Layout
PCA or Elmap
Combination of network structure and data layout
MORPHING DATA-DRIVEN AND STRUCTURE-BASED LAYOUT
![Page 20: Atelier Grand Graphes et Bioinformatiques](https://reader037.vdocuments.site/reader037/viewer/2022102811/58eeb6281a28abaf628b464f/html5/thumbnails/20.jpg)
T-test statistics
MORPHING DATA-DRIVEN AND STRUCTURE-BASED LAYOUT
![Page 21: Atelier Grand Graphes et Bioinformatiques](https://reader037.vdocuments.site/reader037/viewer/2022102811/58eeb6281a28abaf628b464f/html5/thumbnails/21.jpg)
T-test statistics
MORPHING DATA-DRIVEN AND STRUCTURE-BASED LAYOUT
![Page 22: Atelier Grand Graphes et Bioinformatiques](https://reader037.vdocuments.site/reader037/viewer/2022102811/58eeb6281a28abaf628b464f/html5/thumbnails/22.jpg)
MORE ADVANCED FEATURES OF DEDAL
![Page 23: Atelier Grand Graphes et Bioinformatiques](https://reader037.vdocuments.site/reader037/viewer/2022102811/58eeb6281a28abaf628b464f/html5/thumbnails/23.jpg)
ADVANCED FEATURES OF DEDAL Pre-processing of the data:o Smoothingo Double centeringo Quality check
Post-processing of the layout:o Alignmento Overlappingo Missing valueso Outliers
Data-driven layout: PCA or nPMsMorphing
![Page 24: Atelier Grand Graphes et Bioinformatiques](https://reader037.vdocuments.site/reader037/viewer/2022102811/58eeb6281a28abaf628b464f/html5/thumbnails/24.jpg)
NONLINEAR PRINCIPAL MANIFOLDS
Elastic map algorithms Principal manifolds approximation
Linear PCA vs nonlinear Principal Manifolds for visalisation of breast cancer microarray data
a) 3D PCA linear manifold.
b) ELMap2D
c) PCA2D
Gorban A.N., Zinovyev A. 2010
-+
![Page 25: Atelier Grand Graphes et Bioinformatiques](https://reader037.vdocuments.site/reader037/viewer/2022102811/58eeb6281a28abaf628b464f/html5/thumbnails/25.jpg)
NETWORK SMOOTHING
datasample
initial space
subspace offunctions smoothon a gene network
basis vectors areeigenvectors of thegraph Laplacian
projectedsample
Rapaport et al.,2007
courtesy of A.Zinovyev
![Page 26: Atelier Grand Graphes et Bioinformatiques](https://reader037.vdocuments.site/reader037/viewer/2022102811/58eeb6281a28abaf628b464f/html5/thumbnails/26.jpg)
A B
CD
E
F
GH
I
K
A B
CD
E
F
GH
I
K
Smooth distribution,balanced dosage
Non-smooth distribution,unbalanced dosage
courtesy of A.Zinovyev
NETWORK SMOOTHING
![Page 27: Atelier Grand Graphes et Bioinformatiques](https://reader037.vdocuments.site/reader037/viewer/2022102811/58eeb6281a28abaf628b464f/html5/thumbnails/27.jpg)
p=10-23 p=10-5
BIG NETWORK
Czerwinska et al., 2015
1047 nodes1986 edges
pearson coef.= 0.3 pearson coef.= 0.1
![Page 28: Atelier Grand Graphes et Bioinformatiques](https://reader037.vdocuments.site/reader037/viewer/2022102811/58eeb6281a28abaf628b464f/html5/thumbnails/28.jpg)
SCALABILITY
Czerwinska et al., 2015
network smoothing
network smoothing, reusing matrix
principal manifold
PCA 10 components
106
104
102
100
10-2
102 103 104
Number of nodes
Com
puta
tion
time
[sec
]
![Page 29: Atelier Grand Graphes et Bioinformatiques](https://reader037.vdocuments.site/reader037/viewer/2022102811/58eeb6281a28abaf628b464f/html5/thumbnails/29.jpg)
TUTORIALbioinfo-out.curie.fr/projects/dedal/
![Page 30: Atelier Grand Graphes et Bioinformatiques](https://reader037.vdocuments.site/reader037/viewer/2022102811/58eeb6281a28abaf628b464f/html5/thumbnails/30.jpg)
TUTORIALbioinfo-out.curie.fr/projects/dedal/
![Page 31: Atelier Grand Graphes et Bioinformatiques](https://reader037.vdocuments.site/reader037/viewer/2022102811/58eeb6281a28abaf628b464f/html5/thumbnails/31.jpg)
PUBLICATION
![Page 32: Atelier Grand Graphes et Bioinformatiques](https://reader037.vdocuments.site/reader037/viewer/2022102811/58eeb6281a28abaf628b464f/html5/thumbnails/32.jpg)
There is a need to combine networks and -omics data in biology
Network layout should be adapted to the analysis
DeDaL – Cytoscape App. performs o different types of data-driven layoutso morphing between strucure-based and
data-driven layouto pre-processing of data as double
centering and network smoothing
SUMMARY