modularity in biological networks

49
Modularity in Biological networks

Upload: jakeem-mooney

Post on 31-Dec-2015

41 views

Category:

Documents


1 download

DESCRIPTION

Modularity in Biological networks. Traditional view of modularity:. Modularity in Cellular Networks. Hypothesis: Biological function are carried by discrete functional modules. Hartwell, L.-H., Hopfield, J. J., Leibler, S., & Murray, A. W., Nature , 1999. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Modularity in Biological networks

Modularity in Biological networks

Page 2: Modularity in Biological networks

Hypothesis: Biological function are carried by discrete functional modules.

Hartwell, L.-H., Hopfield, J. J., Leibler, S., & Murray, A. W., Nature, 1999.

Question: Is modularity a myth, or a structural property of biological networks?(are biological networks fundamentally modular?)

Modularity in Cellular Networks

Traditional view of modularity:

Page 3: Modularity in Biological networks
Page 4: Modularity in Biological networks

Modularity in cell biology

Page 5: Modularity in Biological networks

Definition of a module

• Loosely linked island of densely connected nodes

• Groups of co-expressed genes

Page 6: Modularity in Biological networks

Concept of modules in a network

Page 7: Modularity in Biological networks

Concept of modules in a network

Page 8: Modularity in Biological networks

Definition of a module

Page 9: Modularity in Biological networks

Computational analysis of modular structures

Data clustering approach

Page 10: Modularity in Biological networks

Concept of data clustering analysis

• Partitioning a data set into groups so that points in one group are similar to each other and are as different as possible from the points in other groups.

• The validity of a clustering is often in the eye of beholder.

Page 11: Modularity in Biological networks

Concept of data clustering analysis

• In order to describe two data points are similar or not, we need to define a similarity measure.

• We also need a score function for our objectives.

• A clustering algorithm can be used to partition the data set with optimized score function.

Page 12: Modularity in Biological networks

Types of clustering algorithms

• Partition-based clustering algorithms

• Hierarchical clustering algorithms

• Probabilistic model-based clustering algorithms

Page 13: Modularity in Biological networks

Partitioning problem

• Given the set of n nodes network D={x(1),x(2), ,x(n)}, our task is to find K cluste∙∙∙rs C={C1,C2, ,C∙∙∙ K} such that each node x(i) is assigned to a unique cluster Ck with optimized score function S(C1,C2, ,C∙∙∙

K).

Page 14: Modularity in Biological networks

Community structure of biological network

Community 1

Community 2

Community 3

Page 15: Modularity in Biological networks

Score function for network clustering

• To maximize the intra group connections as many as possible and to minimize the inter group connection as few as possible.

Page 16: Modularity in Biological networks

Spectral analysis clustering algorithm

Page 17: Modularity in Biological networks

Adjacency Matrix

• Aij= 1 if ith protein interacts with jth protein

• Aij=0 otherwise

• Aij=Aji (undirected graph)

• Aij is a sparse matrix, most elements of Aij are zero

Page 18: Modularity in Biological networks

0

0

Spectral analysis

Page 19: Modularity in Biological networks

Algorithm (Spectral analysis)

• Randomly assign a vector X=(X1,X2,…,Xn)

• Iterate X(k+1)=AX(k) untill it converges

• Try another vector which is perpendicular to previous found eigenspace

Page 20: Modularity in Biological networks

Topological Structure

Original Network Hidden Topological Structure

Page 21: Modularity in Biological networks

An example

Protein-protein interaction network of Saccharomyces cerevisiae

Page 22: Modularity in Biological networks

Assign 80000 interactions of 5400 yeast proteins a confidence

valueWe take 11855 interactions with high and medium confidence among 2617 proteins with 353 unknown function

proteins.

Data source

Page 23: Modularity in Biological networks

Quasi-cliqueQuasi-bipartite

Positive eigenvalue negative eigenvalue

Page 24: Modularity in Biological networks

• With the spectral analysis, we obtain 48 quasi-cliques and 6 quasi-bipartites.

• There are annotated proteins, unannotated and unknown proteins within a quasi-clique

Page 25: Modularity in Biological networks

Application—function prediction

Page 26: Modularity in Biological networks

Hierarchical clustering algorithm

• A similarity distance measure between node i and j, d(i,j)

• The similarity measure can be let the network to be a weighted network Wij.

Page 27: Modularity in Biological networks

Types of hierarchical clustering

• Agglomerative hierarchical clustering

• Divisive hierarchical clustering

Page 28: Modularity in Biological networks

Properties of similarity measure

• d(i,j)≥0

• d(i,j)=d(j,i)

• d(i,j)≤d(i,k)+d(k,j)

Page 29: Modularity in Biological networks

Similarity measure for agglomerative clustering

• Correlation

• Shortest path length

• Edge betweenness

Page 30: Modularity in Biological networks

How good is agglomerative clustering ?

Page 31: Modularity in Biological networks

Hierarchical tree (Dendrogram)

threshold

Page 32: Modularity in Biological networks

Cluster 1Cluster 2

Single link

Distance between clusters

Page 33: Modularity in Biological networks

Cluster 1Cluster 2

Complete link

Distance between clusters

Page 34: Modularity in Biological networks

0203.429.55

205.3539.5

03.45.305.15.2

29.555.102

539.55.220

D

x2

x3

x1

x4

x5

1.5 2.0 2.2 3.5

Single link

Page 35: Modularity in Biological networks

Divisive hierarchical clustering

M.E.J., Newman and M. Girvan, Phys. Rev. E 69, 026113, (2004)

Page 36: Modularity in Biological networks

Definition of edge betweeness

i

j

5

2

and i node connectingpath ofnumber

k edge through passingpath ofnumber ),(

jjiBk

Page 37: Modularity in Biological networks

Definition of edge betweeness

ji

k jiBk,

),( edge of sbetweennes edge

jik jiB

NNk

,

),()3)(2(

2 edge of sbetweennes edge scaled

Page 38: Modularity in Biological networks

Calculation of edge betweenness

Page 39: Modularity in Biological networks

Quantitative measurement of network modularity

Modularity Q

ji

e

ea

aeQ

ij

jiji

iiii

and module

connectingnetwork in edges offraction theis

2

Page 40: Modularity in Biological networks

Threshold selection

Page 41: Modularity in Biological networks

Karate club network

Page 42: Modularity in Biological networks

Karate club network

Page 43: Modularity in Biological networks

Examples of agglomerative hierarchical

clustering

Page 44: Modularity in Biological networks

Can we identify the modules?

),min(

),(),(

jiT kk

jiJjiO J(i,j): # of nodes both i and j link to; +1 if there is a direct (i,j) link

Page 45: Modularity in Biological networks

Modules in the E. coli metabolismE. Ravasz et al., Science, 2002

Pyrimidine metabolism

Page 46: Modularity in Biological networks

Yeast signaling proteins in MIPS

i,jl

lA

ij

ij

ij

proteinbetween path shortest :

12

PNAS, vol.100, pp.1128, (2003).

Page 47: Modularity in Biological networks

Spotted microarray for Saccharomyces cerevisiae

Similarity measure

Page 48: Modularity in Biological networks

Regulatory module network

Page 49: Modularity in Biological networks

Genome Biology, 9, R2, (2008).