analysis of biological networks part iii shalev itzkovitz uri alon’s group

41
Analysis of biological Analysis of biological networks networks Part III Part III Shalev Itzkovitz Shalev Itzkovitz Uri Alon’s group Uri Alon’s group July 2005 July 2005

Upload: toril

Post on 05-Jan-2016

28 views

Category:

Documents


1 download

DESCRIPTION

Analysis of biological networks Part III Shalev Itzkovitz Uri Alon’s group July 2005. What is a suitable random ensemble?. Reminder - Network motifs definition. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Analysis of biological networks Part III Shalev Itzkovitz              Uri Alon’s group

Analysis of biological networksAnalysis of biological networks

Part IIIPart III

Shalev ItzkovitzShalev Itzkovitz Uri Alon’s groupUri Alon’s group July 2005July 2005

Page 2: Analysis of biological networks Part III Shalev Itzkovitz              Uri Alon’s group

What is a suitable random ensemble?What is a suitable random ensemble?

Subgraphs which occur many times in the networks, significantly more than in a

suitable random ensemble.

Reminder - Network motifs definitionReminder - Network motifs definition

Page 3: Analysis of biological networks Part III Shalev Itzkovitz              Uri Alon’s group

Types of random ensemblesTypes of random ensembles

Erdos Networks

For a given network with N nodes and E edges define : p=E/N2, the

probability of an edge existing between any one of the N2 possible

directed edges.

Erdos & Renyi, 1960

N

Ek

Page 4: Analysis of biological networks Part III Shalev Itzkovitz              Uri Alon’s group

UMAN ensemble

a canonical version. All networks have the same numbers of Mutual, Antisymetric and Null edges as the real network, Uniformly distributed.

Used in sociology, analytically solvable for subgraph distributions.

Holland & Leinhardt, american journal of sociology 1970

Antisymetric edge

Mutual edge

Page 5: Analysis of biological networks Part III Shalev Itzkovitz              Uri Alon’s group

The configuration model

All networks preserve the same degree sequence of the real network

All networks preserve the same degree sequence of the real network, and multiple edges between two nodes are not allowed

The configuration model+no multiple edges

Bollobas, Random graphs 1985, Molloy & Reed, Random structures and algorithms 1995, Chung et.al. PNAS 1999

Maslov & Sneppen, science 2002, Newman Phys. Rev.Lett. 2002, Milo science 2002

Page 6: Analysis of biological networks Part III Shalev Itzkovitz              Uri Alon’s group

Stubs method for generating random Stubs method for generating random networksnetworks

Problem – multiple edges between nodesSolution – “Go with the winner” algorithm

Page 7: Analysis of biological networks Part III Shalev Itzkovitz              Uri Alon’s group

A

B D

C A

B D

C

Markov chain Monte-Carlo algorithmMarkov chain Monte-Carlo algorithm

Uniform sampling issues : ergodicity, detailed balance, mixing time

Page 8: Analysis of biological networks Part III Shalev Itzkovitz              Uri Alon’s group

Random networks which do not preserve Random networks which do not preserve the degree sequence are not suitablethe degree sequence are not suitable

Network hub

This v-shaped subgraph appears many timeswould be a network motif when comparing with Erdos networks

It is important to filter out subgraphs which appearIt is important to filter out subgraphs which appearin high numbers only due to the degree sequencein high numbers only due to the degree sequence

Page 9: Analysis of biological networks Part III Shalev Itzkovitz              Uri Alon’s group

More stringent ensembles

•Preserve the number of all subgraphs of sizes 3,4..,n-1 when

counting n-node subgraphs [Milo 2002]

•Can be combined with the markov chain algorithm by using

simulated annealing

•Filters out subgraphs which appear many times only because

they contain significant smaller subgraphs

Will appear many times if Is a motif

Page 10: Analysis of biological networks Part III Shalev Itzkovitz              Uri Alon’s group

A

B D

C A

B D

C

Simulated annealing algorithmSimulated annealing algorithm

•Randomize network by making X switches

•Make switches with a metropolis probability exp(-E/T)

•E is the deviation of any characteristic of the real network you

want to preserve (# 3-node subgraphs, clustering sequence etc)

Page 11: Analysis of biological networks Part III Shalev Itzkovitz              Uri Alon’s group

Erdos Networks

UMAN

Degree distribution

Degree sequence

Degreesequence+triads

Page 12: Analysis of biological networks Part III Shalev Itzkovitz              Uri Alon’s group

Subgraphs in Erdos networks: Subgraphs in Erdos networks: exact solutionexact solution

122

N

k

N

kN

N

Ep

NN nodes (8) nodes (8)

EE edges (8) edges (8)

<<kk> > mean degree (1)

Page 13: Analysis of biological networks Part III Shalev Itzkovitz              Uri Alon’s group

Subgraphs in Erdos networks: Subgraphs in Erdos networks: exact solutionexact solution

3

NPossible tripletsPossible triplets

3p Probability of forming a ffl Probability of forming a ffl given specific 3 nodesgiven specific 3 nodes

3333

3

33 ~3

kNkN

kNp

N

# nodes# nodes # edges# edges

Number of ffls does not Number of ffls does not change with network sizechange with network size!!!!!!

Page 14: Analysis of biological networks Part III Shalev Itzkovitz              Uri Alon’s group

• The expectancy of a subgraph with n nodes and g edges is analytically solvable. Scales as N(n-g)

gng N

N

k

n

N ~))((

N

kp

Subgraphs on Erdos NetworksSubgraphs on Erdos Networks

n=3g=3

Select n nodes

place g edges

Page 15: Analysis of biological networks Part III Shalev Itzkovitz              Uri Alon’s group

n=3, g=2, G~O(N3-2)=O(N) n=3, g=3, G~N3-3=O(1)

n=3, g=4, G~N3-4=O(N-1) n=3, g=6, G~N3-6=O(N-3)

Subgraph scaling familiesSubgraph scaling families

Page 16: Analysis of biological networks Part III Shalev Itzkovitz              Uri Alon’s group

P(K)~K-

Natural networks often have scale-freeNatural networks often have scale-free

outdegreeoutdegree

Page 17: Analysis of biological networks Part III Shalev Itzkovitz              Uri Alon’s group

Erdos networkErdos network Scale free networkScale free network

P(k

)P(K)~K

-

2<<3

Page 18: Analysis of biological networks Part III Shalev Itzkovitz              Uri Alon’s group

P(K)~K-

=3

=2

Scale-free networks have hubsScale-free networks have hubs

Page 19: Analysis of biological networks Part III Shalev Itzkovitz              Uri Alon’s group

Edge probability in the configuration modelEdge probability in the configuration model

edgesnetwork #

indegree nodeP(edge)

high edge probabilityhigh edge probability low edge probabilitylow edge probability

Page 20: Analysis of biological networks Part III Shalev Itzkovitz              Uri Alon’s group

Edge probability in the configuration modelEdge probability in the configuration model

edgesnetwork #

indegree) (node2*outdegree) (node1P(edge)

1122

10

2*2P

Page 21: Analysis of biological networks Part III Shalev Itzkovitz              Uri Alon’s group

Networks with E (~N) edges, and arbitrary indegree (Ri ) and outdegree (Ki ) sequences.

Subgraphs in networks that preserve degree Subgraphs in networks that preserve degree sequence: approximate solutionsequence: approximate solution

K1, R1

K2, R2

K3, R3

E

R21KP(edge1) E

R31 )1(K)edge1P(edge2

E

R )1(K,2)edge1P(edge3 32

E

RP(edge)

Page 22: Analysis of biological networks Part III Shalev Itzkovitz              Uri Alon’s group

Subgraph scaling depends on exact Subgraph scaling depends on exact topologytopology

3332211 )1(**)1(KK

)P(subgraphE

RRRK

3

)1()1(~#

K

RRKRKKffl

Subgraph topology effectsSubgraph topology effectsIts expected numbersIts expected numbers

Page 23: Analysis of biological networks Part III Shalev Itzkovitz              Uri Alon’s group

Subgraph scaling depends on exact Subgraph scaling depends on exact topology – as opposed to Erdos networkstopology – as opposed to Erdos networks

Example

O(1)

O(1) O(1)

Erdos Networks

γO( N)

Directed networks with power-law out-degree, compact in-degree :

P(K)~K-

Scale-free Networks ( =2.5)

2K

K

Real networks

O( 1)

O( N)>

Itzkovitz et. al., PRE 2003

Page 24: Analysis of biological networks Part III Shalev Itzkovitz              Uri Alon’s group

Network motifs – a new extensive variableNetwork motifs – a new extensive variable

Milo et. al., science 2002

Page 25: Analysis of biological networks Part III Shalev Itzkovitz              Uri Alon’s group

Global constraints on network structure Global constraints on network structure can create network motifscan create network motifs

• Subgraphs which appear many times in a network (more than random)

• Might stem from evolutionary constraints of selection for some function, or be a result of other global constraints

• Degree sequence is a global constraint with a profound effect on subgraph content

• Are there other global constraints which might result in network motifs?

Page 26: Analysis of biological networks Part III Shalev Itzkovitz              Uri Alon’s group

How do geometrical constraints How do geometrical constraints influence the local structure?influence the local structure?

Page 27: Analysis of biological networks Part III Shalev Itzkovitz              Uri Alon’s group

Examples of geometrically constrained Examples of geometrically constrained systemssystems

• Transportation networks (highways, trains)

• Internet layout

• Neuronal networks, brain layout

• Abstract spaces (www, social, gene-array data)

Page 28: Analysis of biological networks Part III Shalev Itzkovitz              Uri Alon’s group

The neuron network of The neuron network of C. elegansC. elegans

"The abundance of triangular connections in the nervous system of C. elegans may thus simply be a consequence of the high levels of connectivity that are present within neighbourhoods“ (White et. al.)

Page 29: Analysis of biological networks Part III Shalev Itzkovitz              Uri Alon’s group

The geometric modelThe geometric model

• N nodes arranged on d-dimensional lattice

• Connections made only to neighbors within range R

Page 30: Analysis of biological networks Part III Shalev Itzkovitz              Uri Alon’s group

Erdos networks – every node can Erdos networks – every node can connect to every other nodeconnect to every other node

Probability of closing triangles - small

Page 31: Analysis of biological networks Part III Shalev Itzkovitz              Uri Alon’s group

Geometric networks – every node can Geometric networks – every node can connect only to its neighborhoodconnect only to its neighborhood

Probability of closing triangles - large

Page 32: Analysis of biological networks Part III Shalev Itzkovitz              Uri Alon’s group

All subgraphs in geometric networks All subgraphs in geometric networks scale as network sizescale as network size

N/Rd ‘sub-networks’, each one an Erdos network of size Rd

All subgraphs scale as network size

Erdos sub-network

Page 33: Analysis of biological networks Part III Shalev Itzkovitz              Uri Alon’s group

All subgraphs scale as NAll subgraphs scale as N

Page 34: Analysis of biological networks Part III Shalev Itzkovitz              Uri Alon’s group

The Erdos scaling laws determine the The Erdos scaling laws determine the network motifsnetwork motifs

Page 35: Analysis of biological networks Part III Shalev Itzkovitz              Uri Alon’s group

All subgraphs with more edges than All subgraphs with more edges than nodes are motifsnodes are motifs

Motifs – scale as N in geometric networksConstant number in random networks

Not motifs – scale as N in both random and real networks

Page 36: Analysis of biological networks Part III Shalev Itzkovitz              Uri Alon’s group

Feedbacks in neuronal network are much Feedbacks in neuronal network are much more rare than expected from geometrymore rare than expected from geometry

= 1 : 3 = 1 : 3

= 0 : 40 = 0 : 40

geometric model

C elegans neuronal network

Page 37: Analysis of biological networks Part III Shalev Itzkovitz              Uri Alon’s group

Imposing a field changes subgraph ratiosImposing a field changes subgraph ratios

inputsoutputs

Itzkovitz et. al., PRE 2005

Page 38: Analysis of biological networks Part III Shalev Itzkovitz              Uri Alon’s group

A simple model of geometry + directional A simple model of geometry + directional bias is not enoughbias is not enough

abundant in C elegans

Mutual edges rare in geometric networks + directional bias

Page 39: Analysis of biological networks Part III Shalev Itzkovitz              Uri Alon’s group

The mapping of network models and The mapping of network models and resulting network motifs is not a 1-1 resulting network motifs is not a 1-1

mappingmapping

Motif set 1Motif set 1 Motif set 2Motif set 2 Motif set 3Motif set 3

Model 1Model 1 Model 2Model 2 Model 3Model 3 Model 4Model 4

X

Page 40: Analysis of biological networks Part III Shalev Itzkovitz              Uri Alon’s group

conclusionsconclusions

•Biological networks are highly optimized systems aimed at information processing computations.

•These networks contain network motifs – subgraphs that appear significantly more than in suitable random networks.

•The hypothesized functional advantage of each network motif can be tested experimentally.

•Network motifs may be selected modules of information processing, or results of global network constraints.

•The network motif approach can be used to reverse-engineer complex biological networks, and unravel their basic computational building blocks.

Page 41: Analysis of biological networks Part III Shalev Itzkovitz              Uri Alon’s group

AcknowledgmentsAcknowledgments

Ron MiloRon MiloNadav KashtanNadav Kashtan

Uri AlonUri Alon

More information :More information :http://www.weizmann.ac.il/mcb/UriAlon/

PapersPapers

mfinder – network motif detection softwaremfinder – network motif detection software

Collection of complex networksCollection of complex networks