the seed bank model - École polytechniqueanr-manege/aussois2013/... · 2013. 4. 26. · 1/10 1/10...

Post on 19-Jul-2021

0 Views

Category:

Documents

0 Downloads

Preview:

Click to see full reader

TRANSCRIPT

Introduction On the ancestral process of long-range seed bank models Application to biology

The Seed Bank Model

Joint work with J. Blath, N. Kurt and D. Spano

RTG1845Berlin Mathematical School

Technische Universitat BerlinUNAM

05-04-2013

Adrian Gonzalez Casanova On the Ancestral process of the Seed Bank Model 1/25

Introduction On the ancestral process of long-range seed bank models Application to biology

Content

1 Introduction

2 On the ancestral process of long-range seed bank models

3 Application to Biology

Adrian Gonzalez Casanova On the Ancestral process of the Seed Bank Model 2/25

Introduction On the ancestral process of long-range seed bank models Application to biology

Classical Models

The Wright-Fisher Model

Description

I A generation consists of N individuals. Each individual in generation iselects a parent uniformly in the generation i − 1.

Adrian Gonzalez Casanova On the Ancestral process of the Seed Bank Model 3/25

Introduction On the ancestral process of long-range seed bank models Application to biology

Classical Models

The Wright-Fisher Model

Description

I A generation consists of N individuals. Each individual in generation iselects a parent uniformly in the generation i − 1.

Adrian Gonzalez Casanova On the Ancestral process of the Seed Bank Model 3/25

Introduction On the ancestral process of long-range seed bank models Application to biology

Classical Models

The Wright-Fisher Model

Description

I A generation consists of N individuals. Each individual in generation iselects a parent uniformly in the generation i − 1.

Adrian Gonzalez Casanova On the Ancestral process of the Seed Bank Model 3/25

Introduction On the ancestral process of long-range seed bank models Application to biology

Classical Models

The Wright-Fisher Model

Description

I A generation consists of N individuals. Each individual in generation iselects a parent uniformly in the generation i − 1.

Adrian Gonzalez Casanova On the Ancestral process of the Seed Bank Model 3/25

Introduction On the ancestral process of long-range seed bank models Application to biology

Classical Models

The Wright-Fisher Model

Adrian Gonzalez Casanova On the Ancestral process of the Seed Bank Model 4/25

Introduction On the ancestral process of long-range seed bank models Application to biology

Classical Models

The Wright-Fisher Model

Adrian Gonzalez Casanova On the Ancestral process of the Seed Bank Model 4/25

Introduction On the ancestral process of long-range seed bank models Application to biology

Classical Models

The Wright-Fisher Model

Adrian Gonzalez Casanova On the Ancestral process of the Seed Bank Model 4/25

Introduction On the ancestral process of long-range seed bank models Application to biology

Classical Models

The Wright-Fisher Model

Adrian Gonzalez Casanova On the Ancestral process of the Seed Bank Model 4/25

Introduction On the ancestral process of long-range seed bank models Application to biology

Classical Models

The Wright-Fisher Model

Adrian Gonzalez Casanova On the Ancestral process of the Seed Bank Model 4/25

Introduction On the ancestral process of long-range seed bank models Application to biology

Classical Models

The Wright-Fisher Model

Adrian Gonzalez Casanova On the Ancestral process of the Seed Bank Model 4/25

Introduction On the ancestral process of long-range seed bank models Application to biology

Classical Models

The Wright-Fisher Model

Adrian Gonzalez Casanova On the Ancestral process of the Seed Bank Model 4/25

Introduction On the ancestral process of long-range seed bank models Application to biology

Classical Models

The Wright-Fisher Model

Adrian Gonzalez Casanova On the Ancestral process of the Seed Bank Model 4/25

Introduction On the ancestral process of long-range seed bank models Application to biology

Classical Models

The Wright-Fisher Model

Adrian Gonzalez Casanova On the Ancestral process of the Seed Bank Model 4/25

Introduction On the ancestral process of long-range seed bank models Application to biology

Classical Models

The Wright-Fisher Model

Adrian Gonzalez Casanova On the Ancestral process of the Seed Bank Model 4/25

Introduction On the ancestral process of long-range seed bank models Application to biology

Classical Models

Scaling

Fix a sample of size n. Let the number of individuals go to infinity.Measure the time in terms of the number of individuals per generation.

BN([Nt])⇒ K (t)

Adrian Gonzalez Casanova On the Ancestral process of the Seed Bank Model 5/25

Introduction On the ancestral process of long-range seed bank models Application to biology

Classical Models

(The block counting process of) The Kingman coalescent

Description

Each pair of blocks coalesce at rate 1, independently of the others.

Adrian Gonzalez Casanova On the Ancestral process of the Seed Bank Model 6/25

Introduction On the ancestral process of long-range seed bank models Application to biology

Classical Models

(The block counting process of) The Kingman coalescent

Adrian Gonzalez Casanova On the Ancestral process of the Seed Bank Model 7/25

Introduction On the ancestral process of long-range seed bank models Application to biology

Classical Models

(The block counting process of) The Kingman coalescent

Adrian Gonzalez Casanova On the Ancestral process of the Seed Bank Model 7/25

Introduction On the ancestral process of long-range seed bank models Application to biology

Classical Models

Time to the most recent common ancestor

Adrian Gonzalez Casanova On the Ancestral process of the Seed Bank Model 8/25

Introduction On the ancestral process of long-range seed bank models Application to biology

Classical Models

Time to the most recent common ancestor

E [TMRCA] =n−1∑i=1

E [Ti ] =n−1∑i=1

1(n+1−i2

) =n∑

u=2

2

u(u − 1)= 2(1− 1

n

)

Adrian Gonzalez Casanova On the Ancestral process of the Seed Bank Model 9/25

Introduction On the ancestral process of long-range seed bank models Application to biology

The Seed Bank Model

The work of Kaj, Krone and Lascoux (2001)

I The seed bank model introduced by Kaj, Krone and Lascoux is ageneralization of the Wright-Fisher model. Its biological motivationare species that reproduce using seeds. (Like a cactus.)

I Dynamics: Let µ be a bounded measure on N. Each individual selectsits parent independently by the following 2 steps:

1 Select the generation of the parent by performing a µ distributed jump.2 Select a parent uniformly among the members of the selected

generation.

I Problem: We lose the Markov property.

Adrian Gonzalez Casanova On the Ancestral process of the Seed Bank Model 10/25

Introduction On the ancestral process of long-range seed bank models Application to biology

The Seed Bank Model

The work of Kaj, Krone and Lascoux (2001)

I The seed bank model introduced by Kaj, Krone and Lascoux is ageneralization of the Wright-Fisher model. Its biological motivationare species that reproduce using seeds. (Like a cactus.)

I Dynamics: Let µ be a bounded measure on N. Each individual selectsits parent independently by the following 2 steps:

1 Select the generation of the parent by performing a µ distributed jump.2 Select a parent uniformly among the members of the selected

generation.

I Problem: We lose the Markov property.

Adrian Gonzalez Casanova On the Ancestral process of the Seed Bank Model 10/25

Introduction On the ancestral process of long-range seed bank models Application to biology

The Seed Bank Model

m=4

N=10

Adrian Gonzalez Casanova On the Ancestral process of the Seed Bank Model 11/25

Introduction On the ancestral process of long-range seed bank models Application to biology

The Seed Bank Model

1/10

1/10

1/10

1/10

1/10

1/10

1/10

1/10

1/10

1/10

m=4

N=10

Adrian Gonzalez Casanova On the Ancestral process of the Seed Bank Model 11/25

Introduction On the ancestral process of long-range seed bank models Application to biology

The Seed Bank Model

1/10

1/10

1/10

1/10

1/10

1/10

1/10

1/10

1/10

1/10

m=4

N=10

Adrian Gonzalez Casanova On the Ancestral process of the Seed Bank Model 11/25

Introduction On the ancestral process of long-range seed bank models Application to biology

The Seed Bank Model

1/10

1/10

1/10

1/10

1/10

1/10

1/10

1/10

1/10

1/10

m=4

N=10

Adrian Gonzalez Casanova On the Ancestral process of the Seed Bank Model 11/25

Introduction On the ancestral process of long-range seed bank models Application to biology

The Seed Bank Model

1/10

1/10

1/10

1/10

1/10

1/10

1/10

1/10

1/10

1/10

m=4

N=10

Adrian Gonzalez Casanova On the Ancestral process of the Seed Bank Model 11/25

Introduction On the ancestral process of long-range seed bank models Application to biology

The Seed Bank Model

The work of Kaj, Krone and Lascoux (2001)

I The ancestral process can be described in terms of a finite stateMarkov Chain.

I Main result: The scaling limit is the Kingman coalescent, under aconstant time change.

I Limitation: µ must be bounded.

Adrian Gonzalez Casanova On the Ancestral process of the Seed Bank Model 12/25

Introduction On the ancestral process of long-range seed bank models Application to biology

The model

The seed bank model with long-range dependence

I What happens if we remove the boundedness condition of the jumpmeasure µ in the seed bank model?

I Motivation.

Adrian Gonzalez Casanova On the Ancestral process of the Seed Bank Model 13/25

Introduction On the ancestral process of long-range seed bank models Application to biology

The model

The seed bank model with long-range dependence

I Answer: It depends on µ.

I We say that µ ∈ Γα, if

µ({n, ...}) = n−αL(n), α > 0,

where L(n) is a slowly varying function.

I The qualitative behaviour of the model changes drastically dependingon α.

Adrian Gonzalez Casanova On the Ancestral process of the Seed Bank Model 14/25

Introduction On the ancestral process of long-range seed bank models Application to biology

Results

Construction of the Renewal Process

The ancestral line A(v) of an individual v is given by a renewal processwith interarrival law µ and an additional uniform choice of an individual.The renewal times correspond to the generation of an ancestor. Ancestrallines of a sample of individuals are coupled renewal processes.

Adrian Gonzalez Casanova On the Ancestral process of the Seed Bank Model 15/25

Introduction On the ancestral process of long-range seed bank models Application to biology

Results

J. Blath, AGC, N. Kurt, D. Spano (2011)

Let µ,N be fixed and let v ,w denote two individuals living at time 0.

(a) If α > 1, then E[TMRCA] <∞(b) If α ∈ (1/2, 1), then P(A(v) ∩ A(w) 6= ∅) = 1 and E[T ] =∞(c) If α ∈ (0, 1/2), then P(A(v) ∩ A(w) 6= ∅) < 1

Adrian Gonzalez Casanova On the Ancestral process of the Seed Bank Model 16/25

Introduction On the ancestral process of long-range seed bank models Application to biology

Results

J. Blath, AGC, N. Kurt, D. Spano (2011)

If Eµ[ν] <∞ the block counting process induced by our model convergesweakly to the Kingman coalescent (constantly time changed),i.e. BN(Nt)⇒ K (γ(1)2t)

Adrian Gonzalez Casanova On the Ancestral process of the Seed Bank Model 17/25

Introduction On the ancestral process of long-range seed bank models Application to biology

Results

Ingredient 1: a Markov process

n=10

m=4

X10

(0)=(10, 0, 0, 0)

(X1(0), X2(0), X3(0), X4(0))

The configuration process in level 10

Adrian Gonzalez Casanova On the Ancestral process of the Seed Bank Model 18/25

Introduction On the ancestral process of long-range seed bank models Application to biology

Results

Ingredient 1: a Markov process

n=10

m=4

X10

(0)=(10, 0, 0, 0)

X10

(1)=(3, 3, 2, 2)

(X1(1), X2(1), X3(1), X4(1))

The configuration process in level 10

Adrian Gonzalez Casanova On the Ancestral process of the Seed Bank Model 18/25

Introduction On the ancestral process of long-range seed bank models Application to biology

Results

Ingredient 1: a Markov process

n=10

m=4

X10

(0)=(10, 0, 0, 0)

The configuration process in level 10

X10

(1)=(3, 3, 2, 2)

X10

(2)=(4, 3, 2, 1)

Finite States Irreducible Markov

Chain

(X1(2), X2(2), X3(2), X4(2))

Adrian Gonzalez Casanova On the Ancestral process of the Seed Bank Model 18/25

Introduction On the ancestral process of long-range seed bank models Application to biology

Results

Ingredient 2: stationary distribution

Proof

Let X n be the configuration process in level n.

I There exists a stationary distribution for X n if and only if Eµ[X ] <∞.

The stationary distribution is

γ = mult( 1

Eµ[X ],µ(i > 1)

Eµ[X ],µ(i > 2)

Eµ[X ], ...)

Adrian Gonzalez Casanova On the Ancestral process of the Seed Bank Model 19/25

Introduction On the ancestral process of long-range seed bank models Application to biology

Results

Ingredient 2: stationary distribution

{ball 1 visits urn k} = {X 1k = (1, 0, 0, ...)}

Pγ({ball 1 visits urn k}) = 1Eµ[X ]

Adrian Gonzalez Casanova On the Ancestral process of the Seed Bank Model 20/25

Introduction On the ancestral process of long-range seed bank models Application to biology

Results

Ingredient 2: stationary distribution

{ball 1 visits urn k} = {X 1k = (1, 0, 0, ...)}

Pγ({ball 1 visits urn k}) = 1Eµ[X ]

Adrian Gonzalez Casanova On the Ancestral process of the Seed Bank Model 20/25

Introduction On the ancestral process of long-range seed bank models Application to biology

Results

Ingredient 2: stationary distribution

{ball 1 visits urn k} = {X 1k = (1, 0, 0, ...)}

Pγ({ball 1 visits urn k}) = 1Eµ[X ]

Adrian Gonzalez Casanova On the Ancestral process of the Seed Bank Model 20/25

Introduction On the ancestral process of long-range seed bank models Application to biology

Results

Ingredient 2: stationary distribution

{ball 1 visits urn k} = {X 1k = (1, 0, 0, ...)}

Pγ({ball 1 visits urn k}) = 1Eµ[X ]

Adrian Gonzalez Casanova On the Ancestral process of the Seed Bank Model 20/25

Introduction On the ancestral process of long-range seed bank models Application to biology

Results

Ingredient 2: stationary distribution

{ball 1 visits urn k} = {X 1k = (1, 0, 0, ...)}

Pγ({ball 1 visits urn k}) = 1Eµ[X ]

Adrian Gonzalez Casanova On the Ancestral process of the Seed Bank Model 20/25

Introduction On the ancestral process of long-range seed bank models Application to biology

Results

Ingredient 2: stationary distribution

{ball 1 visits urn k} = {X 1k = (1, 0, 0, ...)}

Pγ({ball 1 visits urn k}) = 1Eµ[X ]

Adrian Gonzalez Casanova On the Ancestral process of the Seed Bank Model 20/25

Introduction On the ancestral process of long-range seed bank models Application to biology

Results

Ingredient 2: stationary distribution

{ball 1 visits urn k} = {X 1k = (1, 0, 0, ...)}

Pγ({ball 1 visits urn k}) = 1Eµ[X ]

Adrian Gonzalez Casanova On the Ancestral process of the Seed Bank Model 20/25

Introduction On the ancestral process of long-range seed bank models Application to biology

Results

Ingredient 2: stationary distribution

{ball 1 visits urn k} = {X 1k = (1, 0, 0, ...)}

Pγ({ball 1 visits urn k}) = 1Eµ[X ]

Adrian Gonzalez Casanova On the Ancestral process of the Seed Bank Model 20/25

Introduction On the ancestral process of long-range seed bank models Application to biology

Results

Ingredient 2: stationary distribution

{ball 1 visits urn k} = {X 1k = (1, 0, 0, ...)}

Pγ({ball 1 visits urn k}) = 1Eµ[X ]

Adrian Gonzalez Casanova On the Ancestral process of the Seed Bank Model 20/25

Introduction On the ancestral process of long-range seed bank models Application to biology

Results

Ingredient 2: stationary distribution

{ball 1 visits urn k} = {X 1k = (1, 0, 0, ...)}

Pγ({ball 1 visits urn k}) = 1Eµ[X ]

Adrian Gonzalez Casanova On the Ancestral process of the Seed Bank Model 20/25

Introduction On the ancestral process of long-range seed bank models Application to biology

Results

Ingredient 2: stationary distribution

{ball 1 visits urn k} = {X 1k = (1, 0, 0, ...)}

Pγ({ball 1 visits urn k}) = 1Eµ[X ]

Adrian Gonzalez Casanova On the Ancestral process of the Seed Bank Model 20/25

Introduction On the ancestral process of long-range seed bank models Application to biology

Results

Ingredient 2: stationary distribution

{ball 1 visits urn k} = {X 1k = (1, 0, 0, ...)}

Pγ({ball 1 visits urn k}) = 1Eµ[X ]

Adrian Gonzalez Casanova On the Ancestral process of the Seed Bank Model 20/25

Introduction On the ancestral process of long-range seed bank models Application to biology

Results

Ingredient 2: stationary distribution

{ball 1 visits urn k} = {X 1k = (1, 0, 0, ...)}

Pγ({ball 1 visits urn k}) = 1Eµ[X ]

Adrian Gonzalez Casanova On the Ancestral process of the Seed Bank Model 20/25

Introduction On the ancestral process of long-range seed bank models Application to biology

Results

Ingredient 2: stationary distribution

{ball 1 visits urn k} = {X 1k = (1, 0, 0, ...)}

Pγ({ball 1 visits urn k}) = 1Eµ[X ]

Adrian Gonzalez Casanova On the Ancestral process of the Seed Bank Model 20/25

Introduction On the ancestral process of long-range seed bank models Application to biology

Results

Ingredient 2: stationary distribution

{ball 1 visits urn k} = {X 1k = (1, 0, 0, ...)}

Pγ({ball 1 visits urn k}) = 1Eµ[X ]

Adrian Gonzalez Casanova On the Ancestral process of the Seed Bank Model 20/25

Introduction On the ancestral process of long-range seed bank models Application to biology

Results

Ingredient 2: stationary distribution

{ball 1 visits urn k} = {X 1k = (1, 0, 0, ...)}

Pγ({ball 1 visits urn k}) = 1Eµ[X ]

Adrian Gonzalez Casanova On the Ancestral process of the Seed Bank Model 20/25

Introduction On the ancestral process of long-range seed bank models Application to biology

Results

Ingredient 3: coupling argument

Idea

If particles are always in the stationary distribution

P(coalesce in generation k) =1

N(Eµ[X ])2=

1

Nγ(1)2

Then consider an artificial system where particles are always in thestationary distribution and couple it with the ancestral process of the seedbank model.

Adrian Gonzalez Casanova On the Ancestral process of the Seed Bank Model 21/25

Introduction On the ancestral process of long-range seed bank models Application to biology

Results

Ingredient 3: Coupling argument

The coupling is fast

Let τ be the first time particles labeled 1 in each system are in the samegeneration.

E [τ ] <∞

Adrian Gonzalez Casanova On the Ancestral process of the Seed Bank Model 22/25

Introduction On the ancestral process of long-range seed bank models Application to biology

Current work

J. Blath, AGC, N. Kurt, D. Spano (2011)

Let µ,N be fixed and let v ,w denote two individuals living at time 0.

(a) If α > 1, then E[TMRCA] <∞(b) If α ∈ (1/2, 1), then P(A(v) ∩ A(w) 6= ∅) = 1 and E[T ] =∞(c) If α ∈ (0, 1/2), then P(A(v) ∩ A(w) 6= ∅) < 1

Common ancestor will be close to the seed-mutation.Genetic drift will not cause fixation nor extinction.

Adrian Gonzalez Casanova On the Ancestral process of the Seed Bank Model 23/25

Introduction On the ancestral process of long-range seed bank models Application to biology

Current work

Azotobacter vinelandii

I The seed effect is very important in evolution of bacteria.

I Azotobacter vinelandii makes very big jumps. (endospores)

I Azotobacter vinelandii has too many genes of Pseudomonas not to bea Pseudomonas and too less to be one.

I Our model partially explains this phenomena.

I Challenges.

Adrian Gonzalez Casanova On the Ancestral process of the Seed Bank Model 24/25

Introduction On the ancestral process of long-range seed bank models Application to biology

Current work

Azotobacter vinelandii

I The seed effect is very important in evolution of bacteria.

I Azotobacter vinelandii makes very big jumps. (endospores)

I Azotobacter vinelandii has too many genes of Pseudomonas not to bea Pseudomonas and too less to be one.

I Our model partially explains this phenomena.

I Challenges.

Adrian Gonzalez Casanova On the Ancestral process of the Seed Bank Model 24/25

Introduction On the ancestral process of long-range seed bank models Application to biology

Current work

Azotobacter vinelandii

I The seed effect is very important in evolution of bacteria.

I Azotobacter vinelandii makes very big jumps. (endospores)

I Azotobacter vinelandii has too many genes of Pseudomonas not to bea Pseudomonas and too less to be one.

I Our model partially explains this phenomena.

I Challenges.

Adrian Gonzalez Casanova On the Ancestral process of the Seed Bank Model 24/25

Introduction On the ancestral process of long-range seed bank models Application to biology

Current work

Azotobacter vinelandii

I The seed effect is very important in evolution of bacteria.

I Azotobacter vinelandii makes very big jumps. (endospores)

I Azotobacter vinelandii has too many genes of Pseudomonas not to bea Pseudomonas and too less to be one.

I Our model partially explains this phenomena.

I Challenges.

Adrian Gonzalez Casanova On the Ancestral process of the Seed Bank Model 24/25

Introduction On the ancestral process of long-range seed bank models Application to biology

Current work

Azotobacter vinelandii

I The seed effect is very important in evolution of bacteria.

I Azotobacter vinelandii makes very big jumps. (endospores)

I Azotobacter vinelandii has too many genes of Pseudomonas not to bea Pseudomonas and too less to be one.

I Our model partially explains this phenomena.

I Challenges.

Adrian Gonzalez Casanova On the Ancestral process of the Seed Bank Model 24/25

Introduction On the ancestral process of long-range seed bank models Application to biology

References

References

I Kaj, I., Krone, S., Lascoux, M. 2001. Coalescent theory for seed bankmodels. J. Appl. Prob. 38:285-300

I J. Blath, A. Gonzalez Casanova, N. Kurt and D. Spano. On theancestral process of long-range seed bank models. To appear in J. ofAppl. Prob.

Adrian Gonzalez Casanova On the Ancestral process of the Seed Bank Model 25/25

Introduction On the ancestral process of long-range seed bank models Application to biology

References

Adrian Gonzalez Casanova On the Ancestral process of the Seed Bank Model 26/25

top related