investigating the evolution of developmental …

1
References To model multi-cellular organism dynamics, the avida world is divided into subpopulation - or demes. Gene 1 Gene 2 Gene 3 Gene 1 Gene 2 Gene 3 Gene 1 Gene 2 Gene 3 Gene 4 Genome length Fitness Genome length Fitness Genome length Fitness INVESTIGATING THE EVOLUTION OF DEVELOPMENTAL MECHANISMS IN DIGITAL MULTICELLULAR ORGANISMS BHASKAR KUMAWAT, RAMRAY BHAT Abstract Terminology Avida Conclusion Future work Development in metazoan organisms is composed of a series of steps that give rise to intercellular heterogeneities and signalling capabilities. We have used Avida – an articial life software platform to study how development associated processes change during evolution. Avida uses an agent-based model wherein autoreplicating evolvable computer programs compete for resources through innovation and deployment of a variety of mathematical tasks. In order to simulate multicellular clusters, we introduce spatially discrete and isogenic multicellular sub-populations in the world - called demes. Such a framework allows us to track the evolutionary dynamics of a series of quantiable metrics, such as the number and diversity of distinct unicellular phenotypes, the developmental time as well as the degree to which organisms in the deme communicate and sense their surroundings. Our preliminary experiments indicate - perhaps unsurprisingly - towards a nonlinear disparate relationship between the evolution of the genomic complexity and what can be encapsulated by considering all the above phenotypic metrics as, developmental complexity. Development Development of organisms is the process wherein cells divide, differentiate and proliferate in tissues, set up a body axis, and form organs. It is a well regulated process and this is usually summed up as the presence of a developmental program for the organism. Complexity The complexity of a system/object can be dened in various ways. A. Kolomogorov Complexity : Length of the shortest computer program that produces the object as an output. (Used for sequences, Kolomogorov 1968) B. Physical Complexity : Summation of amount of information stored at all sites in the genome based on their substitution frequencies. (Used for the genome, Adami 2000) C. McShea Structural Complexity : Biologically inspired but only a qualitative description without any way integrate different terms. (McShea, 1996) > > Object Complexity Process Complexity In our system, we estimate object and process complexities through multiple gross measures. - Object Complexity : Number of cells, degree of differentiation. - Process Complexity : Number of communication channels. Agent based simulation framework with genotype-phenotype discretization (A - articial, -vida - life) Genome is made of instructions from an instruction set. (Turing complete) Organisms that perform certain mathematical tasks acquire resources and reproduce faster. RESOURCE 1 Type : AND Inputs : (11011,10110) Check Output Organism Organism Inputs(11011,10110) AND Output (10010) We assay deme properties by isolating them in a test environment. The world geometry is periodic. Deme Runs World size in X : 5 World size in Y : 1000 Deme size : 25 (5x5) - total max 200 demes 1. A single organism with an ancestral genotype is injected into an empty deme. 2. The organisms can incorporate instructions that allow them to communicate with neighbors, sense their position in the deme, and even exit or join the germline of their deme. 3. Phenotypic heterogeneity can arise due to formation of information processing pipelines. Unicellular runs World size in X : 5 World size in Y : 1000 Number of organisms : max 5000 These organisms reproduce as soon as they genome is completely executed. . Results Division of Biological Sciences, Indian Institute of Science, Bangalore, India - 560012 Population tness structure The population tness structure at a particular update is the scatter plot of a particular property of genotypes in the population against their tness. Genome length (genomic structural complexity) RESOURCE ABUNDANCE = 10,000 RESOURCE ABUNDANCE = 1,000 Messaging RESOURCE ABUNDANCE = 10,000 RESOURCE ABUNDANCE = 1,000 Hypothesis 1 : High resource abundance allows appearance of high complexity outliers. Low resource condition is highly selective and restrictive to changes in complexity. Genomic structural complexity is relatively tightly bound in both cases. Hypothesis 2 : Multicellular organisms in low resource environments develop higher instances of messaging and sensing required for division of labor (also seen previously in [2]). The messaging processes correlate well with tness of these organisms. VARYING RESOURCE CONDITIONS Messaging UNICELLULAR MULTICELLULAR Regulation UNICELLULAR MULTICELLULAR No population shift is observed in the case of unicellular populations. The tness of these populations increase over time but they do not incorporate new regulation/ sensing instructions in a stable fashion while at the same time increasing their tness. Hypothesis 3 : Unicellular organisms rely majorly on changes in genomic complexity to bring about evolutionary innovations as compared to multicellular organisms - that rely on regulation and signalling to innovate new solutions. Complexity Utility Use metrics to quantify the contribution of the genome and non-genome to tness. Develop a concise methodology that takes in account these rough complexity measures and integrates them into a single "developmental complexity". Study how network topologies evolve in demes where messaging plays a major role in tness acquistion. Use the physical complexity metric (sensu Adami) to see if developmental features allow major transitions without increase in genomic complexity. In this work, we present a new framework to answer questions involving the evolution of developmental pathways in multi-cellular organisms and present preliminary results indicating the relative importance of genomic complexity and extra-genomic pathways in unicellular and multicellular organisms. Our results indicate that low resource conditions hinder the appearance of high complexity genomes and facilitates messaging to be incorporated as a functional process. They also hint towards the relatively important but disparate roles of genomic complexity and other developmental functions in unicellular and multicellular evolving organisms. [1] McShea, D. W. (1996). Perspective: Metazoan Complexity and Evolution: Is There a Trend? Evolution, 50(2), 477. doi:10.2307/2410824 [2] Goldsby, Heather J., et al. "Task-switching costs promote the evolution of division of labor and shifts in individuality." Proceedings of the National Academy of Sciences 109.34 (2012): 13686-13691. [3] Adami, Christoph, Charles Ofria, and Travis C. Collier. "Evolution of biological complexity." Proceedings of the National Academy of Sciences 97.9 (2000): 4463-4468. [4] Duclos, Kevin K., Jesse L. Hendrikse, and Heather A. Jamniczky. "Investigating the evolution and development of biological complexity under the framework of epigenetics." Evolution & development (2019): e12301. [5] Kolmogorov, Andrei Nikolaevich. "Three approaches to the quantitative denition of information." International journal of computer mathematics 2.1-4 (1968): 157-168. [6] Adami, Christoph. "What is complexity?." BioEssays 24.12 (2002): 1085-1094. [7] Bar-Yam, Yaneer. "Multiscale complexity/entropy." Advances in Complex Systems 7.01 (2004): 47-63. + Hierarchial complexities .Organisms in a deme reproduce independently of organisms in other demes and place offspring only within the same deme. There is no mutation during this process. Organisms can donate their accumulated resources to the deme. A deme that reaches a particular resource threshold reproduces by replacing a neighbor deme. Method Measurements A single long evolutionary run is conducted for 1.5 million updates. The demes organisms from each update are taken and analysed in a separate isolated environment where different properties are measured. Fitness of an organism : Metabolic rate divided by gestation time of the organism Fitness of a deme : Average rate of resource acquistion over lifetime. Genome length Genome length Genome length Genome length Genome length Genome length Genome length Genome length Fitness Fitness Fitness Fitness Fitness Fitness Fitness Fitness Fitness Fitness Fitness Fitness Genome length Fitness Genome length Fitness Genome length Fitness Genome length Genome length Fitness Genome length Genome length Genome length T = 200k T = 400k T = 600k T = 800k T = 1500k T = 1400k T = 1200k T = 1000k T = 200k T = 400k T = 600k T = 800k T = 1500k T = 1400k T = 1200k T = 1000k T = 200k T = 400k T = 600k T = 800k T = 1500k T = 1400k T = 1200k T = 1000k Fitness Fitness Fitness Fitness Fitness Fitness Fitness Fitness Fitness Fitness Fitness Fitness Fitness Fitness Fitness Fitness Messaging Messaging Messaging Messaging Messaging Messaging Messaging Messaging Messaging Messaging Messaging Messaging Messaging Messaging Messaging Messaging UNICELLULAR VS MULTICELLULAR UNICELLULAR MULTICELLULAR Genome length (genomic structural complexity) Fitness Fitness Fitness Fitness Fitness Fitness Fitness Fitness Fitness Fitness Fitness Fitness Fitness Fitness Fitness Fitness T = 200k T = 400k T = 600k T = 800k T = 1500k T = 1400k T = 1200k T = 1000k Genome length Genome length Genome length Genome length Genome length Genome length Genome length Genome length Genome length Genome length Genome length Genome length Genome length Genome length Genome length Genome length Fitness Fitness Fitness Fitness Fitness Fitness Fitness Fitness Fitness Fitness Fitness Fitness Fitness Fitness Fitness Fitness T = 200k T = 400k T = 600k T = 800k T = 1500k T = 1400k T = 1200k T = 1000k T = 200k T = 400k T = 600k T = 800k T = 1500k T = 1400k T = 1200k T = 1000k Fitness Fitness Fitness Fitness Fitness Fitness Fitness Fitness Fitness Fitness Fitness Fitness Fitness Fitness Fitness Fitness T = 4k T = 8k T = 12k T = 16k T = 20k T = 24k T = 28k T = 30k T = 4k T = 8k T = 12k T = 16k T = 20k T = 24k T = 28k T = 30k Messaging Messaging Messaging Messaging Messaging Messaging Messaging Messaging Messaging Messaging Messaging Messaging Messaging Messaging Messaging Messaging Regulation Regulation Regulation Regulation Regulation Regulation Regulation Regulation Regulation Regulation Regulation Regulation Regulation Regulation Regulation Regulation T = 200k T = 400k T = 600k T = 800k T = 1500k T = 1400k T = 1200k T = 1000k T = 4k T = 8k T = 12k T = 16k T = 20k T = 24k T = 28k T = 30k

Upload: others

Post on 07-Apr-2022

2 views

Category:

Documents


0 download

TRANSCRIPT

References

To model multi-cellular organism dynamics, the avida world is divided into subpopulation - or demes.

Gene 1 Gene 2 Gene 3 Gene 1 Gene 2 Gene 3 Gene 1 Gene 2 Gene 3 Gene 4

Genome length

Fit

nes

s

Genome length

Fit

nes

s

Genome length

Fit

nes

s

INVESTIGATING THE EVOLUTION OF DEVELOPMENTAL MECHANISMS IN DIGITAL

MULTICELLULAR ORGANISMSBHASKAR KUMAWAT, RAMRAY BHAT

Abstract

Terminology

Avida

Conclusion

Future work

Development in metazoan organisms is composed of a series of steps that give rise to intercellular heterogeneities and signalling capabilities. We have used Avida – an artificial life software platform to study how development associated processes change during evolution. Avida uses an agent-based model wherein autoreplicating evolvable computer programs compete for resources through innovation and deployment of a variety of mathematical tasks. In order to simulate multicellular clusters, we introduce spatially discrete and isogenic multicellular sub-populations in the world - called demes. Such a framework allows us to track the evolutionary dynamics of a series of quantifiable metrics, such as the number and diversity of distinct unicellular phenotypes, the developmental time as well as the degree to which organisms in the deme communicate and sense their surroundings. Our preliminary experiments indicate - perhaps unsurprisingly - towards a nonlinear disparate relationship between the evolution of the genomic complexity and what can be encapsulated by considering all the above phenotypic metrics as, developmental complexity.

DevelopmentDevelopment of organisms is the process wherein cells divide, differentiate and proliferate in tissues, set up a body axis, and form organs. It is a well regulated processand this is usually summed up as the presence of a developmental program for the organism.

ComplexityThe complexity of a system/object can be defined in various ways.

A. Kolomogorov Complexity : Length of the shortest computer program that produces the object as an output. (Used for sequences, Kolomogorov 1968)

B. Physical Complexity : Summation of amount of information stored at all sites in the genome based on their substitution frequencies. (Used for the genome, Adami 2000)

C. McShea Structural Complexity : Biologically inspired but only a qualitative description without any way integrate different terms. (McShea, 1996)

> >Object Complexity Process Complexity

In our system, we estimate object and process complexities through multiple gross measures.

- Object Complexity : Number of cells, degree of differentiation.

- Process Complexity : Number of communication channels.

Agent based simulation framework with genotype-phenotype discretization (A - artificial, -vida - life)

Genome is made of instructions from an instruction set. (Turing complete)

Organisms that perform certain mathematical tasks acquire resources and reproduce faster.

RESOURCE 1Type : ANDInputs : (11011,10110)

Check Output

Organism

Organism

Inputs(11011,10110)

AND

Output(10010)

We assay deme properties by isolating them in a test environment. The world geometry is periodic.

Deme RunsWorld size in X : 5World size in Y : 1000Deme size : 25 (5x5) - total max 200 demes1. A single organism with an ancestral genotype is injected into an empty deme. 2. The organisms can incorporate instructions that allow them to communicate with neighbors, sense their position in the deme, and even exit or join the germline of their deme. 3. Phenotypic heterogeneity can arise due to formation of information processing pipelines.

Unicellular runsWorld size in X : 5World size in Y : 1000Number of organisms : max 5000These organisms reproduce as soon as they genome is completely executed.

.

Results

Division of Biological Sciences, Indian Institute of Science, Bangalore, India - 560012

Population fitness structureThe population fitness structure at a particular update is the scatter plot of a particular property of genotypes in the population against their fitness.

Genome length (genomic structural complexity)RESOURCE ABUNDANCE = 10,000 RESOURCE ABUNDANCE = 1,000

MessagingRESOURCE ABUNDANCE = 10,000 RESOURCE ABUNDANCE = 1,000

Hypothesis 1 : High resource abundance allows appearance of high complexity outliers. Low resource condition is highly selective and restrictive to changes in complexity. Genomic structural complexity is relatively tightly bound in both cases.

Hypothesis 2 : Multicellular organisms in low resource environments develop higher instances of messaging and sensing required for division of labor (also seen previously in [2]). The messaging processes correlate well with fitness of these organisms.

VARYING RESOURCE CONDITIONS

MessagingUNICELLULAR MULTICELLULAR

RegulationUNICELLULAR MULTICELLULAR

No population shift is observed in the case of unicellular populations. The fitness of these populations increase over time but they do not incorporate new regulation/sensing instructions in a stable fashion while at the same time increasing their fitness.

Hypothesis 3 : Unicellular organisms rely majorly on changes in genomic complexity to bring about evolutionary innovations as compared to multicellular organisms - that rely on regulation and signalling to innovate new solutions.

Complexity ≠ Utility

Use metrics to quantify the contribution of the genome and non-genome to fitness.

Develop a concise methodology that takes in account these rough complexity measures and integrates them into a single "developmental complexity".

Study how network topologies evolve in demes where messaging plays a major role in fitness acquistion.

Use the physical complexity metric (sensu Adami) to see if developmental features allow major transitions without increase in genomic complexity.

In this work, we present a new framework to answer questions involving the evolution of developmental pathways in multi-cellular organisms and present preliminary results indicating the relative importance of genomic complexity and extra-genomic pathways in unicellular and multicellular organisms.

Our results indicate that low resource conditions hinder the appearance of high complexity genomes and facilitates messaging to be incorporated as a functional process.

They also hint towards the relatively important but disparate roles of genomic complexity and other developmental functions in unicellular and multicellular evolving organisms.

[1] McShea, D. W. (1996). Perspective: Metazoan Complexity and Evolution: Is There a Trend? Evolution, 50(2), 477. doi:10.2307/2410824

[2] Goldsby, Heather J., et al. "Task-switching costs promote the evolution of division of labor and shifts in individuality." Proceedings of the National Academy of Sciences 109.34 (2012): 13686-13691.

[3] Adami, Christoph, Charles Ofria, and Travis C. Collier. "Evolution of biological complexity." Proceedings of the National Academy of Sciences 97.9 (2000): 4463-4468.

[4] Duclos, Kevin K., Jesse L. Hendrikse, and Heather A. Jamniczky. "Investigating the evolution and development of biological complexity under the framework of epigenetics." Evolution & development (2019): e12301.

[5] Kolmogorov, Andrei Nikolaevich. "Three approaches to the quantitative definition of information." International journal of computer mathematics 2.1-4 (1968): 157-168.

[6] Adami, Christoph. "What is complexity?." BioEssays 24.12 (2002): 1085-1094.

[7] Bar-Yam, Yaneer. "Multiscale complexity/entropy." Advances in Complex Systems 7.01 (2004): 47-63.

+ Hierarchial complexities

.Organisms in a deme reproduce independently of organisms in other demes and place offspring only within the same deme. There is no mutation during this process.

Organisms can donate their accumulated resources to the deme. A deme that reaches a particular resource threshold reproduces by replacing a neighbor deme.

Method

MeasurementsA single long evolutionary run is conducted for 1.5 million updates. The demesorganisms from each update are taken and analysed in a separate isolated environment where different properties are measured.

Fitness of an organism : Metabolic rate divided by gestation time of the organism

Fitness of a deme : Average rate of resource acquistion over lifetime.

Genome length Genome length

Genome length Genome length

Genome length

Genome lengthGenome length

Genome length

Fit

nes

sF

itn

ess

Fit

nes

s

Fit

nes

sF

itn

ess

Fit

nes

sF

itn

ess

Fit

nes

sF

itn

ess

Fit

nes

sF

itn

ess

Fit

nes

s

Genome length

Fit

nes

s

Genome length

Fit

nes

s

Genome length

Fit

nes

s

Genome lengthGenome length

Fit

nes

s

Genome length

Genome length

Genome length

T = 200k

T = 400k

T = 600k

T = 800k T = 1500k

T = 1400k

T = 1200k

T = 1000k T = 200k

T = 400k

T = 600k

T = 800k T = 1500k

T = 1400k

T = 1200k

T = 1000k

T = 200k

T = 400k

T = 600k

T = 800k T = 1500k

T = 1400k

T = 1200k

T = 1000k

Fit

nes

sF

itn

ess

Fit

nes

s

Fit

nes

sF

itn

ess

Fit

nes

sF

itn

ess

Fit

nes

s

Fit

nes

sF

itn

ess

Fit

nes

s

Fit

nes

sF

itn

ess

Fit

nes

sF

itn

ess

Fit

nes

s

Messaging

Messaging

Messaging Messaging

Messaging

MessagingMessaging

Messaging Messaging Messaging

Messaging Messaging

MessagingMessaging

MessagingMessaging

UNICELLULAR VS MULTICELLULAR

UNICELLULAR MULTICELLULARGenome length (genomic structural complexity)

Fit

nes

sF

itn

ess

Fit

nes

sF

itn

ess

Fit

nes

sF

itn

ess

Fit

nes

sF

itn

ess

Fit

nes

sF

itn

ess

Fit

nes

sF

itn

ess

Fit

nes

sF

itn

ess

Fit

nes

sF

itn

ess

T = 200k

T = 400k

T = 600k

T = 800k T = 1500k

T = 1400k

T = 1200k

T = 1000k

Genome length Genome length

Genome lengthGenome length

Genome length Genome length

Genome lengthGenome length Genome length Genome length

Genome lengthGenome length

Genome lengthGenome length

Genome lengthGenome length

Fit

nes

sF

itn

ess

Fit

nes

s

Fit

nes

sF

itn

ess

Fit

nes

sF

itn

ess

Fit

nes

s

Fit

nes

sF

itn

ess

Fit

nes

s

Fit

nes

sF

itn

ess

Fit

nes

sF

itn

ess

Fit

nes

s

T = 200k

T = 400k

T = 600k

T = 800k T = 1500k

T = 1400k

T = 1200k

T = 1000k

T = 200k

T = 400k

T = 600k

T = 800k T = 1500k

T = 1400k

T = 1200k

T = 1000k

Fit

nes

sF

itn

ess

Fit

nes

s

Fit

nes

sF

itn

ess

Fit

nes

sF

itn

ess

Fit

nes

s

Fit

nes

sF

itn

ess

Fit

nes

s

Fit

nes

sF

itn

ess

Fit

nes

sF

itn

ess

Fit

nes

s

T = 4k

T = 8k

T = 12k

T = 16k

T = 20k

T = 24k

T = 28k

T = 30k

T = 4k

T = 8k

T = 12k

T = 16k

T = 20k

T = 24k

T = 28k

T = 30k

Messaging Messaging

MessagingMessaging

Messaging Messaging

MessagingMessaging Messaging Messaging

MessagingMessaging

Messaging Messaging

MessagingMessaging

Regulation Regulation

Regulation

Regulation

RegulationRegulation

Regulation

Regulation

Regulation

Regulation

Regulation

RegulationRegulation

Regulation

Regulation

Regulation

T = 200k

T = 400k

T = 600k

T = 800k T = 1500k

T = 1400k

T = 1200k

T = 1000kT = 4k

T = 8k

T = 12k

T = 16k

T = 20k

T = 24k

T = 28k

T = 30k