of mice and men learning from genome reversal findings genome rearrangements in mammalian evolution:...

37
Of Mice and Men Learning from genome reversal findings Genome Rearrangements in Mammalian Evolution: Lessons From Human and Mouse Genomes and Transforming Men into Mice: the Nadeau- Taylor Chromosomal Breakage Model Revisited both papers written by Pavel Pevzner and Glenn Tesler

Post on 21-Dec-2015

223 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: Of Mice and Men Learning from genome reversal findings Genome Rearrangements in Mammalian Evolution: Lessons From Human and Mouse Genomes and Transforming

Of Mice and MenLearning from genome reversal findings

Genome Rearrangements in Mammalian Evolution: Lessons From Human and Mouse Genomes

and

Transforming Men into Mice: the Nadeau-Taylor Chromosomal Breakage Model Revisited

both papers written by Pavel Pevzner and Glenn Tesler

Page 2: Of Mice and Men Learning from genome reversal findings Genome Rearrangements in Mammalian Evolution: Lessons From Human and Mouse Genomes and Transforming

Reversal Distance – The minimum number of reversals to translate from one genome to another

Syntney Block – region in which the same gene order is observed between species

Ortholog – corresponding gene in two different species

Basic, Basic Terms

Page 3: Of Mice and Men Learning from genome reversal findings Genome Rearrangements in Mammalian Evolution: Lessons From Human and Mouse Genomes and Transforming

Overview

● Theory of reversal distance calculation

● A new model for presenting reversal information (primary topic of Genome Rearrangements in Mammalian Evolution)

● Evidence of “fragile” genome regions (primary topic of (Transforming Men into Mice)

Page 4: Of Mice and Men Learning from genome reversal findings Genome Rearrangements in Mammalian Evolution: Lessons From Human and Mouse Genomes and Transforming

What are we solving?

Find d(), the reversal distance, from permutation

to permutation where

is L 4 5 2 1 3 6 R

is L 1 2 3 4 5 6 R

Page 5: Of Mice and Men Learning from genome reversal findings Genome Rearrangements in Mammalian Evolution: Lessons From Human and Mouse Genomes and Transforming

Reversals

A reversal operation, , is defined as follows:

= [ i , j ]

(k) = { ( i + j - k ) if i < k < j, (k) otherwise}

Page 6: Of Mice and Men Learning from genome reversal findings Genome Rearrangements in Mammalian Evolution: Lessons From Human and Mouse Genomes and Transforming

Breakpoints

L 1 3 2 4 5 6 R

A breakpoint of with respect to is a pair x, y of elements of Lº such that xy appears in the extended version of , but neither xy nor the reverse pair yx

appear in the extended .

Page 7: Of Mice and Men Learning from genome reversal findings Genome Rearrangements in Mammalian Evolution: Lessons From Human and Mouse Genomes and Transforming

Reversal Distance: Guess #1

d() > b() / 2

...we can do better than that!

Page 8: Of Mice and Men Learning from genome reversal findings Genome Rearrangements in Mammalian Evolution: Lessons From Human and Mouse Genomes and Transforming

Reality and Desire Construction

Extended

Reality and Desire Edges

Terminals

Reality Edges

Page 9: Of Mice and Men Learning from genome reversal findings Genome Rearrangements in Mammalian Evolution: Lessons From Human and Mouse Genomes and Transforming

Reality and Desire Diagram

Reality and Desire Diagram - RD()

c() = # of Cycles

Page 10: Of Mice and Men Learning from genome reversal findings Genome Rearrangements in Mammalian Evolution: Lessons From Human and Mouse Genomes and Transforming

Reversal Distance: Guess #2

d() > n + 1 - c()

Try taking a closer look...

Page 11: Of Mice and Men Learning from genome reversal findings Genome Rearrangements in Mammalian Evolution: Lessons From Human and Mouse Genomes and Transforming

Components

Component – set of interleaving cycles (cycles which cross in a reality and desire diagram)

This reality and desire diagram has six components.

Page 12: Of Mice and Men Learning from genome reversal findings Genome Rearrangements in Mammalian Evolution: Lessons From Human and Mouse Genomes and Transforming

You Are Here

Page 13: Of Mice and Men Learning from genome reversal findings Genome Rearrangements in Mammalian Evolution: Lessons From Human and Mouse Genomes and Transforming

Converging and Diverging

● Edges A, C, and E, converge

● Edges D and F diverge● Edges B and D diverge● Edges F and B converge

Page 14: Of Mice and Men Learning from genome reversal findings Genome Rearrangements in Mammalian Evolution: Lessons From Human and Mouse Genomes and Transforming

Converge? Diverge? So what?

Let = [ e , f ] and act on RD()...

If edges e and f belong to different cycles,then c() = c() – 1

If edges e and f belong to the same cycle and converge,then c() = c()

If edges e and f belong to the same cycle and diverge,then c() = c() + 1

Page 15: Of Mice and Men Learning from genome reversal findings Genome Rearrangements in Mammalian Evolution: Lessons From Human and Mouse Genomes and Transforming

The Good and the Bad

Good Components contain at least one Good Cycle.

Bad Components contain only Bad Cycles.

Good Cycles contain at least one pair of diverging edges.

Bad Cycles contain only converging edges.

Page 16: Of Mice and Men Learning from genome reversal findings Genome Rearrangements in Mammalian Evolution: Lessons From Human and Mouse Genomes and Transforming

Some “Bad” Examples

This reality and desire diagram has five bad components and only one good component (bottom).

The good component has one good cycle and one bad cycle.

Page 17: Of Mice and Men Learning from genome reversal findings Genome Rearrangements in Mammalian Evolution: Lessons From Human and Mouse Genomes and Transforming

You Are Here

Page 18: Of Mice and Men Learning from genome reversal findings Genome Rearrangements in Mammalian Evolution: Lessons From Human and Mouse Genomes and Transforming

Hurdles

Hurdle – a bad component that does not separate any other two bad components

Nonhurdle – a bad component that does separate at least two bad components

Page 19: Of Mice and Men Learning from genome reversal findings Genome Rearrangements in Mammalian Evolution: Lessons From Human and Mouse Genomes and Transforming

Example Hurdles

In this example...● A, F, C, and D are

hurdles.● E and B are nonhurdles.● h() = 4

Page 20: Of Mice and Men Learning from genome reversal findings Genome Rearrangements in Mammalian Evolution: Lessons From Human and Mouse Genomes and Transforming

You Are Here

Page 21: Of Mice and Men Learning from genome reversal findings Genome Rearrangements in Mammalian Evolution: Lessons From Human and Mouse Genomes and Transforming

Super/Simple Hurdles

In this example...● Hurdle F protects

nonhurdle E● F is a super hurdle● A, C, and D are simple

hurdles● h() = 4

Page 22: Of Mice and Men Learning from genome reversal findings Genome Rearrangements in Mammalian Evolution: Lessons From Human and Mouse Genomes and Transforming

The Fortress

Fortress – A permutation whose reality and desire diagram contains an odd number of hurdles and all of them are super hurdles.

f( = 1{ is a fortress}

Page 23: Of Mice and Men Learning from genome reversal findings Genome Rearrangements in Mammalian Evolution: Lessons From Human and Mouse Genomes and Transforming

Example Fortress

Smallest Possible fortress:

Page 24: Of Mice and Men Learning from genome reversal findings Genome Rearrangements in Mammalian Evolution: Lessons From Human and Mouse Genomes and Transforming

Reversal Distance: Guess #3

d() = n + 1 – c() + h() + f()

Finally!!!

Page 25: Of Mice and Men Learning from genome reversal findings Genome Rearrangements in Mammalian Evolution: Lessons From Human and Mouse Genomes and Transforming

References

The preceding material was taken from Introduction to Computational Molecular Biology by Setubal and Meidanis, based on the following papers:

●V. Bafna and P. A. Pevzner – Genome rearrangements and sorting by reversals.●S. Hannenhalli and P. A. Pevzner – Transforming cabbage into turnip (polynomial algorithm for sorting signed permutations by reversals) (this paper referenced in text of Transforming Men into Mice for definitions of hurdles and fortresses)●J. D. Kececioglu and D. Sankoff – Exact and approximate algorithms for sorting by reversals with application to genome rearrangement

Page 26: Of Mice and Men Learning from genome reversal findings Genome Rearrangements in Mammalian Evolution: Lessons From Human and Mouse Genomes and Transforming

First Paper

Genome Rearrangements in Mammalian Evolution: Lessons From Human and Mouse Genomes

Pavel Pevzner and Glenn Tesler

Page 27: Of Mice and Men Learning from genome reversal findings Genome Rearrangements in Mammalian Evolution: Lessons From Human and Mouse Genomes and Transforming

First Paper Overview

This paper presents a new kind of graph which achieves the usefulness of reality and desire

diagrams on simple genome comparison graphs.

Page 28: Of Mice and Men Learning from genome reversal findings Genome Rearrangements in Mammalian Evolution: Lessons From Human and Mouse Genomes and Transforming

The ProcessGRIMM-Synteny Algorithm

Useful features:

● Same cycle count as reality desire diagram!

● Cycles of more than for edges indicate reused breakpoints!

Page 29: Of Mice and Men Learning from genome reversal findings Genome Rearrangements in Mammalian Evolution: Lessons From Human and Mouse Genomes and Transforming

WholeGenome

ResultsSynteny Blocks: 281Reversal Distance: 245

Page 30: Of Mice and Men Learning from genome reversal findings Genome Rearrangements in Mammalian Evolution: Lessons From Human and Mouse Genomes and Transforming

Second Paper

Transforming Men into Mice: the Nadeau-Taylor Chromosomal Breakage Model Revisited

Pavel Pevzner and Glenn Tesler

Page 31: Of Mice and Men Learning from genome reversal findings Genome Rearrangements in Mammalian Evolution: Lessons From Human and Mouse Genomes and Transforming

Second Paper Overview

Are breakpoints random or are some sections of the genome more “fragile” than others?

Page 32: Of Mice and Men Learning from genome reversal findings Genome Rearrangements in Mammalian Evolution: Lessons From Human and Mouse Genomes and Transforming

Conventional Wisdom

“Since the [random breakage] model was first introduced in [paper cited]..., it has been analyzed by Nadeau and others [more papers cited]... and has become widely accepted”

To test, simply plot the lengths of known conserved segments and compare to an exponential distribution...

Page 33: Of Mice and Men Learning from genome reversal findings Genome Rearrangements in Mammalian Evolution: Lessons From Human and Mouse Genomes and Transforming

Do we have a match?

Too many short segments!

Page 34: Of Mice and Men Learning from genome reversal findings Genome Rearrangements in Mammalian Evolution: Lessons From Human and Mouse Genomes and Transforming

Micro-rearrangement Evidence

● There is evidence of at least 3,170 micro-rearrangements (reversals) within the synteny blocks (though many may be artifacts of incorrect assemblies)

● 41 out of 281 synteny blocks do not show any evidence of micro-rearrangements, while 10 synteny blocks are extremely rearranged (40 or more rearrangements within a block)

Page 35: Of Mice and Men Learning from genome reversal findings Genome Rearrangements in Mammalian Evolution: Lessons From Human and Mouse Genomes and Transforming

Calculating Breakpoint Reuse

Theorem 1: “If all reversals are delimited by pairs of breakpoints, the number of breakpoint re-uses in any parsimonious reversal scenario is 2d - br. This is the lower bound for non-optimal reversal scenarios.”

2 x 245 (Distance) – 300 Breakpoints = = 190 breakpoint reuses

281 Synteny Blocks – 23 Chromosomes +190 Breakpoint Reuses = 448 Breakages

Page 36: Of Mice and Men Learning from genome reversal findings Genome Rearrangements in Mammalian Evolution: Lessons From Human and Mouse Genomes and Transforming

Statistical Evidence

Expected number of “clumps” (pairs of points within a space w, which is a fraction of genome length) is (n – 1)(1 – (1 – w)n), where n is the number of breakages.

For w = 0.668Mb/2,983Mb, the number of expected “clumps” is about 43, far less than the 190 number of reused breakpoints!

Page 37: Of Mice and Men Learning from genome reversal findings Genome Rearrangements in Mammalian Evolution: Lessons From Human and Mouse Genomes and Transforming

Questions?