phase bias in niv sabath university of houston overlappin enes g

30
Phase Bias in Niv Sabath University of Houston Overlappi n enes g

Upload: kristopher-hart

Post on 18-Jan-2018

217 views

Category:

Documents


0 download

DESCRIPTION

Overlap Length Count Long OverlapsShort Overlaps T G A T A A T A G 5’3’ Phase 1 Phase 2 Phase 0 Same-Strand Overlaps

TRANSCRIPT

Page 1: Phase Bias in Niv Sabath University of Houston Overlappin enes g

Phase Bias in

Niv SabathUniversity of Houston

Overlappinenes g

Page 2: Phase Bias in Niv Sabath University of Houston Overlappin enes g

Introduction• Overlapping genes are ubiquitous, particularly in

bacteria and viruses

• Genes can overlap On the same strand (→ →) On opposite strands (→ ← or ← →)

• In bacteria~30% of the genes overlap~70% of the overlaps are on the same strand

Page 3: Phase Bias in Niv Sabath University of Houston Overlappin enes g

Overlap Length

Count

Long Overlaps Short Overlaps

T G AT A AT A G

5’ 3’Phase 1Phase 2

Phase 0

Same-Strand Overlaps

Page 4: Phase Bias in Niv Sabath University of Houston Overlappin enes g

Overlap Length

Count

Long Overlaps Short Overlaps

T G AT A AT A G

5’ 3’Phase 1Phase 2

Phase 0

Same-Strand Overlaps

Page 5: Phase Bias in Niv Sabath University of Houston Overlappin enes g

Overlap Length

Count

Long Overlaps Short Overlaps

T G AT A AT A G

5’ 3’Phase 1Phase 2

Phase 0

Same-Strand Overlaps

1

Page 6: Phase Bias in Niv Sabath University of Houston Overlappin enes g

Overlap Length

Count

Long Overlaps Short Overlaps

T G AT A AT A G

5’ 3’Phase 1Phase 2

Phase 0

Same-Strand Overlaps

2

Page 7: Phase Bias in Niv Sabath University of Houston Overlappin enes g

Overlap Length

Count

Long Overlaps Short Overlaps

T G AT A AT A G

5’ 3’Phase 1Phase 2

Phase 0

Same-Strand Overlaps

4

Page 8: Phase Bias in Niv Sabath University of Houston Overlappin enes g

Overlap Length

Count

Long Overlaps Short Overlaps

T G AT A AT A G

5’ 3’Phase 1Phase 2

Phase 0

Same-Strand Overlaps

5

Page 9: Phase Bias in Niv Sabath University of Houston Overlappin enes g

7Overlap Length

Count

Long Overlaps Short Overlaps

T G AT A AT A G

5’ 3’Phase 1Phase 2

Phase 0

Same-Strand Overlaps

Page 10: Phase Bias in Niv Sabath University of Houston Overlappin enes g

Overlap Length

Count

Long Overlaps Short Overlaps

T G AT A AT A G

5’ 3’Phase 1Phase 2

Phase 0

Same-Strand Overlaps

8

Page 11: Phase Bias in Niv Sabath University of Houston Overlappin enes g

10Overlap Length

Count

Long Overlaps Short Overlaps

T G AT A AT A G

5’ 3’Phase 1Phase 2

Phase 0

Same-Strand Overlaps

Page 12: Phase Bias in Niv Sabath University of Houston Overlappin enes g

Overlap Length

Count

Long Overlaps Short Overlaps

T G AT A AT A G

5’ 3’Phase 1Phase 2

Phase 0

Same-Strand Overlaps

11

Page 13: Phase Bias in Niv Sabath University of Houston Overlappin enes g

Overlap Length

Count

Long Overlaps Short Overlaps

5’ 3’Phase 1Phase 2

Phase 0

T A G C A T G T A T C A T G G

“…the phase bias must be a property of gene locations.”

“We propose that through some mechanism yet to be determined, the creation of unidirectional gene overlaps of phase +1 confers some advantage.”(Cock and Whitworth 2007)

Page 14: Phase Bias in Niv Sabath University of Houston Overlappin enes g

170 bacterial genomes15298 long overlaps in phase 1 4153 long overlaps in phase 2

T A G C A T G T A T C A T G G

Page 15: Phase Bias in Niv Sabath University of Houston Overlappin enes g

170 bacterial genomes15298 long overlaps in phase 1 4153 long overlaps in phase 2

Composition?

T A G C A T G T A T C A T G G

Page 16: Phase Bias in Niv Sabath University of Houston Overlappin enes g

Scenarios of overlap creation

5’ 3’

5’ 3’

5’ 3’

T A G C A T G T A T C A T G G

Page 17: Phase Bias in Niv Sabath University of Houston Overlappin enes g

HypothesisThe phase bias in overlap frequency is a

result of difference between the frequencies of initiation/termination codons in phase 1 and phase 2 reading frames

T A G C A T G T A T C A T G G

Page 18: Phase Bias in Niv Sabath University of Houston Overlappin enes g

We examined the frequencies of ATG, the most common start codon, and stop codons in phase 1 and phase 2 reading frames

T A G C A T G T A T C A T G G

Page 19: Phase Bias in Niv Sabath University of Houston Overlappin enes g

T A G C A T G T A T C A T G G

Page 20: Phase Bias in Niv Sabath University of Houston Overlappin enes g

T A G C A T G T A T C A T G G

Met

Met

T G T

T G C

T G A

T G G

Cys

Cys

Stop

Trp

T A T

C A T

A A T

G A T

Tyr

His

Asn

Asp

5’ 3’ Phase 0

Phase 1

Phase 2

T A G C A T G T A T C A T G G

What causes the difference in ATG frequencies?

Page 21: Phase Bias in Niv Sabath University of Houston Overlappin enes g

What causes the difference in ATG frequencies?

Relative abundance of amino acids

T A G C A T G T A T C A T G G

T G T

T G C

T G A

T G G

Cys

Cys

Stop

Trp

T A T

C A T

A A T

G A T

Tyr

His

Asn

Asp

Phe Leu Ile MetVal Ser Pro ThrAla Gln Lys GluArg Gly

Phase 1 Phase 2

Page 22: Phase Bias in Niv Sabath University of Houston Overlappin enes g

T A G C A T G T A T C A T G G

The frequencies of ATG are correlated with the expected frequencies

00GNNNAT ff

00TGNNNA ff

r2 = 0.93

r2 = 0.80

Page 23: Phase Bias in Niv Sabath University of Houston Overlappin enes g

Amino-acid frequency

ATG frequency

Phase bias

T A G C A T G T A T C A T G G

Page 24: Phase Bias in Niv Sabath University of Houston Overlappin enes g

Acknowledgments

Dr. Dan Graur

Dr. Giddy Landan

NSF

© Marko Posavec

Page 25: Phase Bias in Niv Sabath University of Houston Overlappin enes g

Why was the correlation between overlap frequency and GC content unnoticed?

Long + short overlaps:

Phase 1 + phase 2

Phase 1

Phase 2 (64% ATGA)

Page 26: Phase Bias in Niv Sabath University of Houston Overlappin enes g

Why was the correlation between overlap frequency and GC content unnoticed?

Long + short overlaps:

Phase 1 + phase 2

Phase 1

Phase 2 (64% ATGA)

Stop codon usage:

TGA TAA TAG

Page 27: Phase Bias in Niv Sabath University of Houston Overlappin enes g

What is the proportion (P) of overlaps, which are created by each scenario after accounting for ATG frequencies?

Page 28: Phase Bias in Niv Sabath University of Houston Overlappin enes g

frequencyoverlapphasefrequencyoverlapphaseRgenes 2

1

frequencyATGphasefrequencyATGphaseRATG 2

1

frequencycodonstopphasefrequencycodonstopphaseRstop 2

1

Page 29: Phase Bias in Niv Sabath University of Houston Overlappin enes g

frequencyoverlapphasefrequencyoverlapphaseRgenes 2

1

frequencyATGphasefrequencyATGphaseRATG 2

1

frequencycodonstopphasefrequencycodonstopphaseRstop 2

1

stopATGgenes RPPRR 1

Page 30: Phase Bias in Niv Sabath University of Houston Overlappin enes g

frequencyoverlapphasefrequencyoverlapphaseRgenes 2

1

frequencyATGphasefrequencyATGphaseRATG 2

1

frequencycodonstopphasefrequencycodonstopphaseRstop 2

1

stopATGgenes RPPRR 1

5.0114.67.3 PPP

The expected value under no bias for overlaps in the two scenarios