http://cs273a.stanford.edu [bejerano fall09/10] 1 milestones due today. anything to report?
Post on 21-Dec-2015
215 views
TRANSCRIPT
http://cs273a.stanford.edu [Bejerano Fall09/10] 2
Lecture 17
Ultraconservation evolutionary data
Finish early – come hear the talk with us?
http://cs273a.stanford.edu [Bejerano Fall09/10] 3
Sequence Conservation implies Function
(but whichwhich function/s?...)
human
mouse
mammalianancestor
...CTTTGCGA-TGAGTAGCATCTACTATTT...
...ACGTGGGACTGACTA-CATCGACTACGA...
functional region!
Comparative Genomics of Distantly related species:
http://cs273a.stanford.edu [Bejerano Fall09/10] 4
HumanGenome:
3*109 letters
Human Genome full of Conserved Non-Coding Elements
1.5%known
function >50%junk
3x more functional DNA than known!
compare to other species
>5% human genome functional
~106 substrings do not code for protein
What do they do then?
http://cs273a.stanford.edu [Bejerano Fall09/10] 5
Conserved elements in the Human Genome
all human-mouse alignmentshuman-mouse ancestral repeats alignment
Difference: 5% of
Human Genome
[Mouse consortium, Nature 2002]
election
human-mouse ancestral repeats alignment
85%id on average
http://cs273a.stanford.edu [Bejerano Fall09/10] 6
Conserved elements in the Human Genome
all human-mouse alignmentshuman-mouse ancestral repeats alignment
Difference: 5% of
Human Genome
[Mouse consortium, Nature 2002]
election
human-mouse ancestral repeats alignment
85%id on average
UltraconservationUltraconservation
Simple
but
Unexpected
(the lure of Bioinormtaics)
http://cs273a.stanford.edu [Bejerano Fall09/10] 7
Typical DNA Conservation levels
Conserved elements between human and mouse are on average 85% identical. [mouse consortium, 2002]
(dot = base identical to human)
http://cs273a.stanford.edu [Bejerano Fall09/10] 8
Ultraconserved Elements
[Bejerano et al., Science 2004]
fish
481 elements perfectly conserved (100%id) over
200bp or more between human, mouse and rat.
http://cs273a.stanford.edu [Bejerano Fall09/10] 9
Ultraconserved Elements: Why?
Hundreds of long substrings identical between amniotes they must have rejected many different changes.
But... all functions we understand in our genome are encoded using redundant codes.
**
*
**
CDS ncRNA TFBS
seq.
http://cs273a.stanford.edu [Bejerano Fall09/10] 10
Ultras are Functional
Back in 2004 we hypothesized:
481 ultraconserved elements
exonic subset –
post transcriptional regulation
[Ni et al., Genes Dev.; Lareau et al., Nature, 2007]
“nonexonic” subset –
transcriptional regulators
[Pennacchio et al., Nature, 2006]
http://cs273a.stanford.edu [Bejerano Fall09/10] 11
Genomic Distribution of Ultraconserved Elements
•exonic•non•possibly
http://cs273a.stanford.edu [Bejerano Fall09/10] 12
UC.338 comes from an ancient repeat
novelcoelacanthrepeat
ultraconservedexon
enhancer
[Bejerano et al, Nature ,2006]
LF-SINE
http://cs273a.stanford.edu [Bejerano Fall09/10] 13
Ultras are Under Strong Human Selection
Ultra DAF NonSyn DAF
[Katzman et al, Science ,2007]
Mutational cold spots? NO. Rare (new) mutations are introduced to the population.
Fierce purifying selection? YES. Very few of these get anywhere near fixation.
chimpA
humansA A G A
http://cs273a.stanford.edu [Bejerano Fall09/10] 14
Touch an Ultra And You - DIY
[Ahituv et al., PLoS Biology, 2007]
http://cs273a.stanford.edu [Bejerano Fall09/10] 15
What can’t we measure in the lab?
sN
s
e ee
esN 21
1),|fixationPr(
Ne is population size, s selective dis/advantage.Both of which are VERY wrong in the lab.
http://cs273a.stanford.edu [Bejerano Fall09/10] 16
So it can happen – but does it FIX?
tDNA element
mouse
http://cs273a.stanford.edu [Bejerano Fall09/10] 17
Count Fraction Lost, Binned by %id
t human
macaque
dog
mouse
rat
100bp
sliding
window
count_all
count_hole
bin
by
%id
humandog rat mouse
macaque
http://cs273a.stanford.edu [Bejerano Fall09/10] 19
Pragmatic Genomics
define goalrun sensible approachwhile (results full of artefacts*){ characterize artefact write handler into code rerun}
bio
bio
cs
cs
* eg: sequencing errors, assembly errors
contaminating sequence, ambiguous situations, etc.
http://cs273a.stanford.edu [Bejerano Fall09/10] 20
Ultras are Fiercely Retained through Evolution
Ultras are
>300 fold
more
persistent
than
neutral DNA(25% deleted)
the genomic deletiongenomic deletion is
100%id primates-dog: 1,691,090bp
rodents deleted: 1,447bp (0.086%)
sN
s
e ee
esN 21
1),|fixationPr(
http://cs273a.stanford.edu [Bejerano Fall09/10] 21
How special are the Ultras?
election
UltraconservationUltraconservation
http://cs273a.stanford.edu [Bejerano Fall09/10] 23
Adding More Species
Few species More and more species
Hmmm….
http://cs273a.stanford.edu [Bejerano Fall09/10] 24
Most Non-Coding Elements likely work in cis…
9Mb
“IRX1 is a member of the Iroquois homeobox gene family.
Members of this family appear to play multiple roles
during pattern formation of vertebrate embryos.”
gene deserts
regulatory jungles