2014 whitney-public-talk
TRANSCRIPT
{
The Genomic Revolution; How Sequencing Anythingand Everything Is Changing the Way We Do Science
C. Titus Brown
Reed College, BA in Mathematics Caltech, PhD and post-doctoral fellow in
Biology; Michigan State University, Assistant
Professor in Biology and Computer Science.
My background
I’m still confused by almost everything, but in some cases I have a lot more detail to be confused about.
So if you ask questions, I may say “I don’t know!” (but may then guess).
Please! Ask questions!
On “Expertise”
First, genetic investigation of fetuses in utero;
Second, tracking hospital infections;
Third, investigating global nutrient cycling.
Three stories:
Genome sequencing!!
The ability to cheaply sequence DNA is an
extremely exciting and fairly new technique; all three
stories used this extensively.
Why these stories?
1. DNA, genomes, and sequencing;2. Story 1: genetics of unborn fetuses3. Story 2: staph transmission in hospitals4. Story 3: global nutrient cycling in the
oceans5. My research, briefly!
Outline
A C G T
DNA.
AGTCCA is different from CCAAGT!
DNA is combinatorial.
This means that for a string of 10 DNA bases, there are over 1
million combinations!
AAAAAAAAATAAAAAAAATAAAAAAAATAA
…
DNA is combinatorial.
This combinatorial property matters because it means that DNA is an alphabet and can be used as a language – you can build “words” and “sentences” in it.
(Just in case you’re wondering, we still don’t really understand the language in detail, although we know a lot about it.)
DNA is a language.
Every cell in your body contains about 6 billion bases of DNA, in a particular order.
This is your genome.
Almost every one of your cells contains the same 6 billion bases of DNA.
…and it’s what you pass on to your children.
DNA underlies heredity
Since your genome has 6 billion bases of DNA, it would take up about 1.5 million pages in a book --
This book would be the architectural plans for you!
DNA is a language.
Sequencing your genome is the same thing as digitizing it – putting the sequences of bases into a format that computers can read.
Analogy: scanning in old photos.
Important side note: just because you can digitize it, doesn’t mean you understand it!
“Sequencing” the genome.
You can look for known words and sentences, to diagnose disease susceptibility.
You can compare with other genomes, to find out what words and sentences might be responsible for disease.
Why is it useful to sequence your genome?
The first human genome cost between $300m and $3bn dollars. That was in ~2002.
Today, you can sequence your genome for under $5000!
This decrease in cost lets us look at a lot more genomes!
…and the price is dropping fast.
How much does it cost?!
Knowing a particular genome sequence lets us look for known disease susceptibility, as well as helping us find “words” associated with unknown diseases.
We can do this for around $5000 per person.
Summary of DNA:
Questions at this point?
Inheritance of traits.
You have two near-copies of each string of DNA, or “chromosome”.
Inheritance of traits.
These two copies are a bit different.
Inheritance of traits.
One copy may carry a particular trait – say, albinism, or wet earwax.
Inheritance of traits.
This trait may not show up if you have only one copy (albinism).
Non-albino
Inheritance traits.
But if it’s on both copies, it may have an effect.
Albino
Albinism occurs only with two copies of albino trait.
Non-albino Albino
…many diseases work the same way.
OK Very badly ill
Can we diagnose fetuses?
OK Very badly ill
Amniocentisis is invasive.
http://www.reproduccionasistida.org/evitar-amniocentesis/
Mother’s genome Father’s genome
Child’s genome
Heredity and crossover.
Thomas Hunt Morgan, 1916
Mother’s genome Father’s genome
Children’s genomes
(Only 1 in 4 will have trait.)
Sequence plasma, mother, and father – then count.
Pregnant mother
+ Father
* Complication: between 1/10 and ½ of cells are fetal.
(+ Fetal cells)
Good accuracy!Fan et al., 2012
So, it’s now possible (if not yet really cheap!) to non-invasively figure out the genotype of a fetus, by sampling parents + blood.
Instead of one genome, sequence three!
DNA sequencing shows a lot of promise for diagnosing rare diseases.
http://www.forbes.com/sites/matthewherper/2011/01/05/the-first-child-saved-by-dna-sequencing/
Questions?
Methicillin-resistant Staph (“MRSA”)
Wikipedia
Staph tends to attack soft tissue in people who are already ill.
Correlation between staph infections and hospitals/assisted care.
Staph infections are a problem!
Hypothesis 1: broad transmission
Hypothesis 2: deep transmission
Does it spread within facilities?
or
Does it spread between facilities?
How does staph spread?
Sequence staph strains from within hospitals.
If transmission is within hospital, all the strains will look alike.
If transmission is mainly from outside, strains will be spread across hospitals.
Approach:
Tracking transmission by mutations in the genome
AncestorPresent strains
Do staph strains cluster by hospital?
Prosperi et al., 2013. Nature.
Hospitals:
Strain relatedness
Do staph strains cluster by hospital? No!
Prosperi et al., 2013. Nature.
Hospitals:
Strain relatedness
More than 80% of staph infections were newly acquired from non-patients!
Implications for prevention: focus on isolation from outsiders, not just patients.
Conclusions: mostly from outside.
Questions?
Exploring the microbial unknown!
The Great Plate Count Anomaly: most microbes cannot be studied in the lab
http://schaechter.asmblog.org/schaechter/2010/07/the-uncultured-bacteria.html
Distribution of microbial archaea off of Hawaii; why so many, so deep?
Depth
Location
Karner et al., Nature, 2001.
Distribution of microbial archaea off of Hawaii; why so many, so deep?
Measurements + extrapolation suggest:
1/3 of cells in ocean are archaeal;2/3 of cells in ocean are bacterial.
Approximately 20% of cells are from one group of archaea.
Hints came from just sequencing “seawater” in 2004:
“an ammonium monooxygenase gene was found on an archaeal-associated” section of genome.
What are all these archaea doing!?
Venter et al., 2004. Science.
“Primary pump” – CO2 sequestration in deep ocean
Wikipedia
Current theory is that a majority of the nitrification in the ocean (a driver of this CO2 sequestration pump) occurs via these archaeal cells.
What are all these archaea doing!?
More emphasis on analysis rather than just data gathering.
More exploration – “just sequence it”. More unexpected results!
How is cheap sequencing changing research?
We can generate a lot of data quite easily.
How do we interpret the data correctly, and efficiently? How do we correlate between data sets?
How can we do good biology in the face of these technical challenges?
My research is:
Great Prairie Grand Challenge --SAMPLING
LOCATIONS
2008
Sea lamprey in the Great Lakes
Non-native Parasite of
medium to large fishes
Caused populations of host fishes to crash
Li Lab / Y-W C-D
Tail loss and notochord genes
a) M. oculata b) hybrid (occulta egg x oculata sperm) c) M. occultaNotochord cells in orange Swalla, B. et al. Science, Vol 274, Issue 5290, 1205-1208 , 15 November 1996
You do!
Via:National Science Foundation (NSF);National Institutes of Health (NIH);
US Department of Agriculture (USDA);US Department of Energy (DOE)
Who funds all this research (including mine)?