the eukaryotic chromosome (chapter 16) friday, november 5, 2010 genomics 260.605.01 j. pevsner...

132
The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner [email protected]

Upload: hilary-hodges

Post on 01-Jan-2016

220 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

The eukaryotic chromosome(Chapter 16)

Friday, November 5, 2010

Genomics260.605.01J. Pevsner

[email protected]

Page 2: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Many of the images in this powerpoint presentationare from Bioinformatics and Functional Genomicsby J Pevsner, © 2009 by Wiley-Blackwell.

These images and materials may not be usedwithout permission.

Visit http://www.bioinfbook.org

Copyright notice

Page 3: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Today: The eukaryotic chromosome

Monday Nov. 8: The fungi (Chapter 17)

Wednesday Nov. 16: yeast genomics (Jef Boeke)

Friday November 20: computer lab (chapters 16, 17)

Schedule

Page 4: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Outline: eukaryotic chromosomes

General features of eukaryotic chromosomes

Repetitive DNA content of eukaryotic genomes

Gene content of eukaryotic chromosomes

Regulatory regions of eukaryotic chromosomes

Comparison of eukaryotic DNA

Variation in chromosomal DNA

Techniques to measure chromosomal change

Page 5: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Outline: eukaryotic chromosomes

General features of eukaryotic chromosomes

C value paradox and genome sizes

Organization of genomes into chromosomes

Genome browsers

The ENCODE project

Page 6: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Introduction to the eukaryotes

Eukaryotes are single-celled or multicellular organismsthat are distinguished from prokaryotes by the presenceof a membrane-bound nucleus, an extensive system ofintracellular organelles, and a cytoskeleton.

We will explore the eukaryotes using a phylogenetic treeby Baldauf et al. (Science, 2000). This tree was made byconcatenating four protein sequences: elongation factor 1a, actin, -tubulin, and -tubulin.

Page 640

Page 7: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Eukaryotes(after Baldauf et al., 2000)

Page 730

Page 8: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

General features of the eukaryotes

Some of the general features of eukaryotes that distinguishthem from prokaryotes are:

• eukaryotes include many multicellular organisms, in addition to unicellular organisms.• eukaryotes have [1] a membrane-bound nucleus, [2] intracellular organelles, and [3] a cytoskeleton• Most eukaryotes undergo sexual reproduction• The genome size of eukaryotes spans a wider range than that of most prokaryotes• Eukaryotic genomes have a lower density of genes• Prokaryotes are haploid; eukaryotes have varying ploidy• Eukaryotic genomes tend to be organized into linear chromosomes with a centromere and telomeres.

Page 641

Page 9: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Questions about eukaryotic chromosomes

What are the sizes of eukaryotic genomes, and how are they organized into chromosomes?

What are the types of repetitive DNA elements? What are their properties and amounts?

What are the types of genes? How can they be identified?

What is the mutation rate across the genome; what are the selective forces affecting genome evolution?

What is the spectrum of variation between species (comparative genomics) and within species?

Page 10: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Outline: eukaryotic chromosomes

General features of eukaryotic chromosomes

C value paradox and genome sizes

Organization of genomes into chromosomes

Genome browsers

The ENCODE project

Page 11: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

C value paradox: why eukaryotic genome sizes vary

The haploid genome size of eukaryotes, called the C value,varies enormously.

Small genomes include:Encephalitozoon cuniculi (2.9 Mb)A variety of fungi (10-40 Mb)Takifugu rubripes (pufferfish)(365 Mb)(same number of genes as other fish or as the human genome, but 1/10th the size)

Large genomes include:Pinus resinosa (Canadian red pine)(68 Gb)Protopterus aethiopicus (Marbled lungfish)(140 Gb)Amoeba dubia (amoeba)(690 Gb)

Page 643

Page 12: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

C value paradox: why eukaryotic genome sizes vary

The range in C values does not correlate well with thecomplexity of the organism. This phenomenon is calledthe C value paradox.

The solution to this “paradox” is that genomes are filledwith large tracts of noncoding, often repetitive DNA sequences.

Page 643

Page 13: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Eukaryotic genomes are organized into chromosomes

Genomic DNA is organized in chromosomes. The diploid number of chromosomes is constant in each species(e.g. 46 in human). Chromosomes are distinguished by a centromere and telomeres.

The chromosomes are routinely visualized by karyotyping(imaging the chromosomes during metaphase, when each chromosome is a pair of sister chromatids).

Page 644

Page 14: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Fig. 16.1Page 645

Page 15: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Plate II. First P.G. mitosis in polar view. Tradescantia virginiana, Commelinaceae, n = 9 (from aberrrant plant with 22 chromosomes). 2 BE - CV smears. x 1200. Printed on multigrade paper. Darlington.

Page 16: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Mitosis in Paris quadrifolia, Liliaceae, showing all stages from prophase to telophase. n = 10 (Darlington).

Page 17: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Root tip squashes showing anaphase separation. Fritillaria pudica, 3x = 39, spiral structure of chromatids revealed by pressure after cold treatment. Darlington.

Page 18: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Cleavage mitosis in the teleostean fish, Coregonus clupeoides, in the middle of anaphase. Spindle structure revealed by slow fixation. Darlington.

Page 19: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Outline: eukaryotic chromosomes

General features of eukaryotic chromosomes

C value paradox and genome sizes

Organization of genomes into chromosomes

Genome browsers

The ENCODE project

Page 20: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

The eukaryotic chromosome: the centromere

The centromere is a primary constriction where the chromosome attaches to the spindle fibers; here the boundary between sister chromatids is not clear. It may be in the middle (metacentric) or the end (acrocentric).

If a chromosome has two centromeres spaced apart (dicentric) then at anaphase there is a 50% chance that a single chromatid would be pulled to opposite poles of the mitotic spindle. This would result in a bridge formation and chromosome breakage.

Page 644

Page 21: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

The eukaryotic chromosome: the centromere

The short arm of the acrocentric autosomes has a secondary constriction usually containing a nucleolar organizer. This contains the genes for 18S and 28S ribosomal RNA.

Page 644

Page 22: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

The eukaryotic chromosome: the telomere

The telomere is a region of highly repetitive DNA at either end of a linear chromosome. Telomeres include nucleoprotein complexes that function in the protection, replication, and stabilization of chromosome ends. Telomeres of many eukaryotes have tandemly repeated DNA sequences (discussed below).

Page 644

Page 23: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Outline: eukaryotic chromosomes

General features of eukaryotic chromosomes

C value paradox and genome sizes

Organization of genomes into chromosomes

Genome browsers

The ENCODE project

Page 24: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Three main genome browsers

There are three principal genome browsers for eukaryotes:

(1) NCBI offers Map Viewer (less commonly used)

(2) Ensembl (www.ensembl.org) offers browsers for dozens of genomes

(3) UCSC (http://genome.ucsc.edu) offers genome and table browsers for dozens of organisms. We will focus on this browser.

Page 645

Page 25: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Example #1 of a human genome web browser:human chromosome 21 at NCBI

nucleolar organizing center

centromere

Page 26: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

nucleolar organizing center

centromere

Example #2 of a human genome web browser:human chromosome 21at www.ensembl.org

Page 27: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

centromere

Example #3 of a human genome web browser:human chromosome 21 at UCSC

Page 28: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Outline: eukaryotic chromosomes

General features of eukaryotic chromosomes

C value paradox and genome sizes

Organization of genomes into chromosomes

Genome browsers

The ENCODE project

Page 647

Page 29: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

The ENCODE project

►The ENCyclopedia Of DNA Elements (ENCODE) project was launched in 2003

► Pilot phase (completed): devise and test high-throughput approaches to identify functional elements. 44 DNA targets: 1 percent of the human genome, ~30 million base pairs (Mb).

► Second phase (simultaneous): technology development.

► Third phase: production. Expand the ENCODE project to analyze the remaining 99 percent of the human genome.

Key reference: Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project. Nature (2007) 447:799-816. PMID: 17571346

Page 647

Page 30: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

The ENCODE project

Goal of ENCODE: build a list of all sequence-based functional elements in human DNA. This includes:

► protein-coding genes

► non-protein-coding genes

► regulatory elements involved in the control of gene transcription

► DNA sequences that mediate chromosomal structure and dynamics.

Page 647

Page 31: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

http://genome.cse.ucsc.edu/ENCODE/encode.hg17.html

ENCODE data at the UCSC Genome Browser

Page 32: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

http://genome.cse.ucsc.edu/ENCODE/encode.hg17.html

ENCODE data at the UCSC Genome Browser

Page 33: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

ENCODE data at the UCSC Genome Browser: beta globin

HBB, HBD, HBG1,HBG2, HBE1

Page 34: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

ENCODE data at the UCSC Genome Browser: beta globin(50,000 base pairs including HBB, HBD, HBG1, HBG2, HBE1)

Page 35: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

ENCODE tracks available at the UCSC Genome Browser

Page 36: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

ENCODE tracks available at the UCSC Genome Browser

Page 37: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Outline: eukaryotic chromosomes

General features of eukaryotic chromosomes

Repetitive DNA content of eukaryotic genomes

Noncoding and repetitive sequences

1. Interspersed repeats

2. Processed pseudogenes

3. Simple sequence repeats

4. Segmental duplications

5. Blocks of tandem repeats

Page 38: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Britten & Kohne’s analysis of repetitive DNA

In the 1960s, Britten and Kohne defined the repetitivenature of genomic DNA in a variety of organisms.They isolated genomic DNA, sheared it, dissociatedthe DNA strands, and measured the rates of DNAreassociation.

For dozens of eukaryotes—but not bacteria or viruses—large amount of DNA reassociates extremely rapidly.This represents repetitive DNA.

Page 650

Page 39: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Fig. 16.5Page 651

Britten and Kohne (1968) identified repetitive DNA classes

Page 40: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Software to detect repetitive DNA

It is essential to identify repetitive DNA in eukaryoticgenomes. RepBase Update is a database of knownrepeats and low-complexity regions.

RepeatMasker is a program that searches DNA queriesagainst RepBase. There are many RepeatMasker sitesavailable on-line.

We will use 100,000 base pairs from human chromosome10 as an example. This region (from NT_008769)includes the retinol-binding protein 4 gene.

Page 653

Page 41: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

http://www.repeatmasker.org/current 11/09

Repeatmasker software screens DNA for repeats

Page 42: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu
Page 43: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu
Page 44: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

RepeatMaskermasksrepetitive DNA(FASTA format)

Page 45: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Five main classes of repetitive DNA

Page 652

1. Interspersed repeats (transposon-derived repeats)constitute ~45% of the human genome. They involveRNA intermediates (retroelements) or DNA intermediates(DNA transposons).

• Long-terminal repeat transposons (RNA-mediated)• Long interspersed elements (LINEs);

these encode a reverse transcriptase• Short interspersed elements (SINEs)(RNA-mediated);

these include Alu repeats• DNA transposons (3% of human genome)

Page 46: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

RepeatMasker identifies common Alu repeats

Page 47: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Five main classes of repetitive DNA

Table 16.6Page 653

1. Interspersed repeats (transposon-derived repeats)

Examples include retrotransposed genes that lack introns,such as:

ADAM20 NM_003814 14q (original gene on 8p)Cetn1 NM_004066 18p (original gene on Xq)Glud2 NM_012084 Xq (original gene on 10q)Pdha2 NM_005390 4q (original gene on Xp)

Page 48: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Interspersed repeats in the UCSC genome browser

Retrotransposon Insertion Polymorphisms

http://genome.ucsc.edu

Page 49: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

blue = in reference human genomered = not in reference

Interspersed repeats in the UCSC genome browser

Page 50: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Outline: eukaryotic chromosomes

General features of eukaryotic chromosomes

Repetitive DNA content of eukaryotic genomes

Noncoding and repetitive sequences

1. Interspersed repeats

2. Processed pseudogenes

3. Simple sequence repeats

4. Segmental duplications

5. Blocks of tandem repeats

Page 51: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Five main classes of repetitive DNA

Page 653

2. Processed pseudogenes

These genes have a stop codon or frameshift mutationand do not encode a functional protein. They commonly arise from retrotransposition, or following gene duplication and subsequent gene loss.

For a superb on-line resource, visit Mark Gerstein’s website, http://www.pseudogene.org. Gerstein and colleagues (2006) suggest that there are ~19,000 pseudogenes in the human genome, slightly fewer than the number of functional protein-coding genes. (11,000 non-processed, 8,000 processed [lack introns].)

Page 52: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Pseudogenes in the UCSC genome browser

Yale pseudogenes

VEGA pseudogenes

Page 53: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Pseudogenes in the HOX cluster ENCODE region

Page 54: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

A pseudogene upstream of the HOX cluster

Page 55: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

HOX genes

From the Entrez Gene entry for human HOXA1:

In vertebrates, the genes encoding the class of transcription factors called homeobox genes are found in clusters named A, B, C, and D on four separate chromosomes. Expression of these proteins is spatially and temporally regulated during embryonic development. This gene is part of the A cluster on chromosome 7 and encodes a DNA-binding transcription factor which may regulate gene expression, morphogenesis, and differentiation.

Page 56: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Vertebrate Genome Annotation (VEGA) database

From the VEGA home page (http://vega.sanger.ac.uk):

"The Vertebrate Genome Annotation (VEGA) database build 30 is designed to be a central repository for manual annotation of different vertebrate finished genome sequence. In collaboration with the genome sequencing centres Vega attempts to present consistent high-quality curation of the published chromosome sequences."

"Finished genomic sequence is analysed on a clone by clone basis using a combination of similarity searches against DNA and protein databases as well as a series of ab initio gene predictions (GENSCAN, Fgenes)."

"In addition, comparative analysis using vertebrate datasets such as the Riken mouse cDNAs and Genoscope Tetraodon nigroviridis Ecores (Evolutionary Conserved Regions) are used for novel gene discovery."

Page 57: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Vertebrate Genome Annotation (VEGA) database

VEGA definition of pseudogenes (http://vega.sanger.ac.uk):

Page 58: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Yale pseudogene database

http://www.pseudogene.org

Page 59: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Pseudogenes: example

Mouse GULO, required for vitamin C biosynthesis, has become a pseudogene in the primate lineage (GULO). Here is an output for GULO on the human genome:

Page 60: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Pseudogenes: example

GULO pseudogene in Entrez nucleotide:

Page 61: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Pseudogenes: example

Mouse GULO in Entrez protein:

Page 62: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Outline: eukaryotic chromosomes

General features of eukaryotic chromosomes

Repetitive DNA content of eukaryotic genomes

Noncoding and repetitive sequences

1. Interspersed repeats

2. Processed pseudogenes

3. Simple sequence repeats

4. Segmental duplications

5. Blocks of tandem repeats

Page 63: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Five main classes of repetitive DNA

Page 657

3. Simple sequence repeats

Microsatellites: from one to a dozen base pairsExamples: (A)n, (CA)n, (CGG)n

These may be formed by replication slippage.Minisatellites: a dozen to 500 base pairs

Simple sequence repeats of a particular length andcomposition occur preferentially in different species.In humans, an expansion of triplet repeats such as CAGis associated with at least 14 disorders (includingHuntington’s disease).

Page 64: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Example of a simple sequence repeat (CCCA or GGGT) in human genomic DNA

Page 657

Page 65: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

RepeatMasker identifies simple sequence repeats

Page 66: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Repetitive DNA via the UCSC genome browser (May 2004 assembly)

Page 67: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Repetitive DNA in the UCSC genome browser

G. Benson, "Tandem repeats finder: a program to analyze DNA sequences“ Nucleic Acids Research (1999) 27:573-580.

Page 68: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

20 copies of CA 23 copies of CAT48

Page 69: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Repetitive DNA in the UCSC genome browser

Sequences of at least 15 perfect di-nucleotide and tri-nucleotide repeats identified by TRF. These tend to be highly polymorphic in the population.

Page 70: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Beta globin locus: tandem repeats, microsatellites, and RepeatMasker

Page 71: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Outline: eukaryotic chromosomes

General features of eukaryotic chromosomes

Repetitive DNA content of eukaryotic genomes

Noncoding and repetitive sequences

1. Interspersed repeats

2. Processed pseudogenes

3. Simple sequence repeats

4. Segmental duplications

5. Blocks of tandem repeats

Page 72: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Five main classes of repetitive DNA

Page 658

4. Segmental duplications

These are blocks of about 1 kilobase to 300 kb that arecopied intra- or interchromosomally. Evan Eichler andcolleagues estimate that about 5% of the human genomeconsists of segmental duplications. Duplicated regionsoften share very high (99%) sequence identity.

As an example, consider a group of lipocalin geneson human chromosome 9.

Page 73: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Fig. 16.12Page 659

Successive tandem gene duplications(after Lacazette et al., 2000)

observed today

Page 74: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Successive tandem gene duplications(after Lacazette et al., 2000)

Fig. 16.12Page 659

Page 75: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Successive tandem gene duplications(after Lacazette et al., 2000)

Fig. 16.12Page 659

Page 76: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Successive tandem gene duplications(after Lacazette et al., 2000)

Fig. 16.12Page 659

Page 77: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Beta globin locus: segmental duplicationsinvolving HBG1 and HBG2 genes

Page 660

Page 78: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Beta globin locus: segmental duplications(pairwise alignment of duplicated regions at UCSC)

Page 79: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Beta globin locus: segmental duplications(one region is duplicated at dozens of loci)

Page 80: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Beta globin locus: segmental duplications

Page 81: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Outline: eukaryotic chromosomes

General features of eukaryotic chromosomes

Repetitive DNA content of eukaryotic genomes

Noncoding and repetitive sequences

1. Interspersed repeats

2. Processed pseudogenes

3. Simple sequence repeats

4. Segmental duplications

5. Blocks of tandem repeats

Page 82: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Five main classes of repetitive DNA

Page 660

5. Blocks of tandem repeats

These include telomeric repeats (e.g. TTAGGG inhumans) and centromeric repeats (e.g. a 171 base pairrepeat of satellite DNA in humans).

Such repetitive DNA can span millions of base pairs, and it is often species-specific.

Page 83: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Fig. 16.14Page 661

Example of telomeric repeats (obtained by blastn searching TTAGGG4)

Page 84: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Five main classes of repetitive DNA

Page 660

5. Blocks of tandem repeats

In two exceptional cases, chromosomes lack satellite DNA:

• Saccharomyces cerevisiae (very small centromeres)• Neocentromeres (an ectopic centromere; 60 have been described in human, often associated with disease)

Page 85: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Outline: eukaryotic chromosomes

General features of eukaryotic chromosomes

Repetitive DNA content of eukaryotic genomes

Gene content of eukaryotic chromosomes

Regulatory regions of eukaryotic chromosomes

Comparison of eukaryotic DNA

Variation in chromosomal DNA

Techniques to measure chromosomal change

Page 86: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Finding genes in eukaryotic DNA

Two of the biggest challenges in understanding anyeukaryotic genome are

• defining what a gene is, and • identifying genes within genomic DNA

Page 662

Page 87: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Finding genes in eukaryotic DNA

Types of genes include

• protein-coding genes• pseudogenes• functional RNA genes

--tRNA transfer RNA--rRNA ribosomal RNA--snoRNA small nucleolar RNA--snRNA small nuclear RNA--miRNA microRNA

Page 662

Page 88: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Finding genes in eukaryotic DNA

RNA genes have diverse and important functions.However, they can be difficult to identify in genomic DNA,because they can be very small, and lack open reading frames that are characteristic of protein-coding genes.

tRNAscan-SE identifies 99 to 100% of tRNA molecules,with a rate of 1 false positive per 15 gigabases.Visit http://lowelab.ucsc.edu/tRNAscan-SE/

Page 662

Page 89: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu
Page 90: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Finding genes in eukaryotic DNA

Protein-coding genes are relatively easy to find in prokaryotes, because the gene density is high (aboutone gene per kilobase). In eukaryotes, gene density is lower, and exons are interrupted by introns.

There are several kinds of exons:-- noncoding-- initial coding exons-- internal exons-- terminal exons-- some single-exon genes are intronless

Page 663

Page 91: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Fig. 16.16Page 664

Eukaryotic gene prediction algorithms distinguish several kinds of exons

Page 92: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Finding genes in eukaryotic DNA

Algorithms that find protein-coding genes areextrinsic or intrinsic (refer to Chapter 13).

Page 664

Page 93: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Gene-finding algorithms

Homology-based searches (“extrinsic”) Rely on previously identified genes

Algorithm-based searches (“intrinsic”)Investigate nucleotide composition, open-reading frames, and other intrinsic properties of genomic DNA

Page 664

Page 94: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

DNA

RNA

Mature RNA

protein

intron

Page 664

Page 95: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

DNA

RNA

RNA

protein

Extrinsic, homology-based searching: compare genomic DNA to expressed genes (ESTs)

intron

Page 664

Page 96: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

DNA

RNA

Intrinsic, algorithm-based searching: Identify open reading frames (ORFs). Compare DNA in exons (unique codon usage) to DNA in introns (unique splices sites)and to noncoding DNA.

Page 664

Page 97: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

chimpanzeeDNA

Comparative genomics: Compare gene models between species. (For annotation of the chimpanzee genome reported in 2005, BLAT and BLASTZ searches were used to align the two genomes.)

human DNA

Page 98: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Finding genes in eukaryotic DNA

While ESTs are very helpful in finding genes, beware ofseveral caveats.

-- The quality of EST sequence is sometimes low-- Highly expressed genes are disproportionately represented in many cDNA libraries-- ESTs provide no information on genomic location

Page 665

Page 99: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Finding genes in eukaryotic DNA

Both intrinsic and extrinsic algorithms vary in their ratesof false-positive and false-negative gene identification. Programs such as GENSCAN and Grail account for features such as the nucleotide composition of codingregions, and the presence of signals such as promoter elements. Try using the on-line genome annotation pipelineoffered by Oak Ridge National Laboratory. Google ORNL pipeline, or visithttp://compbio.ornl.gov/tools/pipeline/

Page 665

Page 100: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Oak Ridge National Laboratory (ORNL)offers an on-line annotation pipeline

Page 101: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu
Page 102: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu
Page 103: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu
Page 104: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Finding genes in eukaryotic DNA

We used 100,000 base pairs of human DNA. The pipelinecorrectly identified several exons of RBP4, but failed togenerate a complete gene model.

As another example, initial annotation of the rice genomeyielded over 75,000 gene predictions, only 53,000 of whichwere complete (having initial and terminal exons). Also,it is very difficult to accurately identify exon-intron boundaries.

Estimates of gene content improve dramatically whenfinished (rather than draft) sequence is analyzed.

Page 105: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

EGASP: the human ENCODE Genome Annotation Assessment Project

EGASP goals:

[1] Assess of the accuracy of computational methods to predict protein coding genes. 18 groups competed to make gene predictions, blind; these were evaluated relative to reference annotations generated by the GENCODE project.

[2] Assess of the completeness of the current human genome annotations as represented in the ENCODE regions.

Page 666

Page 106: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

UCSC: tracks for Gencode and for various gene prediction algorithms(focus on 50 kb encompassing five globin genes)

JIGSAW

Gencode

Page 667

Page 107: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

EGASP: the human ENCODE Genome Annotation Assessment Project

“RESULTS: The best methods had at least one gene transcript correctly predicted for close to 70% of the annotated genes. Nevertheless, the multiple transcript accuracy, taking into account alternative splicing, reached only approximately 40% to 50% accuracy. At the coding nucleotide level, the best programs reached an accuracy of 90% in both sensitivity and specificity. Programs relying on mRNA and protein sequences were the most accurate in reproducing the manually curated annotations. Experimental validation shows that only a very small percentage (3.2%) of the selected 221 computationally predicted exons outside of the existing annotation could be verified.”

Guigo R et al., Genome Biology (2006) 7 Suppl 1: S2.1-31

Page 667

Page 108: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Protein-coding genes in eukaryotic DNA:a new paradox

The C value paradox is answered by the presence of noncoding DNA.

Why are the number of protein-coding genes about the samefor worms, flies, plants, and humans?

This has been called the N-value paradox (number of genes)or the G value paradox (number of genes).

Page 668

Page 109: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Outline: eukaryotic chromosomes

General features of eukaryotic chromosomes

Repetitive DNA content of eukaryotic genomes

Gene content of eukaryotic chromosomes

Regulatory regions of eukaryotic chromosomes

Comparison of eukaryotic DNA

Variation in chromosomal DNA

Techniques to measure chromosomal change

Page 110: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Transcription factor databases

In addition to identifying repetitive elements and genes,it is also of interest to predict the presence of genomicDNA features such as promoter elements and GC content.

See Table 16.10 (p. 670) for a list of websites that predicttranscription factor binding sites and related sequences.

Page 669

Page 111: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

http://www.sanger.ac.uk/Users/td2/eponine

Eponine predicts transcription start sites in promoter regions. The algorithm uses a set of DNA weight matrices recognizing sequence motifs that are associated with a position distribution relative to the transcription start site. The model is as follows:

The specificity is good (~70%), and the positional accuracy is excellent. The program identifies ~50% of TSSs—although it does not always know the direction of transcription.

Page 112: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Outline: eukaryotic chromosomes

General features of eukaryotic chromosomes

Repetitive DNA content of eukaryotic genomes

Gene content of eukaryotic chromosomes

Regulatory regions of eukaryotic chromosomes

Comparison of eukaryotic DNA

Variation in chromosomal DNA

Techniques to measure chromosomal change

Page 113: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Comparison of eukaryotic DNA: PipMaker and VISTA

In studying genomes, it is important to align large segments of DNA.

PipMaker and VISTA are two tools for sequence alignmentand visualization. They show conserved segments, including the order and orientation of conserved elements. They also display large-scale genomic changes (inversions, rearrangements, duplications).

Try VISTA (http://www-gsd.lbl.gov/vista) or PipMaker(http://bio.cse.psu.edu/pipmaker) with genomic DNAfrom Hs10 and Mm19 (containing RBP4).

Page 673

Page 114: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Fig. ~16.22Page 674

VISTA output for an alignment of human and mousegenomic DNA (including RBP4)

Page 115: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

VISTA output for an alignment of human and mousegenomic DNA (including RBP4)

Page 674

Page 116: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Outline: eukaryotic chromosomes

General features of eukaryotic chromosomes

Repetitive DNA content of eukaryotic genomes

Gene content of eukaryotic chromosomes

Regulatory regions of eukaryotic chromosomes

Comparison of eukaryotic DNA

Variation in chromosomal DNA

Techniques to measure chromosomal change

Page 117: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

The spectrum of variation

Category of variation Size typeSingle base pair changes 1 bp SNPs,

point mutationsSmall insertions/deletions 1 – 50 bpShort tandem repeats 1 – 500 bp microsatellitesFine-scale structural var. 50 bp – 5 kb del, dup, inv

tandem repeatsRetroelement insertions 0.3 – 10 kb SINEs, LINEs

LTRs, ERVsIntermediate-scale struct. 5 kb – 50 kb del, dup, inv,

tandem repeatsLarge-scale structural var. 50 kb – 5 Mb del, dup, inv, large

tandem repeatsChromosomal variation >>5Mb aneuploidy

Adapted from Sharp AJ et al. (2006) Annu Rev Genomics Hum Genet 7:407-42

Page 118: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Eukaryotic chromosomes can be dynamic

Chromosomes can be highly dynamic, in several ways.

• Whole genome duplication (autopolyploidy) can occur, as in yeast (Chapter 15) and some plants.• The genomes of two distinct species can merge, as in the mule (male donkey, 2n = 62 and female horse, 2n = 64)• An individual can acquire an extra copy of a chromosome (e.g. Down syndrome, TS13, TS18)• Chromosomes can fuse; e.g. human chromosome 2 derives from a fusion of two ancestral primate chromosomes• Chromosomal regions can be inverted (hemophilia A)• Portions of chromosomes can be deleted (e.g. 11q syndrome)• Segmental and other duplications occur• Chromatin diminution can occur (Ascaris)

Page 675

Page 119: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Conservative nature of chromosome evolution

Among placental mammals, the number of diploid chromosomes is:

84 in black rhinoceros46 in Homo sapiens17 in two rodent species

The process of chromosome evolution tends to remain conservative. Heterozygous carriers of most types of chromosomal rearrangements are semisterile. Thus many chromosomal changes cannot be fixed.

Ohno (1970) p. 41

Page 120: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Inversions in chromosome evolution

Chromosomal inversions occur when a fragment of a chromosome breaks at two places, inverts, and is reinserted. This is a useful mechanism for producing a sterility barrier during speciation. An example is in deer mice; another example is in Anopheles gambiae.

Ohno (1970) p. 42

Page 121: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

The eukaryotic chromosome: Robertsonian fusioncreates one metacentric by fusion of two acrocentrics

Translocations occur when chromosomal material is exchanged between two non-homologous chromosomes. Roberstonian fusion, which often accompanies speciation, is the creation of one metacentric chromosome by the centric fusion of two acrocentrics.

Robertsonian fusions are often tolerated and may sometimes be considered selectively neutral. An example is the house mouse (Mus musculus, 2n = 40) and a small group of tobacco mice in Switzerland (Mus poschiavinus, 2n = 26). Mus poschiavinus is homozygous for seven Robertsonian fusions.

Page 675; Ohno (1970) p. 43

Page 122: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

The eukaryotic chromosome: Robertsonian fusioncreates one metacentric by fusion of two acrocentrics

Ohno (1970) Plate II

ordinary male house mouse (Mus musculus, 2n = 40)

male tobacco mouse (Mus poschiavinus, 2n = 26)

Male first meiotic metaphase from an interspecific F1-hbrid. Note seven trivalents (each from one poschiavinus metacentric and two musculus acrocentrics)

Page 123: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Diploidization of the tetraploid

Ohno (1970) pp 98- 101

A species can become tetraploid. All loci are duplicated, and what was formerly the diploid chromosome complement is now the haploid set of the genome.

Polyploid evolution occurs commonly in plants. For example, in the cereal plant SorghumS. versicolor (diploid) 2n = 2 x 5; 10 chromosomesS. sudanense (tetraploid) 4n = 4 x 5; 20 chromosomesS. halepense (octoplooid) 8n = 8 x 5; 40 chromosomes

In plants, the male sex organ (stamen) and female organ (pistil or carpel) is present in the same flower; they are hermaphroditic.

Page 124: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Diploidization of the tetraploid

Ohno (1970) pp 98- 101

Polyploid evolution occurs rarely in vertebrates and other metazoans. For diploid organisms with XY/XX sex determination, in tetraploidy the male must maintain 4AXXYY and the female 4AXXXX. But during meiosis of the 4AXXYY male, the four sex elements may pair off as the XX bivalent and the YY bivalent such that every gamete is 2AXY. All offspring of the tetraploid male and tetraploid female would be 4AXXXY. If this were male, there would be no females. The 4AXXYY male cannot produce the necessary two classes of gametes, 2AXX and 2AYY.

Mammals, birds, and reptiles are thus not polyploid.

Page 125: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Tetraploidy: Odontophyrynus americanus is a newly arisen bisexual autotetraploid vertebrate

Ohno (1970) pp 98- 101

Polyploid evolution can occur in fish and amphibians, because of differences in sex determination (X and Y in males, Z and W in females).

Autopolyploidy: in a diploid organism, two daughter cells at the end of mitotic telophase may fuse into one cell, forming a tetraploid cell. Two diploid gametes may produce a tetraploid zygote.

Allopolyploidy: interspecies polyploidy.

Page 126: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Tetraploidy: Odontophyrynus americanus is a newly arisen bisexual autotetraploid vertebrate

Ohno (1970) pp 98- 101

South American frogs species of the family Ceratophyrydidae may be autopolyploids. The diploid chromosome number has a wide range:

O. cultripes 22 chromosomes 11 bivalents in meiosis

O. americanus 44 chromosomes 11 quadrivalents in meiosis

other species of this family110 chromosomes

Page 127: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Ohno (1970) plate III; p. 100

Karyotype of the tetraploid frog Odontophrynus americanus (4n = 44)

44 chromosomes: 11 sets offour homologs

sperm head

two bivalents

ten quadrivalents

Page 128: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Diploidization of the tetraploid

Ohno (1970) p. 102

As an autotetraploid arises, it has four homologous chromosomes for each linkage group. These must change to a disomic state to allow functional diversification of the loci. If the four homologues form a quadrivalent, there cannot be functional diversification. Two distinct, separate bivalents must form, for example via a pericentric inversion.

In fish of the suborder Salmonoidea, trout, salmon, whitefish and graylings are probably autotetraploid species.

Page 129: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Ohno (1970) plate IV; p. 102

Karyotypes of a species in the process of diploidization:rainbow trout Salmo irideus

Liver cell61 chromosomes

43 metacentrics18 acrocentrics

104 chrom. arms

Spleen cell59 chromosomes

45 metacentrics14 acrocentrics

104 chrom. arms

4 quadrivalents 1 quadrivalent

Page 130: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Trisomy and polysomy

Ohno (1970) p. 107

Nondisjunction results in two chromatids of one chromosome moving to the same division pole. In diploid species, one daughter cell receives three homologous chromosomes (trisomy). If this occurs in germ cells, the progeny may be trisomic.

In the Jimson weed (Datura stramonium) trisomy for each of the 12 chromosomes was observed by Blakeslee (1930). A mating between trisomic individuals may produce tetrasomic progeny having two homologous chromosomes (thus duplicating an entire chromosome).

Page 131: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Trisomy and polysomy

Ohno (1970) p. 107

For vertebrates, this mechanism is too severe. Generally, only trisomy of chromosomes 13, 18, or 21 are compatible with postnatal survival in humans.

In rainbow trout that have become tetraploid, trisomy (i.e. from four to five copies) and monosomy (i.e. from four to three copies) may be tolerated.

Page 132: The eukaryotic chromosome (Chapter 16) Friday, November 5, 2010 Genomics 260.605.01 J. Pevsner pevsner@jhmi.edu

Outline: eukaryotic chromosomes

General features of eukaryotic chromosomes

Repetitive DNA content of eukaryotic genomes

Gene content of eukaryotic chromosomes

Regulatory regions of eukaryotic chromosomes

Comparison of eukaryotic DNA

Variation in chromosomal DNA

Techniques to measure chromosomal change