bio 2, lecture 7 life’s information molecule ii: transcription

BIO 2, Lecture 7BIO 2, Lecture 7BIO 2, Lecture 7BIO 2, Lecture 7LIFE’S INFORMATION LIFE’S INFORMATION

MOLECULE II: TRANSCRIPTIONMOLECULE II: TRANSCRIPTION

Overview: The Flow of Genetic Information

• The information content of DNA is in the form of specific sequences of nucleotides

• The DNA inherited by an organism leads to specific traits by dictating the synthesis of proteins

• Gene expression, the process by which DNA directs protein synthesis, includes two stages: transcription and translation

Copyright © 2008 Pearson Education Inc., publishing as Pearson Benjamin Cummings

• Changes in the nucleotide sequence of DNA can lead to changes in the amino acid sequence of proteins

• The genotype of an organism is comprised of the genes that it carries

• The phenotype of an organism is comprised of its physical and behavioral traits

• An organism’s phenotype is dictated, to a large extent, by its genotype

Copyright © 2008 Pearson Education Inc., publishing as Pearson Benjamin Cummings

Genotype: 47(+21) XYPhenotype: Down

Syndrome

Genes do not control every aspect of phenotype ...

• In 1909, British physician Archibald Garrod first suggested that genes dictate phenotypes through enzymes that catalyze specific chemical reactions

• He thought symptoms of an inherited disease reflect an inability to synthesize a certain enzyme

• Linking genes to enzymes required understanding that cells synthesize and degrade molecules in a series of steps, a metabolic pathway

• George Beadle and Edward Tatum exposed bread mold to X-rays, creating mutants that were unable to survive on minimal medium as a result of inability to synthesize certain molecules

• Using crosses, they identified three classes of arginine-deficient mutants, each lacking a different enzyme necessary for synthesizing arginine

• They developed a one gene–one enzyme hypothesis, which states that each gene dictates production of a specific enzyme

EXPERIMENT

Growth:Wild-typecells growingand dividing

No growth:Mutant cellscannot growand divide

Minimal medium

RESULTSClasses of Neurospora crassa

Wild type Class I mutantsClass II mutantsClass III mutants

Minimalmedium(MM)(control)

MM +ornithine

MM +citrulline

MM +arginine(control)

Con

dit

ion

CONCLUSION Class I mutants(mutation in

gene A)

Class II mutants(mutation in

gene B)

Class III mutants(mutation in

gene C)Wild type

Precursor Precursor Precursor PrecursorEnzyme AEnzyme AEnzyme AEnzyme A

Ornithine Ornithine Ornithine OrnithineEnzyme BEnzyme B Enzyme BEnzyme B

Citrulline Citrulline Citrulline CitrullineEnzyme CEnzyme CEnzyme CEnzyme C

Arginine Arginine Arginine Arginine

Gene A

Gene B

Gene C

• Some proteins aren’t enzymes, so researchers later revised the hypothesis: one gene–one protein

• But proteins with quarternary structure are encoded by more than one gene, so Beadle and Tatum’s hypothesis was again revised as the one gene–one polypeptide hypothesis

• But now we know that each gene can encode more than one polypeptide due to a phenomenon called alternative splicing ...

• So now we have the one gene–one or more polypeptides hypothesis

• RNA is the intermediate between genes and the proteins for which they code

• Transcription is the copying of one strand of the double-stranded DNA into a single-stranded RNA molecule

• Transcription produces messenger RNA (mRNA)

• Translation is the synthesis of a polypeptide from the mRNA on a ribosome

RNA polymerase

Chromosomes are like a library of books (in the form of DNA molecules) that cannot be checked

out

But an mRNA copy of some of the pages of

some of the books (genes) are made in a

process called transcription

Ribosomes

Ribosomes then read the

instructions in the RNA molecules to build proteins in a

process called translation

• In prokaryotes, transcription and translation take place in the same space (the cytosol) and the mRNA produced by transcription is immediately translated without more processing

• In a eukaryotic cell, the nuclear envelope separates transcription from translation

• Eukaryotic RNA transcripts are modified through RNA processing to yield finished mRNA

The central dogma is the concept that cells are governed by a cellular chain of command:

DNA RNA protein

(a) Bacterial cell

TRANSCRIPTION DNA

mRNA

TRANSLATIONRibosome

Polypeptide

(b) Eukaryotic cell

TRANSCRIPTION

Nuclearenvelope

DNA

Pre-mRNARNA PROCESSING

mRNA

TRANSLATION Ribosome

Polypeptide

• How are the instructions for

assembling amino acids into proteins encoded by the DNA?

• There are 20 amino acids, but there are only four nucleotide bases in DNA

• How many bases correspond to an amino acid?

• The flow of information from gene to protein is based on a triplet code: a series of non-overlapping, three-nucleotide “words” called codons

• Example: The triplet 5’-AGT-3’ in a gene results in the placement of the amino acid serine in the polypeptide coded by the gene

• Another example: 5’-GGG-3’ codes for the animo acid glycine

• During transcription, one of the two DNA strands is copied into mRNA

• During translation, the codons in the mRNA are read in the 5 to 3 direction

• Each codon specifies the amino acid to be placed at the corresponding position along a polypeptide

DNAmolecule

Gene 1

Gene 2

Gene 3

DNAtemplatestrand

TRANSCRIPTION

TRANSLATION

mRNA

Protein

Codon

Amino acid

DNAcodingstrand

• All 64 codons were deciphered by the mid-1960s (43 = 64)

• Of the 64 triplets, 61 code for amino acids; 3 triplets are “stop” signals to end translation of the mRNA

• The genetic code is redundant but not ambiguous• More than one codon can code for one

amino acid• No codon specifies more than one amino

acid

Second mRNA base

Fir

st

mR

NA

base (

5 e

nd

of

cod

on

)

Th

ird

mR

NA

base (

3 e

nd

of

cod

on

)

• The genetic code is universal, shared by the simplest bacteria to the most complex animals

• This is why genes can be transcribed and translated after being transplanted from one species to another (recombinant DNA technology)

Pig expressing a jellyfish gene

• Transcription, the first stage of gene expression, has been examined in great detail

• RNA synthesis is catalyzed by RNA polymerase, which pries the DNA strands apart and hooks together the RNA nucleotides

• RNA synthesis follows the same base-pairing rules as DNA, except uracil substitutes for thymine

• The DNA sequence where RNA polymerase attaches to a gene is called a promoter because the presence of this sequence “promotes” the recognition and transcription of the gene

• Areas of the DNA lacking promoters are not transcribed

• The stretch of DNA that is transcribed is called a transcription unit

Promoter Transcription unit

DNAStart point

RNA polymerase

553

3


DNAStart point

RNA polymerase

553

3

Initiation

33

1

RNAtranscript

5 5

UnwoundDNA

Template strandof DNA


DNAStart point

RNA polymerase

553

3

Initiation

33

1

RNAtranscript

5 5

UnwoundDNA


2 Elongation

RewoundDNA

5

5 5 3 3 3

RNAtranscript


DNAStart point

RNA polymerase

553

3

Initiation

33

1

RNAtranscript

5 5

UnwoundDNA


2 Elongation

RewoundDNA

5

5 5 3 3 3

RNAtranscript

3 Termination

5

5

5 33

3Completed RNA transcript

Elongation

RNApolymerase

Nontemplatestrand of DNA

RNA nucleotides

3 end

Direction oftranscription(“downstream”) Template

strand of DNA

Newly madeRNA

3

5

5

• Transcription can be broken down into 3 stages:

– Initiation– Elongation– Termination

• Promoters attract proteins called transcription factors to the gene

• Transcription factors then attract RNA polymerase so that transcription can be initiated

• The completed assembly of transcription factors and RNA polymerase II bound to a promoter is called a transcription initiation complex

• Promoters contain A-T rich regions, making it easier for RNA polymerase to pry apart (“melt”) the DNA strands

A eukaryotic promoterincludes a TATA box

3

1

2

3

Promoter

TATA box Start point

Template

TemplateDNA strand

535

Transcriptionfactors

Several transcription factors mustbind to the DNA before RNApolymerase II can do so.

5533

Additional transcription factors bind tothe DNA along with RNA polymerase II,forming the transcription initiation complex.

RNA polymerase IITranscription factors

55 53

3

RNA transcript

Transcription initiation complex

• As RNA polymerase moves along the DNA, it untwists the double helix, 10 to 20 bases at a time

• Transcription progresses at a rate of 40 nucleotides per second in eukaryotes

• A gene can be transcribed simultaneously by several RNA polymerases

• The mechanisms of termination are different in bacteria and eukaryotes

• In bacteria, the polymerase stops transcription at the end of the terminator

• In eukaryotes, the polymerase continues transcription after the pre-mRNA is cleaved from the growing RNA chain; the polymerase eventually falls off the DNA

• Enzymes in the eukaryotic nucleus modify pre-mRNA before the genetic messages are dispatched to the cytoplasm

• During RNA processing, both ends of the primary transcript are usually altered

• Also, usually some interior parts of the molecule are cut out, and the other parts spliced together

• Each end of a pre-mRNA molecule is modified in a particular way:– The 5 end receives a modified nucleotide

5 cap– The 3 end gets a poly-A tail

• These modifications share several functions:– They seem to facilitate the export of

mRNA– They protect mRNA from hydrolytic

enzymes– They help ribosomes attach to the 5 end

Protein-coding segmentPolyadenylation signal3

3’ UTR5’ UTR

5

5’ Cap Start codon Stop codon Poly-A tail

G P PP AAUAAA AAA AAA…

Structure of a eukaryotic mRNA

• Most eukaryotic genes have long noncoding stretches of nucleotides that lie between coding regions

• These noncoding regions are called intervening sequences, or introns

• The other regions are called exons because they are eventually expressed, usually translated into amino acid sequences

• RNA splicing removes introns and joins exons, creating an mRNA molecule with a continuous coding sequence

• Watson and Crick reasoned that the pairing was more specific, dictated by the base structures

• They determined that adenine (A) paired only with thymine (T), and guanine (G) paired only with cytosine (C)

• The Watson-Crick model explains Chargaff’s rules: in any organism the amount of A = T, and the amount of G = C

Pre-mRNA

mRNA

Codingsegment

Introns cut out andexons spliced together

5’ Cap

Exon Intron5’

1 30 31 104

Exon Intron

105

Exon

146

3’Poly-A tail

Poly-A tail5’ Cap

5’ UTR 3’ UTR1 146

• Some genes can encode more than one kind of polypeptide, depending on which segments are treated as exons during RNA splicing

• Such variations are called alternative RNA splicing

• Because of alternative splicing, the number of different proteins an organism can produce is much greater than its number of genes

• Proteins often have a modular architecture consisting of discrete regions called domains

• In many cases, different exons code for the different domains in a protein

• Exon shuffling may result in the evolution of new proteins

• Since the two strands of DNA are complementary, each strand acts as a template for building a new strand in replication

• In DNA replication, the parent molecule unwinds, and two new daughter strands are built based on base-pairing rules

GeneDNA

Exon 1 Exon 2 Exon 3Intron Intron

Transcription

RNA processing

Translation

Domain 2

Domain 3

Domain 1

Polypeptide

bio 2, lecture 7 life’s information molecule ii: transcription

Documents

transcription slide

specific enzyme slide

gene b gene c slide

ribosome slide

syndrome slide

minimal medium slide

polypeptides hypothesis

gene b class