rna processing data analysis lisa bloomer green april 26, 2010

23
RNA Processing Data Analysis Lisa Bloomer Green April 26, 2010

Upload: ewa

Post on 24-Feb-2016

32 views

Category:

Documents


0 download

DESCRIPTION

RNA Processing Data Analysis Lisa Bloomer Green April 26, 2010. RNA Processing Data Analysis. Real Time PCR Splicing The Problem Solving the Simplified Problem Making it Complicated Again. Real-time PCR. DNA is copied again and again for exponential growth in quantity present. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: RNA Processing Data Analysis Lisa Bloomer Green April 26, 2010

RNA Processing Data Analysis

Lisa Bloomer GreenApril 26, 2010

Page 2: RNA Processing Data Analysis Lisa Bloomer Green April 26, 2010

RNA Processing Data Analysis

● Real Time PCR● Splicing● The Problem● Solving the Simplified Problem● Making it Complicated Again

Page 3: RNA Processing Data Analysis Lisa Bloomer Green April 26, 2010

Real-time PCR DNA is copied

again and again for exponential growth in quantity present.

http://pathmicro.med.sc.edu/pcr/realtime-home.htm

Cycle Number

Amount of DNA

24 1677721625 3355443226 6710886427 13421772828 26843545629 53687091230 107374182431 140000000032 150000000033 155000000034 1580000000

Cycle Number

Amount of DNA

0 11 22 43 84 165 326 647 1288 2569 512

10 1024

Page 4: RNA Processing Data Analysis Lisa Bloomer Green April 26, 2010

Real-time PCR Output is the

number of cycles it takes to pass a certain threshold.

http://pathmicro.med.sc.edu/pcr/realtime-home.htm

Page 5: RNA Processing Data Analysis Lisa Bloomer Green April 26, 2010

The Problem Use real-time PCR to help discover how often

alternative splicing occurs in a given region of RNA.

Alternative Splicing: A mechanism by which different forms of mature mRNAs (messengers RNAs) are generated from the same gene.

http://www.medterms.com/script/main/art.asp?articlekey=16831

Page 6: RNA Processing Data Analysis Lisa Bloomer Green April 26, 2010

rpoB rpoC1 rpoC2

rpoB rpoC1 rpoC2

rpoB rpoC1

rpoC1 rpoC2

Rpo chloroplast operon

mRNA

Page 7: RNA Processing Data Analysis Lisa Bloomer Green April 26, 2010

Preliminary Data

rpoB rpoC1 rpoC2

rpoB

IR1r

rpoC1IR2r

rpoc2

0.00

5.00

10.00

15.00

20.00

25.00

30.00

0 1 2 3 4 5 6

Series1

rpoB IR1r rpoC1 IR2r rpoc219.84 26.88 22.52 23.77 15.71

Page 8: RNA Processing Data Analysis Lisa Bloomer Green April 26, 2010

psbI psbK

psbI psbK

psb chloroplast operon

mRNA

Page 9: RNA Processing Data Analysis Lisa Bloomer Green April 26, 2010

psbI psbK

Preliminary Data

psbI IRp psbK14.86 20.02 14.94

0

5

10

15

20

25

Page 10: RNA Processing Data Analysis Lisa Bloomer Green April 26, 2010

A Simplified Version

Model the curve using an exponential function.

tCt NN 20

Page 11: RNA Processing Data Analysis Lisa Bloomer Green April 26, 2010

A Simplified Version

Because we use the same threshold for each quantity, we will assume that Nt is the same for each quantity.

tCt NN 20

t

t

CtC

t NNN 220

Page 12: RNA Processing Data Analysis Lisa Bloomer Green April 26, 2010

ExamplepsbL IRp psbK

Cycle Count 14.86 20.02 14.94Initial Count (is proportional to) 3.36 * 10-5 9.41 * 10-7 3.18 * 10-5

psbL IRp psbKCycle Count 14.86 20.02 14.94Initial Count (is proportional to) 3.36 * 10-5 9.41 * 10-7 3.18 * 10-5

Percentage 50.7% 1.4% 47.9%

Page 13: RNA Processing Data Analysis Lisa Bloomer Green April 26, 2010

ExamplerpoB IR1r rpoC1 IR2r rpoC2

Cycle Count 19.84 26.88 22.52 23.77 15.71

Initial Count 1.07 * 10-6 8.10 * 10-9 1.66 * 10-7 6.99 * 10-8 1.87 * 10-5

Percentage 5.34% 0.04% 0.83% 0.35% 93.44%

Page 14: RNA Processing Data Analysis Lisa Bloomer Green April 26, 2010

Adding Complexity

● Need to estimate variability for N0

● The process may not be 100% efficient. We probably get less than double the amount with each cycle.

● The shape of the curve shows behavior that does not fit the exponential model.

● Nt might be different for the different quantities.

Page 15: RNA Processing Data Analysis Lisa Bloomer Green April 26, 2010

Variability Estimation

Standard deviation of Ct is estimated between 0.036 cycles and 0.367 cycles, with an average of 0.183 cycles. (Rutledge and Cote)

psbL IRp psbKCycle Count 14.86 20.02 14.94Confidence Interval (using 0.183 for st.dev.)

(14.50, 15.22) (19.66, 20.38) (14.58, 15.30)

Page 16: RNA Processing Data Analysis Lisa Bloomer Green April 26, 2010

Variability Estimation

To get confidence intervals for the percentages, we need to know how these cycle numbers interact.

psbL IRp psbKCycle CountConfidence Interval

(14.50, 15.22) (19.66, 20.38) (14.58, 15.30)

2-Ct Confidence Interval (2.6*10-5, 4.3*10-5) (7.3*10-7, 1.2*10-6) (2.5*10-5, 4.1*10-5)

Page 17: RNA Processing Data Analysis Lisa Bloomer Green April 26, 2010

Efficiency

E is a number between 0 and 1 that quantifies the efficiency of the doubling process.

E can be estimated from the standard curve.● Fit a line to the log curve.● Es=e^(-slope)+1

Estimates of E range from 0.85 to 1. (Fronhoffs. et. al.)

tCt NN 20 tCt ENN )1(0

Page 18: RNA Processing Data Analysis Lisa Bloomer Green April 26, 2010

ExamplepsbL IRp psbK

Cycle Count 14.86 20.02 14.94Initial Count E=1 3.36 * 10-5 9.41 * 10-7 3.18 * 10-5

E=0.95 4.90 * 10-5 1.56 * 10-6 4.64 * 10-5

E=0.90 7.21 * 10-5 2.63 * 10-6 6.85 * 10-5

E=0.85 1.07 * 10-4 4.48 * 10-6 1.02 * 10-4

psbL IRp psbKCycle Count 14.86 20.02 14.94Percentage E=1 50.7% 1.4% 47.9%

E=0.95 50.5% 1.6% 47.9%

E=0.90 50.3% 1.8% 47.8%

E=0.85 50.2% 2.1% 47.7%

Page 19: RNA Processing Data Analysis Lisa Bloomer Green April 26, 2010

Efficiency

Can we assume that the efficiency is the same for each quantity?

How does efficiency affect variability?

(There is evidence that efficiency and threshold cycle are dependent.)

Page 20: RNA Processing Data Analysis Lisa Bloomer Green April 26, 2010

The Curve is Not Exponential

Logistic growth? Does this change the percentages?

Page 21: RNA Processing Data Analysis Lisa Bloomer Green April 26, 2010

Is Nt the same for the different quantities?

Probably not.

The flourescence measured is affected by mass as well as number.

Are the masses known? If not, can we assume that the masses are similar?

Page 22: RNA Processing Data Analysis Lisa Bloomer Green April 26, 2010

Summary

A simplified version of the problem has a straight-forward solution, which may be enough for general purposes.

Reinserting the complexity into the problem leads to interesting statistical issues.

Page 23: RNA Processing Data Analysis Lisa Bloomer Green April 26, 2010

References

● Fronhoffs, et. al. “A method for the rapid construction of cRNA standard curves in quantitative real-time reverse transcription polymerase chain reaction,” Molecular and Cellular Probes (2002) 16, 99-110.

● Rutledge and Côté “Mathematics of quantitative kinetic PCR and the application of standard curves,” Nucleic Acids Research (2003) 31, no. 16.

● Swillens, et. al. “Instant evaluation of the absolute initial nuber of cDNA copies from a single real-time PCR curve,” Nucleic Acids Research (2004) 32, no. 6.