predicting rna structure and function. ribozyme the ribosome : the protein factory of the cell...

45
Predicting RNA Structure and Function

Post on 21-Dec-2015

218 views

Category:

Documents


2 download

TRANSCRIPT

Page 1: Predicting RNA Structure and Function. Ribozyme The Ribosome : The protein factory of the cell mainly made of RNA

PredictingRNA Structure and Function

Page 2: Predicting RNA Structure and Function. Ribozyme The Ribosome : The protein factory of the cell mainly made of RNA

Ribozyme

Page 3: Predicting RNA Structure and Function. Ribozyme The Ribosome : The protein factory of the cell mainly made of RNA

The Ribosome : The protein factory of the cell mainly made of RNA

Page 4: Predicting RNA Structure and Function. Ribozyme The Ribosome : The protein factory of the cell mainly made of RNA

Non coding DNA (98.5% human genome)

• Intergenic

• Repetitive elements

• Promoters

• Introns

• untranslated region (UTR)

Page 5: Predicting RNA Structure and Function. Ribozyme The Ribosome : The protein factory of the cell mainly made of RNA

Some biological functions of ncRNA

• Control of mRNA stability (UTR)

• Control of splicing (snRNP)

• Control of translation (microRNA)

The function of the RNA molecule depends on its folded structure

Page 6: Predicting RNA Structure and Function. Ribozyme The Ribosome : The protein factory of the cell mainly made of RNA

Example:Control of Iron levels by mRNA structure

G U A GC N N N’ N N’ N N’ N N’C N N’ N N’ N N’ N N’ N N’ 5’ 3’

conserved

Iron Responsive ElementIRE

Recognized byIRP1, IRP2

Page 7: Predicting RNA Structure and Function. Ribozyme The Ribosome : The protein factory of the cell mainly made of RNA

IRP1/2

5’ 3’F mRNA

5’ 3’TR mRNA

IRP1/2

F: Ferritin = iron storageTR: Transferin receptor = iron uptake

IRE

Low Iron IRE-IRP inhibits translation of ferritinIRE-IRP Inhibition of degradation of TR

High IronIRE-IRP off -> ferritin translated

Transferin receptor degradated

Page 8: Predicting RNA Structure and Function. Ribozyme The Ribosome : The protein factory of the cell mainly made of RNA

RNA Structural levels

tRNA

Secondary Structure Tertiary Structure

Page 9: Predicting RNA Structure and Function. Ribozyme The Ribosome : The protein factory of the cell mainly made of RNA

RNA Secondary Structure

U U

C G U A A UG C

5’ 3’

5’G A U C U U G A U C

3’

STEM

LOOP• The RNA molecule folds on itself. • The base pairing is as follows: G C A U G U hydrogen bond.

Page 10: Predicting RNA Structure and Function. Ribozyme The Ribosome : The protein factory of the cell mainly made of RNA

RNA Secondary structureShort Range Interactions

G G A U

U GC C GG A U A A U G CA G C U U

INTERNAL LOOP

HAIRPIN LOOP

BULGE

STEM

DANGLING ENDS5’ 3’

Page 11: Predicting RNA Structure and Function. Ribozyme The Ribosome : The protein factory of the cell mainly made of RNA

long range interactions of RNA secondary structural elements

Pseudo-knot

Kissing hairpins

Hairpin-bulge contact

These patterns are excluded from the prediction schemes as their computation is too intensive.

Page 12: Predicting RNA Structure and Function. Ribozyme The Ribosome : The protein factory of the cell mainly made of RNA

Predicting RNA secondary Structure

• Searching for a structure with Minimal

Free Energy (MFE)

Page 13: Predicting RNA Structure and Function. Ribozyme The Ribosome : The protein factory of the cell mainly made of RNA

Free energy model• Free energy of structure (at fixed temperature, ionic

concentration) = sum of loop energies

• Standard model uses experimentally determined thermodynamic parameters

Page 14: Predicting RNA Structure and Function. Ribozyme The Ribosome : The protein factory of the cell mainly made of RNA

Why is MFE secondary structure prediction hard?

• MFE structure can be found by calculating free energy of all possible structures

• but, number of potential structures grows exponentially with the number, n, of bases

• structures can be arbitrarily complex

Page 15: Predicting RNA Structure and Function. Ribozyme The Ribosome : The protein factory of the cell mainly made of RNA

RNA folding with Dynamic programming (Zuker and Steigler)

• W(i,j): MFE structure of substrand from i to j

i j

W(i,j)

Page 16: Predicting RNA Structure and Function. Ribozyme The Ribosome : The protein factory of the cell mainly made of RNA

RNA folding with dynamic programming

• Assume a function W(i,j) which is the MFE for the sequence starting at i and ending at j (i<j)

• Define scores, for example a base pair’s score is less than a non-pair

• Consider 4 recursion possibilities:– i,j are a base pair, added to the structure for i+1..j-1

• Define this as V(i,j)– i is unpaired, added to the structure for i+1..j– j is unpaired, added to the structure for i..j-1– i,j are paired, but not to each other; the structure for i..j adds

together sub-structures for 2 sub-sequences: i..k and k+1..j a bifurcation (i<k<j)

• Choose the minimal energy possibility

Page 17: Predicting RNA Structure and Function. Ribozyme The Ribosome : The protein factory of the cell mainly made of RNA

Simplifying Assumptions for Structure Prediction

• RNA folds into one minimum free-energy structure.

• There are no knots (base pairs never cross).

• The energy of a particular base pair in a double stranded regions is calculated independently– Neighbors do not influence the energy.

Page 18: Predicting RNA Structure and Function. Ribozyme The Ribosome : The protein factory of the cell mainly made of RNA

Sequence dependent free-energy Nearest Neighbor Model

U U

C G G C A UG CA UCGAC 3’5’

U U

C G U A A UG CA UCGAC 3’5’

Assign negative energies to interactions between base pair regions.Energy is influenced by the previous base pair (not by the base pairs further down).

Page 19: Predicting RNA Structure and Function. Ribozyme The Ribosome : The protein factory of the cell mainly made of RNA

Sequence dependent free-energy values of the base pairs

(nearest neighbor model) U U

C G G C A UG CA UCGAC 3’5’

U U

C G U A A UG CA UCGAC 3’5’

Example values:GC GC GC GCAU GC CG UA -2.3 -2.9 -3.4 -2.1

These energies are estimated experimentally from small synthetic RNAs.

Page 20: Predicting RNA Structure and Function. Ribozyme The Ribosome : The protein factory of the cell mainly made of RNA

Mfold :Adding Complexity to Energy Calculations

• Positive energy - added for destabilizing regions such as bulges, loops, etc.

• More than one structure can be predicted

Page 21: Predicting RNA Structure and Function. Ribozyme The Ribosome : The protein factory of the cell mainly made of RNA

Free energy computation

U UA A G C G C A G C U A A U C G A U A 3’A5’

-0.3

-0.3

-1.1 mismatch of hairpin-2.9 stacking

+3.3 1nt bulge -2.9 stacking

-1.8 stacking

5’ dangling

-0.9 stacking-1.8 stacking

-2.1 stacking

G= -4.6 KCAL/MOL

+5.9 4 nt loop

Page 22: Predicting RNA Structure and Function. Ribozyme The Ribosome : The protein factory of the cell mainly made of RNA

Prediction Tools based on Energy Calculation

Fold, Mfold Zucker & Stiegler (1981) Nuc. Acids Res.

9:133-148Zucker (1989) Science 244:48-52

RNAfoldVienna RNA secondary structure serverHofacker (2003) Nuc. Acids Res. 31:3429-3431

Page 23: Predicting RNA Structure and Function. Ribozyme The Ribosome : The protein factory of the cell mainly made of RNA

Insight from Multiple Alignment

Information from multiple sequence alignment (MSA) can help to predict the probability of positions i,j to be base-paired.

G C C U U C G G G CG A C U U C G G U CG G C U U C G G C C

Page 24: Predicting RNA Structure and Function. Ribozyme The Ribosome : The protein factory of the cell mainly made of RNA

Compensatory Substitutions

U U

C G U A A UG CA UCGAC 3’

G C

5’

Mutations that maintain the secondary structure

Page 25: Predicting RNA Structure and Function. Ribozyme The Ribosome : The protein factory of the cell mainly made of RNA

RNA secondary structure can be revealed by

identification of compensatory mutations

G C C U U C G G G CG A C U U C G G U CG G C U U C G G C C

U CU GC GN N’G C

Page 26: Predicting RNA Structure and Function. Ribozyme The Ribosome : The protein factory of the cell mainly made of RNA

Insight from Multiple Alignment

Information from multiple sequence alignment (MSA) can help to predict theprobability of positions i,j to be base-paired.

•Conservation – no additional information•Consistent mutations (GC GU) – support stem•Inconsistent mutations – does not support stem.•Compensatory mutations – support stem.

Page 27: Predicting RNA Structure and Function. Ribozyme The Ribosome : The protein factory of the cell mainly made of RNA

RNAalifold (Hofacker 2002)From the vienna RNA package

Predicts the consensus secondarystructure for a set of aligned RNA sequences by using modified dynamic programming algorithm that addalignment information to the standardenergy model

Improvement in prediction accuracy

Page 28: Predicting RNA Structure and Function. Ribozyme The Ribosome : The protein factory of the cell mainly made of RNA

Other related programs

• COVE

RNA structure analysis using the covariance model (implementation of the stochastic free grammar method)

• QRNA (Rivas and Eddy 2001)

Searching for conserved RNA structures

• tRNAscan-SE tRNA detection in genome sequences

Sean Eddy’s Lab WUhttp://www.genetics.wustl.edu/eddy

Page 29: Predicting RNA Structure and Function. Ribozyme The Ribosome : The protein factory of the cell mainly made of RNA

RNA families

• Rfam : General non-coding RNA database

(most of the data is taken from specific databases)

http://www.sanger.ac.uk/Software/Rfam/

Includes many families of non coding RNAs and functionalmotifs, as well as their alignment and their secondary structures

Page 30: Predicting RNA Structure and Function. Ribozyme The Ribosome : The protein factory of the cell mainly made of RNA

Rfam /Pfam

• Pfam uses the HMMER

(based on Hidden Markov Models)

• Rfam uses the INFERNAL

(based on Covariation Model)

Page 31: Predicting RNA Structure and Function. Ribozyme The Ribosome : The protein factory of the cell mainly made of RNA

Rfam (currently version 7.0)

• Different RNA families or functional

Motifs from mRNA, UTRs etc.

View and download multiple sequence alignments Read family annotation Examine species distribution of family members Follow links to otherdatabases

Page 32: Predicting RNA Structure and Function. Ribozyme The Ribosome : The protein factory of the cell mainly made of RNA

An example of an RNA family miR-1 MicroRNAs

mir-1 microRNA precursor family This family represents the microRNA (miRNA) mir-1 family. miRNAs are transcribed as ~70nt precursors (modelled here) and subsequently processed by the Dicer enzyme to give a ~22nt product. The products are thought to have regulatory roles through complementarity to mRNA.

Page 33: Predicting RNA Structure and Function. Ribozyme The Ribosome : The protein factory of the cell mainly made of RNA

Seed alignment (based on 7 sequences)

Page 34: Predicting RNA Structure and Function. Ribozyme The Ribosome : The protein factory of the cell mainly made of RNA

Predicting microRNA target

Page 35: Predicting RNA Structure and Function. Ribozyme The Ribosome : The protein factory of the cell mainly made of RNA

Predicting microRNA target genes

• Why is it hard??– Lots of known miRNAs– Mostly unknown target genes

• Initial method outline– Look at conserved miRNAs– Look for conserved target sites

Page 36: Predicting RNA Structure and Function. Ribozyme The Ribosome : The protein factory of the cell mainly made of RNA

miRNAs in animals

• 0.5%-1.0% of predicted genes encode miRNA (!!)– One of the more abundant regulatory classes

• Tissue-specific or developmental stage-specific expression

• High evolutionary conservation

Page 37: Predicting RNA Structure and Function. Ribozyme The Ribosome : The protein factory of the cell mainly made of RNA

TargetScan Algorithm by Lewis et al 2003

The Goal – a ranked list of candidate target genes

• Stage 1: Search UTRs in one organism

– Bases 2-8 from miRNA = “miRNA seed”

– Perfect Watson-Crick complementarity

– No wobble pairs (G-U)

– 7nt matches = “seed matches”

Page 38: Predicting RNA Structure and Function. Ribozyme The Ribosome : The protein factory of the cell mainly made of RNA

TargetScan Algorithm

• Stage 2: Extend seed matches

– Allow G-U (wobble) pairs

– Both directions

– Stop at mismatches

Page 39: Predicting RNA Structure and Function. Ribozyme The Ribosome : The protein factory of the cell mainly made of RNA

TargetScan Algorithm

• Stage 3: Optimize basepairing

– Remaining 3’ region of miRNA

– 35 bases of UTR 5’ to each seed match

– RNAfold program (Hofacker et al 1994)

Page 40: Predicting RNA Structure and Function. Ribozyme The Ribosome : The protein factory of the cell mainly made of RNA

• Stage 4: Folding free energy (G) assigned to each putative miRNA:target interaction

• Assign rank to each UTR

• Repeat this process for each of the other organisms with UTR datasets

TargetScan Algorithm

Page 41: Predicting RNA Structure and Function. Ribozyme The Ribosome : The protein factory of the cell mainly made of RNA

Predicting RNA-binding protein (RBP) targets

Page 42: Predicting RNA Structure and Function. Ribozyme The Ribosome : The protein factory of the cell mainly made of RNA

Predicting RBPs target• Different types of RBPs

– Proteins that regulate RNA stability (bind usually at the 3’UTR)

– Splicing Factors (bind exonic and intronic regions) – ……

• Why is it hard– Different proteins bind different sequences – Most RBP sites are short and degenertaive (e.g.

CTCTCT )

Page 43: Predicting RNA Structure and Function. Ribozyme The Ribosome : The protein factory of the cell mainly made of RNA

Predicting Exon Splicing Enhancers ESE-finder (Krainer)

1. Built PSSM for ESE, based on experimental data (SELEX)

Page 44: Predicting RNA Structure and Function. Ribozyme The Ribosome : The protein factory of the cell mainly made of RNA

ESE-finder

2. A given sequence is tested against5 PSSM in overlapping windows

3. Each position in the sequence is given a score

4. Position which fit a PSSM (score above a cutoff) are predicted asESEs

Page 45: Predicting RNA Structure and Function. Ribozyme The Ribosome : The protein factory of the cell mainly made of RNA

Predicting RBPs target• RNA binding sites can be predicted by

general motif finders

- MEME http://metameme.sdsc.edu/mhmm-overview.html

- DRIM http://bioinfo.cs.technion.ac.il/drim/