predicting protein properties and structure

33
Predicting Protein Properties and Structure Rui Alves

Upload: landis

Post on 23-Jan-2016

55 views

Category:

Documents


0 download

DESCRIPTION

Predicting Protein Properties and Structure. Rui Alves. Organization of the Talk. From cDNA sequence to protein sequence. Analyzing the information in the protein sequence Predicting the fold (secondary structure) of a protein Predicting the (tertiary) structure of a protein. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Predicting Protein Properties and Structure

Predicting Protein Properties and Structure

Rui Alves

Page 2: Predicting Protein Properties and Structure

Organization of the Talk

• From cDNA sequence to protein sequence.

• Analyzing the information in the protein sequence

• Predicting the fold (secondary structure) of a protein

• Predicting the (tertiary) structure of a protein

Page 3: Predicting Protein Properties and Structure

Predicting protein sequence from DNA sequence

• Protein sequence can be predicted by translating the cDNA and using the genetic code.

Page 4: Predicting Protein Properties and Structure

Translating cDNA into protein sequence

ATGTCTCTTATATGA…

MetSerLeuIleTer

No Gene!!!!!

Page 5: Predicting Protein Properties and Structure

Translating cDNA to Protein

Page 6: Predicting Protein Properties and Structure

Translating yeast mitochondrial cDNA into protein sequence

ATGTCTCTTATATGA………SECIS sequence

TrpSerThrMetsCys

MetSerLeuIleTer

There is a Gene with a considerably different protein sequence from the one we would

predict from the universal genetic code!!!!!

Page 7: Predicting Protein Properties and Structure

Organization of the Talk

• From cDNA sequence to gene sequence.

• Analyzing the information in the protein sequence

• Predicting the fold (secondary structure) of a protein

• Predicting the (tertiary) structure of a protein

Page 8: Predicting Protein Properties and Structure

Inferring function from sequence

Your Sequence

Protein Sequence Database

No Known Homologues in the Database

Oh, $#!¥!!!

Go to the Protein Databank to get structure

&

Live happily ever after

Page 9: Predicting Protein Properties and Structure

Analyzing the information in the protein sequence

• Physical-Chemical Information

Page 10: Predicting Protein Properties and Structure

Why are these properties useful?

For example, they help identifying your protein in an electrophoresis gel

Analyzing the physical chemical information in the protein sequence

Page 11: Predicting Protein Properties and Structure

How to predict hidrophobicity

Page 12: Predicting Protein Properties and Structure

How to predict molecular mass

Ala

Molecular Mass: 71.09

Cys

71.09+103.15-18

-H2O

Page 13: Predicting Protein Properties and Structure

How to predict isoelectric point

Ala

Isoelectric Point:

Cys …

- 9.3 … pH

Pro

tein

Cha

rge

0

0 16

-

+

~10

Amino acid pKa is dependent upon environment

Buried amino acids do not gain/loose protons as easily as exposed amino acids

Does not work very well

Isoelectric point is the pH at which the protein is not charged

At each value of pH, calculate the state of hydrogenation of each residue and thus the charge of the whole protein

Page 14: Predicting Protein Properties and Structure

Analyzing the information in the protein sequence

• Physical-Chemical Information• e.g.

http://prowl.rockefeller.edu/prowl-cgi/sequence.exe/.fsa

• Localization, modifications & secondary structure Information

• E.g. http://seq.cbrc.jp/proteinLocalizationResources/localizationLinks.html

Page 15: Predicting Protein Properties and Structure

Predicting the localization of your protein

Page 16: Predicting Protein Properties and Structure

• Search for homology to the relevant TS in your protein

• Complications:

•Small sequences, divergence, change between organisms

• Signal Peptides

•Nuclear localization signals at the N-terminal

•Mitochondrial TS

•Peroxysomal TS

•…

How is the localization of a protein predicted?

Page 17: Predicting Protein Properties and Structure

Predicting post translational modifications to your protein

Page 18: Predicting Protein Properties and Structure

How are post translational modifications to a protein predicted?

• Signal sequences

• Search for homology to pattern peptides

Page 19: Predicting Protein Properties and Structure

Training set of known structures

Training set of corresponding sequences

Test set of known structures

Test set of corresponding sequences

How is 2ndary structure predicted?

p(-helix) p(coil) p(-strand)

A 0.23 0.28 0.5

Database of known structures

Database of corresponding sequences

ACDEFGTYAEE……

-helix coil -strand

p(-helix) p(coil) p(-strand)

A…C… A…C.. A…C…

A 0.1…0.03 0.04…0.002 0.1…0.21

p(aa1-coil) p(aa1-helix)

p(aa1-strand) …

Predict 2ary structureCompare

Bad Predictions:

Reshuffle training set and test set and repeat until predictions are correct

Good Predictions:

Method ready for new sequence 2ndary structure prediction

Page 20: Predicting Protein Properties and Structure

Predicting transmembrane helices

Page 21: Predicting Protein Properties and Structure

How are transmembrane regions predicted?

• Transmembrane segments are 17 residues long

17 aa residues

Hydrophobic Hydrophobic

Two Transmembrane helices

Page 22: Predicting Protein Properties and Structure

How is membrane orientation predicted?

HN-

Outside

Cytosol

NH

NH

Signal Peptide

17 aa

15 aa 15 aa

+++ ---

Page 23: Predicting Protein Properties and Structure

Organization of the Talk

• From cDNA sequence to gene sequence.

• Analyzing the information in the protein sequence

• Predicting the fold (secondary structure) of a protein

• Predicting the (tertiary) structure of a protein

Page 24: Predicting Protein Properties and Structure

What is fold?

• Fold can be roughly defined as the succession of --coil structures in a protein

Page 25: Predicting Protein Properties and Structure

Predicting protein folding

Page 26: Predicting Protein Properties and Structure

How is fold predicted?

Database of known structures

Database of corresponding sequences

Database of probabilities of aa in 2ndary structure

YOUR SEQUENCE

Homology

based helix

coil-strand

profile folds database

Server

Strong Homology

… Fold Prediction

Weak/No Homology

Helix-coil-strand

profile prediction

… Fold Prediction

Page 27: Predicting Protein Properties and Structure

Organization of the Talk

• From cDNA sequence to gene sequence.

• Analyzing the information in the protein sequence

• Predicting the fold (secondary structure) of a protein

• Predicting the (tertiary) structure of a protein

Page 28: Predicting Protein Properties and Structure

Predicting protein structure

• Homology Modeling– 3D-JIGSAW, SWISSMODEL

• Ab initio Modeling– ROBETTA

Page 29: Predicting Protein Properties and Structure

Predicting protein structure by homology

Page 30: Predicting Protein Properties and Structure

How does homology modeling work?

Database of known structures

Database of corresponding sequences

…YDVRSEQVENCE…

Server/

Program

Strong Homologues

Best possible Sequence alignment

…YDVR-SEQVENCE…

…YDVRMSD-VDNCD…

…YDVR-SEQVENCE…

…YDVRMSD-VDNCD…

Thread sequence to predict over known structure according to alignment

… Optimization via energy

minimization, etc…

Page 31: Predicting Protein Properties and Structure

Predicting protein structure

• Homology Modeling– 3D-JIGSAW,SWISSMODEL

• Ab initio Modeling– ROBETTA

Page 32: Predicting Protein Properties and Structure

Predicting protein structure by ab initio methods

Database of corresponding sequences

…YDVRSEQVENCE…

Server/

Program

NO Homologues

Database of structures for smaller amino acid runs

…YDVR-SEQ

…YDVRMSD-……YDVR-SEQ

…YPVRMSD-…

…VENCE…

…YDNCD……VENCE…

…VEQCE…

… Assemble

Energy minimization

& optimization

Page 33: Predicting Protein Properties and Structure

Summary

• From cDNA sequence to gene sequence.

• Analyzing the information in the protein sequence

• Predicting the fold (secondary structure) of a protein

• Predicting the (tertiary) structure of a protein