part ii. prediction of functional regions within disordered proteins zsuzsanna dosztányi mta-elte...

25
PART II. PART II. Prediction of Prediction of functional regions functional regions within disordered within disordered proteins proteins Zsuzsanna Dosztányi MTA-ELTE Momentum Bioinformatics Group Department of Biochemistry Eotvos Lorand University, Budapest, Hungary [email protected]

Upload: eunice-perry

Post on 31-Dec-2015

224 views

Category:

Documents


5 download

TRANSCRIPT

Page 1: PART II. Prediction of functional regions within disordered proteins Zsuzsanna Dosztányi MTA-ELTE Momentum Bioinformatics Group Department of Biochemistry

PART II.PART II.Prediction of functional Prediction of functional regions within disordered regions within disordered proteinsproteins

Zsuzsanna DosztányiMTA-ELTE Momentum Bioinformatics GroupDepartment of Biochemistry Eotvos Lorand University, Budapest, [email protected]

Page 2: PART II. Prediction of functional regions within disordered proteins Zsuzsanna Dosztányi MTA-ELTE Momentum Bioinformatics Group Department of Biochemistry

Protein disorder is prevalent

20

40

60

0

LD

R (

40<

) p

rote

in, %

kingdom

B

A

E

Page 3: PART II. Prediction of functional regions within disordered proteins Zsuzsanna Dosztányi MTA-ELTE Momentum Bioinformatics Group Department of Biochemistry

Protein disorder is functional

Xie H et al. J Proteome Res. 2007, 6, 1882-1898

Page 4: PART II. Prediction of functional regions within disordered proteins Zsuzsanna Dosztányi MTA-ELTE Momentum Bioinformatics Group Department of Biochemistry

Protein disorder is important

Prion protein Prion disease

CFTR Cystic fibrosis

Alzheimer’s

-synuclein Parkinson’s

p53, BRCA1 cancer

Page 5: PART II. Prediction of functional regions within disordered proteins Zsuzsanna Dosztányi MTA-ELTE Momentum Bioinformatics Group Department of Biochemistry

Functions of intrinsically disordered proteins

I Entropic chains

II Linkers

III Molecular recognition

IV Protein modifications (e.g. phosphorylation)

V Assembly of large multiprotein complexes

Page 6: PART II. Prediction of functional regions within disordered proteins Zsuzsanna Dosztányi MTA-ELTE Momentum Bioinformatics Group Department of Biochemistry

Interactions of IDPsComplex between p53 and MDM2

Coupled folding and binding

Page 7: PART II. Prediction of functional regions within disordered proteins Zsuzsanna Dosztányi MTA-ELTE Momentum Bioinformatics Group Department of Biochemistry

Interactions of proteins

Page 8: PART II. Prediction of functional regions within disordered proteins Zsuzsanna Dosztányi MTA-ELTE Momentum Bioinformatics Group Department of Biochemistry

Interactions of proteins

Page 9: PART II. Prediction of functional regions within disordered proteins Zsuzsanna Dosztányi MTA-ELTE Momentum Bioinformatics Group Department of Biochemistry

Coupled folding and binding

Functional advantages

Weak transient, yet specific interactions Post-translational modifications Flexible binding regions that can overlap Evolutionary plasticity

SignalingRegulation

Page 10: PART II. Prediction of functional regions within disordered proteins Zsuzsanna Dosztányi MTA-ELTE Momentum Bioinformatics Group Department of Biochemistry

Coupled folding and binding Experimental difficulties

Highly flexible Weak interactions Short half-life

Complexes of IDPs in the PDB: ~ 200 Known instances: ~ 2,000 Estimated number of such interactions in the

human proteome: ~ 100,000

Page 11: PART II. Prediction of functional regions within disordered proteins Zsuzsanna Dosztányi MTA-ELTE Momentum Bioinformatics Group Department of Biochemistry

Interactions of IDPs

F19, W23 and L26

Page 12: PART II. Prediction of functional regions within disordered proteins Zsuzsanna Dosztányi MTA-ELTE Momentum Bioinformatics Group Department of Biochemistry

p27

Cyclin-dependent kinase (Cdk) inhibitor, p27Kip1 (p27) Cell cycle regulation Binds to cdk-cyclin komplex and inhibitis their activity Fully disordered protein

Page 13: PART II. Prediction of functional regions within disordered proteins Zsuzsanna Dosztányi MTA-ELTE Momentum Bioinformatics Group Department of Biochemistry

p27

Page 14: PART II. Prediction of functional regions within disordered proteins Zsuzsanna Dosztányi MTA-ELTE Momentum Bioinformatics Group Department of Biochemistry

p27

Partial unfolding enables the phosphorylation of Tyr88, starting a series of signaling events that leads to the beginning of S phase.

Page 15: PART II. Prediction of functional regions within disordered proteins Zsuzsanna Dosztányi MTA-ELTE Momentum Bioinformatics Group Department of Biochemistry

Prediction of functional regions within IDPs

Disordered binding regions (ANCHOR) Linear motifs (ELM, SlimPred) Morfs (Morfpred)

Specific conservation pattern

Page 16: PART II. Prediction of functional regions within disordered proteins Zsuzsanna Dosztányi MTA-ELTE Momentum Bioinformatics Group Department of Biochemistry

Disordered protein complexes

Complex between p53 and MDM2

• Interaction sites are usually linear (consist of only 1 part)

• enrichment of interaction prone amino acids

Sequence

Binding sites

No need for structure, binding sites can be

predicted from sequence alone

Page 17: PART II. Prediction of functional regions within disordered proteins Zsuzsanna Dosztányi MTA-ELTE Momentum Bioinformatics Group Department of Biochemistry

Prediction of disordered binding regions – ANCHOR

What discriminates disordered binding regions? A cannot form enough favorable interactions with their

sequential environment It is favorable for them to interact with a globular protein

Based on simplified physical model Based on an energy estimation method using statistical

potentials Captures sequential context

Page 18: PART II. Prediction of functional regions within disordered proteins Zsuzsanna Dosztányi MTA-ELTE Momentum Bioinformatics Group Department of Biochemistry

ANCHOR

Human p53 C –terminal region

Page 19: PART II. Prediction of functional regions within disordered proteins Zsuzsanna Dosztányi MTA-ELTE Momentum Bioinformatics Group Department of Biochemistry

ANCHOR and linear motifsNCOA2 transcription co-activator

The regions between 600-800 is disordered Contains 3 receptor binding motifs: xLxxLLx (LIG_NRBOX)

Page 20: PART II. Prediction of functional regions within disordered proteins Zsuzsanna Dosztányi MTA-ELTE Momentum Bioinformatics Group Department of Biochemistry

LMs and Disordered Binding Regions

Linear motifs and disordered binding regions often overlap

Complementary approaches

Prediction of disordered binding regions can help to increase likelihood of true instances

Page 21: PART II. Prediction of functional regions within disordered proteins Zsuzsanna Dosztányi MTA-ELTE Momentum Bioinformatics Group Department of Biochemistry

LMs and Disordered Binding Regions

Linear motifs and disordered binding regions often overlap

Complementary approaches

Prediction of disordered binding regions can help to increase likelihood of true instances

Page 22: PART II. Prediction of functional regions within disordered proteins Zsuzsanna Dosztányi MTA-ELTE Momentum Bioinformatics Group Department of Biochemistry

Machine learning approaches

SlimPred: trained on ELM database

Morfpred: trained on short chains in complex

Very small datasets

Negative datasets

Page 23: PART II. Prediction of functional regions within disordered proteins Zsuzsanna Dosztányi MTA-ELTE Momentum Bioinformatics Group Department of Biochemistry

Conservation

Page 24: PART II. Prediction of functional regions within disordered proteins Zsuzsanna Dosztányi MTA-ELTE Momentum Bioinformatics Group Department of Biochemistry

Conservation patterns of linear motifs

No evolutionary constraints to keep the structure Strong constraints on functional site

Island-like conservation

Page 25: PART II. Prediction of functional regions within disordered proteins Zsuzsanna Dosztányi MTA-ELTE Momentum Bioinformatics Group Department of Biochemistry

SlimPrints

Generates sequence alignments of orthologous sequences

Relative conservation score per position Filters out less reliable regions Fails if sequences are too divergent, or too similar http://bioware.ucd.ie/slimprints.html.