structure-based modelling of protein-protein interactions · re-docking known protein complexes n =...

22
Structure-Based Modelling of Protein-Protein Interactions Dave Ritchie Department of Computing Science King’s College, University of Aberdeen

Upload: others

Post on 13-Aug-2020

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Structure-Based Modelling of Protein-Protein Interactions · Re-Docking Known Protein Complexes N = 16 N = 20 N = 25 Case Top RMS Top RMS Top RMS SIC 3,407 0.00 2 0.22 1 0.82 KAI

Structure-Based Modelling of Protein-Protein Interactions

Dave RitchieDepartment of Computing Science

King’s College, University of Aberdeen

Page 2: Structure-Based Modelling of Protein-Protein Interactions · Re-Docking Known Protein Complexes N = 16 N = 20 N = 25 Case Top RMS Top RMS Top RMS SIC 3,407 0.00 2 0.22 1 0.82 KAI

Structure-Based Modelling of Protein-Protein Interactions

(Hex Docking Algorithm)

Contents

• Spherical Polar Fourier Expansions

• Spherical Polar Correlations

• Scoring Using Soft OPLS Potentials

• Docking Very Large Molecules

• CAPRI: International Blind Docking Trial

• Conclusions & Ongoing/Future Work

Page 3: Structure-Based Modelling of Protein-Protein Interactions · Re-Docking Known Protein Complexes N = 16 N = 20 N = 25 Case Top RMS Top RMS Top RMS SIC 3,407 0.00 2 0.22 1 0.82 KAI

Real Spherical Harmonics: ylm(θ, φ)

Orthogonality:

ylm(θ, φ)yl′m′(θ, φ)dΩ = δll′δmm′

Rotation: ylm(θ′, φ′) =l

m′=−l

R(l)m′m(α, β, γ)ylm′(θ, φ)

Page 4: Structure-Based Modelling of Protein-Protein Interactions · Re-Docking Known Protein Complexes N = 16 N = 20 N = 25 Case Top RMS Top RMS Top RMS SIC 3,407 0.00 2 0.22 1 0.82 KAI

Spherical Harmonic Surfaces

Example: 2D Radial Expansions (256 Basis Functions)

r(θ, φ) =15

l=0

l∑

m=−l

almylm(θ, φ)

• Good for matching similar shapes, not so good for docking (complementary shapes) ...

Page 5: Structure-Based Modelling of Protein-Protein Interactions · Re-Docking Known Protein Complexes N = 16 N = 20 N = 25 Case Top RMS Top RMS Top RMS SIC 3,407 0.00 2 0.22 1 0.82 KAI

Radial Basis Functions: Rnl(r)

HO (shape): Rnl(r) = N(q)nl e−ρ/2ρl/2L

(l+1/2)n−l−1 (ρ); ρ = r2/q, q = 20.

Coulomb (electro): Rnl(r) = N(Λ)nl e−ρ/2ρlL

(2l+2)n−l−1(ρ); ρ = 2Λr, Λ = 1/2.

Orthogonality:

∫ ∞

0

Rnl(r)Rn′l(r)r2dr = δnn′

!"#$% $ &' (

Page 6: Structure-Based Modelling of Protein-Protein Interactions · Re-Docking Known Protein Complexes N = 16 N = 20 N = 25 Case Top RMS Top RMS Top RMS SIC 3,407 0.00 2 0.22 1 0.82 KAI

Spherical Polar Shape Reconstruction

(Looking at MCV Antibody CDR Loops)

τ (r) =N

nlm

aτnlmRnl(r)ylm(θ, φ)

Image Expansion Coefficients

A Gaussians -

B N = 16 1,496

C N = 25 5,525

D N = 30 9,455

Page 7: Structure-Based Modelling of Protein-Protein Interactions · Re-Docking Known Protein Complexes N = 16 N = 20 N = 25 Case Top RMS Top RMS Top RMS SIC 3,407 0.00 2 0.22 1 0.82 KAI

Protein Shape Complementarity using Density Representations

τσ(r)

(r)

Surface Density: σ(r) =

1; r ∈ surface skin volume

0; otherwise

Interior Density: τ (r) =

1; r ∈ VDW volume

0; otherwise

Favourable:

(σA(rA)τB(rB) + τA(rA)σB(rB))dV

Unfavourable:

τA(rA)τB(rB)dV

Score: SAB =

(σAτB + τAσB − QτAτB)dV ; Penalty Factor: Q = 11

Page 8: Structure-Based Modelling of Protein-Protein Interactions · Re-Docking Known Protein Complexes N = 16 N = 20 N = 25 Case Top RMS Top RMS Top RMS SIC 3,407 0.00 2 0.22 1 0.82 KAI

Correlations – Overlap as a Function of Coordinate Operations

Basic Overlap:

σA(r)τB(r)dV =

N∑

nlm

aσnlmbτ

nlm

Rotation: R(α, β, γ)σA(r) =N

nlm

aσ′nlmRnl(r)ylm(θ, φ)

Translation: T (R)σA(r) =N

nlm

aσ′′nlmRnl(r)ylm(θ, φ)

Rotated Coeffients: aσ′nlm =

l∑

m′=−l

R(l)mm′(α, β, γ)aσ

nlm′

Translated Coefficients: aσ′′nlm =

N∑

n′l′

T(|m|)nl,n′l′(R)aσ

n′l′m

Hence:

σ′A(r)τ ′′

B(r)dV =N

nlm

aσ′nlmbτ ′′

nlm etc.

Page 9: Structure-Based Modelling of Protein-Protein Interactions · Re-Docking Known Protein Complexes N = 16 N = 20 N = 25 Case Top RMS Top RMS Top RMS SIC 3,407 0.00 2 0.22 1 0.82 KAI

6D Docking Search as a Nested Sequence of Transformations

Get 4 rotations from icosahedral tessellations ...A

(β2,γ2)(β1,γ1)

z

α2

R

βΑ B

Rotate A (×492 @ 9.9): A′(r) = R(0, β1, γ1)A(r)

Translate A (×50 @ 0.75A): A′′(r) = T (−R)A′(r)

Rotate B (×642 @ 8.5): B′(r) = R(0, β2, γ2)B(r)

Twist B (×64 @ 5.6): B′′(r) = R(α2, 0, 0)B′(r)

Search Space: 492 × 50 × 642 × 64 ' 109 ∼ 106/s on a 1GHz PIII Xeon

Page 10: Structure-Based Modelling of Protein-Protein Interactions · Re-Docking Known Protein Complexes N = 16 N = 20 N = 25 Case Top RMS Top RMS Top RMS SIC 3,407 0.00 2 0.22 1 0.82 KAI

Re-Docking Known Protein ComplexesN = 16 N = 20 N = 25

Case Top RMS Top RMS Top RMS

SIC 3,407 0.00 2 0.22 1 0.82

KAI 17 0.41 3 0.69 7 0.81PTC 132 0.52 2 0.48 1 0.48

CGI 1 0.38 1 0.38 1 0.38CHO 1 0.45 1 0.55 1 0.55BGS 1 0.82 1 0.82 1 0.88

GGI 1 2.47 1 0.90 1 0.90TET 5 1.48 1 1.16 1 1.03

FPT 102 1.04 1 0.42 1 0.42IGF 3 0.71 1 0.77 1 0.77

JEL 4,867 0.81 1,060 0.81 2 0.81BQL 524 1.85 12 0.96 1 0.39

HFL 318 1.01 5 1.00 1 1.00HFM 7 2.19 27 1.09 10 1.09VFB 8,344 1.49 216 0.20 9 0.20

MLC 1,401 0.00 116 0.00 187 0.84MEL 9,898 1.03 27 1.03 3 1.03

JHL 385 0.62 8 0.38 1 1.08FBI 14 1.09 1 1.09 1 0.38

NCA 68 1.53 1 0.32 1 0.32NMB 160 2.43 1,630 1.39 1,009 1.39

NSN 19,992 1.11 716 0.75 1,130 2.29IAI 1,381 1.48 111 0.37 20 1.39DVF 11,145 0.00 88 1.38 49 0.44

KB5 140 0.34 1 0.34 78 1.38IGC 1,328 1.74 269 0.81 1 0.34

Page 11: Structure-Based Modelling of Protein-Protein Interactions · Re-Docking Known Protein Complexes N = 16 N = 20 N = 25 Case Top RMS Top RMS Top RMS SIC 3,407 0.00 2 0.22 1 0.82 KAI

Re-Scoring With Soft OPLS Potentials

Classic 12-6 Potential: E(r) =A

r12−

B

r6

Softened OPLS Potential: ELJ(r) =3

n=1

enRn0(ρ); ρ = r2/(r0/2)2

• Represent 12-10 hydrogen bond potentials similarly

Page 12: Structure-Based Modelling of Protein-Protein Interactions · Re-Docking Known Protein Complexes N = 16 N = 20 N = 25 Case Top RMS Top RMS Top RMS SIC 3,407 0.00 2 0.22 1 0.82 KAI

Summary of Hex Docking Strategy

• Contour VDW & SAS surface onto (0.6A)3 grid

• Calculate shape & electrostatic coefficients – once

• Scan search space with N=16 correlations (shape only)

• (Optionally constrain search to one or both epitopes)

• Re-score top 10,000 with N=30 (+electrostatics)

• Refine top 500 by soft OPLS minimisation & cluster solutions

• Output best member of each cluster

• Pray!

Page 13: Structure-Based Modelling of Protein-Protein Interactions · Re-Docking Known Protein Complexes N = 16 N = 20 N = 25 Case Top RMS Top RMS Top RMS SIC 3,407 0.00 2 0.22 1 0.82 KAI

Adapting Hex to Dock Very Large Molecules

Page 14: Structure-Based Modelling of Protein-Protein Interactions · Re-Docking Known Protein Complexes N = 16 N = 20 N = 25 Case Top RMS Top RMS Top RMS SIC 3,407 0.00 2 0.22 1 0.82 KAI

CAPRI – Critical Assessment of Predicted Interactions

• CASP did have a Docking Section

• Too few targets, too easy/hard; Docking was a side-show ?

• Hence CAPRI

• Janin, Wodak, Sternberg, Moult, Ten Eyck, Vajda, Henrick

• To assess the state of the art

• To promote developments in the field

• Hosted by EBI (Henrick, Tate)

• The CAPRI Challenge

• During 2001/2: Seven targets made available

• ∼ 20 International groups attempted predictions...

Page 15: Structure-Based Modelling of Protein-Protein Interactions · Re-Docking Known Protein Complexes N = 16 N = 20 N = 25 Case Top RMS Top RMS Top RMS SIC 3,407 0.00 2 0.22 1 0.82 KAI

The 7 CAPRI Docking Targets

Target Receptor Ligand Type Complex Lab

1 HPr Kinase HPr U/U Fieulaine et al. Janin

2 Rotavirus VP6 MCV U/B Vaney et al. Rey

3 Hemagglutinin HC63 U/B Barbey-Martin et al. Knossow

4 α -Amylase AMD10 U/B Desmyter et al. Cambillau

5 α -Amylase AMB7 U/B Desmyter et al. Cambillau

6 α -Amylase AMD9 U/B Desmyter et al. Cambillau

7 SpeA TCR 14.3.D U/U Sundberg et al. Mariuzza

• At least one docking partner presented in its unbound form

• Participants permitted 5 attempts for each target

• Any predictive approach allowed: homology/literature, etc.

Page 16: Structure-Based Modelling of Protein-Protein Interactions · Re-Docking Known Protein Complexes N = 16 N = 20 N = 25 Case Top RMS Top RMS Top RMS SIC 3,407 0.00 2 0.22 1 0.82 KAI

CAPRI Results (Unofficial)

Predictor Software Algorithm T1 T2 T3 T4 T5 T6 T7

Abagyan ICM FF + + +

Camacho CHARMM FF + +

Gardiner GAPDOCK GA +

Sternberg FTDOCK FFT +

Bates Guided Docking FF +

Ten Eyck DOT FFT +

Vakser GRAMM FFT

Olson Harmony ?

Weng ZDOCK FFT + + +

Eisenstein MolFit FFT + + +

Wolfson BUDDA/PPD GH +

Iwadate TSCF FF

Ritchie Hex SPF + +

Palma BIGGER GF +

Gray ? MC + +

NB. Some predictions were disallowed: “slamming” or > limit of 5 entries

Page 17: Structure-Based Modelling of Protein-Protein Interactions · Re-Docking Known Protein Complexes N = 16 N = 20 N = 25 Case Top RMS Top RMS Top RMS SIC 3,407 0.00 2 0.22 1 0.82 KAI

Hex 3.1 Results for CAPRI Targets 1–7

Case Description Constraints Samples Rank Contacts RMS Time/h

1 HPrK/HPr β1 ≤ 45 1.5 × 108 −/1 2/56 − 0.2

2 VP6/MCV β1, β2 ≤ 45 5.6 × 108 −/5 0/52 − 5.3

3 HA/HC63 β1, β2 ≤ 45 6.3 × 108 4/5 43/63 7.43 6.5

4 Amylase/AMD10 β1, β2 ≤ 45 5.6 × 108 −/5 0/58 − 4.7

5 Amylase/AMB7 β1, β2 ≤ 45 5.6 × 108 −/5 0/64 − 4.7

6 Amylase/AMD9 β1, β2 ≤ 45 5.6 × 108 5/5 53/65 2.16 4.7

7 SpeA/14.3.D β1 ≤ 45 1.5 × 108 −/5 2/37 − 0.25

Page 18: Structure-Based Modelling of Protein-Protein Interactions · Re-Docking Known Protein Complexes N = 16 N = 20 N = 25 Case Top RMS Top RMS Top RMS SIC 3,407 0.00 2 0.22 1 0.82 KAI

Docked Orientation for Target 3

Page 19: Structure-Based Modelling of Protein-Protein Interactions · Re-Docking Known Protein Complexes N = 16 N = 20 N = 25 Case Top RMS Top RMS Top RMS SIC 3,407 0.00 2 0.22 1 0.82 KAI

Docked Orientation for Target 6

Page 20: Structure-Based Modelling of Protein-Protein Interactions · Re-Docking Known Protein Complexes N = 16 N = 20 N = 25 Case Top RMS Top RMS Top RMS SIC 3,407 0.00 2 0.22 1 0.82 KAI

Target 7 – What Went Wrong?

• We found a homologue for Target 7 in the PDB ...

• But Hex didn’t find a high-ranking orientation that was similar!

Page 21: Structure-Based Modelling of Protein-Protein Interactions · Re-Docking Known Protein Complexes N = 16 N = 20 N = 25 Case Top RMS Top RMS Top RMS SIC 3,407 0.00 2 0.22 1 0.82 KAI

Conclusions

• Hex Docking Algorithm:

• Novel, Very Fast, & (Fairly) Accurate Docking Algorithm

• Hex Performed Well in CAPRI: 2 Out Of 7 Ain’t Bad !

• But We Need a Better Model of Protein Flexibility...

• Ongoing/Future Work:

• Essential Dynamics Model of Flexibility During Docking

• 3D Database Search for Small (Drug-Like) Ligands

• Modelling Fish Cytokines & Their Receptors

• Modelling CCR5/HIV Interactions (starting soon)

Page 22: Structure-Based Modelling of Protein-Protein Interactions · Re-Docking Known Protein Complexes N = 16 N = 20 N = 25 Case Top RMS Top RMS Top RMS SIC 3,407 0.00 2 0.22 1 0.82 KAI

Acknowledgements

BBSRC 1996 – 2000: DWR

Dr Graham KempProf John FothergillDr Daan van Aalten

Diana MustardAndreas AthanasopoulosAntonis KoussounadisGuillaume ValadonRussell Hamilton

Software & Preprints: http://www.biochem.abdn.ac.uk/hex/

CAPRI Call For Targets: http://capri.ebi.ac.uk