docking - virginia techchekhov.cs.vt.edu/2007/presentations/docking.pdf•docking algorithms need to...

25
GBCB 5874: Problem Solving in GBCB Docking

Upload: others

Post on 24-May-2020

5 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: docking - Virginia Techchekhov.cs.vt.edu/2007/presentations/docking.pdf•Docking algorithms need to generate poses of ligands in binding site •Then score the poses - simple energy

GBCB 5874: Problem Solving in GBCB

Docking

Page 2: docking - Virginia Techchekhov.cs.vt.edu/2007/presentations/docking.pdf•Docking algorithms need to generate poses of ligands in binding site •Then score the poses - simple energy

GBCB 5874: Problem Solving in GBCB

Benzamidine Docking to Trypsin

Page 3: docking - Virginia Techchekhov.cs.vt.edu/2007/presentations/docking.pdf•Docking algorithms need to generate poses of ligands in binding site •Then score the poses - simple energy

GBCB 5874: Problem Solving in GBCB

Relationship to Drug Design

• Ligand-based design– QSAR– Pharmacophore modeling– Can be done without 3-D structure of protein

• Receptor/Structure-based design– Molecular docking

• Hit identification• Lead identification and optimization• ADMET (e.g., cytochrome P450 docking)

– De novo design– Requires 3-D model of protein

Page 4: docking - Virginia Techchekhov.cs.vt.edu/2007/presentations/docking.pdf•Docking algorithms need to generate poses of ligands in binding site •Then score the poses - simple energy

GBCB 5874: Problem Solving in GBCB

Overview

• Protein-smallmolecule docking

• Protein-proteindocking

Page 5: docking - Virginia Techchekhov.cs.vt.edu/2007/presentations/docking.pdf•Docking algorithms need to generate poses of ligands in binding site •Then score the poses - simple energy

GBCB 5874: Problem Solving in GBCB

Basic Principles

Kitchen et al. (2004) Nature Rev. Drug Disc. 3: 935-949.

Page 6: docking - Virginia Techchekhov.cs.vt.edu/2007/presentations/docking.pdf•Docking algorithms need to generate poses of ligands in binding site •Then score the poses - simple energy

GBCB 5874: Problem Solving in GBCB

Aspects of Docking

• Docking algorithms need to generateposes of ligands in binding site

• Then score the poses - simple energyfunction

• Then rank the poses - more complexenergy function -> ΔG– Desolvation– Entropy

Page 7: docking - Virginia Techchekhov.cs.vt.edu/2007/presentations/docking.pdf•Docking algorithms need to generate poses of ligands in binding site •Then score the poses - simple energy

GBCB 5874: Problem Solving in GBCB

Additional Challenges in Docking

• Limited resolution of receptor structures• Flexibility of receptor• Conformational changes in ligand

and/or receptor upon binding• Role of water molecules in binding

– Desolvation of ligand and binding site– Waters bridging protein and ligand

Page 8: docking - Virginia Techchekhov.cs.vt.edu/2007/presentations/docking.pdf•Docking algorithms need to generate poses of ligands in binding site •Then score the poses - simple energy

GBCB 5874: Problem Solving in GBCB

Receptor Representations for Docking

• Atomic - usually only in final ranking inconjunction with potential energy function

• Surface - Connolly’s MS program• Grid

Kitchen et al. (2004) Nature Rev. Drug Disc. 3: 935-949.

Page 9: docking - Virginia Techchekhov.cs.vt.edu/2007/presentations/docking.pdf•Docking algorithms need to generate poses of ligands in binding site •Then score the poses - simple energy

GBCB 5874: Problem Solving in GBCB

Ligand Flexibility

• Systematic search• Random (Stochastic) search• Simulation methods

Page 10: docking - Virginia Techchekhov.cs.vt.edu/2007/presentations/docking.pdf•Docking algorithms need to generate poses of ligands in binding site •Then score the poses - simple energy

GBCB 5874: Problem Solving in GBCB

Systematic Search

• Conformational search methods(combinatorial explosion)

• Fragmentation methods– Dock molecular fragments and link them

covalently• Database methods

– Libraries of pre-generated conformers– e.g., OMEGA - eliminates those that are

close in RMSD or have high internal strain

Page 11: docking - Virginia Techchekhov.cs.vt.edu/2007/presentations/docking.pdf•Docking algorithms need to generate poses of ligands in binding site •Then score the poses - simple energy

GBCB 5874: Problem Solving in GBCB

Random (Stochastic) Search

• Monte Carlo methods• Tabu search - take into consideration

areas of conformational space alreadyexplored (using RMSD)

• Genetic algorithms - adapt principles ofbiological competition and populationdynamics

Page 12: docking - Virginia Techchekhov.cs.vt.edu/2007/presentations/docking.pdf•Docking algorithms need to generate poses of ligands in binding site •Then score the poses - simple energy

GBCB 5874: Problem Solving in GBCB

Genetic Algorithm in Docking

Rotation 1 Translation 1 Conformation 1

Rotation 2 Translation 2 Conformation 2

Rotation n Translation n Conformation n

Evaluate energies and take best scoring

R1 T1 C50

Crossover

R1 T5 C50Mutation

Page 13: docking - Virginia Techchekhov.cs.vt.edu/2007/presentations/docking.pdf•Docking algorithms need to generate poses of ligands in binding site •Then score the poses - simple energy

GBCB 5874: Problem Solving in GBCB

Simulation Methods

• Molecular dynamics• Elevated temperatures• Different starting positions for ligand• Generate ensemble of conformations

Page 14: docking - Virginia Techchekhov.cs.vt.edu/2007/presentations/docking.pdf•Docking algorithms need to generate poses of ligands in binding site •Then score the poses - simple energy

GBCB 5874: Problem Solving in GBCB

Protein Flexibility

• MD/MC methods - time-consuming• Rotamer libraries - neglects changes in

backbone• Protein ensemble grids - multiple

structures from crystallography, NMR,or simulation

• Soft-receptor modeling - “energy-weighted” average grid based onmultiple conformations

Page 15: docking - Virginia Techchekhov.cs.vt.edu/2007/presentations/docking.pdf•Docking algorithms need to generate poses of ligands in binding site •Then score the poses - simple energy

GBCB 5874: Problem Solving in GBCB

Scoring

• Need to differentiate correct poses from“incorrect” ones– RMSD between experimentally observed

and docked positions• Calculates an “energy”• Ideal is to have lowest energy pose be

the one with the lowest RMSD relativeto experimental pose

Page 16: docking - Virginia Techchekhov.cs.vt.edu/2007/presentations/docking.pdf•Docking algorithms need to generate poses of ligands in binding site •Then score the poses - simple energy

GBCB 5874: Problem Solving in GBCB

Scoring Functions

• Force field-based• Empirical scoring• Knowledge-based scoring• Consensus scoring

Page 17: docking - Virginia Techchekhov.cs.vt.edu/2007/presentations/docking.pdf•Docking algorithms need to generate poses of ligands in binding site •Then score the poses - simple energy

GBCB 5874: Problem Solving in GBCB

Force Field-Based Scoring

• Calculates interaction energy betweenprotein and ligand and the internalenergy of ligand

• Pairwise interactions for van der Waalsand electrostatics

• May also have explicit term for H-bonds• Does not inherently include solvation

and entropy

Page 18: docking - Virginia Techchekhov.cs.vt.edu/2007/presentations/docking.pdf•Docking algorithms need to generate poses of ligands in binding site •Then score the poses - simple energy

GBCB 5874: Problem Solving in GBCB

Empirical Scoring Functions

• Designed to reproduce experimental data(e.g., binding energies)

• Sum of several uncorrelated terms(e.g., H-bond, ionic, hydrophobic, aromatic)

• Coefficients of terms calculated fromregression analysis using experimental data=> dependent on quality and quantity of data

• Can include terms for entropy and solvation

Page 19: docking - Virginia Techchekhov.cs.vt.edu/2007/presentations/docking.pdf•Docking algorithms need to generate poses of ligands in binding site •Then score the poses - simple energy

GBCB 5874: Problem Solving in GBCB

Knowledge-Based Scoring Functions

• Designed to reproduce experimentalstructures rather than binding energies

• Simple atomic interaction-pair potentialsbased on frequency of occurrence ofdifferent pair contacts in dataset

• Simple, fast to calculate• Limited by quantity and quality of data

Page 20: docking - Virginia Techchekhov.cs.vt.edu/2007/presentations/docking.pdf•Docking algorithms need to generate poses of ligands in binding site •Then score the poses - simple energy

GBCB 5874: Problem Solving in GBCB

Consensus Scoring

• Combine information from differentscoring methods

• Need to consider correlation of differentscoring functions

Page 21: docking - Virginia Techchekhov.cs.vt.edu/2007/presentations/docking.pdf•Docking algorithms need to generate poses of ligands in binding site •Then score the poses - simple energy

GBCB 5874: Problem Solving in GBCB

Evaluating Scoring Schemes

• Several recent studies• Ideal result is correct redocking of ligand

(< 2Å RMSD) and having lowest energy poseas best docked

• Most programs able to predict accuracy of~1.5-2Å with 70-80% success

• No single scoring function superior to others• Scoring functions respond differently to

features in binding sites• Consensus scoring gaining favor

Page 22: docking - Virginia Techchekhov.cs.vt.edu/2007/presentations/docking.pdf•Docking algorithms need to generate poses of ligands in binding site •Then score the poses - simple energy

GBCB 5874: Problem Solving in GBCB

Posing vs Scoring

• Where are calculation errors greater?• No comprehensive study yet available• Consensus is that scoring represents

the major limiting factor

Page 23: docking - Virginia Techchekhov.cs.vt.edu/2007/presentations/docking.pdf•Docking algorithms need to generate poses of ligands in binding site •Then score the poses - simple energy

GBCB 5874: Problem Solving in GBCB

Improving Scoring Functions

• Include solvation and rotational entropy contributions• Tune scoring schemes on case-by-case basis

– Example: cAMP-dependent kinase

Staurosporine binds at Mg-ATP site => had to balanceenergy functions for hydrophobic, H-bond, ionic interactions

Kitchen et al. (2004) Nature Rev. Drug Disc. 3: 935-949.

Page 24: docking - Virginia Techchekhov.cs.vt.edu/2007/presentations/docking.pdf•Docking algorithms need to generate poses of ligands in binding site •Then score the poses - simple energy

GBCB 5874: Problem Solving in GBCB

Examples of Docking Programs

Sousa et al. (2006) Proteins: Struct. Func. Bioinfo. 65: 15-26.

Page 25: docking - Virginia Techchekhov.cs.vt.edu/2007/presentations/docking.pdf•Docking algorithms need to generate poses of ligands in binding site •Then score the poses - simple energy

GBCB 5874: Problem Solving in GBCB

De Novo Design

Schneider and Fechner (2005) Nature Rev. Drug Disc. 4: 649-663.