mark mackey, cresset, 'meet molecular architect, a new product for understanding sar and...
DESCRIPTION
TRANSCRIPT
![Page 1: Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understanding SAR and gaining better activity predictions](https://reader033.vdocuments.site/reader033/viewer/2022051819/54c637914a7959c9388b463c/html5/thumbnails/1.jpg)
Meet Molecular Architect
Dr Mark MackeyChief Scientifc Officer
![Page 2: Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understanding SAR and gaining better activity predictions](https://reader033.vdocuments.site/reader033/viewer/2022051819/54c637914a7959c9388b463c/html5/thumbnails/2.jpg)
Outline
> Fields, Field points and the good things you can do with them
> The alignment problem
> 3D-QSAR using Fields
> Examples> SARS PLpro – small data set, known xtal structure
> NK3 – large data set, unknown xtal structure
![Page 3: Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understanding SAR and gaining better activity predictions](https://reader033.vdocuments.site/reader033/viewer/2022051819/54c637914a7959c9388b463c/html5/thumbnails/3.jpg)
NN
Br
F FF
SH2NO
O
Field Points
Condensed representation of electrostatic, hydrophobic and shape properties (“protein’s view”)
> Molecular Field Extrema (“Field Points”)
3D Molecular Electrostatic
Potential (MEP)
Field Points= Positive = Negative= Shape= Hydrophobic
2D
![Page 4: Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understanding SAR and gaining better activity predictions](https://reader033.vdocuments.site/reader033/viewer/2022051819/54c637914a7959c9388b463c/html5/thumbnails/4.jpg)
Field Points have lots of applications
> Virtual screening
> Alignment
> Pharmacophore elucidation
> Bioisosteres
> etc
![Page 5: Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understanding SAR and gaining better activity predictions](https://reader033.vdocuments.site/reader033/viewer/2022051819/54c637914a7959c9388b463c/html5/thumbnails/5.jpg)
Field Points have lots of applications
> Virtual screening
> Alignment
> Pharmacophore elucidation
> Bioisosteres
> etc
> What about 3D QSAR?
![Page 6: Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understanding SAR and gaining better activity predictions](https://reader033.vdocuments.site/reader033/viewer/2022051819/54c637914a7959c9388b463c/html5/thumbnails/6.jpg)
The Alignment Problem
> Historically very difficult
> Early approaches template-based> Issues with side chain orientations
> Some success with docked data sets
> Easy to fool yourself> Correlation/causation
![Page 7: Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understanding SAR and gaining better activity predictions](https://reader033.vdocuments.site/reader033/viewer/2022051819/54c637914a7959c9388b463c/html5/thumbnails/7.jpg)
Alignment issues
> Ligand-centric view vs protein-centric
Cramer, JCAMD, 2010, DOI 10.1007/s10822-010-9403-z
NN
Br
F FF
SH2NO
O
NN
Br
F FF
SH2NO
O
![Page 8: Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understanding SAR and gaining better activity predictions](https://reader033.vdocuments.site/reader033/viewer/2022051819/54c637914a7959c9388b463c/html5/thumbnails/8.jpg)
Which is better?
> “The superior statistical qualities of 3D-QSAR models based on poses that superimpose presumably critical ligand features, rather than docked conformations.” Clark R., JCAMD 2007, p587
Doweyko, J. Comp-Aided Mol. Des., 2004, p 587
Free alignment adds signal, but also noise. Worse statistics, better predictability?
![Page 9: Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understanding SAR and gaining better activity predictions](https://reader033.vdocuments.site/reader033/viewer/2022051819/54c637914a7959c9388b463c/html5/thumbnails/9.jpg)
N-methyl acetamideImidazole
Field Alignment
H3N
N
OO
HN
O
NH
NH2H2N
O
NH
NH
HN
SOO
![Page 10: Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understanding SAR and gaining better activity predictions](https://reader033.vdocuments.site/reader033/viewer/2022051819/54c637914a7959c9388b463c/html5/thumbnails/10.jpg)
N-methyl acetamideImidazole
Field Scoring
Cheeseright et al, J. Chem Inf. Mod., 2006, 665
To score a particular alignment, we use the field points of molecule 1 to sample the actual field of molecule 2
![Page 11: Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understanding SAR and gaining better activity predictions](https://reader033.vdocuments.site/reader033/viewer/2022051819/54c637914a7959c9388b463c/html5/thumbnails/11.jpg)
Field Scoring
N-methyl acetamideImidazole
Cheeseright et al, J. Chem Inf. Mod., 2006, 665
To score a particular alignment, we use the field points of molecule 1 to sample the actual field of molecule 2 and vice-versa
![Page 12: Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understanding SAR and gaining better activity predictions](https://reader033.vdocuments.site/reader033/viewer/2022051819/54c637914a7959c9388b463c/html5/thumbnails/12.jpg)
Field Sampling
Field-point based QSAR descriptors
![Page 13: Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understanding SAR and gaining better activity predictions](https://reader033.vdocuments.site/reader033/viewer/2022051819/54c637914a7959c9388b463c/html5/thumbnails/13.jpg)
Field Sampling
Field-point based QSAR descriptors
![Page 14: Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understanding SAR and gaining better activity predictions](https://reader033.vdocuments.site/reader033/viewer/2022051819/54c637914a7959c9388b463c/html5/thumbnails/14.jpg)
Advantages
> Many fewer sample points than grid-based methods
> E.g. Vegfr2 data set
Field Sampling Grid Sampling Filtered Grid Sampling
q2 0.62 0.42 (0.48) 0.40
Number of descriptors
466 3940 1243
Du et al., J Mol Graph Model. 27 (2009) 642-652
![Page 15: Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understanding SAR and gaining better activity predictions](https://reader033.vdocuments.site/reader033/viewer/2022051819/54c637914a7959c9388b463c/html5/thumbnails/15.jpg)
Advantages
> Many fewer sample points than grid-based methods
> Sample points physically rather than statistically chosen
> Gauge invariant
> Consistent framework for alignment and QSAR
![Page 16: Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understanding SAR and gaining better activity predictions](https://reader033.vdocuments.site/reader033/viewer/2022051819/54c637914a7959c9388b463c/html5/thumbnails/16.jpg)
Initial validation
> Tested against literature CoMFA datasets> 15 datasets with alignments available
> CoMFA average cross-validated RMSE is 0.72
> Field QSAR using CoMFA alignments is 0.74
> Simple model (volume indicator variable) is 0.83
> Data sets re-aligned using field alignment> RMSE 1.00
![Page 17: Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understanding SAR and gaining better activity predictions](https://reader033.vdocuments.site/reader033/viewer/2022051819/54c637914a7959c9388b463c/html5/thumbnails/17.jpg)
Interpretability
Electrostatic Steric Variance
![Page 18: Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understanding SAR and gaining better activity predictions](https://reader033.vdocuments.site/reader033/viewer/2022051819/54c637914a7959c9388b463c/html5/thumbnails/18.jpg)
SARS PLpro
![Page 19: Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understanding SAR and gaining better activity predictions](https://reader033.vdocuments.site/reader033/viewer/2022051819/54c637914a7959c9388b463c/html5/thumbnails/19.jpg)
The target
> PLpro (Papain-like protease) is a DUB target which is critical for the replication of the coronavirus responsible for SARS
> Crystal structures available with bound ligands from 2 series of compounds: structurally related (PDB entries 3E9S and 3MJ5)
> Small number of analogues – challenge to see if we can use 3D-QSAR for small data sets
![Page 20: Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understanding SAR and gaining better activity predictions](https://reader033.vdocuments.site/reader033/viewer/2022051819/54c637914a7959c9388b463c/html5/thumbnails/20.jpg)
Alignment
![Page 21: Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understanding SAR and gaining better activity predictions](https://reader033.vdocuments.site/reader033/viewer/2022051819/54c637914a7959c9388b463c/html5/thumbnails/21.jpg)
Sampling points
![Page 22: Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understanding SAR and gaining better activity predictions](https://reader033.vdocuments.site/reader033/viewer/2022051819/54c637914a7959c9388b463c/html5/thumbnails/22.jpg)
Model
3.50 4.00 4.50 5.00 5.50 6.00 6.50 7.003.50
4.00
4.50
5.00
5.50
6.00
6.50
7.00
R² = 0.838410392100944
R² = 0.987121600034644Training SetLinear (Training Set)Test SetLinear (Test Set)
PLS Components = 5 RMSE = 0.09 RMSEP = 0.38
![Page 23: Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understanding SAR and gaining better activity predictions](https://reader033.vdocuments.site/reader033/viewer/2022051819/54c637914a7959c9388b463c/html5/thumbnails/23.jpg)
Summary
> Able to build a predictive 3D-QSAR model based on small number of analogues
> Guided (by volume of Xtal structure) alignment worked best. Free alignment was OK, but noisier.
![Page 24: Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understanding SAR and gaining better activity predictions](https://reader033.vdocuments.site/reader033/viewer/2022051819/54c637914a7959c9388b463c/html5/thumbnails/24.jpg)
NK3 antagonists
![Page 25: Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understanding SAR and gaining better activity predictions](https://reader033.vdocuments.site/reader033/viewer/2022051819/54c637914a7959c9388b463c/html5/thumbnails/25.jpg)
NK3 example
> GPCR target (Tachykinin receptor 3) – selectively binds Neurokinin B – target for treatment of neurological disorders such as schizophrenia
> Three series of inhibitors from Euroscreen> Scaffold-1 – 81 compounds with pIC50 (radioligand
binding) in range 4.6-8.7
> Scaffold-2 – 80 compounds with pIC50 in range 4.8-7.7
> Errors in radioligand binding data c. ± 0.4
![Page 26: Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understanding SAR and gaining better activity predictions](https://reader033.vdocuments.site/reader033/viewer/2022051819/54c637914a7959c9388b463c/html5/thumbnails/26.jpg)
NK3 binding mode
> For a 3D method you need a 3D alignment
> FieldAlign can align to a reference
> FieldTemplater generates the reference
O
N+
NH
N
HO
O
O
OHO
H
N
N+
N
O
N
N
O
H
F
F F
F
F
O
HN N+
H
H
N N
NH
FieldTemplater
![Page 27: Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understanding SAR and gaining better activity predictions](https://reader033.vdocuments.site/reader033/viewer/2022051819/54c637914a7959c9388b463c/html5/thumbnails/27.jpg)
NK3 binding mode prediction
> FieldTemplater> Selection of 3 highly active scaffold-1 compounds
plus 2 structurally dissimilar literature NK3 actives (Talnetant and SB-218795).
> Generated Templates filtered and candidate selected
> Conformation of most active scaff-1 structure then used as alignment target for other structures
![Page 28: Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understanding SAR and gaining better activity predictions](https://reader033.vdocuments.site/reader033/viewer/2022051819/54c637914a7959c9388b463c/html5/thumbnails/28.jpg)
3D-QSAR details
> Alignment> Free alignment to template conformation
> Field selection> Generated Field points for both steric and electrostatic
fields, with both sets at independent locations.
> 80/20 training/test split > Most active and least active training set
> 2nd most active, 2nd least active test set
> Random distribution of remaining compounds
![Page 29: Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understanding SAR and gaining better activity predictions](https://reader033.vdocuments.site/reader033/viewer/2022051819/54c637914a7959c9388b463c/html5/thumbnails/29.jpg)
Initial models problematic
> When all else fails, talk to the chemists
> “Are you using the right tautomer?”
N N
N
N
H
N N
N
N
![Page 30: Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understanding SAR and gaining better activity predictions](https://reader033.vdocuments.site/reader033/viewer/2022051819/54c637914a7959c9388b463c/html5/thumbnails/30.jpg)
NK3 Series 1
RMSE 0.19, RMSEpred 0.64
4.5 5 5.5 6 6.5 7 7.5 8 8.5 94.5
5
5.5
6
6.5
7
7.5
8
8.5
9
Training Set
Test Set
![Page 31: Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understanding SAR and gaining better activity predictions](https://reader033.vdocuments.site/reader033/viewer/2022051819/54c637914a7959c9388b463c/html5/thumbnails/31.jpg)
NK3 Series 1
Electrostatics Sterics
![Page 32: Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understanding SAR and gaining better activity predictions](https://reader033.vdocuments.site/reader033/viewer/2022051819/54c637914a7959c9388b463c/html5/thumbnails/32.jpg)
Extend to scaff-2?
4 5 6 7 8 9 10 114
5
6
7
8
9
10
11
Actual vs Predicted - scaff2 compounds on scaff1 model
> Complete lack of predictivity
> Visual analysis suggests a shift in binding mode for scaff-2
> Cross-series QSAR difficult
> Requires consistent binding modes!
![Page 33: Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understanding SAR and gaining better activity predictions](https://reader033.vdocuments.site/reader033/viewer/2022051819/54c637914a7959c9388b463c/html5/thumbnails/33.jpg)
NK1 Scaffold 2
4 4.5 5 5.5 6 6.5 7 7.5 8 8.54
4.5
5
5.5
6
6.5
7
7.5
8
8.5
TrainingTest
Activity
Pred
icted
Acti
vity
RMSEpred 0.60
![Page 34: Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understanding SAR and gaining better activity predictions](https://reader033.vdocuments.site/reader033/viewer/2022051819/54c637914a7959c9388b463c/html5/thumbnails/34.jpg)
Summary
> Able to generate models based on alignment to predicted active conformation by templating
> Independent models within each of two series show reasonable predictivity and can be used to guide further work
> Cross-series analysis suggests different binding modes for the two series
![Page 35: Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understanding SAR and gaining better activity predictions](https://reader033.vdocuments.site/reader033/viewer/2022051819/54c637914a7959c9388b463c/html5/thumbnails/35.jpg)
Molecular Architect
![Page 36: Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understanding SAR and gaining better activity predictions](https://reader033.vdocuments.site/reader033/viewer/2022051819/54c637914a7959c9388b463c/html5/thumbnails/36.jpg)
Molecular Architect
> Initially FieldAlign + QSAR
> Align your molecules
> Build models
> Test models
> Fit new compounds to models
> Interactive feedback
> Add additional alignment options
![Page 37: Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understanding SAR and gaining better activity predictions](https://reader033.vdocuments.site/reader033/viewer/2022051819/54c637914a7959c9388b463c/html5/thumbnails/37.jpg)
Molecular Architect
> One tool for molecule designers> Align
> QSAR
> Pharmacophore elucidation
> Bioisosteres
> What do I make next?
> Beta Q4 2011
![Page 38: Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understanding SAR and gaining better activity predictions](https://reader033.vdocuments.site/reader033/viewer/2022051819/54c637914a7959c9388b463c/html5/thumbnails/38.jpg)
Acknowledgements
> Cresset> Andy Vinter
> Tim Cheeseright
> James Melville
> Chris Earnshaw
> Euroscreen> Hamid Hoveyda
> Julien Parcq