interactive visualization in decision meetings · groups of selected compounds by color coding . ps...
TRANSCRIPT
PS April 2002 carlssonresearch
interactive visualization in interactive visualization in decision meetingsdecision meetings
a way to improve the a way to improve the compound selection processcompound selection process
Peder Svensson
carlssonresearch
PS April 2002 carlssonresearch
the ”standard” drug discovery processthe ”standard” drug discovery processthe ”standard” drug discovery process
Genetics/GenomicsBioinformaticsMolecular Biology
ToxicologyBioanalysis
Medicinal Chemistry/Computational ChemistryBiochemistry & Cell Biology
PharmacologyPharmacokinetics
Lead Discovery
LeadOptimization
CD to INDTarget and ConceptDiscovery
Candidate DrugInvestigational New DrugCandidate DrugInvestigational New Drug
bioinformaticsstructural chemistry/biology
Structure Based Drug Designcombinatorial library design
ADME modellingLigand Based Drug Design
A Continuum of Selections & Decisions
A Continuum of Selections & Decisions
documentation & patenting
PS April 2002 carlssonresearch
philosophyphilosophyphilosophydrug design is a multivariate problem
a drug is often not optimal in all relevant aspects but the best compromise of properties
one should always try to make as informative experiments/compounds as possible
dogmas should be challenged once in a while
interdisciplinary discussions are very important for success
PS April 2002 carlssonresearch
En Variabel I Taget - EVITE(one variable at the time – ETERNITY)
En n VVariabel ariabel II TTaget aget -- EVITEVIT(one variable at the time (one variable at the time –– ETERNITY)ETERNITY)
?
PS April 2002 carlssonresearch
statistical designstatistical designstatistical design
PS April 2002 carlssonresearch
multivariate analysis multivariate analysis
chemical properties biological response
Des
crip
tor
2
Descriptor 1
Descri
ptor 3
PC2
PCA
Res
pons
e 2
Response 1
Response
3
PC2
compoundcompound or building blockPC1 PC1
PLS
PS April 2002 carlssonresearch
molecular descriptorsmolecular descriptorsmolecular descriptors
lipophilicity (LogP)size and shapeatom & fragment countsmolecular surface based descriptorselectronic properties, atomic charges, dipole moments , ionization potentialstopological indicesgeometries – distances and anglesrotational barriers around bondsvibrational frequencies, NMR chemical shifts, etc.
.... in principle, any physical observable!
PS April 2002 carlssonresearch
experimental design & QSARexperimental design & QSARexperimental design & QSAR
Principal Component Analysis, PCA
selection of a representative subsetusing filtering & experimental design methods
synthesis/testing
data table
QSAR
PS April 2002 carlssonresearch
visualization and interactivity for compound selection & experimental designvisualization and interactivity visualization and interactivity for compound selection & experimental designfor compound selection & experimental design
low barriers for testing and examining different alternativesany type of information can be included in the process:
chemical structureschemical and biological propertiesresults from simulations – e.g. in silico dockingcosts, logistics, etc.
all aspects can be considered at once:synthetic feasibilitytoxicitycoverage & balancenovelty & patent situation
well suited for group sessions using a screen projector
improves chance to reach the best compromise
PS April 2002 carlssonresearch
prepare the optimal dataset for visualization
prepare the optimal dataset for prepare the optimal dataset for visualizationvisualization
as rich as possible – include all available information that could become useful as basis for decisioncombine different types of data
model results – principal components, model diagnostics, scoring results, ...raw data – descriptors, experimental propertiesclassification and labeling information – chemical types, source, ...additional filtering information - cost, availability, ...
make links to other types of informationchemical structuresgraphs from experimentsexternal information on the webgraphical docking results
PS April 2002 carlssonresearch
historyhistoryhistory
PS April 2002 carlssonresearch
the starting point ~96property based selection using dummy 3D molecule in ISIS and lists on
paper with structures for design of small combinatorial libraries
the starting point ~96the starting point ~96property based selection using dummy 3D molecule in ISIS and lisproperty based selection using dummy 3D molecule in ISIS and lists on ts on
paper with structures for design of small combinatorial librariepaper with structures for design of small combinatorial librariess
h1 h4
h6 h7
h15
h16
h17
h19
h20
h21
h22
h23h24
h25
h27
h28
h29
h30
h31
h32
h33
h34
h35 h36h37
h38
h41
h42
h43
h44
h47
h53
h54h55
h59
h60
h92
h95
h96
h97
h98
h99h100
h101
h102
h103 h104
h105h106
h107
h108
h109h110
h111
h112h113 h114
h115h116
h117
h118
h119
h120
h121
h122h123
h124
h125
h126
h127h128
h129
h130
h131
h132
h133h134
h135
h136h137h138
h139
h140
h141
h142
h143
h144h145
h146
h147 h148
h149
h150
h151
i1
i2i3
i4
i5
i6
i7
i8i9
i10
i11
i12
i13
i14
i15
i16
i17
i18i19
i20 i21
i22
i23
i24
i25
i26
i27
i28
i29
i30i31
i32i33
i34
i35i36i37
k1
k2
k3 k4
k5
k6
k7
k8
k9
k10k11
k12
k13k14
PC1
PC2
PC3
rotation – additional distance and cluster information
keeping track of selections
enables efficient modifications to the selected set
distinguish between groups of selected compounds by color coding
PS April 2002 carlssonresearch
the c3-project at the cthe c33--project at project at objective:
improve the quality of combinatorial library designs andcompound selections:
overall workflowtools for filtering, selection, enumeration, visualization and knowledge managementusability (user friendliness)
PS April 2002 carlssonresearch
SaSA tool for combinatorial SaSA tool for combinatorial library generationlibrary generation
SStructuretructure aandnd SSynthesisynthesis AAdministratordministrator
generates new structures to be synthesized or analyzed virtually from reaction descriptions and lists of starting materials
registers structures and synthesis documentation
analyzes possible starting materials and virtual libraries with respect to a number of different descriptors, e.g. molecular weight, lipophilicity, hydrogen bonding….
selects a subset of compounds for synthesis, based on diversity (or similarity)
filters out compounds with undesired properties (MW, lipophilicity, etc. as above)
exports the results for further processing using Excel or Spotfire...
very useful but not interactive ...very useful but not interactive ...
PS April 2002 carlssonresearch
c3-vision cc33--vision vision reagents productsvisualization
of structures and property space
chemical filtersto accommodate restrictions
imposed by reaction conditions
property and/or structure based filters
to introduce the desired bias
chemGPSto generate the property space and enable a straight forward comparison with previously
positioned reagents
reagent selectionproperty based selection using D-optimal and/or space-filling
design
enumeration
chemical filters
p&s based filters
scoring functionsto score lead and/or drug
likeness or oral bioavailability
save&reuse
chemGPS
designed libraryby selecting final products
PS April 2002 carlssonresearch
the present set of toolsthe present set of toolsthe present set of tools
PS April 2002 carlssonresearch
the visualization environmentthe visualization environmentthe visualization environment
SpotfireSpotfire
Spotfire Structure Visualizer
connected to ISIS
molecular databases
Spotfire Structure Visualizer
connected to ISIS
molecular databases
Structure spaceStructure space
property spaceproperty space
3 dim3 dim
12 dim12 dim
PS April 2002 carlssonresearch
multidimensional distance filtersmultidimensional distance filters
filtering in 8 PC dimensionsfiltering in 8 PC dimensions
3D neighbourhood zooming3D neighbourhood zooming
Labels only on selected molecules Labels only on selected molecules
PS April 2002 carlssonresearch
grouping by substructure searches grouping by substructure searches
substructure filteringsubstructure filtering
PS April 2002 carlssonresearch
property based selection property based selection
original library
selected subsetexclusion sphere
PS April 2002 carlssonresearch
ISSS an Interactive SubSet Selection toolISSSISSS an an IInteractive nteractive SSububSSetet SSelection toolelection tool
very fast space filling selectionon large datasets over many dimensions in just a few seconds
supports a flexible and constructive compound/experiment selection workflow
works on any property space modeleasy to include preselected compoundseasy to filter out compounds based on any property criteria
designed for interactive use e.g. in group sessionsfast and easy to reiterate with new preselections and filtersgenerates query devices for selection status and inclusion orderautomatic color coding of selection status
density biased selection ability to select a truly representative subsetability to avoid regions with only outliers
PS April 2002 carlssonresearch
ISSS-Interactive SubSet SelectionISSSISSS--IInteractive nteractive SSububSSet et SSelectionelection
.jx
hS
1−hS
Complexity: ( )nNΟ
A fast MaxMin algorithm
R eal tim e fo r th e calcu latio n s (17213 m o lecu les in S can d , 10 co lu m n s, E u clid ean
d ist.)
0
2
4
6
8
1 0
0 1 0 0 2 0 0 3 0 0 4 0 0
N r o f m o le cu le s in su b se t
time [s] W ith a d d in g
su b se t in c. N rco lu m n
W ith o u t a d d in gsu b se t in c. N rco lu m n
R eal tim e fo r th e calcu latio n s (17213 m o lecu les in S can d , 10 co lu m n s, E u clid ean
d ist.)
0
2
4
6
8
1 0
0 1 0 0 2 0 0 3 0 0 4 0 0
N r o f m o le cu le s in su b se t
time [s] W ith a d d in g
su b se t in c. N rco lu m n
W ith o u t a d d in gsu b se t in c. N rco lu m n
PS April 2002 carlssonresearch
visualization of the selectionvisualization of the selectionvisualization of the selection
inspection of selectioninspection of selection
PS April 2002 carlssonresearch
density biased space filling selectiondensity biased space filling selectiondensity biased space filling selection
space fillingselection
space fillingselection
density biasedselection
density biasedselection
all compoundsall compounds
exampleselection of 100
compounds out of 2900 reference compounds with CNS activity from MDDR based on a 3 dimensional property
space
exampleexampleselection of 100
compounds out of 2900 reference compounds with CNS activity from MDDR based on a 3 dimensional property
spacedensity bins density bins
PS April 2002 carlssonresearch
database docking – in silicoscreening
database docking database docking –– in silicoin silicoscreeningscreening
3600 commercially available small basic compounds docked into the “basic pocket” of an protein receptor of interest
two different docking algorithms were used:
Slide (Volker Schnecke, AZ-M & Leslie Kuhn, MSU)Fast!Protein flexibilityLimited ligand flexibility
FlexX (Thomas Lengauer & Matthias Rarey, GMD)Moderately fastRigid proteinFull ligand flexibility
PS April 2002 carlssonresearch
filtering based on database filtering based on database docking resultsdocking results
protein structure based filtersprotein structure based filters
inspection of docking results inspection of docking results
PS April 2002 carlssonresearch
enumeration exampleenumeration example20 sulphonyl chlorides x 20 benzylic amines20 sulphonyl chlorides x 20 benzylic amines
filtering and cherry picking (1 plate filtering and cherry picking (1 plate -- 80 cmpds)80 cmpds)
R1S
Cl
OONH2
R2 NH
SR1
OO
R2
+
20 x 20 400
PS April 2002 carlssonresearch
selection on productsselection on productsselection on productsR-group presentation with activity colour codingR-group presentation with activity colour coding
R-group colour codingR-group colour coding
property space with selection status colour coded
property space with selection status colour coded
PS April 2002 carlssonresearch
QSAR Example Observed vs. PredictedQSAR Example Observed vs. Predicted
filteringfiltering
inspection of outliersinspection of outlierscolor codingcolor coding
PS April 2002 carlssonresearch
QSAR example QSAR example -- PLSPLS--loadingsloadings
responses & origo in redresponses & origo in red
filtering by variable importancefiltering by variable importance
PS April 2002 carlssonresearch
conclusionsconclusionsconclusionsmaking design decisions in meetings using interactive visualization leads to:
increased probability of finding “the optimal compromise”
a much faster drug design process
the combination of property and protein structure based combinatorial library design adds value
PS April 2002 carlssonresearch
thanks!thanks!Andreas Stansvik, presently at SpotfireThomas Olsson, AZ MölndalFredrik Rosell, SpotfireVolker Schnecke, AZ MölndalVladimir Sherbukhin, AZ -> Serono
Discovery GenevaJohan Gottfries, AZ MölndalTudor Oprea, AZ MölndalBo Nordén, AZ LundMarie Berghult, AZ MölndalThomas Kühler, Swedish MPA
Andreas Stansvik, presently at SpotfireThomas Olsson, AZ MölndalFredrik Rosell, SpotfireVolker Schnecke, AZ MölndalVladimir Sherbukhin, AZ -> Serono
Discovery GenevaJohan Gottfries, AZ MölndalTudor Oprea, AZ MölndalBo Nordén, AZ LundMarie Berghult, AZ MölndalThomas Kühler, Swedish MPA