joseph persichetti, david wolfe, ajeetk. sharma, and ... · joseph persichetti, david wolfe,...

1
00 Theoretically Describing Ensembles of Pathways between Disconnected Markov State Models Joseph Persichetti, David Wolfe, Ajeet K. Sharma, and Edward P. O’Brien Department of Chemistry, Penn State University References Introduction and Motivation Future Work Build a Markov State Model for candidate enzyme systems and use QM/MM finite-temperature string method to obtain paths in collective variable space. Use umbrella sampling and WHAM to obtain free energy profiles for these paths. Predict trends in the observed rate of an enzyme at different temperatures and for different mutations. (1) Ovchinnikov, V.; Karplus, M. J. Chem. Phys. 2014, 140 (17), 175103. (2) Vanden-Eijnden, E.; Venturoli, M. J. Chem. Phys. 2009, 130 (19), 194103. (3) Dhoke, G. V; Davari, M. D.; Schwaneberg, U.; Bocola, M. ACS Catal. 2015, 5 (6), 3207–3215. (4) Benkovic, S. J.; Hammes, G. G.; Hammes-Schiffer, S. Biochemistry 2008, 47 (11), 3317–3321. (5) Buchete, N.; Hummer, G. J. Phys. Chem. B 2008, 112, 6057-6069 (6) Noé, F.; Schütte, C.; Vanden-Eijnden, E.; Reich, L.; Weikl, T. R.; PNAS. 2009, 106 (45), 19011-19016 The high-dimensional space and non-zero temperature at which chemical reactions take place ensures that for most systems there will be multiple, disparate reaction pathways connecting the ensemble of reactant states to the ensemble of product states. However, most quantum mechanical/molecular mechanical (QM/MM) path-sampling methods for modeling reactions do not account for this fact. We are developing a theoretical framework which combines finite-temperature string method (FTSM) with Master equation modeling to predict observable rates when considering the ensemble of pathways. To begin, we have modeled the alanine, proline and glycine dipeptide using classical path sampling at different temperatures to develop the foundation for our method. In the future, we will apply our framework to an enzyme. This will require both QM and MM levels of theory for the FTSM simulations. A subset of the active site of the enzyme will be modeled with a QM Hamiltonian so that the chemical reaction is accurately represented. The remainder of the enzyme and explicit water will be modeled classically to maintain efficiency and feasibility. Question 1) How can we combine the kinetics obtained from Markov States and chain of states simulations to predict observable rates? Question 2) How is the observable rate affected by changes in temperature or mutations to the structure? Can we characterize these trends with regard to perturbations to the ensemble of pathways? Chain of States Method Connecting Markov States Rate Prediction from Multiple Pathways Project Workflow We have constructed a Markov State Model for alanine, proline and glycine dipeptide in vacuo from long timescale classical MD simulations. This provides the rate ( ) of transitioning between Markov states in either the reactants or products. Finite-temperature string method provides the final path through dihedral space connecting two Markov states. We then seed umbrella sampling simulations from this path to obtain the free energy profile describing transitions between reactant and product Markov states from which we obtain the rate ( ). Finally, we construct a Master equation for our system which incorporates kinetic information from disconnected Markov State Models to predict the observed rate of transitioning from reactants to products through an ensemble of pathways. We analyzed the flux through each of the observed pathways which have significant flux for proline dipeptide. This analysis shows that methods which only consider the pathway which is lowest in energy only captures ~ 20% of the total flux for this test case. It is necessary to include all pathways in order to account for the total flux from reactants to products. The observed trends of calculated rate as a function of temperature reveal that the best approximation (k NET ) of the true rate includes contributions from all observed pathways. = = , =−

Upload: others

Post on 26-Apr-2020

12 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Joseph Persichetti, David Wolfe, AjeetK. Sharma, and ... · Joseph Persichetti, David Wolfe, AjeetK. Sharma, and Edward P. O’Brien Department of Chemistry, Penn State University

00

Theoretically Describing Ensembles of Pathways between Disconnected Markov State ModelsJoseph Persichetti, David Wolfe, Ajeet K. Sharma, and Edward P. O’Brien

Department of Chemistry, Penn State University

References

Introduction and Motivation

Future Work

• Build a Markov State Model for candidate enzyme systems and useQM/MM finite-temperature string method to obtain paths in collectivevariable space. Use umbrella sampling and WHAM to obtain free energyprofiles for these paths.

• Predict trends in the observed rate of an enzyme at differenttemperatures and for different mutations.

(1) Ovchinnikov, V.; Karplus, M. J. Chem. Phys. 2014, 140 (17), 175103.(2) Vanden-Eijnden, E.; Venturoli, M. J. Chem. Phys. 2009, 130 (19), 194103.(3) Dhoke, G. V; Davari, M. D.; Schwaneberg, U.; Bocola, M. ACS Catal. 2015, 5

(6), 3207–3215.(4) Benkovic, S. J.; Hammes, G. G.; Hammes-Schiffer, S. Biochemistry 2008, 47

(11), 3317–3321.(5) Buchete, N.; Hummer, G. J. Phys. Chem. B 2008, 112, 6057-6069(6) Noé, F.; Schütte, C.; Vanden-Eijnden, E.; Reich, L.; Weikl, T. R.; PNAS. 2009,

106 (45), 19011-19016

The high-dimensional space and non-zero temperature at which chemicalreactions take place ensures that for most systems there will be multiple,disparate reaction pathways connecting the ensemble of reactant states tothe ensemble of product states. However, most quantummechanical/molecular mechanical (QM/MM) path-sampling methods formodeling reactions do not account for this fact. We are developing atheoretical framework which combines finite-temperature string method(FTSM) with Master equation modeling to predict observable rates whenconsidering the ensemble of pathways. To begin, we have modeled thealanine, proline and glycine dipeptide using classical path sampling atdifferent temperatures to develop the foundation for our method. In thefuture, we will apply our framework to an enzyme. This will require both QMand MM levels of theory for the FTSM simulations. A subset of the activesite of the enzyme will be modeled with a QM Hamiltonian so that thechemical reaction is accurately represented. The remainder of the enzymeand explicit water will be modeled classically to maintain efficiency andfeasibility.

Question 1) How can we combine the kinetics obtained from Markov

States and chain of states simulations to predict observable rates?

Question 2) How is the observable rate affected by changes in

temperature or mutations to the structure? Can we characterize these

trends with regard to perturbations to the ensemble of pathways?

Chain of States Method Connecting Markov States Rate Prediction from Multiple Pathways

Project Workflow

We have constructed a Markov State Model for alanine, proline and glycinedipeptide in vacuo from long timescale classical MD simulations. Thisprovides the rate (𝜔) of transitioning between Markov states in either thereactants or products. Finite-temperature string method provides the finalpath through dihedral space connecting two Markov states. We then seedumbrella sampling simulations from this path to obtain the free energyprofile describing transitions between reactant and product Markov statesfrom which we obtain the rate (𝑘). Finally, we construct a Master equationfor our system which incorporates kinetic information from disconnectedMarkov State Models to predict the observed rate of transitioning fromreactants to products through an ensemble of pathways.

We analyzed the flux througheach of the observedpathways which havesignificant flux for proline

dipeptide. This analysisshows that methods whichonly consider the pathwaywhich is lowest in energy onlycaptures ~ 20% of the totalflux for this test case. It isnecessary to include allpathways in order to accountfor the total flux fromreactants to products.

The observed trends ofcalculated rate as a functionof temperature reveal that thebest approximation (kNET) ofthe true rate includescontributions from allobserved pathways.

𝒌𝒊𝒋 ∝ 𝐞−∆𝐄‡

𝐤𝐁𝐓

𝐊 =

−𝜶𝟏 𝛚𝟏𝟐 𝒌𝟏𝟑 𝒌𝟏𝟒𝛚𝟐𝟏 −𝜶𝟐 𝒌𝟐𝟑 𝒌𝟐𝟒𝟎 𝟎 −𝜶𝟑 𝛚𝟑𝟒

𝟎 𝟎 𝝎𝟒𝟑 −𝜶𝟒

𝒌𝒐𝒃𝒔 =𝟏

𝛕, 𝛕 = −𝝅𝑻𝑲𝑹

−𝟏𝟏