hybrid qm/mm and related electronic structure methods · jnh / lecture 1 2 ... lengths.12,13 it has...

31
Hybrid QM/MM and Related Electronic Structure Methods Jeremy Harvey Winter School in Multiscale Modeling December 1-12 2014, KTH Stockholm

Upload: others

Post on 08-Oct-2019

2 views

Category:

Documents


0 download

TRANSCRIPT

Hybrid QM/MM and Related Electronic Structure Methods Jeremy Harvey Winter School in Multiscale Modeling December 1-12 2014, KTH Stockholm

Lectures 1-2: Introduction, ONIOM, QM/MM

JNH / Lecture 1 2

•  Introduction: need for hybrid methods •  Hybrid methods: Generalities •  Hybrid ways to calculate potential energy •  ONIOM •  Other subtractive methods •  QM/MM with electronic embedding •  Polarizable QM/MM

Introduction: Electronic Structure for Large Systems

JNH / Lecture 1 3

•  Electronic structure theory methods scale steeply in terms of computational effort: N3 or worse in simple implementations

•  There are several ways to side-step this problem: •  For large periodic systems, one can use translational symmetry

and carry out periodic calculations •  For large systems where the contribution of most of the atoms

to the potential energy is considered a small perturbation, this environment can be left out, or treated as a continuum

•  One can use advanced implementations of electronic structure theory methods in order to lower the scaling

Low-Scaling Electronic Structure Theory

JNH / Lecture 1 4

•  There has been considerable progress in making semiempirical, HF, DFT and correlated methods scale better with system size

•  For correlated methods, this exploits the locality of correlation •  In DFT, linear-scaling methods are obtained using a range of

methods, addressing the issue of Coulombic interactions, the exchange-correlation functional, and matrix diagonization

•  E.g. the Coulombic interactions between contributions from different atomic orbitals can be split into terms coming from near-field interactions and those with a far-field character

•  The former are small in number – the latter can be accurately approximated.

Why Use Hybrid Methods

JNH / Lecture 1 5

•  It is often desirable to treat (with full atomistic detail) the electronic structure of large, aperiodic, systems for which even low-scaling electronic structure methods are too expensive

•  Note that “too expensive” depends a lot on what you are trying to do, and what your resources are

•  To calculate a single-point energy for a large system as a proof of principle, it may be possible to do the calculation even for systems with 10,000 atoms or more

•  To carry out geometry optimization or to calculate properties, the limit may be much lower

•  To map out the thermally populated region, or to carry out simulations will be even harder

Systems Where Hybrid Methods are Used

JNH / Lecture 1 6

•  The range of systems of interest is quite large: large molecular systems (organic/inorganic molecules with over 100 atoms or so), or supermolecular systems (e.g. proteins), or special centres in a solid or on a surface

•  In a hybrid method, the electronic structure of the core or active region is studied with one level of theory – the structure of the environment is treated either with a simpler method or indeed with a molecular mechanics approach

•  It is possible to study systems with many thousands of atoms provided that the active region is much smaller.

Hybrid Methods: Hamiltonian

JNH / Lecture 1 7

The overall energy includes an energy term for both subsystems and a coupling term:

B A

V = V A

high

+ V B

low

+ V A-B

High-level energy of A Low-level energy of B

This also yields (in principle) an electronic wave-function for the whole system:

= A| A

high

⇥ B

low

>

Hybrid Methods: Diversity

JNH / Lecture 1 8

Within the above framework, there are very many variations on how to treat A and B, and the coupling between them. One frequently use distinction is between additive methods and subtractive ones. Another distinction is between QM/QM and QM/MM schemes. For QM/MM methods, there can be mechanical embedding or electronic embedding. Finally the treatment of boundary region bonds can be quite diverse.

B A

V = V A

high

+ V B

low

+ V A-B

Additive Methods

JNH / Lecture 1 9

In additive methods, the energy of A is computed at one level of theory, that of B at another, and the coupling between them at a third level This is the usual approach for QM/MM methods The diversity of such methods lies in the nature of the coupling term (and to some extent in the treatment of the environment B) These aspects will be discussed in detail

B A

Subtractive Methods

JNH / Lecture 1 10

Subtractive methods treat the whole system at the lower level of theory, and correct this at the high level for the active region: This can also be viewed as a high level calculation on the model, to which is added a corrective environment term:

B A

V = V A

high

+ (V AB

low

� V A

low

)

V = V AB

low

+ (V A

high

� V A

low

)

Subtractive Methods – 2

JNH / Lecture 1 11

The subtractive method appears initially to be very different, but can be re-written as:

The term is intended to refer to the low-level calculation of the parts of the energy of AB which relate to B, whereas the term: refers to the low-level calculation of the parts of the energy of AB which relate to the interaction between A and B. If one thinks of it this way, then and one gets a similar expression to the additive approach:

V = V A

high

+ (V AB

low

[B] + V AB

low

[A] + V AB

low

[A-B]� V A

low

)

V AB

low

[A-B]

V AB

low

[A] = V A

low

V = V A

high

+ (V AB

low

[B] + V AB

low

[A-B])

V AB

low

[B]

ONIOM (NB: original form)

JNH / Lecture 1 12

One of the most commonly used subtractive methods is ONIOM: “Our own N-layered Integrated molecular Orbital and molecular Mechanics” method.

ONIOM: A Multilayered Integrated MO + MM Method for Geometry Optimizations and Single Point Energy Predictions. A Test for Diels-Alder Reactions and Pt(P(t-Bu)3)2 + H2 Oxidative Addition M. Svensson, S. Humbel, R. D. J. Froese, T. Matsubara, S. Sieber and K. Morokuma J. Phys. Chem. 1996, 100, 19357-19363.

ONIOM is potentially n-layered, hence the acronym: like an onion, there can be many successive layers

ONIOM: Background

JNH / Lecture 1 13

The philosophy behind ONIOM is to add corrections to a given reference method. This is similar to the use of extrapolation methods like G2 where corrections for basis set and correlation are used

ONIOM: Gradients

JNH / Lecture 1 14

The gradient (and higher derivatives) of the ONIOM energy can be straightforwardly computed from the gradients of the different methods

@V

@X=

@V A

high

@X+ (

@V AB

low

@X+

�@V A

low

@X)

ONIOM: Application

JNH / Lecture 1 15

J. Phys. Chem. 1996, 100, 19357-19363.

transition state has been predicted theoretically,11 while forasymmetric reactants, the transition states are also asymmetricwith the two forming bonds often differing distinctly in theirlengths.12,13 It has been shown that high levels of electroncorrelation are required to obtain activation energies quantita-tively in agreement with experimental values.13 Owing to strongelectronic and correlation effects, the Hartree-Fock methodbadly overestimates these activation energies, and MP2 resultsare often far too low. Our interest in using the ONIOM schemefor this type of reaction comes from the difficulty in obtaininggood agreement with experimental results at modest levels ofelectron correlation and the possible quantitative evaluation ofthe activation energy for larger systems where steric effects areimportant.In this section, we will apply the ONIOM3 method, as well

as IMOMM and IMOMO methods, to three Diels-Alderreactions: acrolein + isoprene (2-methyl-1,3-butadiene), ac-rolein + 2-tert-butyl-1,3-butadiene, and ethylene + 1,4-di-tert-butyl-1,3-butadiene. For the first two reactions, the results arecompared with experimental results and for the third withbenchmark calculations. Geometry optimizations will be per-formed at a variety of combinations of levels in ONIOM3,IMOMO, and IMOMM as well as in pure B3LYP and HF.Higher level single point energies will also be determined. Theintegrated methods should allow us to more easily separateelectronic and steric effects and to achieve high accuracy at afraction of the cost.A. Geometry Optimization of Acrolein + Isoprene Sys-

tem. For acrolein + isoprene and acrolein + 2-tert-butyl-1,3-butadiene reactions, we have divided our real system into threelayers as shown in Figure 2. The small model (SModel),ethylene + butadiene, with six non-H atoms is to describe theelectron reorganization during the reaction and is treated at thehighest level of approximation. The intermediate model (IMo-del), acrolein+ butadiene, with eight non-H atoms is to describethe electronic effects on the electron reorganization and is treatedat a medium level, and the full real system including thesubstituent (methyl or tert-butyl) at the butadiene 2-position isto describe mainly the steric effects and is treated at the lowestlevel.In these reactions, four products and corresponding transition

states are possible with the carbonyl group at the endo or exoposition with respect to the diene and with the alkyl substituenton the diene closer to or further from the carbonyl group.13,14

We examined the four transition states in the Diels-Alder [4+ 2] addition of acrolein to isoprene with the B3LYP method.It was determined that the structure with the lowest energy hasthe endo carbonyl and the isoprene methyl group away fromthe carbonyl group. In the following calculations, we consideredonly this regio- and stereospecific transition state.To test the reliability of the ONIOM scheme for the

optimization of geometries, we compared the B3LYP/6-31Ggeometry (which is considered as our benchmark geometry) tothe ONIOM (B3LYP/6-31G:HF/6-31G:MM3) geometries asshown in Figure 2. Additional IMOMM (B3LYP/6-31G:MM3:MM3) results are also presented here for comparison.The benchmark B3LYP/6-31G geometry is very asymmetric

with the two forming C-C bonds being 2.587 Å (adjacent tothe aldehyde group) and 2.049 Å (next to the methyl group). Inthe ethylene + butadiene case at the same level of theory, thesevalues are 2.265 Å for both sides. The reason for this strongasymmetry clearly is the electronic effects of the aldehyde group.The effects of the methyl group are not significant to thegeometry of this transition state. As a matter of fact, the fourpossible transition states for this reaction all exhibit nearly thesame geometry with these two distances being 2.56 ( 0.03 Åon the side of the aldehyde and 2.06 ( 0.01 Å on the oppositeside, regardless of the position of the methyl group. TheB3LYP/6-31G//B3LYP/6-31G barrier from trans-1,3-butadiene,19.8 kcal/mol, compares well with the experimental value of18.7( 0.8 kcal/mol,15 giving credibility to our benchmark result.The ONIOM3(B3LYP:HF:MM3) optimized structure in

Figure 2 looks quite reasonable as compared to the pure MOone. A single point B3LYP/6-31G energy at this ONIOMoptimized geometry is only 1.3 kcal/mol higher than that at thebenchmark-optimized structure. Since optimized reactants arealso higher by the same value, the B3LYP/6-31G activationenergy obtained at the ONIOM3 optimized geometry is exactlythe same (19.8 kcal/mol) as that at the benchmark-optimizedgeometry. The present results show that the ONIOM 3(B3LYP:HF:MM3) is capable of properly handling the steric effect ofthe “nonactive” part and the electronic effect of the “semiactive”carbonyl group.For comparison, the geometry optimization at the cheapest

IMOMM (B3LYP:MM3:MM3) method appears to almostcompletely miss the strong asymmetry of the transition state,leading to a nearly symmetric transition state with the formingC-C distances of 2.282 and 2.239 Å vs the benchmark valuesof 2.587 and 2.049 Å, respectively. The potential energy surfacenear the transition state is rather flat, and the energy of thisIMOMM optimized structure is 3.3 kcal/mol higher than thebenchmark, leading to a B3LYP/6-31G//IMOMM barrier of 21.8kcal/mol or 2 kcal/mol higher than the benchmark. ThisIMOMM scheme fails to account for the electronic effects ofthe carbonyl group, especially at the transition state.B. Single Point Energy Calculations for Acrolein +

Isoprene Diels-Alder Reaction. As mentioned above, theenergy barriers in Diels-Alder reactions are difficult to evaluatequantitatively for any system involving more than six non-Hatoms, since very high correlation levels, such as CCSD(T),are required, and this is very computer time intensive. However,using the IMOMO scheme,3,4 one can mimic very high levelsof calculation by treating only a small model very accuratelyand assuming the reliability of a modest correlation level, suchas MP2, in describing the real r model difference. In thissection, we will perform and compare the results of single pointenergy calculations with many different ONIOM integration

Figure 2. Small model, intermediate model, and real system for theDiels-Alder reactions between acrolein and 2-alkyl-1,3-butadiene. Theoptimized transition state bond distances (in Å) for the forming bondsare shown for the reaction of acrolein with isoprene and 2-tert-butyl-1,3-butadiene (to the right of /) at the nonintegrated (B3LYP:B3LYP:B3LYP) level in bold, those at the ONIOM3(B3LYP:HF:MM3) innormal, and those at the IMOMM(B3LYP:MM3:MM3) level in italic.

ONIOM J. Phys. Chem., Vol. 100, No. 50, 1996 19359

+ +

+ +

schemes for the activation energy of the acrolein + isoprenereaction, all at the ONIOM3(B3LYP:HF:MM3) optimizedgeometries.As shown in Table 1, compared with the B3LYP/6-31G(d)

benchmark barrier height of 19.2 kcal/mol, the pure HF valueis more than two times too large. So is the pure MM3 value,if we dare to consider that MM3 can handle the early transitionstate assuming the maximum MM conjugation discussed above.A two-layered B3LYP:MM3:MM3 approach results in anactivation energy of 21.9 kcal/mol, which is a drastic improve-ment over the pure HF method, accounting for the importanceof electron correlation effects in describing the reacting regionof the system but still too large by 2.7 kcal/mol relative to thefull B3LYP benchmark. Four schemes, B3LYP:HF:MM3,B3LYP:HF:HF, B3LYP:B3LYP:MM3, and B3LYP:B3LYP:HF,all give barrier heights to within 1.4 kcal/mol of the benchmarkvalue. This indicates clearly that at least the HF level is requiredto describe the electronic effect of the carbonyl group and theMM3 force field method is enough to describe the effect of themethyl substituent. The differences in the activation energyamong these four schemes and the benchmark show that an errorof 0.8 kcal/mol is introduced by going from HF to MM3 forthe low level and an error of 0.5 kcal/mol by going from B3LYPto HF for the medium level. The error introduced when themedium level goes from HF to MM3 is much larger, 4 kcal/mol, and is not acceptable. Considering the computational effortrequired and the accuracy achieved, one can easily recognize,especially for much larger real systems than those studied herein this test, that the ONIOM3(B3LYP:HF:MM3) accomplishesa nearly quantitative agreement with the B3LYP benchmark,with a very small additional cost over that for the medium levelcalculation for the intermediate model.As mentioned in the Introduction, the pure MP2 calculation

gives a barrier too low by 6 kcal/mol, as also seen in Table 1.The integration of various levels of ab initioMOmethods allowsreliable evaluation of the activation barrier. The highest levelone may practically use, the two-layered CCSD(T):MP2:MP2scheme, gives the activation energy within the error limit ofthe experiment. The three-layered integrated MO method,CCSD(T):MP2:HF, and the ONIOM3 (CCSD(T):MP2:MM3)introduce a deviation of 0.3 and 0.6 kcal/mol, respectively, butare still within a chemical accuracy from the best CCSD(T):MP2:MP2 method. Again as expected, the IMOMM schemeCCSD(T):MM3:MM3 is a total failure because of the missing

electronic effect for the carbonyl group. As to the computationaleffort, if ONIOM3(CCSD(T):MP2:MM3) took 19 h of worksta-tion time, CCSD(T):CCSD(T):MP2 would have taken 3 daysand the pure CCSD(T) about 40 days even for this modest 12non-H atom system. For larger systems, the time differencewould be much larger.C. Acrolein + 2-tert-Butyl-1,3-butadiene System. Using

an MM method for the low level in the ONIOM3 scheme, onecan perform geometry optimization and energy calculation atvirtually no additional cost for any large “nonactive” third layer.To examine the effect of increased steric repulsion in and bythe third layer substituents, we have replaced the 2-methyl ofisoprene by 2-tert-butyl and tested the ONIOM schemes forthe Diels-Alder reaction between acrolein and 2-tert-butyl-1,3-butadiene. As shown in Figure 2, the transition state here isslightly less asymmetric than that for 2-methyl. The MM3 usedas the medium level is again totally inadequate.By use of ONIOM3 geometries, single point calculations were

carried out for the activation energy at various integration levels,as shown in Table 1. The performance of various schemes issimilar to that for the acrolein/isoprene system. One requiresCCSD(T) or B3LYP as the high level, MP2 or HF as theintermediate level, and MM3 as acceptable for the low level inthe present dividing scheme.The difference in the barrier heights between the 2-tert-butyl

and 2-methyl systems at the best CCSD(T):MP2:MP2 level is2.7 kcal/mol. In all other acceptable levels of calculation, onefinds that this difference is reproduced within (0.5 kcal/mol.Though there is no experimental value of activation energy forthe acrolein/2-tert-butyl-1,3-butadiene reaction, activation ener-gies for these two dienes reacting with a different olefin, maleicanhydride, are known to be 12.2 and 6.5 kcal/mol for the2-methyl and 2-tert-butyl, respectively. Our prediction for thereactions of acrolein is consistent with this experiment.16Why is the barrier for the 2-tert-butyl system lower than for

the 2-methyl system? It should be noted at first that the cis-isomer is more stable than the trans-isomer for the reactant,2-tert-butyl-1,3-butadiene, and is used as the reference for theactivation energy. For 2-methyl-1,3-butadiene the trans-isomeris more stable and is used as the reference, since the barrier fortrans f cis isomerization, 7.3 kcal/mol, is smaller than thereaction barrier. Since the difference can be described by theMM3 method, it must be steric in origin. If one compares theactivation barrier from the cis-isomer where the steric energyis expected to be smaller, the activation barrier is 16.2 kcal/mol (at CCSD(T):MP2:MP2 level) for 2-methyl-1,3-butadieneand 16.9 kcal/mol for 2-tert-butyl-1,3-butadiene. This is to saythat the steric energy at the transition state relative to the cis-reactant is only 0.7 kcal/mol larger for 2-tert-butyl-1,3-butadienethan for 2-methyl-1,3-butadiene and is not much different. Thus,the reason the barrier is lower for 2-tert-butyl-1,3-butadiene isthat the reaction for this system is initiated from the intrinsicallyless stable cis-isomer, since the trans-isomer of the reactant hasa large steric energy of 4.8 kcal/mol and is not the most stableisomer of the two anymore.D. Ethylene + s-Trans trans,trans-1,4-Di-tert-butyl-1,3-

butadiene System. As a final example of Diels-Alder systems,the reaction of ethylene with s-trans trans,trans-1,4-di-tert-butyl-1,3-butadiene was compared with benchmark predictions at theMP4(SDQ) level. Geometry optimizations were performed atthe B3LYP/6-31G level. We partitioned this system into a smallmodel of ethylene + butadiene, an intermediate model ofethylene+ 2,4-hexadiene (s-trans trans,trans-1,4-di-methyl-1,3-butadiene), and the real system. Table 2 gives the activationenergies for various ONIOM schemes with MP4(SDQ) always

TABLE 1: Activation Barrier (in kcal/mol) for theDiels-Alder Reaction of Acrolein + Isoprene (2-Methyl) and2-tert-Butyl-1,3-butadiene (2-tert-Butyl) Calculated withVarious ONIOM Schemesa,b

high:med:low 2-methyl 2-tert-butylB3LYP:B3LYP:B3LYP 19.2 16.4B3LYP:B3LYP:HF 19.1 17.1B3LYP:B3LYP:MM3 18.3 15.4B3LYP:HF:HF 18.6 16.6B3LYP:HF:MM3 17.8 15.0B3LYP:MM3:MM3 21.9 18.6HF:HF:HF 41.1 39.7MM3:MM3:MM3 42.1 39.9CCSD(T):MP2:MP2 19.6 16.9CCSD(T):MP2:HF 19.9 18.9CCSD(T):MP2:MM3 19.0 17.2CCSD(T):MM3:MM3 24.7 22.6MP2:MP2:MP2 12.7 10.1a Acrolein + trans-dienes were assumed as the reactants. For 2-tert-

butyl-1,3-butadiene, the s-cis-isomer is actually more stable than thes-trans-isomer by 1.4 kcal/mol at the CCSD(T):MP2 level. b Thegeometries optimized at the ONIOM3 (B3LYP:HF:MM3) level areused. The 6-31G(d) basis set is used in all MO calculations.

19360 J. Phys. Chem., Vol. 100, No. 50, 1996 Svensson et al.

+ +

+ +

ONIOM: Application 2

JNH / Lecture 1 16

•  ONIOM is typically aimed at organometallic cases:

J. Phys. Chem. 1996, 100, 19357-19363.

MM3 result is 1.1 kcal/mol better than the two-layered B3LYP:MM3:MM3; the electronic effect of the alkyl group is consid-ered to contribute to this improvement.For the insertion product, the steric repulsion between P(t-

Bu)3 ligands is much larger than in the transition state. Theuse of the HF method as the low level clearly underestimatesthe steric repulsion by several kcal/mol. The MM3 method asthe low level overestimates the steric repulsion and gives theproduct about 3.5 kcal/mol too low.It is noted that the relative computational costs for the most

cost effective two-layered B3LPY:MM3:MM3 method (withoutconsideration of the alkyl electronic effect) and the three-layeredB3LYP:HF:MM3 method (with the alkyl electronic effectincluded) are 80 and 24 times, respectively, smaller than thatfor the pure B3LYP calculation. The last calculation itself isusually considered to be very cost effective but would take 1day on a workstation for the energy only; with the ONIOM orIMOMM scheme, one can perform nearly as reliable a calcula-tion within 1 h or less.As mentioned above, the correlation effect is very important

for the energetics for the oxidative addition. Therefore, tocalculate a reliable value for the activation barrier and the energyof formation, one has to adopt the highest level of electroncorrelation using such a method as CCST(T), which would beprohibitive for the present 83 atom H2 + Pt(P(t-Bu)3)2 reactioneven on the fastest computers. We have performed anONIOM3(CCSD(T)/TZP:MP2/DZP:MM3) calculation on aworkstation, which we expect to predict the energetics ofreaction within a few kcal/mol. The predicted activation barrieris 14.2 kcal/mol, which is lower than those for pure B3LYPand ONIOM3(B3LYP:HF:MM3) calculations by 4.1 and 3.3kcal/mol, respectively. The major difference comes from thedifference between CCSD(T) and B3LYP results for the smallmodel system. The predicted endoergicity of reaction is 4.1kcal/mol, which is 6.4 and 2.0 kcal/mol smaller than the pureB3LYP and ONIOM3(B3LYP:HF:MM3) results. Since CCSD-(T) correlation energy with finite basis sets is usually anunderestimate, the barrier and the endoergicity could be a few

kcal/mol smaller. This single point ONIOM3(CCSD(T):MP2:MM3) calculation costs only 40% of the pure B3LYP calcula-tion.

V. Conclusions

ONIOM is a multilayered approach integrating various levelsof MO and MM methods in optimizing transition state andequilibrium geometries and in performing single point energycalculations. Its three-layer version ONIOM3 divides a systeminto an active, a semiactive, and a nonactive part and considersa small model, an intermediate model, and the real system. Thesmall model would be typically treated with a very high levelof correlation such as CCSD(T), the intermediate model with amodest accuracy MO level such as MP2 and HF, and the realsystem with low accuracy and inexpensive method such as MMforce fields. The motivation behind ONIOM is the costeffectiveness, with the high-level method the most importantpart and handled very accurately, and at the same time theelectronic effects of the semiactive and the steric and electro-static effects of the nonactive part would taken into account ata very small cost.In the example of the Diels-Alder reaction of acrolein +

substituted butadiene it has been shown that the two-layeredIMOMM method using ethylene + butadiene fails because theelectronic effect of the carbonyl group is not taken into account.On the other hand, one would not like to spend a large amountof computer time to include the steric effects of bulky substit-uents. Inserting a third layer at the modest MP2 or HF levelinto the IMOMM approach gives a better description of theelectronic effects and can reproduce the results of a very high-level calculation with an error of a few kcal/mol.With this method, it would become possible to make

predictions of the activation energy or the bond dissociationenergy of a very large system within a chemical accuracy of afew kcal/mol. Using the ONIOM3(CCSD(T):MP2:MM3)method, we have predicted the activation barrier and the energyof reaction for the 83 atom H2 + Pt(P(t-Bu)3)2. The costs ofthe ONIOM3(B3LPY:HF:MM3) and the ONIOM3(CCSD(T):MP2:MM3) calculations for this system are about 4% and 40%,respectively, of the pure B3LYP calculation.

Acknowledgment. The authors are grateful to Dr. JamalMusaev and Qiang Cui for their numerous help and discussions.The use of the Emerson Center computing facilities is acknowl-edged. The present research is in part supported by grants(CHE-9409020 and CHE-9627775) from the National ScienceFoundation. R.D.J.F. acknowledges a postdoctoral fellowshipfrom the Natural Sciences and Engineering Research Councilof Canada. M.S. acknowledges a postdoctoral fellowship fromthe Swedish Natural Science Research Council.

References and Notes(1) Field, M. J.; Bash, P. A.; Karplus, M. J. Comput. Chem. 1990, 11,

700.(2) Maseras, F.; Morokuma, K. J. Comput. Chem. 1995, 16, 1170.(3) Humbel S.; Sieber S.; Morokuma, K. J. Chem. Phys. 1996, 105,

1959.(4) Svensson M.; Humbel S.; Morokuma, K. J. Chem. Phys. 1996, 105,

3654.(5) Matsubara, T.; Maseras, F.; Koga, N.; Morokuma, K. J. Phys. Chem.

1996, 100, 2573.(6) The code we implemented combines the IMOMM code to the

former IMOMO code. For the IMOMM code, see ref 2 and for the IMOMOsee ref 3.

(7) Frisch, M. J.; Trucks, G. W.; Head-Gordon, M.; Gill, P. M. W.;Wong, M. W.; Foresman, J. B.; Johnson, B. G.; Schelgel, H. B.; Robb, M.A.; Replogle, E. S.; Gomperts, R.; Andres, J. L.; Raghavachari, K.; Binkley,

TABLE 3: Activation Barriers Ea, Energies of Reaction Er(in kcal/mol), and Their Errors from the Pure B3LYPBenchmark for the Oxidative Addition of H2 to Pt(P(t-Bu)3)2Calculated with Various ONIOM Schemesa

Ea ¢Ea Er ¢Ercomputertimeb

B3LYP:B3LYP:B3LYPc 18.3 0.0 10.5 0.0 1207HF:HF:HFd 24.6 6.3 18.7 8.2 438B3LYP:B3LYP:HF 19.1 0.8 14.9 4.4 586B3LYP:B3LYP:MM3 16.8 -1.5 7.0 -3.5 148B3LYP:HF:HF 19.8 1.5 14.0 3.4 453B3LYP:HF:MM3 17.5 -0.8 6.1 -4.4 51B3LYP:MM3:MM3 16.4 -1.9 8.0 -2.5 15CCSD(T) e:MP2f:MM3 14.2 4.1 500

a The geometries optimized at the IMOMM(MP2:MM3) level takenfrom ref 5 are used. The energy of reaction is calculated for the cis-product. b The relative computer time required. c Basis set: (5s5p3d1f/3s3p2d1f) with RECP of Hay and Wadt for Pt (Hay, P. J.; Wadt, W.R. J. Chem. Phys. 1985, 82, 299), (4s1p/2s1p) for active H2, and adouble-˙ set for P, C, and H on P(t-Bu)3 (Dunning, T. H.; Hay, P. J.In Modern Theoretical Chemistry; Schaefer, H. F., III, Ed.; Plenum:New York, 1976; p 1). d Basis set: the same as in footnote c abovebut without polarization functions. e Basis set: (5s5p3d1f/4s4p3d1f)with RECP of Hay and Wadt for Pt, 6-311G(2p) for active H2, and thesame as in footnote c augmented with 1d(0.55) on P and 1p(0.75) onH for PH3. f Basis set: (5s5p3d1f/3s3p2d1f) with RECP of Hay andWadt for Pt, 6-31G(p) for active H2, and the same as in footnote c forP (augmented with 1d(0.55)) and C and 6-31G for H for P(t-Bu)3.

19362 J. Phys. Chem., Vol. 100, No. 50, 1996 Svensson et al.

+ +

+ +

ONIOM: Link Atoms

JNH / Lecture 1 17

•  The energy for the sub-system A is not well defined when there is a covalent bond crossing the boundary. Instead, in such cases, A is augmented by an appropriate number of link atoms (typically H atoms).

J. Phys. Chem. 1996, 100, 19357-19363.

Here an H atom is added to convert CH2OH into CH3OH

High-level region

ONIOM: Link Atoms and Gradients

JNH / Lecture 1 18

How do gradients work?

L C

@V

@X=

@V A

high

@X+ (

@V AB

low

@X+

�@V A

low

@X)

Feliu Maseras and Keiji Morokuma, J. Comp. Chem. 1995, 16, 1170-1179

Vhigh implicitly depends on the coordinates XC of atom ‘replaced’ by link atom L, as XL itself depends on XC (e.g. L is positioned along the same bond axis)

=)@V A

high

@XC=

@V Ahigh

@XL⇥ @XL

@XC

ONIOM: Mechanical Embedding

JNH / Lecture 1 19

When the low level in ONIOM is a forcefield method, the only impact the environment has on the subsystem is to distort it according to the coupling terms implicit in the correction term: Especially where the forcefield contains no point charges (ball and spring model typical for hydrocarbons), this is a purely “mechanical” effect.

V = V A

high

+ (V AB

low

� V A

low

)

IMOMM: Another Mechanical Embedding

JNH / Lecture 1 20

ONIOM had a predecessor that was formally an additive method, but also uses mechanical embedding: IMOMM: “Integrated MO and MM”. IMOMM: A new integrated ab initio + molecular mechanics geometry optimization scheme of equilibrium structures and transition states Feliu Maseras and Keiji Morokuma, J. Comp. Chem. 1995, 16, 1170-1179

QM/MM and Electronic Embedding

JNH / Lecture 1 21

For some systems, electrostatic interactions between the QM and MM regions can be quite strong – and the MM environment may be expected to polarize the QM region somewhat. For these systems, it is desirable to allow the QM wavefunction to relax with respect to the MM point charges. This leads to typical QM/MM electronic embedding methods.

V = V AQM + V B

MM + V A-BQM-MM

ρ q

q q

q

q

q

q

q

q

q

q

q

Electronic Embedding in Detail

JNH / Lecture 1 22

In practice the energy expression is rather:

ρ q

q q

q

q

q

q

q

q

q

q

q

V = V ABQM’ + V AB

MM’

V ABQM’ is the QM energy of A in the presence of the point charges

(n electrons, N nuclei, NMM MM point charges)

V ABMM’ is more straightforward:

it contains the MM energy of MM atoms + MM-like interactions between MM and QM atoms

Helec =�1

2

nX

i=1

r2i +

nX

i=1

NX

A=1

�ZA

riA+

nX

i=1

nX

j>i

1

rij+

nX

i=1

NMMX

K=1

�qKriK

Electronic Embedding: MM Peculiarity in Detail

JNH / Lecture 1 23

Consider the MM part of the energy:

Ignoring covalent bonds across the QM-MM boundary, this simply takes into account van der Waals interactions, typically using:

V ABMM’

The Lennard-Jones atom-atom radii and well depths σij and εij are however not expected to be a constant for a given QM atom: if that atom is more polarizable, etc., these numbers change. In practice, it is not easy to change them – so they are fixed.

V ABMM’ =

X

i2A

X

j2B

4✏ij

(✓�ij

rij

◆12

�✓�ij

rij

◆6)

Electronic Polarization

JNH / Lecture 1 24

Point charges indeed lead to similar polarization compared to all-QM calculations:

Analysis of polarization in QM/MM modelling of biologically relevant hydrogen bonds, Senthilkumar, Mujika, Ranaghan, Manby, Mulholland and Harvey, J. Roy. Soc. Interface, 2008, 5, S207 – S216

nX

i=1

NMMX

K=1

�qKriK

7!nX

i=1

NMMX

K=1

(�qK ⇥ f(riK))

Charge Leakage

JNH / Lecture 1 25

Regions of high electron density in the QM region could in principle ‘leak out’ onto positive point charges due to lack of Pauli repulsion by these charges. Not a big problem for modest-sized Gaussian basis sets, but significant with plane waves. Solution: replace 1/r term with a shielded interaction.

See e.g. A regularized and renormalized electrostatic coupling Hamiltonian for hybrid quantum-mechanical–molecular-mechanical calculations, P. K. Biswas and V. Gogonea, J. Chem. Phys. 2005, 123, 164114.

QM/MM Gradients

JNH / Lecture 1 26

There are four types of gradient elements in QM/MM with electronic embedding:

@VQM

@XQM

@VQM

@XMM

@VMM

@XQM

@VMM

@XMM

Like standard QM gradient except for modified w.f.

Not routinely computed by QM codes; but for SCF methods, just equal to –qK × EQM

Standard MM-like terms, for interactions (e.g. vdW) with the QM atoms

Standard MM

QM/MM Geometry Optimization: Microiterations

JNH / Lecture 1 27

Time-consuming for large systems (N atoms). Even efficient algo-rithms tend to take ~N steps until acceptable convergence. Even if each energy and gradient evaluation for the whole QM/MM system is cheap (roughly the cost of the QM energy + gradient for the QM region), this makes optimization expensive. Hence microiterations: carry out many cycles of optimization of MM atom positions for each calculation of QM energy/gradient. Possible because QM and MM degrees of freedom are almost uncoupled: @VMM

@XMM

Does not depend on the QM system

Quite small; depends only weakly on the QM wavefunction – can be appro- ximated by electrostatics.

@VQM

@XMM

See e.g. A Mixed Quantum Mechanics/Molecular Mechanics (QM/MM) Method for Large-Scale Modeling of Chemistry in Protein Environments, Murphy, Philipp and Friesner, J. Comp. Chem. 2000, 21, 1442.

QM/MM: Ab initio versus Semiempirical

JNH / Lecture 1 28

QM/MM calculations are frequently performed with semiempirical (MNDO-type, DFTB), DFT and ab initio methods. The former are cheap enough that the cost of the QM calculation is of similar magnitude to the MM calculation. For typical DFT and for HF, the QM part is usually much more expensive than the whole MM part – but cheap enough to enable geometry optimization, etc. For correlated ab initio methods, e.g. MP2, CASPT2, CCSD(T) etc., the single-point energy is available but the gradient is either not available or too expensie to allow geometry optimization.

Covalent bonds and the QM-MM boundary

JNH / Lecture 1 29

In QM/MM calculations, covalent bonds between the QM and MM boundary can be treated in several different ways: •  Link atoms, as in mechanical embedding •  Pseudo-atoms: instead of an H atom to replace the bonded MM

atom, use some kind of 1-electron atom with a pseudopotential designed to yield the correct bond length and electronic structure. (See discussion in Xiao and Zhang, J. Chem. Phys. 2007, 127, 124102)

•  Frozen orbitals: constrain one hybrid atomic orbital of the QM atom to have exactly the same shape it has in a reference compound, with two electrons – behaves a bit like a lone pair (see e.g. Murphy, Philipp and Friesner, J. Comp. Chem. 2000, 21, 1442.)

Polarizable MM QM/MM

JNH / Lecture 1 30

With standard electronic embedding, the MM region is not polarizable. Hence e.g. in a simple system like the water dimer, with one molecule MM and the other QM, the energy is not the same for the two possible splits. In QM/MM using polarizable MM (see e.g. Schwörer et al, J. Chem. Phys. 2013,

138, 244103) the MM atoms are treated using a polarizable forcefield, with e.g. induced dipoles on atoms. The MM forcefield needs to be iterated for internal convergence and convergence with QM region. This approach requires good polarizable MM force fields… Work in progress (see e.g. Y. Shi, Z. Xia, J. Zhang, R. Best, C. Wu, J. W. Ponder and P. Ren, J. Chem. Theory Comput., 2013, 9, 4046)

QM/MM and Properties

JNH / Lecture 1 31

Many electronic properties can be computed using QM/MM (various spectroscopic properties especially) Whenever the property is determined mainly by the QM region, any method which can be used in QM calculations to compute that property can be applied Care is needed as often long-range effects can be larger than expected (see e.g. studies of redox potentials and NMR in Case Studies part)