arxiv:1909.00028v1 [quant-ph] 30 aug 2019 · leveraging the structure of this new representation on...

Discontinuous Galerkin discretization for quantum simulation of chemistry

Jarrod R. McClean,1 Fabian M. Faulstich,2 Qinyi Zhu,3 Bryan O’Gorman,4, 5

Yiheng Qiu,6 Steven R. White,6 Ryan Babbush,1 and Lin Lin3, 7

1Google Research, 340 Main Street, Venice, CA 90291, USA2Hylleraas Centre for Quantum Molecular Sciences,

Department of Chemistry, University of Oslo, Oslo, Norway3Department of Mathematics, University of California, Berkeley, CA 94720, USA

4Department of Electrical Engineering and Computer Sciences,University of California, Berkeley, CA 94720, USA

5Quantum Artificial Intelligence Laboratory, NASA Ames Research Center, Moffett Field, CA 94035, USA6Department of Physics and Astronomy, University of California, Irvine, CA 92697, USA

7Computational Research Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA(Dated: September 4, 2019)

Methods for electronic structure based on Gaussian and molecular orbital discretizations offer awell established, compact representation that forms much of the foundation of correlated quantumchemistry calculations on both classical and quantum computers. Despite their ability to describeessential physics with relatively few basis functions, these representations can suffer from a quarticgrowth of the number of integrals. Recent results have shown that, for some quantum and classicalalgorithms, moving to representations with diagonal two-body operators can result in dramaticallylower asymptotic costs, even if the number of functions required increases significantly. We introducea way to interpolate between the two regimes in a systematic and controllable manner, such thatthe number of functions is minimized while maintaining a block diagonal structure of the two-body operator and desirable properties of an original, primitive basis. Techniques are analyzed forleveraging the structure of this new representation on quantum computers. Empirical results forhydrogen chains suggest a scaling improvement from O(N4.5) in molecular orbital representationsto O(N2.6) in our representation for quantum evolution in a fault-tolerant setting, and exhibit aconstant factor crossover at 15 to 20 atoms. Moreover, we test these methods using modern densitymatrix renormalization group methods classically, and achieve excellent accuracy with respect to thecomplete basis set limit with a speedup of 1–2 orders of magnitude with respect to using the primitiveor Gaussian basis sets alone. These results suggest our representation provides significant costreductions while maintaining accuracy relative to molecular orbital or strictly diagonal approachesfor modest-sized systems in both classical and quantum computation for correlated systems.

I. INTRODUCTION

Predicting properties of both molecular and extendedsystems from first principles has long been the goal ofelectronic structure in both correlated classical methods[1], including new approaches based on tensor networks[2–5], and now many approaches based on quantum com-puting [6–15], some of which have even been implementedon experimental devices [16–25]. A crucial aspect ofsuch simulations is the representation of the problem in atractable discretization scheme such as a finite-differencemethod, or (more commonly) a basis set, also known asa Galerkin discretization of the problem. The selectionof the basis influences not only the accuracy of the cal-culation but also the fundamental scaling of the cost ofthe simulation as a function of the system size.

The design of basis sets for correlated electronic struc-ture has a long and rich history, packing much of theessential physics and chemistry into very compact rep-resentations to exploit the power of existing methods.While these basis sets have ranged from general purposeto those optimized for individual computations, a com-mon mainstay has been the use of Gaussian-based molec-ular orbitals [1, 26–28]. Molecular orbitals tend to offercompact representations of correlated problems and also

present an energy ordering of orbitals that facilitates fur-ther reduction of the space through active space methods.However, a side effect of this reduction in the number oforbitals is often a Hamiltonian with a quartic number ofterms and orbitals that are delocalized in space.

The utility of localizing orbitals in both the occupiedand virtual space has been recognized in the developmentof many linear scaling methods [29], such as those basedon pair natural orbitals [30–32]. In these cases the local-ization helps not only in allowing the screening of someterms in the Hamiltonian, but also reducing the corre-lations that need to be treated to reach a certain levelof accuracy. However, even the most localized orbitalsthat can be produced from transformations of a stan-dard basis are often not strictly local in the sense thatthey have heavy tails that can extend throughout thesystem to enforce orthogonality between orbitals. Whilereasonably compatible with some methods, these tailsnegatively impact tensor network methods such as thedensity matrix renormalization group (DMRG), where anarea-law entanglement system that is solvable in modestpolynomial time becomes a volume law system with ex-ploding bond dimension in a completely delocalized ba-sis [33]. This effect, in combination with considerationof the number of Hamiltonian terms, led to the recent

arX

iv:1

909.

0002

8v1

[qu

ant-

ph]

30

Aug

201

9

2

Active Space(MO / Gaussian)

Primitive(Planewave-dual / Gausslet)

Discontinuous Galerkin(Blocked primitive)

FIG. 1. A cartoon schematic of the general objective of thiswork depicted in terms of the sparsity pattern of the two-electron integrals. At the top, we depict a dense two-electronintegral tensor with O(N4

a ) non-zero two-electron integrals,but relatively few basis functions. This is often the case withmolecular orbital or diffuse Gaussian basis sets. In the centerwe depict a diagonal primitive basis set, such as Gausslets,with a relatively high number of basis functions but overallscaling O(N2

p ). Finally, we depict this for a discontinuousGalerkin (DG) basis set built from a diagonal primitive basisset that interpolates between the two regimes by using fewerbasis functions than strictly diagonal sets, while retaining anoverall O(N2

d ) scaling through its block diagonal structure.We note that the form of block diagonal structure is a par-ticular sparsity pattern which depends on the arrangement ofindices when considering a matrix form; the text defines thissparsity pattern precisely.

development of Gausslet basis sets [34, 35], which useproperties of wavelet transforms to maintain strict local-ization, orthogonality, and a diagonal Coulomb operator.

In the case of quantum computing-based approaches,while basis localization may impact the representationalpower of an ansatz-based variational approach, the struc-ture and number of terms in the Hamiltonian representthe dominant cost factor [36–39]. Moreover, some tradi-tional density-based truncations and localizations can bedifficult to utilize in quantum computers, due to the needto measure the density and rotate the basis to maximizethe benefits. Recent advances in quantum-computing ap-proaches have shown that Hamiltonians with a diagonalstructure with a quadratic number of terms allow algo-rithms to execute numerically exact real-time dynamicsand perform full configuration interaction (assuming areference state with non-vanishing overlap on the groundstate can be prepared) with a cost in terms of basis size

that scales as roughly O(N1/3p ) in first quantization [40]

Np Primitive basis functions in diagonal basisNa Functions in compact active space basisNb Blocks in the DG basisnκ Functions in DG block κNd Total DG functions, (

∑κ nκ)

FIG. 2. Compact description of the notation used throughoutthe paper in counting basis functions in different representa-tions for the electronic structure problem. Here discontinuousGalerkin (DG) is the block basis we construct from primitivefunctions to represent the active space orbitals with a blockdiagonal Hamiltonian representation.

or O(N2p ) in second quantization [41]. While the first

work was done for periodic systems based on basis setsrelated to discrete variable representations (DVR) [36],these results apply to all basis sets with these properties.This is in contrast to the most advanced methods basedon Gaussian molecular orbitals on quantum computers,which have costs more like O(N3

a ) in first quantization[42] or O(N4

a ) in second quantization [38].

While the scaling advantages of basis sets with diag-onal representations appear at reasonable system sizes,reduced representational flexibility can mean the cost ofswitching representations for equivalent accuracy is stillhigher than current quantum computers with limited re-sources can afford. Thus, it is desirable to be able tosplit the difference between strictly diagonal basis setsand molecular orbitals. The adaptive local basis set [43–47] was recently introduced to achieve this tradeoff in thesetting of density functional theory (DFT). The adaptivelocal basis is constructed on-the-fly to capture atomic andenvironmental effects, by solving eigenvalue problems re-stricted to local domains called the elements. Each basisfunction is only supported on one element and is discon-tinuous on the level of the global domain, and the basisfunctions are “glued” together to approximate the con-tinuous electron density using the interior penalty Dis-continuous Galerkin (DG) formalism [48, 49]. Comparedto typical numerical methods for solving DFT, the DGformulation includes extra correction terms to handle thediscontinuities of the basis functions at the boundary ofthe elements. Numerics indicate that roughly 10 to 40basis functions per atom are often sufficient to achievechemical accuracy for DFT calculations using pseudopo-tentials [50, 51].

In this work, we introduce a variation of the DG ap-proach for simulation on quantum computers, which hastwo main differences compared to previous works [43, 44].First, our target is to model electronic structure at thecorrelated level instead of the DFT level, and hence wecan afford to obtain the best local basis functions, e.g., bystarting from molecular orbitals or approximate naturalorbitals. This removes a significant step of approximationin the original DG approach due to the solution of localeigenvalue problems with certain artificial boundary con-ditions (such as the Dirichlet or periodic boundary con-ditions). Second, our basis functions are represented as

3

the linear combination of primitive basis functions, whichmeans that the basis functions are no longer strictly dis-continuous in the continuous space. We demonstrate thatthe two-body operator can still maintain a block-diagonalstructure for efficient quantum simulation, and for sim-plicity we will still refer to the basis set as the DG basisset. The use of the primitive basis functions removes theneed for correction terms that account for the discontinu-ity, which is similar in spirit to the discrete discontinuousbasis projection method [52] for DFT calculations. TheDG basis set enables one to interpolate between cheapdiagonal representations and compact non-diagonal ba-sis sets with blocks of specified size.

We begin by introducing the standard discretizationof the electronic structure problem in a basis, and de-fine precisely what is meant by a diagonal basis represen-tation and strictly localized functions. We then brieflyreview basis sets that exhibit spatial locality and diago-nal interactions that have already been used in the con-text of DMRG and quantum computing. This allows usto highlight the cost advantages of these representationswithin each approach. Then, the discontinuous Galerkinapproach is introduced as a general framework for main-taining these properties by using those basis sets as aprimitive building block. The block diagonal structureof the resulting Hamiltonians leads us to new cost mod-els based on swap networks for quantum algorithms. Weshow that the new approach both maintains accuracy incorrelated calculations, and demonstrates a crossover todramatically lower costs at modest system sizes between15 and 20 atoms. We finish with an outlook on how thisapproach will influence quantum and classical approachesto correlated electronic structure alike.

II. DISCRETIZING THE ELECTRONICSTRUCTURE PROBLEM

A crucial aspect of essentially all algorithms for thesimulation of electronic systems is discretization of thesystem into some tractable representation. This steptakes the electronic Hamiltonian that acts in some con-tinuous space, and maps it to a discrete space. The con-tinuous electronic structure Hamiltonian is given by

H = −∑i

∇2ri

2−∑I,j

ZI|RI − rj |

+∑i<j

1

|ri − rj |+ EII,

(1)

where we have assumed atomic units, the Born–Oppenheimer approximation such that the positions ofnuclei RI are constants (giving rise to a constant energycorrection EII for the nuclear-nuclear interaction), andthe ri represent the positions of electrons. In the caseof Galerkin discretizations, where one chooses a basis setgiven by some set of orthonormal functions χi(r) (forsimplicity we assume a spin-restricted formulation), andenforces the anti-symmetry of electrons in the operators,

one may express the Hamiltonian in the standard secondquantized form. It is possible to select basis functionsfrom a number of complete sets that allow the electron-electron interaction to be represented in a way that isentirely diagonal under a Jordan–Wigner representationin the computational basis and also exhibits a diagonalproperty under matricization. In this representation, thesecond quantized Hamiltonian is given by

H(p) =

Np∑µ,ν=1

h(p)µν b†µbν +

1

2

Np∑µ,ν=1

v(p)µν nµnν , (2)

where b†µ is a creation operator in the primitive basis,

and nµ = b†µbµ is a number operator. We refer to Hamil-tonians written in this form as “diagonal Hamiltonians”,and use such basis sets here as our “primitive” basis (as-

sociated with the superscript (p) in H(p)) to efficientlyconstruct compact bases which partially retain this prop-erty.

Generally, the coefficients in these expressions aregiven by the following integrals,

h(p)µν =

∫dr χµ(r)

(−∇2

r

2−∑I

ZI|RI − r|

)χν(r),

vµσγν =

∫drdr′

χµ(r)χσ(r)χγ(r′)χν(r′)

|r− r′|,

(3)

and the defining property of our primitive basis sets may

be written as vµσγν → v(p)µν δµσδγν , i.e., v becomes a di-

agonal matrix when we view (µ, γ) as the row index and(σ, ν) as the column index, respectively. Note that the

relation between vµσγν and v(p)µν δµσδγν is not necessar-

ily an equality: what one requires is that solutions tothe Schrodinger equation using the two different forms ofthe interaction can systematically approach each otheras one approaches the complete basis set limit. A grid,defined via finite differences, has the diagonal property,although there is no underlying basis. Here we consideronly basis sets, but the basis functions with the diagonalproperty are naturally associated with a uniform or non-uniform grid, with typical spacings between grid pointsmuch smaller compared to interatomic distances.

For a basis set, a sufficient condition for equality ofthese expressions is that one has functions with strictlydisjoint support, such that (formally) χµ(r)χσ(r) = 0 forall µ 6= σ. However, such a requirement would be unre-alistic because upon careful inspection it would imply asimultaneously diagonal kinetic and potential operator ifevaluated in the Galerkin formulation in Eq. (3) (in con-trast to a finite-difference or overlapping finite-elementapproach). Classical finite element methods can pro-duce near-diagonality by allowing overlapping elementsbetween only spatially neighboring sites, which retainsthe favorable scaling, but generically introduces non-orthogonality in the basis [53–56]. Accordingly, methodsrequiring a return to orthogonality, such as quantum-computing methods, suffer a transformation to O(N4

p )

4

terms on orthogonalization, the introduction of unnec-essary orthogonality tails, or both. More recent devel-opments in classical discretization have extended thisidea of disjoint cells to allow for truly disjoint basis sets,but at the cost of the introduction of additional surfaceterms and discontinuity penalties to reintroduce physical-ity into the problem. These methods are known as Dis-continuous Galerkin (DG) methods [48, 49] and were firstintroduced to electronic structure in the context of den-sity functional theory [43–46]. They represent a generaland rigorous framework for constructing problem repre-sentations that have this property of strict (block) local-ity, and we develop a variation of these methods in thiswork to achieve this goal for correlated electronic struc-ture methods without the need for the introduction ofsurface terms.

While the condition of spatial disjointness is a sufficientcondition to obtain a Hamiltonian with O(N2

p ) terms, itis not necessary. Two basis sets that have the diagonalproperty without the equality of the individual matrixelements, and thus without the strict spatial disjointnessproperty, are the plane wave dual basis [36] and Gaussletbasis [34]. The plane wave dual basis is related to peri-odic sinc functions and discrete variable representations(DVR) [36] and is constructed from an aliased Fouriertransform of plane waves for a given box to yield a diag-onal Hamiltonian. Here the diagonality originates fromthe plane wave dual functions (also called the periodicsinc functions) that are Lagrange interpolation functionson a uniform grid. It has the advantage that the basis isnaturally periodic, and thus is well suited for the treat-ment of materials and other condensed phase systems.Moreover, a modification using a truncated Coulomb in-teraction can enable the treatment of isolated systems.Two key downsides of this approach are that it inherentlyreflects a uniform discretization in space of the problem,lacking the ability to adaptively refine sharp features suchas the electron-nuclear cusp, and that it has long tails re-sponsible for maintaining the orthogonality of the basis.This first property is reflected in a large overhead for rep-resenting atomic systems to a level of accuracy similar toGaussians, and the second makes the method difficult touse with geometric entanglement based approaches suchas tensor networks.

When integrated with smooth functions, Gausslets be-have like δ functions, which results in the diagonal prop-erty. The Gausslets are obtained as linear combinationsof arrays of Gaussians using wavelet transforms. Spe-cific moment properties of the wavelet transforms makethe Gausslets integrate polynomials up to a certain orderlike a δ function. The δ function property survives un-der smooth coordinate transformations, so unlike planewave dual bases, Gausslets can have variable resolutions,with more degrees of freedom near the nuclei to representthe electron-nuclear cusp. Also in contrast to the planewave dual basis functions, Gausslets have strong local-ization characteristics, avoiding long tails. The lack oftails means these basis sets are naturally suited for tensor

network methods and, relatedly, a variational quantumansatz may have more expressive power at shorter circuitdepths. The ability to more naturally represent inhomo-geneous features means the representational overhead isexpected to be modest relative to plane wave representa-tions.

We note, however, that both methods can benefit fromthe introduction of pseudopotentials, and that the rep-resentational power of all single particle basis sets areexpected to be limited by the same asymptotic scaling inthe limit of a very large number of basis functions. Thismeans that for very large basis set sizes approaching thecomplete basis set limit, the representational overheadsare expected to be negligible, and the determining fac-tors are the other properties of the basis sets. Despitethis, however, there is considerable interest in treatingsystems before reaching this limit, where the representa-tional overhead for basis sets can differ considerably.

In order to meet the demands of compactness one couldstart from a more generic basis set and use the expres-sions in Eq. (3) to determine the Hamiltonian. However,an approach that will prove fruitful here is to use thefact that it is equivalent to start from a complete prim-itive basis set, such as those above, and project into acompact “active space” Hamiltonian. Gaussian, molecu-lar orbitals, or other active space constructions can beenseen as a specific case of the active space we refer to here.

Since the number of primitive basis functions is typi-cally large compared to the number of electrons, quan-tum chemistry calculations, especially at the correlatedlevel, are often performed using a smaller basis set (e.g.Gaussian basis functions, atomic orbitals, or molecularorbitals). Let ϕp(r)Nap=1 be a set of orthonormal single-particle functions; we refer to them as active space or-bitals (for instance, canonical Hartree–Fock orbitals, ornatural orbitals), which can be expanded using a primi-tive basis set as

ϕp(r) =∑µ

χµ(r)Φµp. (4)

Here Φ ∈ CNp×Na is a matrix with orthogonal columns.While this projection step represents an approximationdepending on the nature of the primitive and activebases, it is one that can be systematically controlled andunderstood by increasing the size of the underlying prim-itive basis without significantly increasing the size of theresulting block basis we will construct here. For flexibil-ity and accuracy, we will allow quite general definitionsof the active space basis set in this work. It will pertainto traditional Hartree-Fock canonical orbitals as well asGaussian basis sets such as the Dunning cc-pVDZ basisset [26], which allows us to use Gaussians with strictlyblock diagonal properties by going through the primitivebasis set using point sampling through the approximatedelta function properties of the primitive basis set. Ithas been shown previously that for the Gausslet primi-tive basis set we use, point sampling is extremely accuratedue to the delta function property of the basis [34]. We

5

will also make use of this construction to build hybridactive spaces, where we use a weighted density matrixfrom multiple basis sets to define Φ through its mostimportant natural orbitals. In particular, we will com-bine the expressive power of a large primitive basis, suchas Gausslets, to capture static correlations through anunrestricted Hartree-Fock defined active space, while in-cluding a Gaussian basis empirically refined to expressdynamic correlation, e.g. cc-pVDZ through weighting.In this work we make use of a joined set of density ma-trices built from a UHF solution DUHF and Gaussianorbitals αDGaussian, to form D = DUHF + αDGaussian

in the primitive basis, and use a natural orbital trun-cation of D to define Φ. Empirically we use a value ofα ≈ 0.01 later in this work when we combine these basissets, which appears to give an excellent improvement inaccuracy. A more detailed description of this procedureand refinement of the value α is left to a future work. Weterm this the hybrid active space approach. As we onlyuse this hybrid approach in conjunction with the discon-tinuous Galerkin blocking procedure, we offload concernsabout orthogonality in the projected basis to the singularvalue decomposition (SVD) used in the DG procedure.

Taking as granted the construction of the matrix Φ, wedefine a rotated set of creation and annihilation operatorsin the active space as

a†p =

Np∑µ=1

b†µΦµp, ap =

Np∑µ=1

bµΦµp, (5)

where Φµp denotes the complex conjugate and we mayproject the Hamiltonian as

H(a) =

Na∑p,q=1

h(a)pq a†paq +

1

2

Na∑p,q,r,s=1

v(a)pqrsa†pa†qaras, (6)

which we refer to as the active space Hamiltonian.Generally we see that our primitive basis sets have fa-

vorable scaling in number of terms in the Hamiltonian(O(N2

p ) vs O(N4a )), which often corresponds to better

scaling algorithms. While Np and Na have the sameasymptotic scaling, for modest sized calculations it is of-ten observed that Np Na in order to achieve com-parable accuracy. Here we will seek a way to split thedifference between these two regimes by forming a morecompact basis that partially retains the diagonal proper-ties, i.e., the resulting Hamiltonian is block-diagonal.

III. DISCONTINUOUS GALERKINDISCRETIZATION

At a high level, we construct the block-diagonal basisby fitting spatially connected blocks of the primitive basisset to the active basis set, while preserving the propertiesof the primitive basis set. We therewith interpolate be-tween the primitive basis set and the active basis set. We

FIG. 3. Three DG functions in the X-Z plane, representedin real space. The axes are in units of Bohr and the colorintensity represent the amplitude |φκ,j |. Each of the functionsis localized to its block, where the block divisions are shown indotted grey lines. The DG functions are represented by linearcombination of plane wave dual basis functions. Each DGfunction is strictly still a continuous function, but its nodalvalues (defined according to the center of each plane wavedual function) are only supported within only one block.

will refer to the general class of basis sets that achieveboth completeness in some limit and have the diagonalproperty as primitive basis sets.

Our goal is to systematically compress the active ba-sis set ϕp(r)Nap=1 into a set of orthonormal basis func-

tions partitioned into elements (groups), so that basisfunctions associated with different elements have mutu-ally disjoint support. Assume that the index set Ω =1, . . . , Np can be partitioned into Nb non-overlappingindex sets

K = κ1, κ2, · · · , κNb, (7)

so that ∪κ∈Kκ = Ω. Then the matrix Φ can be parti-tioned into Nb blocks Φκ := [Φµp]µ∈κ for κ ∈ K. Per-forming the singular value decomposition for Φκ,

Φκ ≈ UκSκV †κ , (8)

where Uκ is a matrix with orthonormal columns corre-sponding to the leading nκ singular values up to sometruncation tolerance τ , we obtain our compressed basis

φκ,j(r) =∑µ∈κ

χµ(r)(Uκ)µ,j . (9)

The basis set is adaptively compressed with respectto the given set of basis functions, and are locally sup-ported (in a discrete sense) only on a single index setκ. In the absence of SVD truncation, we clearly havespanϕp ⊆ spanφκ,j. We refer to this basis set φκ,jas the DG basis set. Note that each DG basis functionφκ,j is a linear combination of primitive basis functionswhich are themselves continuous, so φκ,j is also techni-cally continuous in the real space. In fact, φκ,j mightnot be locally supported in the real space if each primi-tive basis function χp is delocalized. When the primitive

6

basis functions are localized, φκ,j can be very close toa discontinuous function. (See Fig. 3 for an example.)When computing the projected Hamiltonian, we do notneed to evaluate the surface terms in the DG formalism.If we form a block diagonal matrix

U = diag[U1, . . . , UNb ], (10)

the total number of basis functions is thus Nd :=∑κ∈K nκ. We remark that the number of basis functions

nκ can be different across different elements.To facilitate the complexity count below we may, with-

out loss of generality, assume that nκ is a constant andthat Nd = Nbnκ. Then we have defined a new set ofcreation and annihilation operators

c†κ,j =∑µ

b†µ(Uκ)µj , cκ,j =∑µ

bµ(Uκ)µj , (11)

with κ = 1, . . . , Nb and j = 1, . . . , nκ that correspond tothe DG basis set.

Unlike Eq. (5), the basis set rotation in Eq. (11) isrestricted to each element κ. We readily obtain the pro-jected Hamiltonian in the DG basis as

H(d) =∑

κ,κ′;j,j′

h(d)κ,κ′;j,j′ c

†κ,j cκ′,j′

+1

2

∑κ,κ′;i,i′,j,j′

v(d)κ,κ′;i,i′,j,j′ c

†κ,ic†κ′,i′ cκ′,j′ cκ,j . (12)

The matrix elements are

h(d)κ,κ′;j,j′ =

∑µν

(Uκ)µjh(p)µν (Uκ′)νj′ , (13)

and

v(d)κ,κ′;i,i′,j,j′ =

∑µν

(Uκ)µi(Uκ′)νi′v(p)µν (Uκ)µj(Uκ′)νj′ .

(14)In general, the one-body matrix h(d) can be a full densematrix, but the two-body tensor v(d) always takes a“block diagonal” form in the following sense (it has aspecific sparsity pattern). In principle, the two-body in-teraction in the DG basis set should take the form

1

2

∑κ,κ′,λ,λ′;i,i′,j,j′

vκ,i;κ′,i′;λ,j;λ′,j′ c†κ,ic†κ′,i′ cλ′,j′ cλ,j . (15)

Compared to Eq. (12), we find that

vκ,i;κ′,i′;λ,j;λ′,j′ = v(d)κ,κ′;i,i′,j,j′δκλδκ′λ′ . (16)

In other words, v can be viewed as a block diagonal ma-trix with respect to the grouped indices (κκ′, λλ′).

We remark that the convergence of the DG basis setis independent of the choice of the primitive basis set solong as the primitive basis has sufficient degrees of free-dom to form a good approximation to the active space

FIG. 4. A schematic illustration of the compression processof delocalized active space basis functions into DG basis func-tions. Beginning with a matrix representing the projection ofa primitive basis onto a chosen active basis (Left), with theprimitive basis grouped into blocks represented by rows here.Those blocks are then reduced by a singular value decompo-sition (Center), which finally leads to the DG basis that hasa block diagonal two-electron integral representation (Right).

functions of interest. At the end of this adaptive proce-dure, we expect the number of elements in the Hamil-tonian to scale as O(N2

b n4κ). However, as we expect the

number of basis functions required to reach a fixed accu-racy within a block (i.e., nκ) to be bounded by a constantas system size grows, and the scaling ,with system sizebecomes O(N2

d ). We substantiate the rapid asymptoticconvergence of nκ for real systems later in this work, how-ever, simple arguments from spatial locality and basis setcompleteness lead to the same conclusion.

IV. QUANTUM SIMULATION WITH A DGBASIS

Here we introduce the methods used to exploit theproperties of the DG basis on a quantum computerfor evolution under the Hamiltonian. We first describea method that implements evolution using a Suzuki-Trotter decomposition through swap networks, takingadvantage of recent advances in quantum swap networks.An alternative Trotter approach based on low-rank de-compositions is detailed in Appendix A. We then describea method for time evolution based on the linear combi-nations of unitaries approach, which will provide the re-quired background for determining the degree of advan-tage for using DG basis sets for fault-tolerant quantumcomputations of chemistry.

A. Swap networks for block diagonal Hamiltonians

In the first work using a strictly diagonal basis in quan-tum computing for chemistry [36], the ability for quan-tum computers to perform fast Fourier transforms onquantum wavefunctions was exploited to capitalize onthe representational advantages of being in either theplane wave basis or its Fourier-transformed dual. Thatmethod was originally restricted to Hamiltonians withthat particular structure in the coefficients, similar to

7

(1, ↑)

K45

(1, ↑)

(1, ↑)

(1, ↑)

(1, ↑)

(1, ↓)

K45

(1, ↓)

(1, ↓)

(1, ↓)

(1, ↓)

(2, ↑)

K45

(2, ↑)

(2, ↑)

(2, ↑)

(2, ↑)

(2, ↓)

K45

(2, ↓)

(2, ↓)

(2, ↓)

(2, ↓)

(3, ↑)

K45

(3, ↑)

(3, ↑)

(3, ↑)

(3, ↑)

(3, ↓)

K45

(3, ↓)

(3, ↓)

(3, ↓)

(3, ↓)

(4, ↑)

K45

(4, ↑)

(4, ↑)

(4, ↑)

(4, ↑)

(4, ↓)

K45

(4, ↓)

(4, ↓)

(4, ↓)

(4, ↓)

FIG. 5. Acquaintance strategy for block-diagonal Hamilto-nian with Nb = 4 and nκ = 10. “K4

n” indicates a 4-completeswap network on n qubits, i.e., one that acquaints the

(n4

)subsets of 4 qubits with each other. The other gates are dou-ble bipartite swap networks, explained in Figures 19 and 20.

split-operator Fourier transform methods used in classi-cal simulation of quantum systems. However, it was soonrealized that the structure of any diagonal Hamiltoniancould be similarly exploited. This generalization used alinear, fermionic swap network to achieve perfect paral-lelization of a Trotter step with depth that scales linearlyin the number of orbitals [57], even when gates are re-stricted to act on nearest neighbors of a line of qubits.

Fermionic swap networks are analogous to sorting net-works from traditional computer science except builtupon the primitive of the fermionic swap operation,

fpqswap = 1 + a†paq + a†qap − a†pap − a†qaq, (17)

fpqswapa†p(f

pqswap)† = a†q, (18)

where fpqswap is the fermionic swap that swaps the label-ing of modes. The fermionic swap operation was intro-duced in [58] and also studied in the context of tensornetworks. The difference between such a swap and a tra-ditional swap is by swapping fermionic modes instead ofassignments to qubits, non-local parity strings used toenforce the fermionic anti-commutation relations can beavoided.

The basic idea of the linear swap network is tofermionic swap all neighboring qubits, interact them with

their current neighbors, and repeat until all qubits haveinteracted with each other. Since the introduction anduse of these linear fermionic swap networks in quantumalgorithms, they have been generalized for use in non-diagonal Hamiltonians with some overhead. For example,the quantum chemistry Hamiltonian can be decomposedinto a sum of diagonal Hamiltonians (each in a rotatedbasis) using techniques similar to Cholesky or density fit-ting methods, where each diagonal Hamiltonian can thenbe implemented in sequence [39]. We show how to usethis method in the DG representation in Appendix A.

For the general Hamiltonian in quantum chemistry,which is non-diagonal, a generalized swap networkthat works directly with such Hamiltonians was devel-oped [59]. This network implements time steps forgeneric O(N4

a ) Hamiltonians in a time that scales asO(N3

a ), and we take advantage of it here with special-izations for the block diagonal structure. To implementa Trotter step of the Hamiltonian, the swap network dy-namically updates the mapping from qubits to orbitalsso that for each term in the Hamiltonian, the involvedorbitals are mapped to adjacent qubits. We say that aswap network “acquaints” a set of orbitals when it bringsthem together at some point in this way, and representthat point by an empty box in the circuit diagrams, whichacts as a placeholder for the logical gate to be executedthere. Prior work utilizing swap networks has appliedthem to two extremal regimes with respect to the struc-ture of the two-electron terms in the Hamiltonian: thestrictly diagonal case, which can be implemented withO(Np) depth [57]; and the fully general case, which canbe implemented in O(N3

a ) [59]. Here we show how to in-terpolate between these to achieve O(Nbn

3κ) = O(Ndn

2κ)

depth for block-diagonal Hamiltonians. (For simplicity,in this section we will assume that all blocks have thesame size nκ, but the techniques generalize in a straight-forward way to non-uniform block sizes.)

We focus on how to implement the quartic terms inthe Hamiltonian (i.e., two-electron terms involving fourdistinct spin orbitals). The lower-order terms can be ad-dressed with negligible additional resources by incorpo-rating them into the quartic terms. The quartic termsin the block-diagonal Hamiltonian satisfy the followingproperties:

1. Two orbitals are from one block κ and two orbitalsare from another block κ′ (or all four from the sameblock when κ = κ′).

2. The orbital spins have even parity (i.e., all up, alldown, or two and two).

We will exploit both of these properties in constructingour swap network, which uses primitives originally de-signed for implementing unitary coupled cluster [59].

Figure 5 shows the overall swap network. Initially, theorbitals are arranged on the line in lexicographical or-dering; only the block index κ and spin are indicated forconcision. The logic of the strategy is as follows:

8

1. The first layer acquaints all sets of four spin orbitalswithin each block in which all four orbitals have thesame spin. This is achieved by a “4-complete” swapnetwork on each half-block of orbitals, denoted byK4nκ/2

because the sets of orbitals it acquaints cor-

respond to the edges of a complete 4-uniform hy-pergraph; it has depth O(n3κ). Note that the edgesof the complete k-uniform hypergraph Kk

n on n arethe

(nk

)sets of k vertices. The “uniform” qualifier

indicates that all of the hyperedges have the samenumber of vertices. See [59] for details.

2. The second layer acquaints all sets of four spin or-bitals within each block in which two orbitals havespin up and the other two have spin down. This isachieved by a “double bipartite” swap network oneach block in depth O(n3κ); see Figure 19.

3. The third layer permutes, in O(nκ) depth, the or-bitals within each block in preparation for the inter-block acquaintances to follow.

4. The rest of the strategy consists of Nb alternatinglayers that acquaint pairs of parts. In each layer,each block of qubits is paired up with an adjacentone and a “balanced double bipartite” swap net-work is executed on the pair of blocks; see Fig-ure 20. Each balanced double bipartite swap net-work acquaints the sets of four orbitals containingtwo from each block and with even (“balanced”)spin parity. This also has the effect of swapping theblocks, so overall a balanced double bipartite swapnetwork is applied to every pair of blocks. Eachdouble bipartite swap network has depth O(n3κ).

Overall, the depth is O(Nbn3κ), dominated by the lat-

ter swap networks that effect the inter-block interactions.The components of this approach are explained in moredetail in Appendix D and an alternative approach thatmay have asymptotic advantages in some regimes is dis-cussed in Appendix A.

B. LCU approaches for simulation

The quantum simulation algorithms discussed in theprevious section and in Appendix A are useful for im-plementing Trotter steps of the chemistry Hamiltonian.Such Trotter steps can be repeated to perform time-evolution for modeling dynamics or for preparing eigen-states via the phase estimation algorithm [60, 61], butthey can also serve as an ansatz for composing quan-tum variational algorithms [15, 62]. This in conjunctionwith these Trotter steps requiring only minimal (linear)connectivity makes them attractive algorithms for near-term quantum computing. However, within cost mod-els appropriate for a fault-tolerant quantum computer,Trotter steps are not the most competitive technique forchemistry simulation. In such a cost model, the key re-source to minimize is the number of T gates required by

the algorithm. This is because in most practical error-correcting codes (e.g., the surface code), T gates requiremany physical qubits for their distillation and are ordersof magnitude slower to implement than Clifford gates.

Currently, the lowest T complexity quantum algo-rithms for simulating chemistry are all based on linearcombinations of unitaries (LCU) methods [63]. This fam-ily includes Taylor series methods [64] (applied to chem-istry in [14, 42]), qubitization [65] (applied to chemistryin [38, 65, 66]) and Hamiltonian simulation in the interac-tion picture [41] (applied to chemistry in [40, 41]). Whatall LCU methods have in common is that they involvesimulating the Hamiltonian from a representation whereit can be accessed as a linear combination of unitaries,

H =

L∑`=1

ω` U`, λ =

L∑`=1

|ω`| , (19)

where U` are unitary operators, ω` are scalars, and λis a parameter that determines the complexity of thesemethods. The second quantized Hamiltonians discussedin this paper satisfy this requirement once mapped toqubits (e.g., under the Jordan–Wigner transformation)since strings of Pauli operators are unitary.

LCU methods perform quantum simulation in terms ofqueries to two oracle circuits defined as

select |`〉 |ψ〉 7→ |`〉U` |ψ〉 , (20)

prepare |0〉⊗ logL 7→L∑`=1

√ω`λ|`〉 , (21)

where |ψ〉 is the system register and |`〉 is an ancilla regis-ter which usually indexes the terms in Eq. (19) in binaryand thus contains logL ancillae. Up to log factors in pre-cision and other system parameters, LCU methods canperform time-evolution with gate complexity scaling as

O ((CS + CP )λ t) , (22)

where CS and CP are the gate complexities of selectand prepare respectively, and t is time.

To implement the LCU oracles one must be able tocoherently translate the index ` into the associated U`and ω`. In the quantum chemistry context the U` arerelated to the second quantized fermion operators (e.g.,a†pa†qaras) and the ω` are related to the molecular inte-

grals. While the U` have a structure that is straightfor-ward to unpack in a quantum circuit (see [38, 66] for ex-plicit implementations), the ω` are typically challengingto compute directly from this index (unless one pursuesthe highly impractical strategy of computing the integralson-the-fly, as in [14]). As a consequence, with state-of-the-art LCU methods for simulating chemistry the pri-mary bottleneck has been implementation of preparerather than select [38, 66].

In the most performant LCU approaches for chemistry(see [66] for plane waves and [38] for arbitrary basis sets),prepare is implementing (and bottlenecked) by using

9

a data-lookup routine referred to as QROM (quantumread-only memory) to load the ω` values into superpo-sition. Using this approach, the cost of prepare is afunction of the number of unique coefficients ω` in theHamiltonian. Using the QROM of [67] (which improveson the original concept from [66]), one can (up to log fac-tors) implement prepare with T complexity scaling asO(L/g+ g) where L is the number of molecular integralsand g is a free parameter so long as at least g dirty ancillaare available during the implementation of prepare. Asprepare acts only on the ancilla register, there are typi-cally at least Nd dirty ancilla available (from the |ψ〉 reg-ister). Thus (assuming L > N2

d ) the scaling is O(L/Nd)

without ancilla or O(√L) with O(

√L) ancilla (often a

reasonable tradeoff within error-correction).

When simulating in a molecular orbital basis withL = O(N4

a ), one can (up to log factors) evolve theHamiltonian for some time t and achieve T complexityO(N2

aλt) [38]. If a straightforward extension of the ap-proach in [38] is applied to the DG basis Hamiltonianwith L = n4κN

2b then this scaling would be reduced to

O(n2κNbλt). However, if one can index the ω` in a waythat exploits symmetries in the coefficients, then one canfurther reduce the effective value of “L” in both of theseexpressions. For instance, in [66] it is recognized thatwhile there are N2

p coefficients ω` in the Hamiltonianof Eq. (2) (one for each value of ν and µ), there areonly Np unique coefficients due to the translational in-variance of the Coulomb operator (essentially, one foreach unique value of the index µ − ν), and the cost ofthe algorithm is reduced accordingly. As a rough ruleof thumb, the scaling becomes O(J/g + g) where J canbe thought of as the number of unique scalars neededto completely describe the Hamiltonian without recom-puting the molecular integrals. In [38], it is shown thatbecause only J = O(N3

a ) numbers are needed to describethe low rank factorized Coulomb operator in an arbitrary

basis [39], one can improve the scaling to O(N3/2a λt). By

tailoring such techniques to symmetries such as periodiccrystalline symmetry or other redundancies in the DGHamiltonian (e.g., those exploited in the low rank repre-sentation of Appendix A) one can also further improvethe O(n2κNbλt) scaling.

Finally, we note that the λ value associated with themolecular orbital basis Hamiltonian is likely larger thanthe λ value associated with the DG Hamiltonian, andwe quantify the extent to which that is the later in thispaper. This is yet another way in which the DG represen-tation should lead to even more efficient implementationof LCU methods for chemistry simulation. Note also,that the value of λ is important for the scaling of quan-tum simulation in a near-term cost model as well. Inparticular, for quantum variational algorithms the num-ber of measurements (corresponding to the number ofcircuit repetitions) required to estimate the energy of aHamiltonian to within error ε scales as O(λ2/ε2) [68].

V. NUMERICAL RESULTS

To understand the performance of different discretiza-tion schemes, we examine the costs and associated accu-racy for each of the methods, using hydrogen chains ofincreasing lengths as test systems. We show that thecrossover point occurs around 15 to 20 atoms, abovewhich the DG representation with the block-diagonalHamiltonian structure has significant lower costs.

The subsequent calculations are twofold. We beginwith numerical simulations designed to exhibit the ex-pected crossover behavior. We investigate hydrogenchains of increasing length (up to H30) in a Gaussiancc-pVDZ basis [26] (the active space to be fit), where wechoose the primitive diagonal basis set to be plane wavedual functions with refinement built to match the accu-racy of the Gaussian basis set to a specified tolerance atthe level of density functional theory with the Perdew–Burke–Ernzerhof (PBE) exchange-correlation function-als [69]. We are not aware of any electronic structuresoftware package using a plane wave basis set for prac-tical all-electron DFT calculations. Hence we use thepseudopotential formulation based on the ONCV pseu-dopotential [70]. As we fit to the span of the basis setitself, rather than a density matrix generated with a fixedlevel of theory, these results should be representative ofperformance across different levels of theory including,but not limited to, correlated methods. This is done toobserve the crossover without extrapolation, as by thesize the crossover occurs, the more accurate correlatedcalculations become prohibitively expensive using tradi-tional classical methods.

In the second set of calculations, we instead fit to anatural orbital active space using a Gausslet basis set asthe primitive diagonal basis set [34, 35] to demonstratethat this representation maintains accuracy in correlatedcalculations. This is possible as the properties of a Gaus-slet basis allow accurate DMRG calculations for com-parison on these systems. Similar DMRG calculationsare expected to more expensive in basis sets with heavytails, such as the plane wave dual basis, which makes theone-particle Hamiltonian a dense matrix, and result inexcessive bond dimension requirements for accurate de-scriptions. These calculations exhibit the versatility ofthis method and its ability to achieve accuracy, even ina correlated active space built fit to a highly non-localactive space. Furthermore, we demonstrate that by fit-ting the DG basis simultaneously to two active basis sets(molecular orbitals at the UHF level and the cc-pVDZbasis functions), the DG basis set can obtain accurateresults when compared to quantum Monte Carlo calcu-lations at the complete basis set limit, while achieving a1-2 orders of magnitude reduction of computational costover the primitive Gausslet basis or the Gaussian basisset.

10

1.5 2.0 2.5 3.0Internuclear Distance (a. u. )

11.4

11.2

11.0

10.8

10.6

10.4

10.2

10.0

9.8E

nerg

y (a

.u.)

Primitive BasisGaussianDG 10 1, n = 14DG 10 2, n = 17DG 10 3, n = 23

FIG. 6. Potential energy surfaces for H20 in DG basis withdifferent tolerances and full plane wave basis. At equilib-rium position, the average number of DG basis per atom isgiven by 〈nκ〉 for each fit tolerance specified by DG ε, whereε is the SVD cutoff. For comparison the number of primitivefunctions per atom here is approximately 3000. The prim-itive basis set is more expressive by design than the activespace Gaussian cc-pVDZ basis against which the DG fit isperformed. This allows even fairly loose DG fits to match theaccuracy of the active space basis.

A. Scaling crossover in a DG plane wave dual basis

In order to demonstrate the performance of the DG ap-proach for a long hydrogen chain, we use the DFT-basedMATLAB toolbox KSSOLV [71]. KSSOLV uses a planewave discretization in combination with pseudopotentialsto make larger systems computationally tractable. Fig. 7shows that chemical accuracy is achieved when the ki-netic energy cutoff is set to around 20 hartree for a H10

system using the ONCV pseudopotential. The kinetic en-ergy cutoff needs to be much larger for all-electron calcu-lations using the plane wave basis set. With this kineticenergy cutoff, we can perform calculations on hydrogenchains up to H30. For H30 the box size is 10×10×118 a30,and the grid size is 20 × 20 × 238, or 95,200 plane wavedual basis functions (around 3,000 basis functions peratom).

We start by constructing an active basis set using thecc-pVDZ basis with the given molecular geometry, andsample it with an underlying grid of plane wave dualfunctions. This yields a real space discretization of thecc-pVDZ basis, which is then transformed into a block-diagonal form by means of a DG blocking procedure, alsoimplemented in KSSOLV. As a technical note, the parti-tioning boundaries in the DG approach are important.The hydrogen chains that are the subject of this nu-merical investigation are quasi 1D-problems. This andthe fact that we use a real space discretization suggests

FIG. 7. Convergence of the total energy per atom for a H10

system with respect to the kinetic energy cutoff using theplane wave basis set and the ONCV pseudopotential. Chemi-cal accuracy (black dashed line) is achieved around Ecut = 20hartree, which is the value used for all examples in this sec-tion.

5 10 15 20 25 30Number of hydrogens

10

12

14

16

18

20

22M

ean

DG

bas

is fu

nctio

ns p

er a

tom

DG 10 1

DG 10 2

DG 10 3

FIG. 8. Convergence of block size nκ as a function of systemsize. The average number of DG-basis per atom at equilib-rium with SVD tolerances of 10−1, 10−2 and 10−3. For a fixedaccuracy, it is observed that the average number of DG func-tions per block converges as a function of system size.

that the ideal partitioning of the basis in terms of a DG-scheme (see Section III) corresponds to a non-uniformpartitioning strategy, so that each hydrogen atom is ap-proximately located at the center of each element. Weremark that by construction the DG blocking procedureis able to produce accurate results even if the partitionis non-ideal, e.g. when an hydrogen atom is located nearthe boundary of an element. However, this can require alarger number of basis functions per atom compared tothe ideal partitioning strategy.

We first verify the accuracy of our DG-basis in Figure 6

11

by comparing potential energy surfaces (PES) in DG dis-cretizations of different tolerances with the correspondingPES in the primitive basis. The primitive basis has beenchosen to be much more expressive than the Gaussianactive space basis, showing a lower absolute energy. Thisallows even a coarse DG fit to reliably match the Gaus-sian active space accuracy, while being far more compactthan the original primitive basis set. In fact, the ener-gies obtained from the DG basis are slightly lower thanthose from the Gaussian basis set. This is because asnκ increases, the span of the Gaussian basis set becomesapproximately a subspace of the span of the DG basisset, and hence the DG basis can possibly yield lower en-ergies due to the variational principle. Also due to thevariational principle, the energies obtained from the DGbasis are noticeably higher than those from the primitiveplane wave basis set, and the main limiting factor is thecc-pVDZ basis set to which DG is fitted. The averagenumber of functions per atom 〈nκ〉 is shown for each tol-erance and is approximately 20, as compared to the prim-itive basis which is built from roughly 3,000 functions peratom. The results suggest that the overall energetic ac-curacy is relatively insensitive even to rather aggressivesingular value cutoffs in the DG blocking procedure.

We then plot the average number of DG-basis func-tions per atom in Figure 8 for fixed SVD thresholds inthe DG blocking procedure. The data shows that as sys-tem size grows, 〈nκ〉 approaches a constant as a functionof system size for a fixed tolerance as reasoned earlier.The combination of the energetic insensitivity to cutoffand the approach to a constant 〈nκ〉 strongly supportsthe existence of a crossover regime where DG is morecost effective than other representations. We confirm thecrossover directly by examining the quantities most rel-evant for fault-tolerant cost models of chemistry on aquantum computer, in particular the number of non-zerotwo-electron integrals in each representation as well asthe λ factor Eq. (B1) in Figures 9 and 10, respectively(for λ factors for different bond lengths see Appendix B,Figure 13). A cutoff of 10−6 is used to count an in-dividual integral as 0 when calculating these quantitiesempirically on the systems of interest. We choose to illus-trate the cost crossover for a bond length of 1.7a0 sincewe here detected the largest deviation with respect to thetruncation tolerance.

Considering the cost-model in the fault-tolerant set-ting outlined in Section IV B, one can already observe ascaling advantage for the DG representation over simpleGaussian based active space representations. Recall fromthat section that the cost using an LCU method to evolvefor some time t is roughly O(

√Lλt) where L is the num-

ber of non-zero terms in the Hamiltonian. We considerhere only the two-electron integrals as they represent thedominant cost contribution in most cases. For molecularorbital representations, this is generally O(N2

aλt), which

was subsequently improved to O(N3/2a λt) using low-rank

structure in the problem. For the hydrogen chains ex-amined here in the Gaussian basis set, we see empirically

2 4 8 16 32Number of hydrogens

102

103

104

105

106

107

108

109

1010

Non

-zer

o tw

o-el

ectr

on in

tegr

als

Primitive Basis ( = 1.54)Gaussian ( = 4.14)DG 10 1( = 2.03)DG 10 2( = 2.18)DG 10 3( = 2.34)

FIG. 9. The number of non-zero two-electron integrals in dif-ferent representations, for equilibrium and dissociation bondlengths with SVD truncation tolerances of 10−1, 10−2 and10−3, plotted on a log-log scale. We fit a trendline plottedwith black dots from the second point onward to extract thescaling as a function of system size as Nα + c for some con-stant c, and list the exponent α beside each representation inthe legend. As predicted, for these system sizes the numberof two-electron integrals lies between the primitive and ac-tive space representations, tending closer to the O(N2) scal-ing of the primitive representation, requiring fewer functions.Note that for the Gaussian basis set, certain elements of thetwo-electron integrals can vanish due to the symmetry of theatomic configuration of the hydrogen chain. This has a largerimpact for small system than for large systems, and thereforethe scaling is observed to be slightly larger than O(N4

a ).

over the system sizes considered L ∝ N4h and λ ∝ N2.5

h ,where Nh is the number of hydrogen atoms in the chain,leading to an expected cost scaling of O(N4.5

h t) when notexploiting further low rank structure. In the DG repre-sentation for the same problem, we see that L ∝ 2.25and λ ∝ 1.5, which suggests an empirical scaling for thisphysical system of O(N2.6

h t). The same calculation forthe primitive basis suggests a scaling of O(N1.8

h ), whichis the lowest seen, however, the simulation cost and qubitcounts required are many orders of magnitude higher hereas seen in the figures. In both quantities we observe aconstant factor crossover where the DG representationhas strictly lower unique two-electron integrals and λ fac-tor when compared to the Gaussian active space basis ataround 15 to 20 hydrogen atoms. This suggests that theDG representation is not only advantageous in a scalingsense, but also for modest finite system sizes for fault-tolerant implementations.

12

2 4 8 16 32Number of hydrogens

102

103

104

105

106

107

108

Primitive Basis ( = 1.07)Gaussian ( = 2.52)DG 10 1 ( = 1.42)DG 10 2 ( = 1.47)DG 10 3 ( = 1.58)

FIG. 10. λ value for Gaussian and DG basis. A core quantityin determining the cost in quantum algorithms, λ, is plottedas a function of system size for different representations. Thenotation DG ε indicates an SVD cutoff of ε in the blockingprocedure. We observe an advantageous crossover before oraround H20 in all cases with respect to an actual value. Wefit a trendline plotted with black dots from the second pointonward to extract the scaling as a function of system size asNα + c for some constant c, and list the exponent α besideeach representation in the figure legend. We see the scalingfor the DG basis is significantly better in all cases than forthe active space basis as well.

B. Correlated calculations in a DG Gausslet basis

Here we demonstrate that the performance of the DGbasis set for a vastly different regime, which uses anatural orbital active space from an exact DMRG cal-culation fit to Gausslet primitive functions in a corre-lated calculation. The Gausslet basis set is a recent ap-proach to improve the discretization of quantum chemicalproblems[34, 35]. It has a special focus on sparsity, spa-tial locality, and orthonormality, to fulfill the needs ofstrong-correlation methods like the DMRG, while keep-ing the number of basis functions lower than other gridbased bases. Our calculations illustrate the accuracy ofthe CCSD method in a DG-basis with an underlyingprimitive Gausslet basis set with respect to the originalactive space. The block-diagonal form of the Hamilto-nian in a DG-basis yields an asymptotic improvement inthe number of non-zero two-electron integrals, which isdirectly related to the circuit depth and size. The calcula-tions presented here are limited to short hydrogen chainsdue to the size of the Gausslet basis set and expense ofcorrelated calculations. The goal of these calculations isto demonstrate that the DG approach maintains accu-racy for correlated calculations due to its construction asa fit to the span of the active basis set, rather than adensity matrix generated with a fixed level of theory.


4.5

4.4

4.3

4.2

4.1

4.0

3.9

Ene

rgy

(a.u

.)

RSCFDMRGDG CCSDAS CCSD

FIG. 11. Correlated potential energy surfaces for H8. We plotthe potential energy surface computed using the Gausslet ba-sis as the primitive basis and a subset of the exact DMRGnatural orbitals as an active space (AS). The CCSD energy iscalculated in the active space as well as the DG fit to that ac-tive space, showing excellent agreement with the active spaceenergetics in a correlated calculation. The exact DMRG en-ergy in the more expressive, full primitive basis (Gausslets) isshown for reference.

Specifically, we compare CCSD energy results in anactive space basis set with calculations in a DG-basisfit to the same active space basis. This serves to quan-tify the overhead in restricting the basis to have blocklocality versus the totally delocalized natural orbital ac-tive space. The calculations are performed for H2, H4,H6 and H8 with varying symmetrically stretched bondlengths. Figure 11 shows the calculation for H8 withoptimal DG partitioning of the Gausslet basis set, therespective figures for H2, H4, and H6 are presented inAppendix C (Figures 14, 15, 16). From these plots onemay see that enforcing a block-diagonal structure of theHamiltonian does not reduce the accuracy further fromthe active space approximation.

The Gausslet basis for computations resulting in Fig-ure 11 consists of 1,336 Gausslets corresponding to anaverage of 167 functions per atom. This is far lower thanthat in the plane wave dual basis reflecting the variableresolution available with the Gausslet basis. Averagingover all values along the PES, we find that the full two-electron integral tensor for this basis has ∼ 2,310,318non-zero elements. Using a DG basis it is possible toreduce this number to ∼ 511,449 non-zero two-electronintegrals without losing accuracy compared to the respec-tive active space approximation. This suggests cost re-ductions for correlated calculations that are similar tothe observations in Section V A. Note, however, that H8

is a small system compared to the systems considered inSection V A, consequently, the number of non-zero two-

13

electron integrals in the DG-basis is still larger than inthe active space approximation (∼ 52,346). This alignswith the results from the previous section, which showedthat the improvement of the non-zero two-electron inte-gral count for the DG-basis becomes observable for 15to 20 atoms, depending on the imposed truncation toler-ance. A naive extrapolation of the non-zero two-electronintegral count in Table III (see Appendix C Figure 17)suggests a crossover around 25 atoms for the coupled-cluster calculations, which is again in agreement withthe computations in Section V A with a low truncationtolerance (see Appendix B Table I). We conclude thatthe trial calculations for small systems performed heretogether with results from Section V A suggest that theDG approach maintains the accuracy of active space ap-proximations for correlation methods but with a moreefficient representation on quantum devices once a cer-tain system size is reached.

C. DMRG calculations in a DG basis

Here we examine the power of the DG technique tocompactly and cheaply represent problems for classical,correlated DMRG calculation by constructing an activespace that takes static correlation from a UHF calcula-tion done in a flexible primitive basis set such as Gaus-slets, combined with contributions from a basis set thathas been empirically refined to capture dynamic corre-lations. These calculations have the dual purpose ofdemonstrating the power of the DG approach even inhighly correlated calculations more generally, includingcalculations on a quantum computer. Here we demon-strate a hybrid active space approach for cheaply findinga balanced active space. Specifically, after performing aUHF calculation in a Gausslet basis, we project both thecc-pVDZ basis set and the UHF orbitals onto the prim-itive Gausslet basis. The UHF calculation can be per-formed cheaply. By including the UHF orbitals in ouractive space, we ensure that there are no HF-level errorsin the basis. The only lack of completeness is associ-ated with correlation beyond HF. To (partially) capturecorrelation in a compact way, we include contributionsfrom the empirical Gaussian basis with a slightly smallerfactor, α = 0.01 here, then perform the DG blocking pro-cedure to develop the basis as before. The resulting basismaintains the flexibility for the HF solution in the Gaus-slet basis for the low lying orbitals, while now includingthe refined features of the Gaussian orbitals without los-ing the block diagonal structure. Although it is difficultto compare precisely of the efficiency of the primitive andDG bases, roughly we find that this approach can achievea 1-2 orders of magnitude reduction of computationalcost over either the primitive Gausslet basis or Gaussianbasis set, while achieving excellent accuracy with respectto the complete basis set limit.

Numerical calculations for the H10 system are shownFig. 12. For comparison, we perform an unrestricted

Hartree-Fock calculation in both a traditional Gaussianbasis set, cc-pVDZ, and a multi-sliced Gausslet basis set.The Gaussian basis set contains 5 spatial orbitals peratom, totaling to 50 spatial orbitals with the associatednon-diagonal two-electron integrals as one would expect.The Gausslet basis is formed adaptively according to pre-determined cutoffs, and the number of functions rangesfrom 7000-10000 spatial orbitals for these calculations,while retaining the diagonal property, making calcula-tions at the UHF level relatively straightforward, evenwith such a formidable number of basis functions. Thislarge primitive basis gives UHF results near the completebasis set limit, well beyond the accuracy of this Gaussianbasis. This high accuracy in the HF comes at little cost;the UHF is still fast compared to the correlated calcu-lations and the large number of primitive functions donot strongly affect the size of the DG basis. For largersystems the accuracy could be reduced to keep the HFmanageable.

To construct the DG basis, we construct 10 spatialblocks, 1 around each atom. We make use of the UHFcalculation density matrix and keep 7 total orbitals perblock, yielding 70 total spatial orbitals, a number almostidentical to the number of Gaussians in the cc-pVDZ ba-sis, but maintaining the block diagonal property of thetwo-electron integrals. By construction, UHF in this ba-sis can accurately match the UHF results of the Gaussletbasis set. The introduction of correlation through DMRGon this basis shows improvement as expected but a rela-tive offset from the exact answer due to UHF’s focus onstatic correlation. By using the weighting procedure tothe include in the DG construction some of the cc-pVDZweight for dynamic correlation, at a weighting factor ofα = 0.01, we find excellent agreement with calculationsdone in the exact basis set limit. The number of func-tions kept here is 15 per block, yielding 150 functionstotal, or about 3 times that of a cc-pVDZ basis. How-ever, the structure of the interactions and spatially localconstruction of the DG functions allows the DMRG cal-culation to be done with a 1-2 order of magnitude reduc-tion in computational cost as compared to using the DGor Gaussian basis sets alone for the current implemen-tation. This suggests the hybrid active space approachwith the DG blocking procedure is a powerful techniquefor recovering both static and dynamic correlation in acost effective manner.

To elaborate on the scaling of DMRG, m be the bonddimension for the state required for the desired accuracy.Then the computational cost of the Gaussian basis setsis expected to be O(N3

am3). In contrast, for Gausslets

and DG representations based on Gausslets, using matrixproduct operator (MPO) compression and let D be theMPO dimension, one expects the asymptotic cost to beO(NpDm

3) and O(NdDm3), respectively. Due to the lo-

calization properties, D is expected to depend weakly onlength, and be comparable for both the Gausslet and DGblock representations. One can see this from consideringthe MPO decomposition in the Gausslet representation

14

1.5 2.0 2.5 3.0 3.5Internuclear Distance (a. u. )

0.56

0.55

0.54

0.53

0.52

0.51

Ene

rgy

(a.u

.)

HF vDZHF GaussletDMRG (DG)DMRG (DG+)Exact (CBS)

FIG. 12. Potential energy surfaces for H10 constructed withthe hybrid approach. We plot the potential energy surfacefor unrestricted Hartree–Fock (HF) calculations in both a cc-pVDZ (vDZ) Gaussian basis and a Gausslet basis, from whichthe DG basis sets are derived to perform a DMRG calculation.We then contrast this with a calculation done in the DG basisbased using the first 7 virtuals from the HF-DG basis (DG),and finally showcase the power of the hybrid approach to cost-effectively re-introduce dynamic correlation without the costof the original Gaussian basis by adding a point sampling ofthe cc-pVDZ basis into the DG basis (DG+). This hybridbasis attains nearly the exact solution with respect to thecomplete basis set (CBS) limit with a fraction of the cost ofusing either the Gausslet or Gaussian basis set directly.

then transformed to the DG representation. While someexpansion of the bond dimension could happen withina block, it is bounded by the Gausslet dimension of theblock from the properties of a Schmidt decomposition,and no inter-block mixing occurs. As a result, the bonddimension will be comparable. Hence, it is the massivereduction in number of basis functions, from 10,000 to150 that reduces the cost by several orders of magni-tude while maintaining excellent accuracy. The Gaus-sian basis set, while advantageous in number of functions,suffers from lack of spatial locality, but the cost of thisnon-locality in terms of the required bond dimension forequivalent accuracy has not been studied, and likely de-pends significantly on the completeness of the basis. Itis easier to compare the scaling of the Gaussian DMRGdue to the lack of diagonality, resulting in a much largerset of two-electron integrals. In this case, the block diag-onality of the DG basis typically results in approximatelinear scaling in the number of atoms, versus cubic de-pendence with Gaussian DMRG. Hence in both cases,the DG approach offers a significant reduction in com-putational cost for an accuracy that nearly approachesthe complete basis set limit, which was obtained throughaccurate quantum Monte Carlo calculations [35, 72].

VI. CONCLUSION

The discretization problem is a crucial aspect deter-mining the cost and effectiveness of quantum chemistrymethods both for classical and quantum methods. Themost popular basis sets for correlated calculations, Gaus-sian and molecular orbital representations, are notablymore compact than alternatives and have a well developset of tools for their use. Unfortunately, in cost mod-els for quantum computation, the overhead of a quarticnumber of terms in the Hamiltonian leads to poor scalingwith system size. On the other side of the spectrum, rep-resentations that achieve a strictly quadratic number ofterms and a diagonal representation, such as Gausslets orplane wave dual functions exhibit excellent scaling withsystem size, but have overheads that make them unde-sirable for modest size implementations.

Here we introduced a systematic method for interpo-lating between the two regimes through the use of ablocking procedure, motivated from the discontinuousGalerkin (DG) method. This method is able to use anyprimitive basis with the diagonal property to representa delocalized active space basis, while maintaining thediagonal property between blocks. By choosing a planewave dual primitive basis and Gaussian active space, wewere able to show how one can adaptively interpolatebetween these two regimes to attain both a scaling andconstant factor advantage over the target active space.

When these empirical results are put into the contextof known costs for exact quantum algorithms for chem-istry, we observed a scaling improvement over Gaussianbasis sets from O(N4.5

h ) to O(N2.6h ) with a constant fac-

tor crossover around 15 to 20 hydrogen atoms. This sug-gests that for modest sized systems, such as those justbeyond the classically tractable regime, this representa-tion will be the optimal choice for quantum algorithms.Moreover, we showed that for high accuracy DMRG cal-culations, one may take advantage of this representationto achieve a high accuracy calculation with a cost reduc-tion that is over an order of magnitude with respect totraditional representations. In all cases, one may use thismethodology to scale between a compact representationand one with superior integral scaling depending on therequirements of a particular method.

Acknowledgments

This work was partially supported by the Departmentof Energy under Grant No. DE-SC0017867, the Quan-tum Algorithm Teams Program under Grant No. DE-AC02-05CH11231, the Google Quantum Research Award(L.L.), the Research Council of Norway under CoE GrantNo. 262695, the Peder Sather Grant Program (F.M.F),the NASA Space Technology Research Fellowship (B.O.),and the Ning fellowship (Q.Z.).

15

[1] T. Helgaker, P. Jorgensen, and J. Olsen, Molecularelectronic-structure theory (John Wiley & Sons, 2014).

[2] S. R. White, “Density matrix formulation for quan-tum renormalization groups,” Phys. Rev. Lett. 69, 2863(1992).

[3] G. K.-L. Chan and S. Sharma, “The density matrixrenormalization group in quantum chemistry,” Annu.Rev. Phys. Chem. 62, 465 (2011).

[4] N. Nakatani and G. K.-L. Chan, “Efficient tree tensornetwork states (ttns) for quantum chemistry: General-izations of the density matrix renormalization group al-gorithm,” J. Chem. Phys. 138, 134113 (2013).

[5] S. Szalay, M. Pfeffer, V. Murg, G. Barcza, F. Verstraete,

R. Schneider, and O. Legeza, “Tensor product meth-ods and entanglement optimization for ab initio quantumchemistry,” Int. J. Quantum Chem. 115, 1342 (2015).

[6] D. S. Abrams and S. Lloyd, “Simulation of Many-BodyFermi Systems on a Universal Quantum Computer,”Phys. Rev. Lett. 79, 4 (1997).

[7] G. Ortiz, J. Gubernatis, E. Knill, and R. Laflamme,“Quantum algorithms for fermionic simulations,” Phys.Rev. A 64, 022319 (2001).

[8] A. Aspuru-Guzik, A. D. Dutoi, P. J. Love, and M. Head-Gordon, “Simulated Quantum Computation of MolecularEnergies,” Science 309, 1704 (2005).

[9] D. Wecker, B. Bauer, B. K. Clark, M. B. Hastings, andM. Troyer, “Gate-count estimates for performing quan-tum chemistry on small quantum computers,” Phys. Rev.A 90, 1 (2014).

[10] M. B. Hastings, D. Wecker, B. Bauer, and M. Troyer,“Improving Quantum Algorithms for Quantum Chem-istry,” Quantum Information & Computation 15, 1(2015).

[11] R. Babbush, J. McClean, D. Wecker, A. Aspuru-Guzik,and N. Wiebe, “Chemical Basis of Trotter-Suzuki Er-rors in Chemistry Simulation,” Phys. Rev. A 91, 022311(2015).

[12] J. R. McClean, R. Babbush, P. J. Love, and A. Aspuru-Guzik, “Exploiting locality in quantum computationfor quantum chemistry,” J. Phys. Chem. Lett. 5, 4368(2014).

[13] J. R. McClean, M. E. Schwartz, J. Carter, and W. A.de Jong, “Hybrid Quantum-Classical Hierarchy for Mit-igation of Decoherence and Determination of ExcitedStates,” Phys. Rev. A 95, 042308 (2017).

[14] R. Babbush, D. W. Berry, I. D. Kivlichan, A. Y. Wei,P. J. Love, and A. Aspuru-Guzik, “Exponentially MorePrecise Quantum Simulation of Fermions in SecondQuantization,” New J. Phys. 18, 033032 (2016).

[15] J. R. McClean, J. Romero, R. Babbush, and A. Aspuru-Guzik, “The theory of variational hybrid quantum-classical algorithms,” New J. Phys. 18, 23023 (2016).

[16] B. P. Lanyon, J. D. Whitfield, G. G. Gillett, M. E.Goggin, M. P. Almeida, I. Kassal, J. D. Biamonte,M. Mohseni, B. J. Powell, M. Barbieri, A. Aspuru-Guzik,and A. G. White, “Towards quantum chemistry on aquantum computer,” Nat. Chem. 2, 106 (2010).

[17] J. Du, N. Xu, X. Peng, P. Wang, S. Wu, and D. Lu,“NMR implementation of a molecular hydrogen quantumsimulation with adiabatic state preparation,” Phys. Rev.Lett. 104, 030502 (2010).

[18] A. Peruzzo, J. McClean, P. Shadbolt, M.-H. Yung, X.-Q.Zhou, P. J. Love, A. Aspuru-Guzik, and J. L. O’Brien,“A variational eigenvalue solver on a photonic quantumprocessor,” Nat. Commun. 5, 1 (2014).

[19] Y. Shen, X. Zhang, S. Zhang, J.-N. Zhang, M.-H. Yung,and K. Kim, “Quantum Implementation of Unitary Cou-pled Cluster for Simulating Molecular Electronic Struc-ture,” e-print arXiv: 1506:00443 (2015).

[20] R. Santagati, J. Wang, A. Gentile, S. Paesani, N. Wiebe,J. McClean, S. Short, P. Shadbolt, D. Bonneau, J. Silver-stone, D. Tew, X. Zhou, J. OBrien, and M. Thompson,“Quantum simulation of Hamiltonian spectra on a siliconchip,” e-print arXiv: 1611.03511 (2016).

[21] P. J. J. O’Malley, R. Babbush, I. D. Kivlichan,J. Romero, J. R. McClean, R. Barends, J. Kelly,P. Roushan, A. Tranter, N. Ding, B. Campbell, Y. Chen,Z. Chen, B. Chiaro, A. Dunsworth, A. G. Fowler, E. Jef-frey, A. Megrant, J. Y. Mutus, C. Neill, C. Quintana,D. Sank, A. Vainsencher, J. Wenner, T. C. White, P. V.Coveney, P. J. Love, H. Neven, A. Aspuru-Guzik, andJ. M. Martinis, “Scalable Quantum Simulation of Molec-ular Energies,” Phys. Rev. X 6, 031007 (2016).

[22] J. Colless, V. Ramasesh, D. Dahlen, M. Blok, M. Kimchi-Schwartz, J. McClean, J. Carter, W. De Jong, and I. Sid-diqi, “Computation of molecular spectra on a quantumprocessor with an error-resilient algorithm,” Phys. Rev.X 8, 011021 (2018).

[23] A. Kandala, A. Mezzacapo, K. Temme, M. Takita,M. Brink, J. M. Chow, and J. M. Gambetta,“Hardware-efficient variational quantum eigensolver forsmall molecules and quantum magnets,” Nature 549, 242(2017).

[24] E. F. Dumitrescu, A. J. McCaskey, G. Hagen, G. R.Jansen, T. D. Morris, T. Papenbrock, R. C. Pooser, D. J.Dean, and P. Lougovski, “Cloud quantum computing ofan atomic nucleus,” Phys. Rev. Lett. 120, 210501 (2018).

[25] C. Hempel, C. Maier, J. Romero, J. McClean, T. Monz,H. Shen, P. Jurcevic, B. Lanyon, P. Love, R. Babbush,A. Aspuru-Guzik, R. Blatt, and C. Roos, “QuantumChemistry Calculations on a Trapped-Ion Quantum Sim-ulator,” Physical Review X 8, 031022 (2018).

[26] T. H. Dunning Jr, “Gaussian basis sets for use in corre-lated molecular calculations. i. the atoms boron throughneon and hydrogen,” J. Chem. Phys. 90, 1007 (1989).

[27] R. A. Kendall, T. H. Dunning Jr, and R. J. Harrison,“Electron affinities of the first-row atoms revisited. sys-tematic basis sets and wave functions,” J. Chem. Phys.96, 6796 (1992).

[28] D. E. Woon and T. H. Dunning Jr, “Gaussian basissets for use in correlated molecular calculations. iii. theatoms aluminum through argon,” J. Chem. Phys. 98,1358 (1993).

[29] S. Goedecker, “Linear scaling electronic structure meth-ods,” Reviews of Modern Physics 71, 1085 (1999).

[30] R. Ahlrichs, H. Lischka, V. Staemmler, and W. Kutzel-nigg, “Pno–ci (pair natural orbital configuration interac-tion) and cepa–pno (coupled electron pair approximationwith pair natural orbitals) calculations of molecular sys-tems. i. outline of the method for closed-shell states,” J.Chem. Phys. 62, 1225 (1975).

http://dx.doi.org/10.1002/9781119019572

http://dx.doi.org/10.1002/9781119019572

http://dx.doi.org/ 10.1103/PhysRevLett.69.2863


http://dx.doi.org/ 10.1146/annurev-physchem-032210-103338

http://dx.doi.org/ 10.1146/annurev-physchem-032210-103338

http://dx.doi.org/ 10.1063/1.4798639

http://dx.doi.org/10.1002/qua.24898


http://dx.doi.org/10.1103/PhysRevA.64.022319


http://dx.doi.org/ 10.1126/science.1113479

http://dx.doi.org/ 10.1103/PhysRevA.90.022305


http://www.rintonpress.com/xxqic15/qic-15-12/0001-0021.pdf

http://www.rintonpress.com/xxqic15/qic-15-12/0001-0021.pdf



http://dx.doi.org/ 10.1021/jz501649m

http://dx.doi.org/ 10.1021/jz501649m


http://dx.doi.org/ 10.1088/1367-2630/18/3/033032

http://iopscience.iop.org/article/10.1088/1367-2630/18/2/023023/

http://dx.doi.org/10.1038/nchem.483



http://dx.doi.org/10.1038/ncomms5213

http://arxiv.org/abs/1506.00443


http://dx.doi.org/10.1103/PhysRevX.6.031007



http://dx.doi.org/10.1038/nature23879

http://dx.doi.org/10.1038/nature23879

http://dx.doi.org/10.1103/PhysRevLett.120.210501


http://dx.doi.org/10.1063/1.456153

http://dx.doi.org/10.1063/1.462569

http://dx.doi.org/10.1063/1.462569

http://dx.doi.org/10.1063/1.464303

http://dx.doi.org/10.1063/1.464303

http://dx.doi.org/ 10.1103/RevModPhys.71.1085

http://dx.doi.org/10.1063/1.430637

http://dx.doi.org/10.1063/1.430637

16

[31] C. Riplinger, B. Sandhoefer, A. Hansen, and F. Neese,“Natural triple excitations in local coupled cluster calcu-lations with pair natural orbitals,” J. Chem. Phys. 139,134101 (2013).

[32] C. Riplinger and F. Neese, “An efficient and near linearscaling pair natural orbital based local coupled clustermethod,” J. Chem. Phys. 138, 034106 (2013).

[33] E. M. Stoudenmire and S. R. White, “Sliced basis densitymatrix renormalization group for electronic structure,”Phys. Rev. Lett. 119, 046401 (2017).

[34] S. R. White, “Hybrid grid/basis set discretizations ofthe schrodinger equation,” J. Chem. Phys. 147, 244102(2017).

[35] S. R. White and E. M. Stoudenmire, “Multisliced gaus-slet basis sets for electronic structure,” Phys. Rev. B 99,081110 (2019).

[36] R. Babbush, N. Wiebe, J. McClean, J. McClain,H. Neven, and G. K.-L. Chan, “Low-depth quantumsimulation of materials,” Phys. Rev. X 8, 011044 (2018).

[37] R. Babbush, C. Gidney, D. W. Berry, N. Wiebe, J. Mc-Clean, A. Paler, A. Fowler, and H. Neven, “Encodingelectronic spectra in quantum circuits with linear t com-plexity,” Phys. Rev. X 8, 041015 (2018).

[38] D. Berry, C. Gidney, M. Motta, J. McClean, and R. Bab-bush, “Qubitization of Arbitrary Basis Quantum Chem-istry by Low Rank Factorization,” arXiv:1902.02134(2019).

[39] M. Motta, E. Ye, J. R. McClean, Z. Li, A. J. Minnich,R. Babbush, and G. K.-L. Chan, “Low Rank Represen-tations for Quantum Simulation of Electronic Structure,”arXiv:1808.02625 (2018).

[40] R. Babbush, D. W. Berry, J. R. McClean, and H. Neven,“Quantum Simulation of Chemistry with Sublinear Scal-ing to the Continuum,” arXiv:1807.09802 (2018).

[41] G. H. Low and N. Wiebe, “Hamiltonian Simulation inthe Interaction Picture,” arXiv:1805.00675 (2018).

[42] R. Babbush, D. W. Berry, Y. R. Sanders, I. D. Kivlichan,A. Scherer, A. Y. Wei, P. J. Love, and A. Aspuru-Guzik, “Exponentially More Precise Quantum Simula-tion of Fermions in the Configuration Interaction Repre-sentation,” Quantum Sci. Technol. 3, 015006 (2018).

[43] L. Lin, J. Lu, L. Ying, and W. E, “Adaptive local basisset for Kohn-Sham density functional theory in a discon-tinuous Galerkin framework I: Total energy calculation,”J. Comput. Phys. 231, 2140 (2012).

[44] W. Hu, L. Lin, and C. Yang, “DGDFT: A massivelyparallel method for large scale density functional theorycalculations,” J. Chem. Phys. 143, 124110 (2015).

[45] A. S. Banerjee, L. Lin, W. Hu, C. Yang, and J. E.Pask, “Chebyshev polynomial filtered subspace itera-tion in the discontinuous galerkin method for large-scaleelectronic structure calculations,” J. Chem. Phys. 145,154101 (2016).

[46] A. S. Banerjee, L. Lin, P. Suryanarayana, C. Yang, andJ. E. Pask, “Two-level Chebyshev filter based comple-mentary subspace method for pushing the envelope oflarge-scale electronic structure calculations,” J. Chem.Theory Comput. 14, 2930 (2018).

[47] Y. Li and L. Lin, “Globally constructed adaptive localbasis set for spectral projectors of second order differen-tial operators,” Multiscale Model. Simul. 17, 92 (2019).

[48] D. N. Arnold, “An interior penalty finite element methodwith discontinuous elements,” SIAM J. Numer. Anal. 19,742 (1982).

[49] B. Cockburn, G. E. Karniadakis, and C.-W. Shu, “Thedevelopment of discontinuous galerkin methods,” in Dis-continuous Galerkin Methods (Springer, 2000) pp. 3–50.

[50] W. Hu, L. Lin, and C. Yang, “Edge reconstruction inarmchair phosphorene nanoribbons revealed by discon-tinuous galerkin density functional theory,” Phys. Chem.Chem. Phys. 17, 31397 (2015).

[51] G. Zhang, L. Lin, W. Hu, C. Yang, and J. E. Pask,“Adaptive local basis set for kohn–sham density func-tional theory in a discontinuous galerkin framework ii:Force, vibration, and molecular dynamics calculations,”J. Comput. Phys. 335, 426 (2017).

[52] Q. Xu, P. Suryanarayana, and J. E. Pask, “Discretediscontinuous basis projection method for large-scaleelectronic structure calculations,” J. Chem. Phys. 149,094104 (2018).

[53] J. E. Pask and P. A. Sterne, “Finite element methodsin ab initio electronic structure calculations,” ModellingSimul. Mater. Sci. Eng. 13, R71 (2005).

[54] H. Chen, X. Dai, X. Gong, L. He, and A. Zhou, “Adap-tive finite element approximations for Kohn–Sham mod-els,” Multiscale Model. Simul. 12, 1828 (2014).

[55] B. Kanungo and V. Gavini, “Large-scale all-electrondensity functional theory calculations using an enrichedfinite-element basis,” Phys. Rev. B 95, 035112 (2017).

[56] S. Lehtola, “A review on non-relativistic, fully numericalelectronic structure calculations on atoms and diatomicmolecules,” Int. J. Quantum Chem. 0, e25968 (2019).

[57] I. D. Kivlichan, J. McClean, N. Wiebe, C. Gidney,A. Aspuru-Guzik, G. K.-L. Chan, and R. Babbush,“Quantum simulation of electronic structure with lineardepth and connectivity,” Phys. Rev. Lett. 120, 110501(2018).

[58] S. Bravyi and A. Kitaev, “Fermionic quantum computa-tion,” Annals of Physics 298, 210 (2002).

[59] B. O’Gorman, W. J. Huggins, E. G. Rieffel, andK. B. Whaley, “Generalized swap networks for near-termquantum computing,” arXiv preprint arXiv:1905.05118(2019).

[60] A. Y. Kitaev, “Quantum measurements and the AbelianStabilizer Problem,” arXiv:9511026 (1995).

[61] D. S. Abrams and S. Lloyd, “Quantum Algorithm Pro-viding Exponential Speed Increase for Finding Eigenval-ues and Eigenvectors,” Physical Review Letters 83, 5162(1999).

[62] D. Wecker, M. B. Hastings, and M. Troyer, “ProgressTowards Practical Quantum Variational Algorithms,”Physical Review A 92, 42303 (2015).

[63] A. M. Childs and N. Wiebe, “Hamiltonian simulation us-ing linear combinations of unitary operations,” QuantumInformation & Computation 12, 901 (2012).

[64] D. W. Berry, A. M. Childs, R. Cleve, R. Kothari, andR. D. Somma, “Simulating Hamiltonian Dynamics with aTruncated Taylor Series,” Physical Review Letters 114,90502 (2015).

[65] G. H. Low and I. L. Chuang, “Hamiltonian Simulationby Qubitization,” arXiv:1610.06546 (2016).

[66] R. Babbush, C. Gidney, D. Berry, N. Wiebe, J. McClean,A. Paler, A. Fowler, and H. Neven, “Encoding ElectronicSpectra in Quantum Circuits with Linear T Complexity,”Physical Review X 8, 041015 (2018).

[67] G. H. Low, V. Kliuchnikov, and L. Schaeffer, “TradingT-gates for dirty qubits in state preparation and unitarysynthesis,” arXiv:1812.00954 (2018).

http://dx.doi.org/10.1063/1.4821834

http://dx.doi.org/10.1063/1.4821834

http://dx.doi.org/10.1063/1.4773581


http://dx.doi.org/ 10.1063/[email protected]

http://dx.doi.org/ 10.1063/[email protected]

http://dx.doi.org/ 10.1103/PhysRevB.99.081110




https://arxiv.org/abs/1902.02134





http://dx.doi.org/ 10.1088/2058-9565/aa9463

http://dx.doi.org/10.1016/j.jcp.2011.11.032

http://dx.doi.org/ 10.1063/1.4931732

http://dx.doi.org/10.1063/1.4964861

http://dx.doi.org/10.1063/1.4964861

http://dx.doi.org/10.1021/acs.jctc.7b01243

http://dx.doi.org/10.1021/acs.jctc.7b01243

http://dx.doi.org/10.1137/17M1140236

http://dx.doi.org/10.1137/0719052

http://dx.doi.org/10.1137/0719052

http://dx.doi.org/ 10.1007/978-3-642-59721-3_1

http://dx.doi.org/ 10.1007/978-3-642-59721-3_1

http://dx.doi.org/10.1039/C5CP00333D

http://dx.doi.org/10.1039/C5CP00333D

http://dx.doi.org/ 10.1016/j.jcp.2016.12.052

http://dx.doi.org/10.1063/1.5037794

http://dx.doi.org/10.1063/1.5037794

http://dx.doi.org/ 10.1088/0965-0393/13/3/r01

http://dx.doi.org/ 10.1088/0965-0393/13/3/r01

http://dx.doi.org/10.1137/130916096

http://dx.doi.org/10.1103/PhysRevB.95.035112

http://dx.doi.org/10.1002/qua.25968



http://dx.doi.org/10.1006/aphy.2002.6254



http://arxiv.org/abs/quant-ph/9511026




https://dl.acm.org/citation.cfm?id=2481570

https://dl.acm.org/citation.cfm?id=2481570




https://journals.aps.org/prx/abstract/10.1103/PhysRevX.8.041015


17

[68] N. Rubin, R. Babbush, and J. McClean, “Applicationof Fermionic Marginal Constraints to Hybrid QuantumAlgorithms,” New Journal of Physics 20, 053020 (2018).

[69] J. P. Perdew, K. Burke, and M. Ernzerhof, “Generalizedgradient approximation made simple,” Phys. Rev. Lett.77, 3865 (1996).

[70] D. Hamann, “Optimized norm-conserving vanderbiltpseudopotentials,” Phys. Rev. B 88, 085117 (2013).

[71] C. Yang, J. C. Meza, B. Lee, and L.-W. Wang, “Ks-solva matlab toolbox for solving the kohn-sham equa-tions,” ACM Trans. Math. Software 36, 10 (2009).

[72] M. Motta, D. M. Ceperley, G. K.-L. Chan, J. A. Gomez,E. Gull, S. Guo, C. A. Jimenez-Hoyos, T. N. Lan, J. Li,F. Ma, A. J. Millis, N. V. Prokof’ev, U. Ray, G. E. Scuse-ria, S. Sorella, E. M. Stoudenmire, Q. Sun, I. S. Tupitsyn,S. R. White, D. Zgid, and S. Zhang (Simons Collab-oration on the Many-Electron Problem), “Towards thesolution of the many-electron problem in real materials:Equation of state of the hydrogen chain with state-of-the-art many-body methods,” Phys. Rev. X 7, 031059(2017).

[73] F. Gygi and A. Baldereschi, “Self-consistent Hartree-Fock and screened-exchange calculations in solids: Ap-plication to silicon,” Phys. Rev. B 34, 4405 (1986).

[74] E. J. Bylaska, K. Tsemekhman, S. B. Baden, J. H.Weare, and H. Jonsson, “Parallel implementation of γ-point pseudopotential plane-wave DFT with exact ex-change,” J. Comput. Chem. 32, 54 (2011).

Appendix A: Trotter step by low-rank factorization

As an alternative to the fixed swap networks used inthe main text to evolve under the two-electron integralterm in the DG basis, one may use the low-rank factor-ization strategy in [39], but applied to the block diagonal

matricized tensor v(d)κ,κ′,λ,λ′;i,i′,j,j′ . For such a matrix, we

know the maximum number of Cholesky factors is givenby the dimension of the matrix, N2

d , however, becauseof the block structure, each of the Cholesky factors needonly have non-trivial support only within a κ, κ′ block.If nκ bounds the larger of nκ, nκ′ , then the dimensionof one of these blocks is O(n2κ). It’s easy to see that thetotal matricized tensor has dimension O(n2κN

2b ), and a

number of non-zero entries scaling as O(n4κN2b ). Hence

it matches the primitive limit as nκ → 1 and Nb → Ndand the active space limit as nκ → Nd and Nb → 1.

To execute a single Trotter step of the two-electronpart of the Hamiltonian, one may start from a factoriza-tion that is a product over κ, κ′ blocks as

exp

−i∆t ∑κ,κ′;i,i′,j,j′


†κ,ic†κ′,i′ cκ′,j′ cκ,j

≈∏κ,κ′

exp

−i∆t ∑i,i′,j,j′


†κ,ic†κ′,i′ cκ′,j′ cκ,j

≡∏κκ′

Rκκ′ (A1)

Within a non-trivial block (v(d)κ,κ′,λ,λ′;i,i′,j′,j′ =

v(d)κ,κ′;i,i′,j′,j′) we expect the following decomposition

v(d)κ,κ′;i,i′,j′,j′ ≈

∑µν`

(U

(`,κ,κ′)iµ U

(`,κ,κ′)i′µ U

(`,κ,κ′)jν

· U (`,κ,κ′)j′ν λ(`,κ,κ

′)µ λ(`,κ,κ

′)ν

). (A2)

Here ` comes from Cholesky factorization, and µ, ν comesfrom a second eigenvalue decomposition. As for a singleblock, κ, κ′, we assume nκ is independent of the systemsize, so are the index ranges of µ, ν, `. For each `, κ, κ′,U (`,κ,κ′) is a matrix with orthogonal columns. This de-composition allows us to apply Rκκ′ using the low-rankdecomposition technique. The maximum rank of thesefactors is L, which empirically for Gaussian basis sets weexpect to scale as O(nκ), but due to the lack of empiricaldata for large DG basis sets we assume the worst case,O(n2κ). For each of these factors, the second eigenvaluedecomposition will have maximal rank ρ` = O(nκ). Thedepth of such circuits using a fermionic swap networkscales as

∑` ρ` < Lnκ, which in empirical studies on

molecular orbital basis sets scaled at least as Ω(nκ log nκ)and in the worst case O(n3κ). In any case, for a given er-ror tolerance, from locality we know that nκ converges toa constant as a function of system size, and hence eachRκκ′ may be executed in constant depth for large systemusing the low-rank Trotter step, which simultaneouslyexecutes a fermionic swap network.

Following this idea further, if one constructs a systemmapped to qubits in 1D, where the mapping to orbitalsis grouped by κ, then the κκ′ block may be executed forκ′ = κ+ 1, and at the same time, the two blocks may beswapped with this technique. Moreover, as this adjacentκκ′ block is disjoint from the qubits outside of κκ′ thisoperation may be parallelized across pairs of adjacentblocks. This corresponds exactly to lifting the originalfermionic swap network for time evolution to the level ofblocks, where the individual fermionic simulation opera-tions (as they were referred to in the original work) areperformed via the low-rank method. From this, we seethat the total required depth scales as O(Nb) as desired.Although we have assumed here that nκ approaches aconstant as system size grows, if we account for this inthe worst case, we expect a depth of O(Nbn

3κ), which

matches the fixed swap networks in the main body of thetext. Hence, determining the optimal choice of imple-mentation will be done to constant factors and requiresfurther empirical study.

Appendix B: Scaling crossover in a DG plane wavedual basis

Besides the total number of two-electron integrals, thecost of certain quantum algorithms such as the LCUmethod depends also on λ, the sum of the absolute value

http://iopscience.iop.org/article/10.1088/1367-2630/aab919/meta




http://dx.doi.org/10.1145/1499096.1499099



http://dx.doi.org/10.1103/PhysRevB.34.4405

http://dx.doi.org/ 10.1002/jcc.21598

18

FIG. 13. The number of non-zero two-electron integrals withbond lengths = 1.0 to 3.6 in atomic units, tol = 10−1 to10−3, plotted on log-log scale. The crossover appears beforeor around H22 in all cases.

of the two-electron integrals. Here λ is computed as

λ =∑

κ,κ′;pqrs/p=r,q=s

|vκ,κ′;p,q,r,s|. (B1)

Notice that the terms p = r, q = s nominally divergein a plane wave basis set with periodic boundary condi-tions. These values can be computed by including cor-


1.15

1.10

1.05

1.00

Ene

rgy

(a.u

.)


FIG. 14. Potential energy surfaces for H2 in a Gausslet basiscomputed by restricted SCF, (DG-)CCSD and benchmarkedwith DMRG results. The averaged number of non-zero two-electron integrals per atom (DG-element κ) is 4,144. The av-eraged number of non-zero two-electron integrals of the CAScalculations is 205.

rection terms to the periodic boundary condition [73, 74].However, the number of such terms in the two-body in-teraction scales the same as that in the one-body interac-tion in the Hamiltonian, so we omit such low-order termsdirectly for simplicity. In Figure 10, we observe that thecrossover point occurs around H8 when the tolerance isset to 0.1, and around H18 when the tolerance is 0.01.

Tolerance 10−3 10−2 10−1

b = 1.0 H16-H18 H14-H16 H8-H10

b = 1.2 H18-H20 H14-H16 H6-H8

b = 1.4 H20-H22 H14-H16 H6-H8

b = 1.6 H20-H22 H12-H14 H6-H8

b = 1.7 H20-H22 H10-H12 H6-H8

b = 1.8 H18-H20 H8-H10 H8-H10

b = 2.0 H16-H18 H8-H10 H8-H10

b = 2.4 H10-H12 H8-H10 H8-H10

b = 2.8 H8-H10 H8-H10 H8-H10

b = 3.0 H8-H10 H8-H10 H8-H10

b = 3.2 H8-H10 H8-H10 H6-H8

b = 3.6 H8-H10 H8-H10 H4-H6

TABLE I. Crossover regions for different bond lengths anddifferent truncation tolerances.

Appendix C: Correlated calculations in a DGGausslet basis

Figures 14, 15, 16 show the DG calculations for H2, H4

and H6 using Gausslets as the primitive basis set. Ta-ble II shows information on the real space discretization

19


2.25

2.20

2.15

2.10

2.05

2.00

1.95E

nerg

y (a

.u.)


FIG. 15. Potential energy surfaces for H4 in a Gausslet basiscomputed by restricted SCF, (DG-)CCSD and benchmarkedwith DMRG results. The averaged number of non-zero two-electron integrals per atom (DG-element κ) is 16,580. The av-eraged number of non-zero two-electron integrals of the CAScalculations is 3,269.


3.4

3.3

3.2

3.1

3.0

Ene

rgy

(a.u

.)


FIG. 16. Potential energy surfaces for H6 in a Gausslet basiscomputed by restricted SCF, (DG-)CCSD and benchmarkedwith DMRG results. The averaged number of non-zero two-electron integrals per atom (DG-element κ) is 287,762. Theaveraged number of non-zero two-electron integrals of theCAS calculations is 16,551.

for the used Gausslet basis set. Table III compares thenon-zero two-electron integral count for the DG-Gaussletbasis and the active space basis. Figure 17 shows a naiveextrapolation of the non-zero two-electron integral countpresented in Table III.

Atoms H2 H4 H6 H8

tot. no. of Gausslet 529 801 1,066 1,336Gausslets in z-axis 11-13 15-21 17-27 21-33

Gausslets in x-resp. y-axis 6.33-7.40 6.92-7.53 7.31-7.78 7.27-8.01

TABLE II. Averaged numbers Gausslets used to discretize themolecular system.

Atoms H2 H4 H6 H8

No of Gausslets 529 801 1,066 1,336nnz-tei Gausslet 351,084 816,232 1,459,901 2,310,318

DG per elem. 6 6 10 10nnz-tei DG 4,144 16,580 287,762 511,449

CAS 4 8 12 16nnz-tei CAS 205 3,269 16,551 52,346

TABLE III. Averaged numerical values of non-zero two-electron integrals for H2, H4, H6 and H8.

10 15 20 25Number of Hydrogens

0

1

2

3

4

5

6

7

8

Non

-zer

o tw

o-el

ectr

on in

tegr

als

(×10

6 ) DG-nnz-tei extrapolatedAS-nnz-tei extrapolated

FIG. 17. Extrapolation of non-zero two-electron integralswith 10 basis functions per DG element.

Appendix D: Swap network sub-circuits

In this section, we give some more detail aboutthe components of the swap network described in Sec-tion IV A. Recall the structure of the overall swap net-work:

1. A 4-complete swap network within each half-block.This acquaints all sets of 4 orbitals with the samespin and within each block.

2. A double bipartite swap network on each block.This acquaints all sets of 4 orbitals with no net spinand within each block. Details of the constructionare given in Figure 19.

3. A permutation within each block. This changes

20

=

FIG. 18. Notation and decomposition for a P-swap network with partition sizes (1, 2, 1, 2, 2). A P-swap network for a partition(P1, P2, . . . , P|P|) of the qubits

⋃i Pi acquaints every union of a pair of parts, i.e., P ∪ P ′|P, P ′ ∈ P. At a high level, the

structure is similar to that of the simple linear swap network, except that instead of single qubits being swapped, groups are(i.e., the parts of the partition P). There are |P| layers of generalized swap gates, each of which swaps sets of qubits. For moredetails, see [59].

⇓

=

FIG. 19. Construction of the double bipartite swap network, with parts of size 4. The top half of the top circuit contains thesame swap gates as a linear swap network but with additional acquaintance opportunities. In the bottom half of the top circuitare 4 linear swap networks in a row, one for each acquaintance layer of the linear swap network in the top half, which is copiedfor each acquaintance layer of the bottom half. Overall, for every set of of four orbitals consisting of two from the top partand two from the bottom part, there is a layer in the circuit in which both pairs are simultaneously acquainted. The bottomcircuit, depicting the double bipartite swap network, is formed by replacing each such acquaintance layer in the top circuit witha P-swap network, where a pair of qubits acquainted in the top circuit corresponds to a part of the partition P. The P-swapnetwork acquaints the union of each pair of pairs; see Fig. 18. The final gate ensures that overall effect is to shift the parts.

the orbital to qubit mapping in preparation for thenext stage.

4. Alternating layers of balanced double bipartiteswap networks. Each balanced double bipartiteswap network acquaints, for some pair of blocks,all sets of 4 orbitals with an even number of each

spin and with two orbitals from each block. The Nbalternating layers ensure that every pair of blocksis involved together in some balanced double bi-partite swap network. Details of the constructionof a double bipartite swap network are given in Fig-ure 20.

21

↑↓↓↑↑↓↓↑

⇓

=

FIG. 20. Construction of the balanced double bipartite swap network. Similar to the double bipartite swap network, exceptthat pairs of orbitals from each part are only acquainted when their spins have the same parity. The spins of the orbitals inthe initial mapping of qubits to orbitals are indicated in the top left.

arxiv:1909.00028v1 [quant-ph] 30 aug 2019 · leveraging the structure of this new representation on...

Documents