density functional theory and nuclear quantum effects
Post on 24-Jan-2022
3 Views
Preview:
TRANSCRIPT
Grassmann manifold, gauge, and quantum chemistry
Lin Lin
Department of Mathematics, UC Berkeley; Lawrence Berkeley National Laboratory
Flatiron Institute, April 2019
1
Stiefel manifoldโข โ๐๐ร๐๐ ๐๐ > ๐๐
โข Stiefel manifold (1935): ๐๐ orthogonal vectors in โ๐๐
๐๐๐ก๐ก๐๐,๐๐ = ๐๐ โ โ๐๐ร๐๐|๐๐โ๐๐ = ๐ผ๐ผ๐๐
2
๐๐
๐๐
Grassmann manifoldโข Grassmann manifold (1848): set of ๐๐ dimensional
subspace in โ๐๐
๐บ๐บ๐๐๐๐,๐๐ = ๐๐๐ก๐ก๐๐,๐๐/๐ฐ๐ฐ๐๐
๐ฐ๐ฐ๐๐: ๐๐ dimensional unitary group (i.e. the set of unitary matrices in โ๐๐ร๐๐)
โข Any point in ๐๐๐ก๐ก๐๐,๐๐ can be viewed as a representation of a point in ๐บ๐บ๐๐๐๐,๐๐
3
Example (in optimization)min๐๐
๐น๐น(๐๐)๐ ๐ . ๐ก๐ก.๐๐โ๐๐ = ๐ผ๐ผ๐๐
โข ๐น๐น ๐๐ = ๐น๐น ๐๐๐๐ ,๐๐ โ ๐ฐ๐ฐ๐๐. invariant to the choice of basis โOptimization on a Grassmann manifold.
โข The representation ๐๐ is called the gauge in quantum physics / chemistry. Gauge invariant.
4
Example (in optimization)min๐๐
๐น๐น(๐๐)๐ ๐ . ๐ก๐ก.๐๐โ๐๐ = ๐ผ๐ผ๐๐
โข Otherwise, if ๐น๐น(๐๐) is gauge dependent โ Optimization on a Stiefel manifold.
โข [Edelman, Arias, Smith, SIMAX 1998] many works following
5
This talk: Quantum chemistryโข Time-dependent density functional theory (TDDFT)
โข Kohn-Sham density functional theory (KSDFT)
โข Localization.
โข Gauge choice is the key
โข Used in community software packages: Quantum ESPRESSO, Wannier90, Octopus, PWDFT etc
6
7
Time-dependent density functional theory
Parallel transport gauge
Time-dependent density functional theory[Runge-Gross, PRL 1984] (spin omitted, ๐๐๐๐: number of electrons)
๐๐๐๐๐ก๐ก๐๐๐๐ ๐ฅ๐ฅ, ๐ก๐ก = ๐ป๐ป ๐ก๐ก,๐๐ ๐ก๐ก ๐๐๐๐ ๐ฅ๐ฅ, ๐ก๐ก , ๐๐ = 1, โฆ ,๐๐๐๐
๐๐ ๐ฅ๐ฅ, ๐ฅ๐ฅโฒ, ๐ก๐ก = ๏ฟฝ๐๐=1
๐๐๐๐
๐๐๐๐(๐ฅ๐ฅ, ๐ก๐ก)๐๐๐๐โ(๐ฅ๐ฅโฒ, ๐ก๐ก)
Hamiltonian
๐ป๐ป ๐ก๐ก,๐๐ ๐ก๐ก = โ12ฮ + ๐๐๐๐๐๐๐ก๐ก(๐ก๐ก) + ๐๐๐ป๐ป๐๐๐ป๐ป[๐๐ ๐ก๐ก ]
8
Density matrixโข Key quantity throughout the talkโข ฮจ(t) = ๐๐1 ๐ก๐ก , โฆ ,๐๐๐๐๐๐ ๐ก๐ก โ โ๐๐๐๐ร๐๐๐๐. ๐๐๐๐: number of discretized grid points to represent each ๐๐๐๐
๐๐ ๐ก๐ก = ฮจ ๐ก๐ก ฮจ ๐ก๐ก โ
9
=๐๐๐๐
๐๐๐๐ ๐๐๐๐๐๐๐๐
Gauge invarianceโข Unitary rotation
ฮฆ = ฮจ๐๐, ๐๐โ๐๐ = ๐ผ๐ผ
๐๐ = ฮฆฮฆโ = ฮจ(๐๐๐๐โ)ฮจโ = ฮจฮจโ
โข Propagation of the density matrix, von Neumann equation (quantum Liouville equation)
๐๐๐๐๐ก๐ก๐๐ ๐ก๐ก = ๐ป๐ป,๐๐ ๐ก๐ก = ๐ป๐ป ๐ก๐ก,๐๐ ๐ก๐ก ๐๐ ๐ก๐ก โ ๐๐(๐ก๐ก)๐ป๐ป ๐ก๐ก,๐๐ ๐ก๐ก
10
Gauge choice affects the time stepExtreme case (boring Hamiltonian)
๐ป๐ป ๐ก๐ก โก ๐ป๐ป0
Initial condition๐ป๐ป0๐๐๐๐ 0 = ๐๐๐๐ 0 ๐๐๐๐
Exact solution๐๐๐๐ ๐ก๐ก = ๐๐๐๐(0)๐๐โ๐๐๐๐๐๐๐ก๐ก
๐๐ ๐ก๐ก = ๐๐(0)
11
Small time step
(Formally) arbitrarily largetime step
P v.s. ฮจโข Propagation of ๐๐(๐ก๐ก) requires storing ๐๐๐๐ ร ๐๐๐๐ matrix. Often
impractical for large basis set (๐๐๐๐ = 103 โผ 107)
โข Propagation of ฮจ(๐ก๐ก) requires storing ๐๐๐๐ ร ๐๐๐๐ matrix, but is non-optimal due to gauge choice (๐๐๐๐ = 10 โผ 104)
โข ๐๐ is only a rank ๐๐๐๐ matrix. Not enough information to justify storing as dense matrix.
โข Common theme in quantum chemistry.
12
Propagation on the Grassmann manifoldโข The Schrรถdinger gauge can be seen as
ฮฆ ๐ก๐ก = ฮจ ๐ก๐ก ๐๐ ๐ก๐ก , ๐๐ ๐ก๐ก โก ๐ผ๐ผ
โข Optimal choice of ๐๐(๐ก๐ก) at each time step, to yield slowest possible dynamics โOptimization on the Grassmann manifold
โข Related work: dynamical low rank decomposition [Koch-Lubich 2007; Lubich-Oseledets, 2014]
13
Optimization of gauge choiceโข Gauge transformed wavefunctions
ฮฆ ๐ก๐ก = ฮจ ๐ก๐ก ๐๐ ๐ก๐ก , ๐๐โ ๐ก๐ก ๐๐ ๐ก๐ก = ๐ผ๐ผ
โข Goal:
min๐๐(๐ก๐ก)
๐๐๐ก๐กฮฆ(๐ก๐ก) ๐น๐น2
โข ๐๐ ๐ก๐ก ฮฆ t = ฮฆ(๐ก๐ก), optimization at each time step
14
Optimization of gauge choiceโข Orthogonal decomposition
ฮฆฬ ๐น๐น2 = ๐๐ฮฆฬ ๐น๐น
2 + (๐ผ๐ผ โ ๐๐)ฮฆฬ ๐น๐น2
โข After some derivation(๐ผ๐ผ โ ๐๐)ฮฆฬ ๐น๐น
2 = Tr ๏ฟฝฬ๏ฟฝ๐2๐๐
โข Optimizer (parallel transport condition)๐๐ฮฆฬ = 0
โข Also induce a parallel transport propagator๐๐๐๐๐ก๐ก๐ฏ๐ฏ = ๐๐๐๐๐ก๐ก๐๐,๐๐ ๐ฏ๐ฏ, ๐ฏ๐ฏ 0 = ๐ผ๐ผ, ฮฆ ๐ก๐ก = ๐ฏ๐ฏ ๐ก๐ก ฮจ0
15
Gauge invariant!
Parallel transport (PT) dynamicsโข Schrรถdinger dynamics
๐๐๐๐๐ก๐กฮจ = ๐ป๐ปฮจ, ฮจ 0 = ฮจ0โข Parallel transport dynamics
๐๐๐๐๐ก๐กฮฆ = ๐ป๐ปฮฆโฮฆ ฮฆโ๐ป๐ปฮฆ , ฮฆ 0 = ฮจ0
Gauge implicitly defined by the dynamics Only one additional mixing term needed, simple to implement Optimal gauge among all possible choices Mixing term ๐๐ ๐๐3 cost
16
[An, L., arXiv:1804.02095 ][Jia, An, Wang, L., J. Chem. Theory Comput., 2018]
How PT dynamics worksโข Boring Hamiltonian: ๐ป๐ป ๐ก๐ก โก ๐ป๐ป0, ๐ป๐ป0ฮจ0 = ฮจ0ฮ0, ฮฆ0 = ฮจ0โข ๐ป๐ปฮฆโฮฆ ฮฆโ๐ป๐ปฮฆ = 0 โ ๐๐๐๐๐ก๐กฮฆ = 0
Same behavior as density matrix
โข 1D time-dependent Schrรถdinger:
17
Implicit time integratorโข PT dynamics can be discretized with any integrator
โข Smooth dynamics โ Larger time
โข Implicit integrator: Allow ๐ป๐ปฮ๐ก๐ก โซ 1
18
Implicit time integratorโข Example: PT-CN (Crank-Nicolson scheme, a.k.a.
trapezoidal rule)
โข ฮฆ๐๐+1 needs to be solved self-consistently. Preconditioner. Mixing.
โข Natural to combine with Ehrenfest dynamics
โข Many others are possible
19
Analysisโข Near-adiabatic regime, singularly perturbed (๐๐ โช 1)
๐๐๐๐๐๐๐ก๐ก๐๐ = ๐ป๐ป(๐ก๐ก)๐๐๐๐๐๐๐๐๐ก๐ก๐๐ = ๐ป๐ป ๐ก๐ก ๐๐ โ ๐๐(๐๐โ๐ป๐ป ๐ก๐ก ๐๐)
Theorem [An-L., 2018] symplectic integrator of order ๐๐, up to ๐๐ โผ ๐๐(1),
error โผ ๏ฟฝฮ๐ก๐ก๐๐
๐๐๐๐+1, Schrรถdinger
ฮ๐ก๐ก๐๐
๐๐๐๐, parallel transport
20
[An, L., arXiv:1804.02095 ]
Proof sketchKey: Refined adiabatic theorem
Lemma There exists decomposition
๐๐ ๐ก๐ก = ๐๐๐ด๐ด ๐ก๐ก + ๐๐๐๐๐ ๐ (๐ก๐ก)
up to ๐๐ โผ ๐๐(1). ๐๐๐ด๐ด ๐ก๐ก is eigenstate of ๐ป๐ป(๐ก๐ก) and is ๐๐-independent (including phase factor). ๐๐๐ ๐ (๐ก๐ก) 2 is bounded ๐๐-independent.
21
Implementation in electronic structure software packages
โข DGDFT / PWDFT
โข Octopus (most widely used package for TDDFT, in progress)
22
Absorption spectrumโข Anthracene (C14H10)โข Compare RT-TDDFT (PWDFT) and linear response
TDDFT (Quantum ESPRESSO). โข PT-CN 8 times faster than S-RK4
23
[Jia, An, Wang, L., J. Chem. Theory Comput., 2018]
Ultrafast dynamicsโข Benzeneโข Slow laser (800 nm wavelength)โข PT-CN 32 times faster than S-RK4
24
[Jia, An, Wang, L., J. Chem. Theory Comput., 2018]
Ultrafast dynamicsโข Fast laser (250 nm wavelength)โข PT-CN 10 times faster than S-RK4
25
[Jia, An, Wang, L., J. Chem. Theory Comput., 2018]
Efficiencyโข Silicon from 32 to 1000 atoms, ~10 times fasterโข Parallel implementation up to 2000 cores.
26
Ion collisionโข Clโ ion through graphene nanoflake (113 atoms)
27
Ion collision
28
S-RK4 ฮ๐ก๐ก: 0.5 as, 78 hours, 228 coresPT-CN ฮ๐ก๐ก: 50 as, 5.2 hours, 228 cores
Hybrid functional RT-TDDFTโข For the first time, practical hybrid functional RT-TDDFT
calculations with a large basis set
29
[Jia, L., Comput. Phys. Commun. 2019]
GPU acceleration on Summitโข No. 1 supercomputer in the Top500 list in November, 2018
โข Silicon with 1536 atoms
โข Scale to 768 GPUs
โข 34 x times faster than CPU with 3072 cores
โข 1.5 hr per femtosecond (fs)
30
[Jia, Wang, L., submitted]
Conclusionโข Parallel transport dynamics: optimal gauge choice
โข Cost โผ Schrรถdinger dynamics (wavefunction)Time step โผ von Neumann dynamics (density matrix)
โข Implicit time integrators demonstrated to be effective for TDDFT for the first time (c.f. Pueyo et al JCTC 2018). Significantly improved efficiency.
31
32
Electron localization
Selected columns of density matrix
Localization problemโข ฮจ: Kohn-Sham orbitals
dense, unitary matrix of size ๐๐๐๐ ร ๐๐๐๐ ๐๐๐๐ โซ ๐๐๐๐
โข ๐๐-sparse representationฮฆโฮจU ๐น๐น โค ๐๐
Each column of ฮฆ is sparse. ๐๐ is an ๐๐๐๐ ร ๐๐๐๐ unitary matrix.
33
ฮฆฮจ
Wannier functionsโข Maximally localized Wannier function (MLWF) [Marzari-
Vanderbilt, Phys. Rev. B 1997].
โข Reason for the existence of MLWF for insulating systems [Kohn, PR 1959] [Nenciu, CMP 1983] [Panati, AHP 2007], [Brouder et al, PRL 2007] [Benzi-Boito-Razouk, SIAM Rev. 2013] etc
34
Silicon Graphene
Application of Wannier functionsโข Analysis of chemical bondingโข Band-structure interpolationโข Basis functions for DFT calculations (representing
occupied orbitals ๐๐๐๐)โข Basis functions for excited state calculations (representing
Hadamard product of orbitals ๐๐๐๐ โ ๐๐๐๐)โข Strongly correlated systems (DFT+U)โข Phonon calculationsโข etc
[Marzari et al. Rev. Mod. Phys. 2012]
35
Maximally localized Wannier functionsโข Geometric intuition: Minimization of โspreadโ or second
moment.
minฮฆ=ฮจ๐๐,๐๐โ๐๐=๐ผ๐ผ
ฮฉ ฮฆ
ฮฉ ฮฆ = ๏ฟฝ๐๐=1
๐๐๐๐
๏ฟฝ ๐๐๐๐ ๐ฅ๐ฅ2๐ฅ๐ฅ2 ๐๐๐ฅ๐ฅ โ ๏ฟฝ ๐๐๐๐ ๐ฅ๐ฅ
2๐ฅ๐ฅ ๐๐๐ฅ๐ฅ2
โข ๐๐: gauge degrees of freedom
36
Maximally localized Wannier functionsRobustness
โข Initialization: Nonlinear optimization and possible to get stuck at local minima.
โข Entangled band: Localization in the absence of band gap.
Both need to be addressed for high throughput computation.
37
Density matrix perspectiveฮจ is unitary, then
๐๐ = ฮจฮจโ
is a projection operator, and is gauge invariant.
๐๐ = ฮจฮจโ = ฮฆ(๐๐โ๐๐)ฮฆโ = ฮฆฮฆโ
is close to a sparse matrix.
โข Can one construct sparse representation directly from the density matrix?
38
Algorithm: Selected columns of the density matrix (SCDM)Code (MATLAB. Psi: matrix of size ๐๐๐๐ ร ๐๐๐๐)
[U,R,perm] = qr(Psi', 0);
Phi = Psi * U;
โข Very easy to code and to parallelize!โข Deterministic, no initial guess.โข perm encodes selected columns of the density matrix
39
Pivoted QR
GEMM
[Damle-L.-Ying, JCTC, 2015]
k-pointโข Strategy: find columns using one โanchorโ k-point (such
as ฮ), and then apply to all k-points
40
SCDM: A flexible frameworkโข Molecular or periodic systems
โข Isolated or entangled bands
โข Variational formulation
[Damle-L., MMS 2018][Damle-L.-Levitt, MMS 2019]
41
SCDM in software packagesโข MATLAB/Julia code
https://github.com/asdamle/SCDMhttps://github.com/antoine-levitt/wannier
โข Quantum ESPRESSO [I. Carnimeo, S. Baroni, P. Giannozzi, arXiv: 1801.09263]
โข Wannier90 [V. Vitale et al] (starting from v3.0)http://www.wannier.org/
42
Entangled bands
43
Entangled case 1 (metal, valence + conduction):
Entangled case 2 (near Fermi energy):
Examples of SCDM orbitals (ฮ-point)
44
Silicon Water
Examples of SCDM orbitals (k-point)
45
Cr2O3. k-point grid 6 ร 6 ร 6Initial spread from SCDM: 17.22 ร 2
MLWF converged spread: 16.98 ร 2
Example: band interpolation
46
Si Cu
Band structure interpolation: Al
47
SCDM spread: 18.38 ร 2
10x10x10 k-points, 6 bands โ 4 bands (no disentanglement)
Wannier: optimized spread: 12.42 ร 2
Smaller spread Better interpolation
Implementation in Wannier 90
Credit: Dr. Valerio Vitale (APS March meeting 2019)
48
Implementation in Wannier 90
Credit: Dr. Valerio Vitale (APS March meeting 2019)
49
Conclusionโข Extract information from the gauge-invariant density
matrix
โข Selected columns of the density matrix (SCDM) is a simple, robust and deterministic method to find localized orbitals. Alternative method to the Maximally Localized Wannier functions.
โข Wannier localization can be robustly initialized with SCDM (already in Wannier90). High-throughput materials simulation
50
51
Kohn-Sham density functional theory
Projected commutator DIIS
Kohn-Sham density functional theory
52
๐ป๐ป ๐๐ ๐๐๐๐ ๐ฅ๐ฅ = โ12ฮ + ๐๐๐๐๐๐๐ก๐ก ๐ฅ๐ฅ + ๐๐๐ป๐ป๐๐๐ป๐ป ๐๐ (๐ฅ๐ฅ) ๐๐๐๐ ๐ฅ๐ฅ = ๐๐๐๐๐๐๐๐ ๐ฅ๐ฅ
๐๐ ๐ฅ๐ฅ, ๐ฅ๐ฅโฒ, ๐ก๐ก = ๏ฟฝ๐๐=1
๐๐๐๐
๐๐๐๐(๐ฅ๐ฅ, ๐ก๐ก)๐๐๐๐โ(๐ฅ๐ฅโฒ, ๐ก๐ก) ,
๏ฟฝ๐๐๐ฅ๐ฅ ๐๐๐๐โ ๐ฅ๐ฅ ๐๐๐๐ ๐ฅ๐ฅ = ๐ฟ๐ฟ๐๐๐๐ , ๐๐1 โค ๐๐2 โค โฏ โค ๐๐๐๐๐๐
โข In principle exact many body ground state energy.
โข Most widely used electronic structure theory for condensed matter systems and molecules
โข [Hohenberg-Kohn,1964], [Kohn-Sham, 1965], Nobel Prize in Chemistry, 1998
Self consistent field / Fixed point problem
๐ฅ๐ฅ = ๐๐(๐ฅ๐ฅ)
Kohn-Sham density functional theory
๐๐ = โฑ[๐๐]
More precisely
๐ป๐ป P ๐๐๐๐ = ๐๐๐๐๐๐๐๐ , ๐๐ = ๏ฟฝ๐๐
๐๐๐๐๐๐๐๐โ
53
SCF: Fixed point problemโข Fixed point iteration ๐ฅ๐ฅ๐๐+1 = ๐๐(๐ฅ๐ฅ๐๐). Diverges quickly.
โข Simple mixing. ๐ฅ๐ฅ๐๐+1 = ๐ผ๐ผ๐๐ ๐ฅ๐ฅ๐๐ + 1 โ ๐ผ๐ผ ๐ฅ๐ฅ๐๐. Often works but can be very slow
โข Broyden-type methods: method of choice for practical calculations.
โข Direct Inversion of the iterative subspace (DIIS) [Pulay1980; 1982] (another equivalent version is Anderson acceleration [Anderson 1965])
54
SCF: Quantum chemistโs viewpointโข commutator-DIIS (c-DIIS). Most widely used method in
quantum chemistry software packages
โข Residual๐ ๐ ๐๐ = ๐ป๐ป ๐๐ ,๐๐ = ๐ป๐ป ๐๐ ๐๐ โ ๐๐๐ป๐ป[๐๐]
โข Necessary condition for SCF convergence
๐ ๐ ๐๐ = 0
55
[Pulay CPL 1982; Kudin-Scuseria-Cances JCP 2002; Kudin-Scuseria M2AN 2007][T. Rohwedder, R. Schneider, JMC 2011]
C-DIISโข Extrapolate the Fock matrix
โข Obtain coefficients by minimizing the residual
โข Equivalent to a version of Broydenโs method
56
Convergence of C-DIIS
57
Comparison of SCF convergence patterns for (CH3)2CHOH with the Huckelguess at the RHF/6-31G(d) level of theory. [Kudin-Scuseria M2AN 2007]
C-DIIS is not suitable for large basis setโข Large basis: planewaves, finite elements, wavelets etc
๐๐ = ฮจฮจโ โ โ๐๐๐๐ร๐๐๐๐ , ฮจ โ โ๐๐๐๐ร๐๐๐๐
โข ๐๐๐๐ โผ 1000๐๐๐๐. Cannot store ๐ป๐ป,๐ ๐ ,๐๐
58
Projected commutator DIISโข Project to a set of gauge fixing orbitals ฮฆ๐๐๐๐๐๐ โ โ๐๐๐๐ร๐๐๐๐
with full column rankฮฆ = ๐๐ฮฆ๐๐๐๐๐๐
โข Use ฮฆ as the mixing variable (gauge-invariant)
59
[Hu-L.-Yang, JCTC, 2017 ]
Projectionโข ฮฆโบ ๐๐, if ฮฆ has full column rank
๐๐ = ฮฆ ฮฆโฮฆ โ1ฮฆโ
โข Use projected commutator๐ ๐ ฮฆ = ๐ป๐ป ๐๐ ๐๐ฮฆ๐๐๐๐๐๐ โ ๐๐๐ป๐ป ๐๐ ฮฆ๐๐๐๐๐๐
โข Enables C-DIIS for large basis set for the first time
60
[Hu-L.-Yang, JCTC, 2017 ]
PC-DIISโข Minimize
minโ๐๐ ๐ผ๐ผ๐๐=1
๏ฟฝ๐๐=๐๐โ๐๐
๐๐
๐ผ๐ผ๐๐๐ ๐ ฮฆ๐๐
๐น๐น
โข Update
๏ฟฝฮฆ = ๏ฟฝ๐๐=๐๐โ๐๐
๐๐
๐ผ๐ผ๐๐ฮฆ๐๐
โข Compute
ฮฆ๐๐+1 = ๏ฟฝฮฆ ๏ฟฝฮฆโ๏ฟฝฮฆ โ1 ๏ฟฝฮฆโฮฆ๐๐๐๐๐๐
61
PC-DIISProjected commutator
๐ ๐ ฮฆ = ๐ป๐ป ๐๐ ๐๐ฮฆ๐๐๐๐๐๐ โ ๐๐๐ป๐ป ๐๐ ฮฆ๐๐๐๐๐๐
โข If ฮฆ๐๐๐๐๐๐ = ฮจ๐ ๐ ฮฆ = ๐ป๐ป ๐๐ ฮจ โฮจฮ
โ Residual of eigenvalue problem.
โข All operations are limited to matrices of size ๐๐๐๐ ร ๐๐๐๐
62
Accuracy
63
Efficiency
64
Si 1000 atom on 2000 cores
Ab initio molecular dynamicsโข Wavefunction extrapolation with fixed gauge
65
Conclusionโข Projected C-DIIS enables C-DIIS for large basis set by a
gauge fixing procedure.
โข Recent usage in quantum embedding theory (joint work with Garnet Chan)
โข Remark: Convergence in the nonlinear setup for DIIS is very much missing
66
Acknowledgementโข Dong An (Berkeley)โข Anil Damle (NSF postdoc, now Assistant Professor at Cornell)โข Wei Hu (LBNL, now Professor at USTC)โข Weile Jia (Berkeley)
โข Lin-Wang Wang (LBNL)โข Chao Yang (LBNL)โข Lexing Ying (Stanford)
DOE Base Math, CAMERA, SciDAC, Early Career NSF CAREER. NERSC and BRC for Computing Resources
67
Thank you for your attention!
top related