Maximally localized Wannier functions: Theory and applications
Maximally localized Wannier functions: Theory and applications
Maximally localized Wannier functions: Theory and applications
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
11<br />
To proceed further, we write<br />
dΩ = 2 tr [X ′ dX + Y ′ dY + Z ′ dZ] , (40)<br />
exploiting the fact that tr [X ′ X D ] = 0, <strong>and</strong> similarly<br />
for Y <strong>and</strong> Z. We then consider an infinitesimal unitary<br />
transformation |w i ⟩ → |w i ⟩ + ∑ j dW ji|w j ⟩ (where<br />
dW is antihermitian), from which dX = [X, dW ], <strong>and</strong><br />
similarly for Y , Z. Inserting in Eq. (40) <strong>and</strong> using<br />
tr [A[B, C]] = tr [C[A, B]] <strong>and</strong> [X ′ , X] = [X ′ , X D ], we<br />
obtain dΩ = tr [dW G] where the gradient of the spread<br />
functional G = dΩ/dW is given by<br />
{<br />
}<br />
G = 2 [X ′ , X D ] + [Y ′ , Y D ] + [Z ′ , Z D ] dW . (41)<br />
The minimization can then be carried out using a procedure<br />
similar to that outlined above for periodic boundary<br />
conditions. Last, we note that minimizing ˜Ω is equivalent<br />
to maximizing tr [X 2 D + Y 2 D + Z2 D ], since tr [X′ X D ] =<br />
tr [Y ′ Y D ] = tr [Y ′ Y D ] = 0.<br />
2. Γ-point formulation for large supercells<br />
A similar formulation applies in reciprocal space when<br />
dealing with isolated or very large systems in periodic<br />
boundary conditions, i.e., whenever it becomes appropriate<br />
to sample the wave<strong>functions</strong> only at the Γ-point<br />
of the Brillouin zone. For simplicity, we start with the<br />
case of a simple cubic lattice of spacing L, <strong>and</strong> define the<br />
matrices X , Y, <strong>and</strong> Z as<br />
X mn = ⟨w m |e −2πix/L |w n ⟩ (42)<br />
<strong>and</strong> its periodic permutations (the extension to supercells<br />
of arbitrary symmetry has been derived by Silvestrelli<br />
(1999)). 6 The coordinate x n of the n-th WF center<br />
(WFC) can then be obtained from<br />
x n = − L 2π Im ln⟨w n|e −i 2π L x |w n ⟩ = − L 2π Im ln X nn ,<br />
(43)<br />
<strong>and</strong> similarly for y n <strong>and</strong> z n . Equation (43) has been<br />
shown by Resta (1998) to be the correct definition of<br />
the expectation value of the position operator for a system<br />
with periodic boundary conditions, <strong>and</strong> had been<br />
introduced several years ago to deal with the problem<br />
of determining the average position of a single electronic<br />
orbital in a periodic supercell (Selloni et al., 1987) (the<br />
6 We point out that the definition of the X , Y, Z matrices for extended<br />
systems, now common in the literature, is different from<br />
the one used in the previous subsection for the X, Y, Z matrices<br />
for isolated system. We preserved these two notations for<br />
the sake of consistency with published work, at the cost of making<br />
less evident the close similarities that exist between the two<br />
minimization algorithms.<br />
above definition becomes self-evident in the limit where<br />
w n tends to a Dirac delta function).<br />
Following the derivation of the previous subsection,<br />
or of Silvestrelli et al. (1998), it can be shown that the<br />
maximum-localization criterion is equivalent to maximizing<br />
the functional<br />
Ξ =<br />
J∑ (<br />
|Xnn | 2 + |Y nn | 2 + |Z nn | 2) . (44)<br />
n=1<br />
The first term of the gradient dΞ/dA mn is given by<br />
[X nm (Xnn ∗ − Xmm) ∗ − Xmn(X ∗ mm − X nn )], <strong>and</strong> similarly<br />
for the second <strong>and</strong> third terms. Again, once the gradient<br />
is determined, minimization can be performed using<br />
a steepest-descent or conjugate-gradients algorithm; as<br />
always, the computational cost of the localization procedure<br />
is small, given that it involves only small matrices<br />
of dimension J × J, once the scalar products needed to<br />
construct the initial X (0) , Y (0) <strong>and</strong> Z (0) have been calculated,<br />
which takes an effort of order J 2 ∗ N basis . We<br />
note that in the limit of a single k point the distinction<br />
between Bloch orbitals <strong>and</strong> WFs becomes irrelevant,<br />
since no Fourier transform from k to R is involved in the<br />
transformation Eq. (10); rather, we want to find the optimal<br />
unitary matrix that rotates the ground-state selfconsistent<br />
orbitals into their maximally <strong>localized</strong> representation,<br />
making this problem exactly equivalent to the<br />
one of isolated systems. Interestingly, it should also be<br />
mentioned that the local minima alluded to in the previous<br />
subsection are typically not found when using Γ<br />
sampling in large supercells.<br />
Before concluding, we note that care should be taken<br />
when comparing the spreads of MLWFs calculated in supercells<br />
of different sizes. The <strong>Wannier</strong> centers <strong>and</strong> the<br />
general shape of the MLWFs often converge rapidly as<br />
the cell size is increased, but even for the ideal case of an<br />
isolated molecule, the total spread Ω often displays much<br />
slower convergence. This behavior derives from the finitedifference<br />
representation of the invariant part Ω I of the<br />
localization functional (essentially, a second derivative);<br />
while Ω I does not enter into the localization procedure, it<br />
does contribute to the spread, <strong>and</strong> in fact usually represents<br />
the largest term. This slow convergence was noted<br />
by Marzari <strong>and</strong> V<strong>and</strong>erbilt (1997) when commenting on<br />
the convergence properties of Ω with respect to the spacing<br />
of the Monkhorst-Pack mesh, <strong>and</strong> has been studied in<br />
detail by others (Stengel <strong>and</strong> Spaldin, 2006a; Umari <strong>and</strong><br />
Pasquarello, 2003). For isolated systems in a supercell,<br />
this problem can be avoided simply by using a very large<br />
L in Eq. (43), since in practice the integrals only need to<br />
be calculated in the small region where the orbitals are<br />
non-zero (Wu, 2004). For extended bulk systems, this<br />
convergence problem can be ameliorated significantly by<br />
calculating the position operator using real-space integrals<br />
(Lee, 2006; Lee et al., 2005; Stengel <strong>and</strong> Spaldin,<br />
2006a).