26.11.2014 Views

Maximally localized Wannier functions: Theory and applications

Maximally localized Wannier functions: Theory and applications

Maximally localized Wannier functions: Theory and applications

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

11<br />

To proceed further, we write<br />

dΩ = 2 tr [X ′ dX + Y ′ dY + Z ′ dZ] , (40)<br />

exploiting the fact that tr [X ′ X D ] = 0, <strong>and</strong> similarly<br />

for Y <strong>and</strong> Z. We then consider an infinitesimal unitary<br />

transformation |w i ⟩ → |w i ⟩ + ∑ j dW ji|w j ⟩ (where<br />

dW is antihermitian), from which dX = [X, dW ], <strong>and</strong><br />

similarly for Y , Z. Inserting in Eq. (40) <strong>and</strong> using<br />

tr [A[B, C]] = tr [C[A, B]] <strong>and</strong> [X ′ , X] = [X ′ , X D ], we<br />

obtain dΩ = tr [dW G] where the gradient of the spread<br />

functional G = dΩ/dW is given by<br />

{<br />

}<br />

G = 2 [X ′ , X D ] + [Y ′ , Y D ] + [Z ′ , Z D ] dW . (41)<br />

The minimization can then be carried out using a procedure<br />

similar to that outlined above for periodic boundary<br />

conditions. Last, we note that minimizing ˜Ω is equivalent<br />

to maximizing tr [X 2 D + Y 2 D + Z2 D ], since tr [X′ X D ] =<br />

tr [Y ′ Y D ] = tr [Y ′ Y D ] = 0.<br />

2. Γ-point formulation for large supercells<br />

A similar formulation applies in reciprocal space when<br />

dealing with isolated or very large systems in periodic<br />

boundary conditions, i.e., whenever it becomes appropriate<br />

to sample the wave<strong>functions</strong> only at the Γ-point<br />

of the Brillouin zone. For simplicity, we start with the<br />

case of a simple cubic lattice of spacing L, <strong>and</strong> define the<br />

matrices X , Y, <strong>and</strong> Z as<br />

X mn = ⟨w m |e −2πix/L |w n ⟩ (42)<br />

<strong>and</strong> its periodic permutations (the extension to supercells<br />

of arbitrary symmetry has been derived by Silvestrelli<br />

(1999)). 6 The coordinate x n of the n-th WF center<br />

(WFC) can then be obtained from<br />

x n = − L 2π Im ln⟨w n|e −i 2π L x |w n ⟩ = − L 2π Im ln X nn ,<br />

(43)<br />

<strong>and</strong> similarly for y n <strong>and</strong> z n . Equation (43) has been<br />

shown by Resta (1998) to be the correct definition of<br />

the expectation value of the position operator for a system<br />

with periodic boundary conditions, <strong>and</strong> had been<br />

introduced several years ago to deal with the problem<br />

of determining the average position of a single electronic<br />

orbital in a periodic supercell (Selloni et al., 1987) (the<br />

6 We point out that the definition of the X , Y, Z matrices for extended<br />

systems, now common in the literature, is different from<br />

the one used in the previous subsection for the X, Y, Z matrices<br />

for isolated system. We preserved these two notations for<br />

the sake of consistency with published work, at the cost of making<br />

less evident the close similarities that exist between the two<br />

minimization algorithms.<br />

above definition becomes self-evident in the limit where<br />

w n tends to a Dirac delta function).<br />

Following the derivation of the previous subsection,<br />

or of Silvestrelli et al. (1998), it can be shown that the<br />

maximum-localization criterion is equivalent to maximizing<br />

the functional<br />

Ξ =<br />

J∑ (<br />

|Xnn | 2 + |Y nn | 2 + |Z nn | 2) . (44)<br />

n=1<br />

The first term of the gradient dΞ/dA mn is given by<br />

[X nm (Xnn ∗ − Xmm) ∗ − Xmn(X ∗ mm − X nn )], <strong>and</strong> similarly<br />

for the second <strong>and</strong> third terms. Again, once the gradient<br />

is determined, minimization can be performed using<br />

a steepest-descent or conjugate-gradients algorithm; as<br />

always, the computational cost of the localization procedure<br />

is small, given that it involves only small matrices<br />

of dimension J × J, once the scalar products needed to<br />

construct the initial X (0) , Y (0) <strong>and</strong> Z (0) have been calculated,<br />

which takes an effort of order J 2 ∗ N basis . We<br />

note that in the limit of a single k point the distinction<br />

between Bloch orbitals <strong>and</strong> WFs becomes irrelevant,<br />

since no Fourier transform from k to R is involved in the<br />

transformation Eq. (10); rather, we want to find the optimal<br />

unitary matrix that rotates the ground-state selfconsistent<br />

orbitals into their maximally <strong>localized</strong> representation,<br />

making this problem exactly equivalent to the<br />

one of isolated systems. Interestingly, it should also be<br />

mentioned that the local minima alluded to in the previous<br />

subsection are typically not found when using Γ<br />

sampling in large supercells.<br />

Before concluding, we note that care should be taken<br />

when comparing the spreads of MLWFs calculated in supercells<br />

of different sizes. The <strong>Wannier</strong> centers <strong>and</strong> the<br />

general shape of the MLWFs often converge rapidly as<br />

the cell size is increased, but even for the ideal case of an<br />

isolated molecule, the total spread Ω often displays much<br />

slower convergence. This behavior derives from the finitedifference<br />

representation of the invariant part Ω I of the<br />

localization functional (essentially, a second derivative);<br />

while Ω I does not enter into the localization procedure, it<br />

does contribute to the spread, <strong>and</strong> in fact usually represents<br />

the largest term. This slow convergence was noted<br />

by Marzari <strong>and</strong> V<strong>and</strong>erbilt (1997) when commenting on<br />

the convergence properties of Ω with respect to the spacing<br />

of the Monkhorst-Pack mesh, <strong>and</strong> has been studied in<br />

detail by others (Stengel <strong>and</strong> Spaldin, 2006a; Umari <strong>and</strong><br />

Pasquarello, 2003). For isolated systems in a supercell,<br />

this problem can be avoided simply by using a very large<br />

L in Eq. (43), since in practice the integrals only need to<br />

be calculated in the small region where the orbitals are<br />

non-zero (Wu, 2004). For extended bulk systems, this<br />

convergence problem can be ameliorated significantly by<br />

calculating the position operator using real-space integrals<br />

(Lee, 2006; Lee et al., 2005; Stengel <strong>and</strong> Spaldin,<br />

2006a).

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!