Physics Letters B 803 (2020) 135281
Contents lists available at ScienceDirect
Physics Letters B
www.elsevier.com/locate/physletb
Deflated GMRES with multigrid for lattice QCD
Travis Whyte
a
, Walter Wilcox
a,∗
, Ronald B. Morgan
b
a
Department of Physics, Baylor University, Waco, TX 76798-7316, United States of America
b
Department of Mathematics, Baylor University, Waco, TX 76798-7316, United States of America
a r t i c l e i n f o a b s t r a c t
Article history:
Received 5 December 2019
Accepted 4 February 2020
Available online 12 February 2020
Editor: B. Grinstein
Keywords:
Lattice QCD
Matrix deflation
Multigrid
Lattice QCD solvers encounter critical slowing down for fine lattice spacings and small quark mass.
Traditional matrix eigenvalue deflation is one approach to mitigating this problem. However, to improve
scaling we study the effects of deflating on the coarse grid in a hierarchy of three grids for adaptive
multigrid applications of the two dimensional Schwinger model. We compare deflation at the fine
and coarse levels with other non deflated methods. We find the inclusion of a partial solve on the
intermediate grid allows for a low tolerance deflated solve on the coarse grid. We find very good scaling
in lattice size near critical mass when we deflate at the coarse level using the GMRES-DR and GMRES-Proj
algorithms.
© 2020 The Author(s). Published by Elsevier B.V. This is an open access article under the CC BY license
(http://creativecommons.org/licenses/by/4.0/). Funded by SCOAP
3
.
1. Introduction
The problem of critical slowing down and strong scaling is one
of the foremost problems facing modern Lattice Quantum Chro-
modynamics
(LQCD) simulations. Current simulations require ex-
tremely
fine lattice spacings, necessitating the need for larger lat-
tice
volumes. This in turn creates larger Dirac operators needed for
linear systems calculations such as stochastic trace estimators and
fermionic forces in Hybrid Monte Carlo. Moreover, as the fermion
mass approaches physically relevant values, the Dirac operator be-
comes
extremely ill conditioned. This ill conditioning leads to ex-
ceptional
eigenvalues, which drastically slow convergence of linear
equations. Adaptive multigrid (MG) [1]is one method that deals
with both the strong scaling and critical slowing down at the
same time, and has been used successfully for the Wilson, overlap
and staggered fermion discretizations [2–4]. Adaptive MG creates a
hierarchy of coarsened operators from the original fine Dirac oper-
ator
by exploiting its near null kernel. This shifts critical slowing
down to the coarsest level, where the components of the error at-
tributed
to the exceptional eigenvalues can be more easily dealt
with. However, the cost of the coarse grid solve can be very large
when cast in terms of fine grid equivalence.
Deflation has long been used as a method of dealing with ex-
ceptional
eigenvalues in many fields, but is not yet heavily used
in modern LQCD simulations, partly because of eigenvector stor-
age
costs for large systems. Adaptive MG allows for deflation to
*
Corresponding author.
E-mail addresses: travis_whyte@baylor.edu (T. Whyte), walter_wilcox@baylor.edu
(W. Wilcox),
ronald_morgan@baylor.edu (R.B. Morgan).
be employed on the coarsest level, where storage requirements of
deflation are much smaller [5–7]. The preferred method of MG in
LQCD is to use it as a preconditioner for an outer Krylov solver [8].
Because every iteration of the outer Krylov solver represents a new
right hand side for the MG preconditioner, deflation with projec-
tion
methods [9,10]can be efficiently employed on the coarsest
level. We demonstrate the effect that deflation on the coarsest
level has by comparing to MG without coarse grid deflation, and
the effect that this deflation has for multiple right hand sides. We
observe that a partial solve on the intermediate grid in conjunction
with deflation and projection methods on the coarse grid allows
for a partial coarse grid solve. This partial solve on the interme-
diate
grid reduces the number of outer iterations for convergence,
and we observe no sign of critical slowing down resurgence on
the higher grid levels with the use of a deflated partial coarse grid
solve.
2. Methods
We work with the Wilson-Dirac operator in the two-dimensional
lattice Schwinger Model [11], which shares many physical charac-
teristics
with 4D LQCD, and as such is a good algorithmic test-
ing
ground. We created 10 gauge configurations within QCDLAB
1.0 [12]for lattices of size 64
2
, 128
2
and 256
2
at β = 6.0. All
values are averaged over separate solves for each configuration.
The method of coarsening follows that of reference [4]. A hier-
archy
of three grids was created by solving the residual system
DD
†
e =−DD
†
x, where x is a random vector, for 12 near null vec-
tors
on the fine grids. This system was solved to a tolerance of
10
−4
, and the near null vectors were constructed using ψ = x + e.
https://doi.org/10.1016/j.physletb.2020.135281
0370-2693/© 2020 The Author(s). Published by Elsevier B.V. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/). Funded by
SCOAP
3
.