Machine Learning and Variational Algorithms for Lattice Field Theory

Discretizing fields on a spacetime lattice is the only known general and non-perturbative regulator for quantum field theory. The lattice formulation has, for example, played an important role in predicting properties of QCD in the strongly coupled regime, where perturbative methods break down. To r...

Full description

Bibliographic Details
Main Author:	Kanwar, Gurtej
Other Authors:	Detmold, William
Format:	Thesis
Published:	Massachusetts Institute of Technology 2022
Online Access:	https://hdl.handle.net/1721.1/142680 https://orcid.org/0000-0002-4340-4983

_version_	1826205542471696384
author	Kanwar, Gurtej
author2	Detmold, William
author_facet	Detmold, William Kanwar, Gurtej
author_sort	Kanwar, Gurtej
collection	MIT
description	Discretizing fields on a spacetime lattice is the only known general and non-perturbative regulator for quantum field theory. The lattice formulation has, for example, played an important role in predicting properties of QCD in the strongly coupled regime, where perturbative methods break down. To recover information about continuum physics, parameters defining the lattice theory must be tuned toward criticality. However, Markov chain Monte Carlo (MCMC) methods commonly used to evaluate the lattice-regularized path integral suffer from critical slowing down in this limit, restricting the precision of continuum extrapolations. Further difficulties arise when computing the energies and interactions of physical states by measuring correlation functions of operators widely separated in spacetime: for most correlation functions, an exponentially severe signal-to-noise problem is encountered as the operators are taken to be widely separated, limiting the precision of calculations. This dissertation introduces two new techniques to address these issues. First, we define a novel MCMC algorithm based on generative flow-based models. Such models utilize machine learning methods to describe efficient approximate samplers for distributions of interest. Independently drawn flow-based samples are then used as proposals in an asymptotically exact Metropolis-Hastings Markov chain. We also construct models that flexibly parameterize families of distributions while capturing symmetries of interest, including translational and gauge symmetries. By variationally optimizing the distribution selected from these families, one can maximize the efficiency of flow-based MCMC. We secondly introduce an approach to 'deform' Monte Carlo estimators based on contour deformations applied to the domain of the path integral. The deformed estimators associated with an observable give equivalent unbiased measurements of that observable, but generically have different variances. We define families of deformed manifolds for lattice gauge theories and introduce methods to efficiently optimize the choice of manifold (the 'observifold') so that the variance of the associated deformed observable is minimized. Finally, we demonstrate that flow-based MCMC can mitigate critical slowing down and observifolds can exponentially reduce variance in proof-of-principle applications to scalar phi^4 theory and U(1) and SU(N) lattice gauge theories.
first_indexed	2024-09-23T13:15:06Z
format	Thesis
id	mit-1721.1/142680
institution	Massachusetts Institute of Technology
last_indexed	2024-09-23T13:15:06Z
publishDate	2022
publisher	Massachusetts Institute of Technology
record_format	dspace
spelling	mit-1721.1/1426802022-05-25T03:20:55Z Machine Learning and Variational Algorithms for Lattice Field Theory Kanwar, Gurtej Detmold, William Massachusetts Institute of Technology. Department of Physics Discretizing fields on a spacetime lattice is the only known general and non-perturbative regulator for quantum field theory. The lattice formulation has, for example, played an important role in predicting properties of QCD in the strongly coupled regime, where perturbative methods break down. To recover information about continuum physics, parameters defining the lattice theory must be tuned toward criticality. However, Markov chain Monte Carlo (MCMC) methods commonly used to evaluate the lattice-regularized path integral suffer from critical slowing down in this limit, restricting the precision of continuum extrapolations. Further difficulties arise when computing the energies and interactions of physical states by measuring correlation functions of operators widely separated in spacetime: for most correlation functions, an exponentially severe signal-to-noise problem is encountered as the operators are taken to be widely separated, limiting the precision of calculations. This dissertation introduces two new techniques to address these issues. First, we define a novel MCMC algorithm based on generative flow-based models. Such models utilize machine learning methods to describe efficient approximate samplers for distributions of interest. Independently drawn flow-based samples are then used as proposals in an asymptotically exact Metropolis-Hastings Markov chain. We also construct models that flexibly parameterize families of distributions while capturing symmetries of interest, including translational and gauge symmetries. By variationally optimizing the distribution selected from these families, one can maximize the efficiency of flow-based MCMC. We secondly introduce an approach to 'deform' Monte Carlo estimators based on contour deformations applied to the domain of the path integral. The deformed estimators associated with an observable give equivalent unbiased measurements of that observable, but generically have different variances. We define families of deformed manifolds for lattice gauge theories and introduce methods to efficiently optimize the choice of manifold (the 'observifold') so that the variance of the associated deformed observable is minimized. Finally, we demonstrate that flow-based MCMC can mitigate critical slowing down and observifolds can exponentially reduce variance in proof-of-principle applications to scalar phi^4 theory and U(1) and SU(N) lattice gauge theories. Ph.D. 2022-05-24T19:18:26Z 2022-05-24T19:18:26Z 2021-06 2022-05-19T23:48:23.652Z Thesis https://hdl.handle.net/1721.1/142680 https://orcid.org/0000-0002-4340-4983 In Copyright - Educational Use Permitted Copyright MIT http://rightsstatements.org/page/InC-EDU/1.0/ application/pdf Massachusetts Institute of Technology
spellingShingle	Kanwar, Gurtej Machine Learning and Variational Algorithms for Lattice Field Theory
title	Machine Learning and Variational Algorithms for Lattice Field Theory
title_full	Machine Learning and Variational Algorithms for Lattice Field Theory
title_fullStr	Machine Learning and Variational Algorithms for Lattice Field Theory
title_full_unstemmed	Machine Learning and Variational Algorithms for Lattice Field Theory
title_short	Machine Learning and Variational Algorithms for Lattice Field Theory
title_sort	machine learning and variational algorithms for lattice field theory
url	https://hdl.handle.net/1721.1/142680 https://orcid.org/0000-0002-4340-4983
work_keys_str_mv	AT kanwargurtej machinelearningandvariationalalgorithmsforlatticefieldtheory

Machine Learning and Variational Algorithms for Lattice Field Theory

Similar Items