Machine Learning and Variational Algorithms for Lattice Field Theory

Discretizing fields on a spacetime lattice is the only known general and non-perturbative regulator for quantum field theory. The lattice formulation has, for example, played an important role in predicting properties of QCD in the strongly coupled regime, where perturbative methods break down. To r...

Full description

Bibliographic Details
Main Author: Kanwar, Gurtej
Other Authors: Detmold, William
Format: Thesis
Published: Massachusetts Institute of Technology 2022
Online Access:https://hdl.handle.net/1721.1/142680
https://orcid.org/0000-0002-4340-4983
_version_ 1826205542471696384
author Kanwar, Gurtej
author2 Detmold, William
author_facet Detmold, William
Kanwar, Gurtej
author_sort Kanwar, Gurtej
collection MIT
description Discretizing fields on a spacetime lattice is the only known general and non-perturbative regulator for quantum field theory. The lattice formulation has, for example, played an important role in predicting properties of QCD in the strongly coupled regime, where perturbative methods break down. To recover information about continuum physics, parameters defining the lattice theory must be tuned toward criticality. However, Markov chain Monte Carlo (MCMC) methods commonly used to evaluate the lattice-regularized path integral suffer from critical slowing down in this limit, restricting the precision of continuum extrapolations. Further difficulties arise when computing the energies and interactions of physical states by measuring correlation functions of operators widely separated in spacetime: for most correlation functions, an exponentially severe signal-to-noise problem is encountered as the operators are taken to be widely separated, limiting the precision of calculations. This dissertation introduces two new techniques to address these issues. First, we define a novel MCMC algorithm based on generative flow-based models. Such models utilize machine learning methods to describe efficient approximate samplers for distributions of interest. Independently drawn flow-based samples are then used as proposals in an asymptotically exact Metropolis-Hastings Markov chain. We also construct models that flexibly parameterize families of distributions while capturing symmetries of interest, including translational and gauge symmetries. By variationally optimizing the distribution selected from these families, one can maximize the efficiency of flow-based MCMC. We secondly introduce an approach to 'deform' Monte Carlo estimators based on contour deformations applied to the domain of the path integral. The deformed estimators associated with an observable give equivalent unbiased measurements of that observable, but generically have different variances. We define families of deformed manifolds for lattice gauge theories and introduce methods to efficiently optimize the choice of manifold (the 'observifold') so that the variance of the associated deformed observable is minimized. Finally, we demonstrate that flow-based MCMC can mitigate critical slowing down and observifolds can exponentially reduce variance in proof-of-principle applications to scalar phi^4 theory and U(1) and SU(N) lattice gauge theories.
first_indexed 2024-09-23T13:15:06Z
format Thesis
id mit-1721.1/142680
institution Massachusetts Institute of Technology
last_indexed 2024-09-23T13:15:06Z
publishDate 2022
publisher Massachusetts Institute of Technology
record_format dspace
spelling mit-1721.1/1426802022-05-25T03:20:55Z Machine Learning and Variational Algorithms for Lattice Field Theory Kanwar, Gurtej Detmold, William Massachusetts Institute of Technology. Department of Physics Discretizing fields on a spacetime lattice is the only known general and non-perturbative regulator for quantum field theory. The lattice formulation has, for example, played an important role in predicting properties of QCD in the strongly coupled regime, where perturbative methods break down. To recover information about continuum physics, parameters defining the lattice theory must be tuned toward criticality. However, Markov chain Monte Carlo (MCMC) methods commonly used to evaluate the lattice-regularized path integral suffer from critical slowing down in this limit, restricting the precision of continuum extrapolations. Further difficulties arise when computing the energies and interactions of physical states by measuring correlation functions of operators widely separated in spacetime: for most correlation functions, an exponentially severe signal-to-noise problem is encountered as the operators are taken to be widely separated, limiting the precision of calculations. This dissertation introduces two new techniques to address these issues. First, we define a novel MCMC algorithm based on generative flow-based models. Such models utilize machine learning methods to describe efficient approximate samplers for distributions of interest. Independently drawn flow-based samples are then used as proposals in an asymptotically exact Metropolis-Hastings Markov chain. We also construct models that flexibly parameterize families of distributions while capturing symmetries of interest, including translational and gauge symmetries. By variationally optimizing the distribution selected from these families, one can maximize the efficiency of flow-based MCMC. We secondly introduce an approach to 'deform' Monte Carlo estimators based on contour deformations applied to the domain of the path integral. The deformed estimators associated with an observable give equivalent unbiased measurements of that observable, but generically have different variances. We define families of deformed manifolds for lattice gauge theories and introduce methods to efficiently optimize the choice of manifold (the 'observifold') so that the variance of the associated deformed observable is minimized. Finally, we demonstrate that flow-based MCMC can mitigate critical slowing down and observifolds can exponentially reduce variance in proof-of-principle applications to scalar phi^4 theory and U(1) and SU(N) lattice gauge theories. Ph.D. 2022-05-24T19:18:26Z 2022-05-24T19:18:26Z 2021-06 2022-05-19T23:48:23.652Z Thesis https://hdl.handle.net/1721.1/142680 https://orcid.org/0000-0002-4340-4983 In Copyright - Educational Use Permitted Copyright MIT http://rightsstatements.org/page/InC-EDU/1.0/ application/pdf Massachusetts Institute of Technology
spellingShingle Kanwar, Gurtej
Machine Learning and Variational Algorithms for Lattice Field Theory
title Machine Learning and Variational Algorithms for Lattice Field Theory
title_full Machine Learning and Variational Algorithms for Lattice Field Theory
title_fullStr Machine Learning and Variational Algorithms for Lattice Field Theory
title_full_unstemmed Machine Learning and Variational Algorithms for Lattice Field Theory
title_short Machine Learning and Variational Algorithms for Lattice Field Theory
title_sort machine learning and variational algorithms for lattice field theory
url https://hdl.handle.net/1721.1/142680
https://orcid.org/0000-0002-4340-4983
work_keys_str_mv AT kanwargurtej machinelearningandvariationalalgorithmsforlatticefieldtheory