Machine Learning and Variational Algorithms for Lattice Field Theory
Discretizing fields on a spacetime lattice is the only known general and non-perturbative regulator for quantum field theory. The lattice formulation has, for example, played an important role in predicting properties of QCD in the strongly coupled regime, where perturbative methods break down. To r...
Main Author: | |
---|---|
Other Authors: | |
Format: | Thesis |
Published: |
Massachusetts Institute of Technology
2022
|
Online Access: | https://hdl.handle.net/1721.1/142680 https://orcid.org/0000-0002-4340-4983 |
_version_ | 1826205542471696384 |
---|---|
author | Kanwar, Gurtej |
author2 | Detmold, William |
author_facet | Detmold, William Kanwar, Gurtej |
author_sort | Kanwar, Gurtej |
collection | MIT |
description | Discretizing fields on a spacetime lattice is the only known general and non-perturbative regulator for quantum field theory. The lattice formulation has, for example, played an important role in predicting properties of QCD in the strongly coupled regime, where perturbative methods break down. To recover information about continuum physics, parameters defining the lattice theory must be tuned toward criticality. However, Markov chain Monte Carlo (MCMC) methods commonly used to evaluate the lattice-regularized path integral suffer from critical slowing down in this limit, restricting the precision of continuum extrapolations. Further difficulties arise when computing the energies and interactions of physical states by measuring correlation functions of operators widely separated in spacetime: for most correlation functions, an exponentially severe signal-to-noise problem is encountered as the operators are taken to be widely separated, limiting the precision of calculations.
This dissertation introduces two new techniques to address these issues. First, we define a novel MCMC algorithm based on generative flow-based models. Such models utilize machine learning methods to describe efficient approximate samplers for distributions of interest. Independently drawn flow-based samples are then used as proposals in an asymptotically exact Metropolis-Hastings Markov chain. We also construct models that flexibly parameterize families of distributions while capturing symmetries of interest, including translational and gauge symmetries. By variationally optimizing the distribution selected from these families, one can maximize the efficiency of flow-based MCMC. We secondly introduce an approach to 'deform' Monte Carlo estimators based on contour deformations applied to the domain of the path integral. The deformed estimators associated with an observable give equivalent unbiased measurements of that observable, but generically have different variances. We define families of deformed manifolds for lattice gauge theories and introduce methods to efficiently optimize the choice of manifold (the 'observifold') so that the variance of the associated deformed observable is minimized. Finally, we demonstrate that flow-based MCMC can mitigate critical slowing down and observifolds can exponentially reduce variance in proof-of-principle applications to scalar phi^4 theory and U(1) and SU(N) lattice gauge theories. |
first_indexed | 2024-09-23T13:15:06Z |
format | Thesis |
id | mit-1721.1/142680 |
institution | Massachusetts Institute of Technology |
last_indexed | 2024-09-23T13:15:06Z |
publishDate | 2022 |
publisher | Massachusetts Institute of Technology |
record_format | dspace |
spelling | mit-1721.1/1426802022-05-25T03:20:55Z Machine Learning and Variational Algorithms for Lattice Field Theory Kanwar, Gurtej Detmold, William Massachusetts Institute of Technology. Department of Physics Discretizing fields on a spacetime lattice is the only known general and non-perturbative regulator for quantum field theory. The lattice formulation has, for example, played an important role in predicting properties of QCD in the strongly coupled regime, where perturbative methods break down. To recover information about continuum physics, parameters defining the lattice theory must be tuned toward criticality. However, Markov chain Monte Carlo (MCMC) methods commonly used to evaluate the lattice-regularized path integral suffer from critical slowing down in this limit, restricting the precision of continuum extrapolations. Further difficulties arise when computing the energies and interactions of physical states by measuring correlation functions of operators widely separated in spacetime: for most correlation functions, an exponentially severe signal-to-noise problem is encountered as the operators are taken to be widely separated, limiting the precision of calculations. This dissertation introduces two new techniques to address these issues. First, we define a novel MCMC algorithm based on generative flow-based models. Such models utilize machine learning methods to describe efficient approximate samplers for distributions of interest. Independently drawn flow-based samples are then used as proposals in an asymptotically exact Metropolis-Hastings Markov chain. We also construct models that flexibly parameterize families of distributions while capturing symmetries of interest, including translational and gauge symmetries. By variationally optimizing the distribution selected from these families, one can maximize the efficiency of flow-based MCMC. We secondly introduce an approach to 'deform' Monte Carlo estimators based on contour deformations applied to the domain of the path integral. The deformed estimators associated with an observable give equivalent unbiased measurements of that observable, but generically have different variances. We define families of deformed manifolds for lattice gauge theories and introduce methods to efficiently optimize the choice of manifold (the 'observifold') so that the variance of the associated deformed observable is minimized. Finally, we demonstrate that flow-based MCMC can mitigate critical slowing down and observifolds can exponentially reduce variance in proof-of-principle applications to scalar phi^4 theory and U(1) and SU(N) lattice gauge theories. Ph.D. 2022-05-24T19:18:26Z 2022-05-24T19:18:26Z 2021-06 2022-05-19T23:48:23.652Z Thesis https://hdl.handle.net/1721.1/142680 https://orcid.org/0000-0002-4340-4983 In Copyright - Educational Use Permitted Copyright MIT http://rightsstatements.org/page/InC-EDU/1.0/ application/pdf Massachusetts Institute of Technology |
spellingShingle | Kanwar, Gurtej Machine Learning and Variational Algorithms for Lattice Field Theory |
title | Machine Learning and Variational Algorithms for Lattice Field Theory |
title_full | Machine Learning and Variational Algorithms for Lattice Field Theory |
title_fullStr | Machine Learning and Variational Algorithms for Lattice Field Theory |
title_full_unstemmed | Machine Learning and Variational Algorithms for Lattice Field Theory |
title_short | Machine Learning and Variational Algorithms for Lattice Field Theory |
title_sort | machine learning and variational algorithms for lattice field theory |
url | https://hdl.handle.net/1721.1/142680 https://orcid.org/0000-0002-4340-4983 |
work_keys_str_mv | AT kanwargurtej machinelearningandvariationalalgorithmsforlatticefieldtheory |