Learning a restricted Boltzmann machine using biased Monte Carlo sampling

Restricted Boltzmann Machines are simple and powerful generative models that can encode any complex dataset. Despite all their advantages, in practice the trainings are often unstable and it is difficult to assess their quality because the dynamics are affected by extremely slow time dependencies. T...

Full description

Bibliographic Details
Main Author:	Nicolas Béreux, Aurélien Decelle, Cyril Furtlehner, Beatriz Seoane
Format:	Article
Language:	English
Published:	SciPost 2023-03-01
Series:	SciPost Physics
Online Access:	https://scipost.org/SciPostPhys.14.3.032

_version_	1797871058299125760
author	Nicolas Béreux, Aurélien Decelle, Cyril Furtlehner, Beatriz Seoane
author_facet	Nicolas Béreux, Aurélien Decelle, Cyril Furtlehner, Beatriz Seoane
author_sort	Nicolas Béreux, Aurélien Decelle, Cyril Furtlehner, Beatriz Seoane
collection	DOAJ
description	Restricted Boltzmann Machines are simple and powerful generative models that can encode any complex dataset. Despite all their advantages, in practice the trainings are often unstable and it is difficult to assess their quality because the dynamics are affected by extremely slow time dependencies. This situation becomes critical when dealing with low-dimensional clustered datasets, where the time required to sample ergodically the trained models becomes computationally prohibitive. In this work, we show that this divergence of Monte Carlo mixing times is related to a phenomenon of phase coexistence, similar to that which occurs in physics near a first-order phase transition. We show that sampling the equilibrium distribution using the Markov chain Monte Carlo method can be dramatically accelerated when using biased sampling techniques, in particular the Tethered Monte Carlo (TMC) method. This sampling technique efficiently solves the problem of evaluating the quality of a given trained model and generating new samples in a reasonable amount of time. Moreover, we show that this sampling technique can also be used to improve the computation of the log-likelihood gradient during training, leading to dramatic improvements in training RBMs with artificial clustered datasets. On real low-dimensional datasets, this new training method fits RBM models with significantly faster relaxation dynamics than those obtained with standard PCD recipes. We also show that TMC sampling can be used to recover the free-energy profile of the RBM. This proves to be extremely useful to compute the probability distribution of a given model and to improve the generation of new decorrelated samples in slow PCD-trained models. The main limitations of this method are, first, the restriction to effective low-dimensional datasets and, second, the fact that the Tethered MC method breaks the possibility of performing parallel alternative Monte Carlo updates, which limits the size of the systems we can consider in practice.
first_indexed	2024-04-10T00:37:11Z
format	Article
id	doaj.art-31c12b96057d4153a622f5186039d1e8
institution	Directory Open Access Journal
issn	2542-4653
language	English
last_indexed	2024-04-10T00:37:11Z
publishDate	2023-03-01
publisher	SciPost
record_format	Article
series	SciPost Physics
spelling	doaj.art-31c12b96057d4153a622f5186039d1e82023-03-14T10:24:07ZengSciPostSciPost Physics2542-46532023-03-0114303210.21468/SciPostPhys.14.3.032Learning a restricted Boltzmann machine using biased Monte Carlo samplingNicolas Béreux, Aurélien Decelle, Cyril Furtlehner, Beatriz SeoaneRestricted Boltzmann Machines are simple and powerful generative models that can encode any complex dataset. Despite all their advantages, in practice the trainings are often unstable and it is difficult to assess their quality because the dynamics are affected by extremely slow time dependencies. This situation becomes critical when dealing with low-dimensional clustered datasets, where the time required to sample ergodically the trained models becomes computationally prohibitive. In this work, we show that this divergence of Monte Carlo mixing times is related to a phenomenon of phase coexistence, similar to that which occurs in physics near a first-order phase transition. We show that sampling the equilibrium distribution using the Markov chain Monte Carlo method can be dramatically accelerated when using biased sampling techniques, in particular the Tethered Monte Carlo (TMC) method. This sampling technique efficiently solves the problem of evaluating the quality of a given trained model and generating new samples in a reasonable amount of time. Moreover, we show that this sampling technique can also be used to improve the computation of the log-likelihood gradient during training, leading to dramatic improvements in training RBMs with artificial clustered datasets. On real low-dimensional datasets, this new training method fits RBM models with significantly faster relaxation dynamics than those obtained with standard PCD recipes. We also show that TMC sampling can be used to recover the free-energy profile of the RBM. This proves to be extremely useful to compute the probability distribution of a given model and to improve the generation of new decorrelated samples in slow PCD-trained models. The main limitations of this method are, first, the restriction to effective low-dimensional datasets and, second, the fact that the Tethered MC method breaks the possibility of performing parallel alternative Monte Carlo updates, which limits the size of the systems we can consider in practice.https://scipost.org/SciPostPhys.14.3.032
spellingShingle	Nicolas Béreux, Aurélien Decelle, Cyril Furtlehner, Beatriz Seoane Learning a restricted Boltzmann machine using biased Monte Carlo sampling SciPost Physics
title	Learning a restricted Boltzmann machine using biased Monte Carlo sampling
title_full	Learning a restricted Boltzmann machine using biased Monte Carlo sampling
title_fullStr	Learning a restricted Boltzmann machine using biased Monte Carlo sampling
title_full_unstemmed	Learning a restricted Boltzmann machine using biased Monte Carlo sampling
title_short	Learning a restricted Boltzmann machine using biased Monte Carlo sampling
title_sort	learning a restricted boltzmann machine using biased monte carlo sampling
url	https://scipost.org/SciPostPhys.14.3.032
work_keys_str_mv	AT nicolasbereuxaureliendecellecyrilfurtlehnerbeatrizseoane learningarestrictedboltzmannmachineusingbiasedmontecarlosampling

Learning a restricted Boltzmann machine using biased Monte Carlo sampling

Similar Items