Theoretical Analysis of Bayesian Optimisation with Unknown Gaussian Process Hyper−Parameters

Bayesian optimisation has gained great popularity as a tool for optimising the parameters of machine learning algorithms and models. Somewhat ironically, setting up the hyper-parameters of Bayesian optimisation methods is notoriously hard. While reasonable practical solutions have been advanced, the...

Full description

Bibliographic Details
Main Authors:	Wang, Z, de Freitas, N
Format:	Report
Published:	University of Oxford 2014

_version_	1797061038283161600
author	Wang, Z de Freitas, N
author_facet	Wang, Z de Freitas, N
author_sort	Wang, Z
collection	OXFORD
description	Bayesian optimisation has gained great popularity as a tool for optimising the parameters of machine learning algorithms and models. Somewhat ironically, setting up the hyper-parameters of Bayesian optimisation methods is notoriously hard. While reasonable practical solutions have been advanced, they can often fail to find the best optima. Surprisingly, there is little theoretical analysis of this crucial problem in the literature. To address this, we derive a cumulative regret bound for Bayesian optimisation with Gaussian processes and unknown kernel hyper-parameters in the stochastic setting. The bound, which applies to the expected improvement acquisition function and sub-Gaussian observation noise, provides us with guidelines on how to design hyper-parameter estimation methods. A simple simulation demonstrates the importance of following these guidelines.
first_indexed	2024-03-06T20:25:23Z
format	Report
id	oxford-uuid:2f35eb0d-13dd-47ae-8bc1-6d0b64f97e91
institution	University of Oxford
last_indexed	2024-03-06T20:25:23Z
publishDate	2014
publisher	University of Oxford
record_format	dspace
spelling	oxford-uuid:2f35eb0d-13dd-47ae-8bc1-6d0b64f97e912022-03-26T12:54:02ZTheoretical Analysis of Bayesian Optimisation with Unknown Gaussian Process Hyper−ParametersReporthttp://purl.org/coar/resource_type/c_93fcuuid:2f35eb0d-13dd-47ae-8bc1-6d0b64f97e91Department of Computer ScienceUniversity of Oxford2014Wang, Zde Freitas, NBayesian optimisation has gained great popularity as a tool for optimising the parameters of machine learning algorithms and models. Somewhat ironically, setting up the hyper-parameters of Bayesian optimisation methods is notoriously hard. While reasonable practical solutions have been advanced, they can often fail to find the best optima. Surprisingly, there is little theoretical analysis of this crucial problem in the literature. To address this, we derive a cumulative regret bound for Bayesian optimisation with Gaussian processes and unknown kernel hyper-parameters in the stochastic setting. The bound, which applies to the expected improvement acquisition function and sub-Gaussian observation noise, provides us with guidelines on how to design hyper-parameter estimation methods. A simple simulation demonstrates the importance of following these guidelines.
spellingShingle	Wang, Z de Freitas, N Theoretical Analysis of Bayesian Optimisation with Unknown Gaussian Process Hyper−Parameters
title	Theoretical Analysis of Bayesian Optimisation with Unknown Gaussian Process Hyper−Parameters
title_full	Theoretical Analysis of Bayesian Optimisation with Unknown Gaussian Process Hyper−Parameters
title_fullStr	Theoretical Analysis of Bayesian Optimisation with Unknown Gaussian Process Hyper−Parameters
title_full_unstemmed	Theoretical Analysis of Bayesian Optimisation with Unknown Gaussian Process Hyper−Parameters
title_short	Theoretical Analysis of Bayesian Optimisation with Unknown Gaussian Process Hyper−Parameters
title_sort	theoretical analysis of bayesian optimisation with unknown gaussian process hyper parameters
work_keys_str_mv	AT wangz theoreticalanalysisofbayesianoptimisationwithunknowngaussianprocesshyperparameters AT defreitasn theoreticalanalysisofbayesianoptimisationwithunknowngaussianprocesshyperparameters

Theoretical Analysis of Bayesian Optimisation with Unknown Gaussian Process Hyper−Parameters

Similar Items