Theoretical Analysis of Bayesian Optimisation with Unknown Gaussian Process Hyper−Parameters

Bayesian optimisation has gained great popularity as a tool for optimising the parameters of machine learning algorithms and models. Somewhat ironically, setting up the hyper-parameters of Bayesian optimisation methods is notoriously hard. While reasonable practical solutions have been advanced, the...

Full description

Bibliographic Details
Main Authors: Wang, Z, de Freitas, N
Format: Report
Published: University of Oxford 2014
_version_ 1797061038283161600
author Wang, Z
de Freitas, N
author_facet Wang, Z
de Freitas, N
author_sort Wang, Z
collection OXFORD
description Bayesian optimisation has gained great popularity as a tool for optimising the parameters of machine learning algorithms and models. Somewhat ironically, setting up the hyper-parameters of Bayesian optimisation methods is notoriously hard. While reasonable practical solutions have been advanced, they can often fail to find the best optima. Surprisingly, there is little theoretical analysis of this crucial problem in the literature. To address this, we derive a cumulative regret bound for Bayesian optimisation with Gaussian processes and unknown kernel hyper-parameters in the stochastic setting. The bound, which applies to the expected improvement acquisition function and sub-Gaussian observation noise, provides us with guidelines on how to design hyper-parameter estimation methods. A simple simulation demonstrates the importance of following these guidelines.
first_indexed 2024-03-06T20:25:23Z
format Report
id oxford-uuid:2f35eb0d-13dd-47ae-8bc1-6d0b64f97e91
institution University of Oxford
last_indexed 2024-03-06T20:25:23Z
publishDate 2014
publisher University of Oxford
record_format dspace
spelling oxford-uuid:2f35eb0d-13dd-47ae-8bc1-6d0b64f97e912022-03-26T12:54:02ZTheoretical Analysis of Bayesian Optimisation with Unknown Gaussian Process Hyper−ParametersReporthttp://purl.org/coar/resource_type/c_93fcuuid:2f35eb0d-13dd-47ae-8bc1-6d0b64f97e91Department of Computer ScienceUniversity of Oxford2014Wang, Zde Freitas, NBayesian optimisation has gained great popularity as a tool for optimising the parameters of machine learning algorithms and models. Somewhat ironically, setting up the hyper-parameters of Bayesian optimisation methods is notoriously hard. While reasonable practical solutions have been advanced, they can often fail to find the best optima. Surprisingly, there is little theoretical analysis of this crucial problem in the literature. To address this, we derive a cumulative regret bound for Bayesian optimisation with Gaussian processes and unknown kernel hyper-parameters in the stochastic setting. The bound, which applies to the expected improvement acquisition function and sub-Gaussian observation noise, provides us with guidelines on how to design hyper-parameter estimation methods. A simple simulation demonstrates the importance of following these guidelines.
spellingShingle Wang, Z
de Freitas, N
Theoretical Analysis of Bayesian Optimisation with Unknown Gaussian Process Hyper−Parameters
title Theoretical Analysis of Bayesian Optimisation with Unknown Gaussian Process Hyper−Parameters
title_full Theoretical Analysis of Bayesian Optimisation with Unknown Gaussian Process Hyper−Parameters
title_fullStr Theoretical Analysis of Bayesian Optimisation with Unknown Gaussian Process Hyper−Parameters
title_full_unstemmed Theoretical Analysis of Bayesian Optimisation with Unknown Gaussian Process Hyper−Parameters
title_short Theoretical Analysis of Bayesian Optimisation with Unknown Gaussian Process Hyper−Parameters
title_sort theoretical analysis of bayesian optimisation with unknown gaussian process hyper parameters
work_keys_str_mv AT wangz theoreticalanalysisofbayesianoptimisationwithunknowngaussianprocesshyperparameters
AT defreitasn theoreticalanalysisofbayesianoptimisationwithunknowngaussianprocesshyperparameters