Output-weighted sampling for multi-armed bandits with extreme payoffs

Output-weighted sampling for multi-armed bandits with extreme payoffs

We present a new type of acquisition function for online decision-making in multi-armed and contextual bandit problems with extreme payoffs. Specifically, we model the payoff function as a Gaussian process and formulate a novel type of upper confidence bound acquisition function that guides explorat...

Full description

Bibliographic Details
Main Authors:	Yang, Yibo, Blanchard, Antoine, Sapsis, Themistoklis, Perdikaris, Paris
Format:	Article
Language:	English
Published:	The Royal Society 2024
Subjects:	General Physics and Astronomy General Engineering General Mathematics
Online Access:	https://hdl.handle.net/1721.1/154219

Similar Items

Optimal criteria and their asymptotic form for data selection in data-driven reduced-order modelling with Gaussian process regression
by: Sapsis, Themistoklis P., et al.
Published: (2024)

Bayesian optimization with output-weighted optimal sampling
by: Blanchard, Antoine, et al.
Published: (2022)

Output-Weighted Optimal Sampling for Bayesian Experimental Design and Uncertainty Quantification
by: Blanchard, Antoine, et al.
Published: (2022)

Representation theoretic interpretation and interpolation properties of inhomogeneous spin q-Whittaker polynomials
by: Korotkikh, Sergei
Published: (2024)

Bayesian optimization with output-weighted optimal sampling
by: Blanchard, Antoine Bertrand Emile, et al.
Published: (2022)

Weber’s Law of perception is a consequence of resolving the intensity of natural scintillating light and sound with the least possible error
by: Pednekar, Shourav, et al.
Published: (2024)

Seek change, not payoffs
by: Md. , Moniruzzaman
Published: (2010)

Nonlinear wave evolution with data-driven breaking
by: Eeltink, D., et al.
Published: (2024)

Quantum Algorithm for Petz Recovery Channels and Pretty Good Measurements
by: Gilyén, András, et al.
Published: (2024)

Observation of a Prethermal U(1) Discrete Time Crystal
by: Stasiuk, Andrew, et al.
Published: (2024)

When fizzy water levitates
by: Bourrianne, Philippe, et al.
Published: (2024)

Analog Quantum Variational Embedding Classifier
by: Yang, Rui, et al.
Published: (2024)

Geometric Event-Based Quantum Mechanics
by: Giovannetti, Vittorio, et al.
Published: (2024)

Precision spectroscopy and laser-cooling scheme of a radium-containing molecule
by: Udrescu, S. M., et al.
Published: (2024)

The hidden hierarchical nature of soft particulate gels
by: Bantawa, Minaspi, et al.
Published: (2024)

Precision spectroscopy of fast, hot, exotic isotopes using machine-learning-assisted event-by-event Doppler correction
by: Udrescu, S. M., et al.
Published: (2024)

Tuning the shear thickening of suspensions through surface roughness and physico-chemical interactions
by: Bourrianne, Philippe, et al.
Published: (2024)

Spectroscopy: the key to the stars : [electronic book] reading the lines in Stellar Spectra /
by: 378492 Robinson, Keith, et al.
Published: (2007)

Toast sliding off a table
by: Mahajan, Sanjoy
Published: (2023)

Quantification of ionic-liquid ion source beam composition from time-of-flight data
by: Jia-Richards, Oliver
Published: (2023)

On the bifurcation behavior of a folded notebook page
by: Zhang, Chenguang
Published: (2023)

Dimer states of Rydberg atoms on the Kagome lattice as resources for universal measurement-based quantum computation
by: Crépel, Valentin
Published: (2023)

Nonlinear Dynamics of Preheating after Multifield Inflation with Nonminimal Couplings
by: Kaiser, David I.
Published: (2020)

Approximate insightful ODE solutions
by: Mahajan, Sanjoy
Published: (2023)

Non-Markovian Collective Emission from Macroscopically Separated Emitters
by: Sinha, Kanupriya, et al.
Published: (2020)

Entropic effects in cell lineage tree packings
by: Imran Alsous, Jasmin, et al.
Published: (2019)

Search for Scalar Diphoton Resonances in the Mass Range 65–600 GeV with the ATLAS Detector inppCollision Data ats=8 TeV
by: Taylor, Frank E., et al.
Published: (2021)

Defect-Level Switching for Highly Nonlinear and Hysteretic Electronic Devices
by: Yin, Han, et al.
Published: (2022)

Multifidelity deep neural operators for efficient learning of partial differential equations with application to fast inverse design of nanoscale heat transport
by: Lu, Lu, et al.
Published: (2022)

Entanglement with negative Wigner function of three thousand atoms heralded by one photon
by: McConnell, Robert, et al.
Published: (2021)

Characterizing Temperature and Strain Variations with Qubit Ensembles for Their Robust Coherence Protection
by: Wang, Guoqing, et al.
Published: (2023)

Percolative Scale-Free Behavior in the Boiling Crisis
by: Zhang, Limiao, et al.
Published: (2020)

Perturbation Independent Decay of the Loschmidt Echo in a Many-Body System
by: Wei, K. X., et al.
Published: (2020)

Hamiltonian engineering with constrained optimization for quantum sensing and control
by: O’Keeffe, Michael F, et al.
Published: (2020)

Lowering the CUORE energy threshold
by: Copello, S., et al.
Published: (2021)

Analytical Criteria for Designing Multiresonance Filters in Scattering Systems, with Application to Microwave Metasurfaces
by: Benzaouia, Mohammed, et al.
Published: (2022)

Two-Photon Interface of Nuclear Spins Based on the Optonuclear Quadrupolar Effect
by: Xu, Haowei, et al.
Published: (2023)

Output-weighted optimal sampling for Bayesian regression and rare event statistics using few samples
by: Sapsis, Themistoklis Panagiotis
Published: (2020)

Matching Triangles and Basing Hardness on an Extremely Popular Conjecture
by: Abboud, Amir, et al.
Published: (2021)

Design and development of single-axis solar tracking system and water level control for application of line focus concentrator for solar desalination process
by: Muhammad Adam, Zahari
Published: (2017)