Solving multi-armed bandit problems using a chaotic microresonator comb
The Multi-Armed Bandit (MAB) problem, foundational to reinforcement learning-based decision-making, addresses the challenge of maximizing rewards amid multiple uncertain choices. While algorithmic solutions are effective, their computational efficiency diminishes with increasing problem complexity....
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
AIP Publishing LLC
2024-03-01
|
Series: | APL Photonics |
Online Access: | http://dx.doi.org/10.1063/5.0173287 |