Anfonwch hwn fel neges destun: Stochastic control approach to the multi-armed bandit problems