Online Learning of Time-Varying Unbalanced Networks in Non-Convex Environments: A Multi-Armed Bandit Approach

This study discusses how agents in a time-varying distributed network can converge to the global minimizer of a time-varying graph network. Each agent knows only the local loss of its observation and must cooperate constructively with other agents to find the global minimizer of the network. Unlike...

Full description

Bibliographic Details
Main Author:	Olusola T. Odeyomi
Format:	Article
Language:	English
Published:	IEEE 2023-01-01
Series:	IEEE Access
Subjects:	Online learning multi-armed bandit Lipschitz regret strongly connected graph
Online Access:	https://ieeexplore.ieee.org/document/10041910/

Internet

https://ieeexplore.ieee.org/document/10041910/

Online Learning of Time-Varying Unbalanced Networks in Non-Convex Environments: A Multi-Armed Bandit Approach

Internet

Similar Items