Sample-efficiency in multi-batch reinforcement learning: the need for dimension-dependent adaptivity

We theoretically explore the relationship between sample-efficiency and adaptivity in reinforcement learning. An algorithm is sample-efficient if it uses a number of queries n to the environment that is polynomial in the dimension d of the problem. Adaptivity refers to the frequency at which queries...

Celý popis

Podrobná bibliografie
Hlavní autoři:	Johnson, E, Pike-Burke, C, Rebeschini, P
Médium:	Conference item
Jazyk:	English
Vydáno:	OpenReview 2024

Sample-efficiency in multi-batch reinforcement learning: the need for dimension-dependent adaptivity

Podobné jednotky