Policy search with rare significant events: Choosing the right partner to cooperate with

This paper focuses on a class of reinforcement learning problems where significant events are rare and limited to a single positive reward per episode. A typical example is that of an agent who has to choose a partner to cooperate with, while a large number of partners are simply not interested in c...

Full description

Bibliographic Details
Main Authors:	Paul Ecoffet, Nicolas Fontbonne, Jean-Baptiste André, Nicolas Bredeche
Format:	Article
Language:	English
Published:	Public Library of Science (PLoS) 2022-01-01
Series:	PLoS ONE
Online Access:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9041856/?tool=EBI

Internet

https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9041856/?tool=EBI

Policy search with rare significant events: Choosing the right partner to cooperate with

Internet

Similar Items