Distributed Lazy Q-Learning for Cooperative Mobile Robots

Compared to single robot learning, cooperative learning adds the challenge of a much larger search space (combined individual search spaces), awareness of other team members, and also the synthesis of the individual behaviors with respect to the task given to the group. Over the years, reinforcement...

Full description

Bibliographic Details
Main Author:	Claude F. Touzet
Format:	Article
Language:	English
Published:	SAGE Publishing 2004-03-01
Series:	International Journal of Advanced Robotic Systems
Online Access:	https://doi.org/10.5772/5614

_version_	1819051046797836288
author	Claude F. Touzet
author_facet	Claude F. Touzet
author_sort	Claude F. Touzet
collection	DOAJ
description	Compared to single robot learning, cooperative learning adds the challenge of a much larger search space (combined individual search spaces), awareness of other team members, and also the synthesis of the individual behaviors with respect to the task given to the group. Over the years, reinforcement learning has emerged as the main learning approach in autonomous robotics, and lazy learning has become the leading bias, allowing the reduction of the time required by an experiment to the time needed to test the learned behavior performance. These two approaches have been combined together in what is now called lazy Q-learning, a very efficient single robot learning paradigm. We propose a derivation of this learning to team of robots : the «pessimistic» algorithm able to compute for each team member a lower bound of the utility of executing an action in a given situation. We use the cooperative multi-robot observation of multiple moving targets (CMOMMT) application as an illustrative example, and study the efficiency of the Pessimistic Algorithm in its task of inducing learning of cooperation.
first_indexed	2024-12-21T11:57:43Z
format	Article
id	doaj.art-afc680bcc059461fb58263ffc6803e90
institution	Directory Open Access Journal
issn	1729-8814
language	English
last_indexed	2024-12-21T11:57:43Z
publishDate	2004-03-01
publisher	SAGE Publishing
record_format	Article
series	International Journal of Advanced Robotic Systems
spelling	doaj.art-afc680bcc059461fb58263ffc6803e902022-12-21T19:04:53ZengSAGE PublishingInternational Journal of Advanced Robotic Systems1729-88142004-03-01110.5772/561410.5772_5614Distributed Lazy Q-Learning for Cooperative Mobile RobotsClaude F. TouzetCompared to single robot learning, cooperative learning adds the challenge of a much larger search space (combined individual search spaces), awareness of other team members, and also the synthesis of the individual behaviors with respect to the task given to the group. Over the years, reinforcement learning has emerged as the main learning approach in autonomous robotics, and lazy learning has become the leading bias, allowing the reduction of the time required by an experiment to the time needed to test the learned behavior performance. These two approaches have been combined together in what is now called lazy Q-learning, a very efficient single robot learning paradigm. We propose a derivation of this learning to team of robots : the «pessimistic» algorithm able to compute for each team member a lower bound of the utility of executing an action in a given situation. We use the cooperative multi-robot observation of multiple moving targets (CMOMMT) application as an illustrative example, and study the efficiency of the Pessimistic Algorithm in its task of inducing learning of cooperation.https://doi.org/10.5772/5614
spellingShingle	Claude F. Touzet Distributed Lazy Q-Learning for Cooperative Mobile Robots International Journal of Advanced Robotic Systems
title	Distributed Lazy Q-Learning for Cooperative Mobile Robots
title_full	Distributed Lazy Q-Learning for Cooperative Mobile Robots
title_fullStr	Distributed Lazy Q-Learning for Cooperative Mobile Robots
title_full_unstemmed	Distributed Lazy Q-Learning for Cooperative Mobile Robots
title_short	Distributed Lazy Q-Learning for Cooperative Mobile Robots
title_sort	distributed lazy q learning for cooperative mobile robots
url	https://doi.org/10.5772/5614
work_keys_str_mv	AT claudeftouzet distributedlazyqlearningforcooperativemobilerobots

Distributed Lazy Q-Learning for Cooperative Mobile Robots

Similar Items