A VIEW ON DEEP REINFORCEMENT LEARNING IN IMPERFECT INFORMATION GAMES

Many real-world applications can be described as large-scale games of imperfect information. This kind of games is particularly harder than the deterministic one as the search space is even more sizeable. In this paper, I want to explore the power of reinforcement learning in such an environment; t...

Full description

Bibliographic Details
Main Author:	Tidor-Vlad PRICOPE
Format:	Article
Language:	English
Published:	Babes-Bolyai University, Cluj-Napoca 2020-10-01
Series:	Studia Universitatis Babes-Bolyai: Series Informatica
Subjects:	Artificial Intelligence, Computer Poker, Adaptive Learning, Fictitious Play, Deep Reinforcement Learning, Neural Networks.
Online Access:	http://193.231.18.162/index.php/subbinformatica/article/view/3888

_version_	1827356526823604224
author	Tidor-Vlad PRICOPE
author_facet	Tidor-Vlad PRICOPE
author_sort	Tidor-Vlad PRICOPE
collection	DOAJ
description	Many real-world applications can be described as large-scale games of imperfect information. This kind of games is particularly harder than the deterministic one as the search space is even more sizeable. In this paper, I want to explore the power of reinforcement learning in such an environment; that is why I take a look at one of the most popular game of such type, no limit Texas Hold’em Poker, yet unsolved, developing multiple agents with different learning paradigms and techniques and then comparing their respective performances. When applied to no-limit Hold’em Poker, deep reinforcement learning agents clearly outperform agents with a more traditional approach. Moreover, if these last agents rival a human beginner level of play, the ones based on reinforcement learning compare to an amateur human player. The main algorithm uses Fictitious Play in combination with ANNs and some handcrafted metrics. We also applied the main algorithm to another game of imperfect information, less complex than Poker, in order to show the scalability of this solution and the increase in performance when put neck in neck with established classical approaches from the reinforcement learning literature.
first_indexed	2024-03-08T05:11:01Z
format	Article
id	doaj.art-4b97a325d39940c78f139b1a0617e43d
institution	Directory Open Access Journal
issn	2065-9601
language	English
last_indexed	2024-03-08T05:11:01Z
publishDate	2020-10-01
publisher	Babes-Bolyai University, Cluj-Napoca
record_format	Article
series	Studia Universitatis Babes-Bolyai: Series Informatica
spelling	doaj.art-4b97a325d39940c78f139b1a0617e43d2024-02-07T10:03:36ZengBabes-Bolyai University, Cluj-NapocaStudia Universitatis Babes-Bolyai: Series Informatica2065-96012020-10-0165210.24193/subbi.2020.2.03A VIEW ON DEEP REINFORCEMENT LEARNING IN IMPERFECT INFORMATION GAMESTidor-Vlad PRICOPE0University of Edinburgh. Email address: T.V.Pricope@sms.ed.ac.uk Many real-world applications can be described as large-scale games of imperfect information. This kind of games is particularly harder than the deterministic one as the search space is even more sizeable. In this paper, I want to explore the power of reinforcement learning in such an environment; that is why I take a look at one of the most popular game of such type, no limit Texas Hold’em Poker, yet unsolved, developing multiple agents with different learning paradigms and techniques and then comparing their respective performances. When applied to no-limit Hold’em Poker, deep reinforcement learning agents clearly outperform agents with a more traditional approach. Moreover, if these last agents rival a human beginner level of play, the ones based on reinforcement learning compare to an amateur human player. The main algorithm uses Fictitious Play in combination with ANNs and some handcrafted metrics. We also applied the main algorithm to another game of imperfect information, less complex than Poker, in order to show the scalability of this solution and the increase in performance when put neck in neck with established classical approaches from the reinforcement learning literature. http://193.231.18.162/index.php/subbinformatica/article/view/3888Artificial Intelligence, Computer Poker, Adaptive Learning, Fictitious Play, Deep Reinforcement Learning, Neural Networks.
spellingShingle	Tidor-Vlad PRICOPE A VIEW ON DEEP REINFORCEMENT LEARNING IN IMPERFECT INFORMATION GAMES Studia Universitatis Babes-Bolyai: Series Informatica Artificial Intelligence, Computer Poker, Adaptive Learning, Fictitious Play, Deep Reinforcement Learning, Neural Networks.
title	A VIEW ON DEEP REINFORCEMENT LEARNING IN IMPERFECT INFORMATION GAMES
title_full	A VIEW ON DEEP REINFORCEMENT LEARNING IN IMPERFECT INFORMATION GAMES
title_fullStr	A VIEW ON DEEP REINFORCEMENT LEARNING IN IMPERFECT INFORMATION GAMES
title_full_unstemmed	A VIEW ON DEEP REINFORCEMENT LEARNING IN IMPERFECT INFORMATION GAMES
title_short	A VIEW ON DEEP REINFORCEMENT LEARNING IN IMPERFECT INFORMATION GAMES
title_sort	view on deep reinforcement learning in imperfect information games
topic	Artificial Intelligence, Computer Poker, Adaptive Learning, Fictitious Play, Deep Reinforcement Learning, Neural Networks.
url	http://193.231.18.162/index.php/subbinformatica/article/view/3888
work_keys_str_mv	AT tidorvladpricope aviewondeepreinforcementlearninginimperfectinformationgames AT tidorvladpricope viewondeepreinforcementlearninginimperfectinformationgames

A VIEW ON DEEP REINFORCEMENT LEARNING IN IMPERFECT INFORMATION GAMES

Similar Items