A Reinforcement Learning Model Based on Temporal Difference Algorithm

In some sense, computer game can be used as a test bed of artificial intelligence to develop intelligent algorithms. The paper proposed a kind of intelligent method: a reinforcement learning model based on temporal difference (TD) algorithm. And then the method is used to improve the playing power o...

Full description

Bibliographic Details
Main Authors: Xiali Li, Zhengyu Lv, Song Wang, Zhi Wei, Licheng Wu
Format: Article
Language:English
Published: IEEE 2019-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/8819952/