Deep Reinforcement Learning Object Tracking Based on Actor-Double Critic Network

Aiming at the problem of poor tracking robustness caused by severe occlusion, deformation, and object rotation of deep learning object tracking algorithm in complex scenes, an improved deep reinforcement learning object tracking algorithm based on actor-double critic network is proposed. In offline...

Full description

Bibliographic Details
Main Authors:	Jing Xin, Jianglei Zhou, Xinhong Hei, Pengyu Yue, Jia Zhao
Format:	Article
Language:	English
Published:	Tsinghua University Press 2023-12-01
Series:	CAAI Artificial Intelligence Research
Subjects:	object tracking deep reinforcement learning actor-double critic network
Online Access:	https://www.sciopen.com/article/10.26599/AIR.2023.9150013

_version_	1797202593306378240
author	Jing Xin Jianglei Zhou Xinhong Hei Pengyu Yue Jia Zhao
author_facet	Jing Xin Jianglei Zhou Xinhong Hei Pengyu Yue Jia Zhao
author_sort	Jing Xin
collection	DOAJ
description	Aiming at the problem of poor tracking robustness caused by severe occlusion, deformation, and object rotation of deep learning object tracking algorithm in complex scenes, an improved deep reinforcement learning object tracking algorithm based on actor-double critic network is proposed. In offline training phase, the actor network moves the rectangular box representing the object location according to the input sequence image to obtain the action value, that is, the horizontal, vertical, and scale transformation of the object. Then, the designed double critic network is used to evaluate the action value, and the output double Q value is averaged to guide the actor network to optimize the tracking strategy. The design of double critic network effectively improves the stability and convergence, especially in challenging scenes such as object occlusion, and the tracking performance is significantly improved. In online tracking phase, the well-trained actor network is used to infer the changing action of the bounding box, directly causing the tracker to move the box to the object position in the current frame. Several comparative tracking experiments were conducted on the OTB100 visual tracker benchmark and the experimental results show that more intensive reward settings significantly increase the actor network’s output probability of positive actions. This makes the tracking algorithm proposed in this paper outperforms the mainstream deep reinforcement learning tracking algorithms and deep learning tracking algorithms under the challenging attributes such as occlusion, deformation, and rotation.
first_indexed	2024-04-24T08:05:54Z
format	Article
id	doaj.art-455b8c9ecbd34549833127a71f3cdde7
institution	Directory Open Access Journal
issn	2097-194X
language	English
last_indexed	2024-04-24T08:05:54Z
publishDate	2023-12-01
publisher	Tsinghua University Press
record_format	Article
series	CAAI Artificial Intelligence Research
spelling	doaj.art-455b8c9ecbd34549833127a71f3cdde72024-04-17T10:29:52ZengTsinghua University PressCAAI Artificial Intelligence Research2097-194X2023-12-012915001310.26599/AIR.2023.9150013Deep Reinforcement Learning Object Tracking Based on Actor-Double Critic NetworkJing Xin0Jianglei Zhou1Xinhong Hei2Pengyu Yue3Jia Zhao4School of Automation and Information Engineering, Xi’an University of Technology, Xi’an 710048, ChinaSchool of Automation and Information Engineering, Xi’an University of Technology, Xi’an 710048, ChinaSchool of Computer Science and Engineering, Xi’an University of Technology, Xi’an 710048, ChinaSchool of Automation and Information Engineering, Xi’an University of Technology, Xi’an 710048, ChinaSchool of Automation and Information Engineering, Xi’an University of Technology, Xi’an 710048, ChinaAiming at the problem of poor tracking robustness caused by severe occlusion, deformation, and object rotation of deep learning object tracking algorithm in complex scenes, an improved deep reinforcement learning object tracking algorithm based on actor-double critic network is proposed. In offline training phase, the actor network moves the rectangular box representing the object location according to the input sequence image to obtain the action value, that is, the horizontal, vertical, and scale transformation of the object. Then, the designed double critic network is used to evaluate the action value, and the output double Q value is averaged to guide the actor network to optimize the tracking strategy. The design of double critic network effectively improves the stability and convergence, especially in challenging scenes such as object occlusion, and the tracking performance is significantly improved. In online tracking phase, the well-trained actor network is used to infer the changing action of the bounding box, directly causing the tracker to move the box to the object position in the current frame. Several comparative tracking experiments were conducted on the OTB100 visual tracker benchmark and the experimental results show that more intensive reward settings significantly increase the actor network’s output probability of positive actions. This makes the tracking algorithm proposed in this paper outperforms the mainstream deep reinforcement learning tracking algorithms and deep learning tracking algorithms under the challenging attributes such as occlusion, deformation, and rotation.https://www.sciopen.com/article/10.26599/AIR.2023.9150013object trackingdeep reinforcement learningactor-double critic network
spellingShingle	Jing Xin Jianglei Zhou Xinhong Hei Pengyu Yue Jia Zhao Deep Reinforcement Learning Object Tracking Based on Actor-Double Critic Network CAAI Artificial Intelligence Research object tracking deep reinforcement learning actor-double critic network
title	Deep Reinforcement Learning Object Tracking Based on Actor-Double Critic Network
title_full	Deep Reinforcement Learning Object Tracking Based on Actor-Double Critic Network
title_fullStr	Deep Reinforcement Learning Object Tracking Based on Actor-Double Critic Network
title_full_unstemmed	Deep Reinforcement Learning Object Tracking Based on Actor-Double Critic Network
title_short	Deep Reinforcement Learning Object Tracking Based on Actor-Double Critic Network
title_sort	deep reinforcement learning object tracking based on actor double critic network
topic	object tracking deep reinforcement learning actor-double critic network
url	https://www.sciopen.com/article/10.26599/AIR.2023.9150013
work_keys_str_mv	AT jingxin deepreinforcementlearningobjecttrackingbasedonactordoublecriticnetwork AT jiangleizhou deepreinforcementlearningobjecttrackingbasedonactordoublecriticnetwork AT xinhonghei deepreinforcementlearningobjecttrackingbasedonactordoublecriticnetwork AT pengyuyue deepreinforcementlearningobjecttrackingbasedonactordoublecriticnetwork AT jiazhao deepreinforcementlearningobjecttrackingbasedonactordoublecriticnetwork

Deep Reinforcement Learning Object Tracking Based on Actor-Double Critic Network

Similar Items