UGV Navigation Optimization Aided by Reinforcement Learning-Based Path Tracking

The success of robotic, such as UGV systems, largely benefits from the fundamental capability of autonomously finding collision-free path(s) to commit mobile tasks in routinely rough and complicated environments. Optimization of navigation under such circumstance has long been an open problem: 1) to...

Full description

Bibliographic Details
Main Authors:	Minggao Wei, Song Wang, Jinfan Zheng, Dan Chen
Format:	Article
Language:	English
Published:	IEEE 2018-01-01
Series:	IEEE Access
Subjects:	UGV navigation reinforcement learning deterministic policy gradient path tracking
Online Access:	https://ieeexplore.ieee.org/document/8476521/

_version_	1828889985467547648
author	Minggao Wei Song Wang Jinfan Zheng Dan Chen
author_facet	Minggao Wei Song Wang Jinfan Zheng Dan Chen
author_sort	Minggao Wei
collection	DOAJ
description	The success of robotic, such as UGV systems, largely benefits from the fundamental capability of autonomously finding collision-free path(s) to commit mobile tasks in routinely rough and complicated environments. Optimization of navigation under such circumstance has long been an open problem: 1) to meet the critical requirements of this task typically including the shortest distance and smoothness and 2) more challengingly, to enable a general solution to track the optimal path in real-time outdoor applications. Aiming at the problem, this study develops a two-tier approach to navigation optimization in terms of path planning and tracking. First, a “rope”model has been designed to mimic the deformation of a path in axial direction under external force and the fixedness of the radial plane to contain a UGV in a collision-free space. Second, a deterministic policy gradient (DPG) algorithm has been trained efficiently on abstracted structures of an arbitrarily derived “rope”to model the controller for tracking the optimal path. The learned policy can be generalized to a variety of scenarios. Experiments have been performed over complicated environments of different types. The results indicate that: 1) the rope model helps in minimizing distance and enhancing smoothness of the path, while guarantees the clearance; 2) the DPG can be modeled quickly (in a couple of minutes on an office desktop) and the model can apply to environments of increasing complexity under the circumstance of external disturbances without the need for tuning parameters; and 3) the DPG-based controller can autonomously adjust the UGV to follow the correct path free of risks by itself.
first_indexed	2024-12-13T12:50:40Z
format	Article
id	doaj.art-9393e6310a3c43ea91b28528e93b3efc
institution	Directory Open Access Journal
issn	2169-3536
language	English
last_indexed	2024-12-13T12:50:40Z
publishDate	2018-01-01
publisher	IEEE
record_format	Article
series	IEEE Access
spelling	doaj.art-9393e6310a3c43ea91b28528e93b3efc2022-12-21T23:45:20ZengIEEEIEEE Access2169-35362018-01-016578145782510.1109/ACCESS.2018.28727518476521UGV Navigation Optimization Aided by Reinforcement Learning-Based Path TrackingMinggao Wei0Song Wang1Jinfan Zheng2Dan Chen3https://orcid.org/0000-0002-7055-141XNational Engineering Research Center for Multimedia Software, School of Computer Science, Wuhan University, Wuhan, ChinaNational Engineering Research Center for Multimedia Software, School of Computer Science, Wuhan University, Wuhan, ChinaNational Engineering Research Center for Multimedia Software, School of Computer Science, Wuhan University, Wuhan, ChinaNational Engineering Research Center for Multimedia Software, School of Computer Science, Wuhan University, Wuhan, ChinaThe success of robotic, such as UGV systems, largely benefits from the fundamental capability of autonomously finding collision-free path(s) to commit mobile tasks in routinely rough and complicated environments. Optimization of navigation under such circumstance has long been an open problem: 1) to meet the critical requirements of this task typically including the shortest distance and smoothness and 2) more challengingly, to enable a general solution to track the optimal path in real-time outdoor applications. Aiming at the problem, this study develops a two-tier approach to navigation optimization in terms of path planning and tracking. First, a “rope”model has been designed to mimic the deformation of a path in axial direction under external force and the fixedness of the radial plane to contain a UGV in a collision-free space. Second, a deterministic policy gradient (DPG) algorithm has been trained efficiently on abstracted structures of an arbitrarily derived “rope”to model the controller for tracking the optimal path. The learned policy can be generalized to a variety of scenarios. Experiments have been performed over complicated environments of different types. The results indicate that: 1) the rope model helps in minimizing distance and enhancing smoothness of the path, while guarantees the clearance; 2) the DPG can be modeled quickly (in a couple of minutes on an office desktop) and the model can apply to environments of increasing complexity under the circumstance of external disturbances without the need for tuning parameters; and 3) the DPG-based controller can autonomously adjust the UGV to follow the correct path free of risks by itself.https://ieeexplore.ieee.org/document/8476521/UGV navigationreinforcement learningdeterministic policy gradientpath tracking
spellingShingle	Minggao Wei Song Wang Jinfan Zheng Dan Chen UGV Navigation Optimization Aided by Reinforcement Learning-Based Path Tracking IEEE Access UGV navigation reinforcement learning deterministic policy gradient path tracking
title	UGV Navigation Optimization Aided by Reinforcement Learning-Based Path Tracking
title_full	UGV Navigation Optimization Aided by Reinforcement Learning-Based Path Tracking
title_fullStr	UGV Navigation Optimization Aided by Reinforcement Learning-Based Path Tracking
title_full_unstemmed	UGV Navigation Optimization Aided by Reinforcement Learning-Based Path Tracking
title_short	UGV Navigation Optimization Aided by Reinforcement Learning-Based Path Tracking
title_sort	ugv navigation optimization aided by reinforcement learning based path tracking
topic	UGV navigation reinforcement learning deterministic policy gradient path tracking
url	https://ieeexplore.ieee.org/document/8476521/
work_keys_str_mv	AT minggaowei ugvnavigationoptimizationaidedbyreinforcementlearningbasedpathtracking AT songwang ugvnavigationoptimizationaidedbyreinforcementlearningbasedpathtracking AT jinfanzheng ugvnavigationoptimizationaidedbyreinforcementlearningbasedpathtracking AT danchen ugvnavigationoptimizationaidedbyreinforcementlearningbasedpathtracking

UGV Navigation Optimization Aided by Reinforcement Learning-Based Path Tracking

Similar Items