Deep Reinforcement Learning-Based Failure-Safe Motion Planning for a 4-Wheeled 2-Steering Lunar Rover

The growing trend of onboard computational autonomy has increased the need for self-reliant rovers (SRRs) with high efficiency for unmanned rover activities. Mobility is directly associated with a successful execution mission, thus fault response for actuator failures is highly crucial for planetary...

Full description

Bibliographic Details
Main Authors: Beom-Joon Park, Hyun-Joon Chung
Format: Article
Language:English
Published: MDPI AG 2023-02-01
Series:Aerospace
Subjects:
Online Access:https://www.mdpi.com/2226-4310/10/3/219
_version_ 1797614157977092096
author Beom-Joon Park
Hyun-Joon Chung
author_facet Beom-Joon Park
Hyun-Joon Chung
author_sort Beom-Joon Park
collection DOAJ
description The growing trend of onboard computational autonomy has increased the need for self-reliant rovers (SRRs) with high efficiency for unmanned rover activities. Mobility is directly associated with a successful execution mission, thus fault response for actuator failures is highly crucial for planetary exploration rovers in such a trend. However, most of the existing mobility health management systems for rovers have focused on fault diagnosis and protection sequences that are determined by human operators through ground-in-the-loop solutions. This paper presents a special four-wheeled two-steering lunar rover with a modified explicit steering mechanism, where each left and right wheel is controlled by only two actuators. Under these constraints, a new motion planning method that combines reinforcement learning with the rover’s kinematic model without the need for dynamics modeling is devised. A failure-safe algorithm is proposed to address the critical loss of mobility in the case of steering motor failure, by expanding the devised motion planning method, which is designed to ensure mobility for mission execution in a four-wheeled rover. The algorithm’s performance and applicability are validated through simulations on high-slip terrain scenarios caused by steering motor failure and compared with a conventional control method in terms of reliability. This simulation validation serves as a preliminary study toward future works on deformable terrain such as rough or soft areas and optimization of the deep neural network’s weight factor for fine-tuning in real experiments. The failure-safe motion planning provides valuable insights as a first-step approach toward developing autonomous recovery strategies for rover mobility.
first_indexed 2024-03-11T07:05:51Z
format Article
id doaj.art-a7d6b17cafbb4151aae7bb8f702b9368
institution Directory Open Access Journal
issn 2226-4310
language English
last_indexed 2024-03-11T07:05:51Z
publishDate 2023-02-01
publisher MDPI AG
record_format Article
series Aerospace
spelling doaj.art-a7d6b17cafbb4151aae7bb8f702b93682023-11-17T08:58:00ZengMDPI AGAerospace2226-43102023-02-0110321910.3390/aerospace10030219Deep Reinforcement Learning-Based Failure-Safe Motion Planning for a 4-Wheeled 2-Steering Lunar RoverBeom-Joon Park0Hyun-Joon Chung1Korea Institute of Robotics and Technology Convergence, Seoul 06372, Republic of KoreaKorea Institute of Robotics and Technology Convergence, Pohang 37666, Republic of KoreaThe growing trend of onboard computational autonomy has increased the need for self-reliant rovers (SRRs) with high efficiency for unmanned rover activities. Mobility is directly associated with a successful execution mission, thus fault response for actuator failures is highly crucial for planetary exploration rovers in such a trend. However, most of the existing mobility health management systems for rovers have focused on fault diagnosis and protection sequences that are determined by human operators through ground-in-the-loop solutions. This paper presents a special four-wheeled two-steering lunar rover with a modified explicit steering mechanism, where each left and right wheel is controlled by only two actuators. Under these constraints, a new motion planning method that combines reinforcement learning with the rover’s kinematic model without the need for dynamics modeling is devised. A failure-safe algorithm is proposed to address the critical loss of mobility in the case of steering motor failure, by expanding the devised motion planning method, which is designed to ensure mobility for mission execution in a four-wheeled rover. The algorithm’s performance and applicability are validated through simulations on high-slip terrain scenarios caused by steering motor failure and compared with a conventional control method in terms of reliability. This simulation validation serves as a preliminary study toward future works on deformable terrain such as rough or soft areas and optimization of the deep neural network’s weight factor for fine-tuning in real experiments. The failure-safe motion planning provides valuable insights as a first-step approach toward developing autonomous recovery strategies for rover mobility.https://www.mdpi.com/2226-4310/10/3/219planetary rovermoon explorationfailure-safe algorithmmotion planningreinforcement learning
spellingShingle Beom-Joon Park
Hyun-Joon Chung
Deep Reinforcement Learning-Based Failure-Safe Motion Planning for a 4-Wheeled 2-Steering Lunar Rover
Aerospace
planetary rover
moon exploration
failure-safe algorithm
motion planning
reinforcement learning
title Deep Reinforcement Learning-Based Failure-Safe Motion Planning for a 4-Wheeled 2-Steering Lunar Rover
title_full Deep Reinforcement Learning-Based Failure-Safe Motion Planning for a 4-Wheeled 2-Steering Lunar Rover
title_fullStr Deep Reinforcement Learning-Based Failure-Safe Motion Planning for a 4-Wheeled 2-Steering Lunar Rover
title_full_unstemmed Deep Reinforcement Learning-Based Failure-Safe Motion Planning for a 4-Wheeled 2-Steering Lunar Rover
title_short Deep Reinforcement Learning-Based Failure-Safe Motion Planning for a 4-Wheeled 2-Steering Lunar Rover
title_sort deep reinforcement learning based failure safe motion planning for a 4 wheeled 2 steering lunar rover
topic planetary rover
moon exploration
failure-safe algorithm
motion planning
reinforcement learning
url https://www.mdpi.com/2226-4310/10/3/219
work_keys_str_mv AT beomjoonpark deepreinforcementlearningbasedfailuresafemotionplanningfora4wheeled2steeringlunarrover
AT hyunjoonchung deepreinforcementlearningbasedfailuresafemotionplanningfora4wheeled2steeringlunarrover