A Task-Risk Consistency Object Detection Framework Based on Deep Reinforcement Learning

A discernible gap has materialized between the expectations for object detection tasks in optical remote sensing images and the increasingly sophisticated design methods. The flexibility of deep learning object detection algorithms allows the selection and combination of multiple basic structures an...

Full description

Bibliographic Details
Main Authors: Jiazheng Wen, Huanyu Liu, Junbao Li
Format: Article
Language:English
Published: MDPI AG 2023-10-01
Series:Remote Sensing
Subjects:
Online Access:https://www.mdpi.com/2072-4292/15/20/5031
_version_ 1797572424324087808
author Jiazheng Wen
Huanyu Liu
Junbao Li
author_facet Jiazheng Wen
Huanyu Liu
Junbao Li
author_sort Jiazheng Wen
collection DOAJ
description A discernible gap has materialized between the expectations for object detection tasks in optical remote sensing images and the increasingly sophisticated design methods. The flexibility of deep learning object detection algorithms allows the selection and combination of multiple basic structures and model sizes, but this selection process relies heavily on human experience and lacks reliability when faced with special scenarios or extreme data distribution. To address these inherent challenges, this study proposes an approach that leverages deep reinforcement learning within the framework of vision tasks. This study introduces a Task-Risk Consistent Intelligent Detection Framework (TRC-ODF) for object detection in optical remote sensing images. The proposed framework designs a model optimization strategy based on deep reinforcement learning that systematically integrates the available information from images and vision processes. The core of the reinforcement learning agent is the proposed task-risk consistency reward mechanism, which is the driving force behind the optimal prediction allocation in the decision-making process. To verify the effectiveness of the proposed framework, multiple sets of empirical evaluations are conducted on representative optical remote sensing image datasets: RSOD, NWPU VHR-10, and DIOR. When applying the proposed framework to representative advanced detection models, the mean average precision (mAP@0.5 and mAP@0.5:0.95) is improved by 0.8–5.4 and 0.4–2.7, respectively. The obtained results showcase the considerable promise and potential of the TRC-ODF framework to address the challenges associated with object detection in optical remote sensing images.
first_indexed 2024-03-10T20:56:05Z
format Article
id doaj.art-d5f4d656e056450d8716271f0e0ac024
institution Directory Open Access Journal
issn 2072-4292
language English
last_indexed 2024-03-10T20:56:05Z
publishDate 2023-10-01
publisher MDPI AG
record_format Article
series Remote Sensing
spelling doaj.art-d5f4d656e056450d8716271f0e0ac0242023-11-19T17:59:52ZengMDPI AGRemote Sensing2072-42922023-10-011520503110.3390/rs15205031A Task-Risk Consistency Object Detection Framework Based on Deep Reinforcement LearningJiazheng Wen0Huanyu Liu1Junbao Li2Faculty of Computing, Harbin Institute of Technology, Harbin 150080, ChinaFaculty of Computing, Harbin Institute of Technology, Harbin 150080, ChinaFaculty of Computing, Harbin Institute of Technology, Harbin 150080, ChinaA discernible gap has materialized between the expectations for object detection tasks in optical remote sensing images and the increasingly sophisticated design methods. The flexibility of deep learning object detection algorithms allows the selection and combination of multiple basic structures and model sizes, but this selection process relies heavily on human experience and lacks reliability when faced with special scenarios or extreme data distribution. To address these inherent challenges, this study proposes an approach that leverages deep reinforcement learning within the framework of vision tasks. This study introduces a Task-Risk Consistent Intelligent Detection Framework (TRC-ODF) for object detection in optical remote sensing images. The proposed framework designs a model optimization strategy based on deep reinforcement learning that systematically integrates the available information from images and vision processes. The core of the reinforcement learning agent is the proposed task-risk consistency reward mechanism, which is the driving force behind the optimal prediction allocation in the decision-making process. To verify the effectiveness of the proposed framework, multiple sets of empirical evaluations are conducted on representative optical remote sensing image datasets: RSOD, NWPU VHR-10, and DIOR. When applying the proposed framework to representative advanced detection models, the mean average precision (mAP@0.5 and mAP@0.5:0.95) is improved by 0.8–5.4 and 0.4–2.7, respectively. The obtained results showcase the considerable promise and potential of the TRC-ODF framework to address the challenges associated with object detection in optical remote sensing images.https://www.mdpi.com/2072-4292/15/20/5031object detectionreinforcement learningoptical remote sensing image
spellingShingle Jiazheng Wen
Huanyu Liu
Junbao Li
A Task-Risk Consistency Object Detection Framework Based on Deep Reinforcement Learning
Remote Sensing
object detection
reinforcement learning
optical remote sensing image
title A Task-Risk Consistency Object Detection Framework Based on Deep Reinforcement Learning
title_full A Task-Risk Consistency Object Detection Framework Based on Deep Reinforcement Learning
title_fullStr A Task-Risk Consistency Object Detection Framework Based on Deep Reinforcement Learning
title_full_unstemmed A Task-Risk Consistency Object Detection Framework Based on Deep Reinforcement Learning
title_short A Task-Risk Consistency Object Detection Framework Based on Deep Reinforcement Learning
title_sort task risk consistency object detection framework based on deep reinforcement learning
topic object detection
reinforcement learning
optical remote sensing image
url https://www.mdpi.com/2072-4292/15/20/5031
work_keys_str_mv AT jiazhengwen ataskriskconsistencyobjectdetectionframeworkbasedondeepreinforcementlearning
AT huanyuliu ataskriskconsistencyobjectdetectionframeworkbasedondeepreinforcementlearning
AT junbaoli ataskriskconsistencyobjectdetectionframeworkbasedondeepreinforcementlearning
AT jiazhengwen taskriskconsistencyobjectdetectionframeworkbasedondeepreinforcementlearning
AT huanyuliu taskriskconsistencyobjectdetectionframeworkbasedondeepreinforcementlearning
AT junbaoli taskriskconsistencyobjectdetectionframeworkbasedondeepreinforcementlearning