Constraint Loss for Rotated Object Detection in Remote Sensing Images

Rotated object detection is an extension of object detection that uses an oriented bounding box instead of a general horizontal bounding box to define the object position. It is widely used in remote sensing images, scene text, and license plate recognition. The existing rotated object detection met...

Full description

Bibliographic Details
Main Authors: Luyang Zhang, Haitao Wang, Lingfeng Wang, Chunhong Pan, Qiang Liu, Xinyao Wang
Format: Article
Language:English
Published: MDPI AG 2021-10-01
Series:Remote Sensing
Subjects:
Online Access:https://www.mdpi.com/2072-4292/13/21/4291
_version_ 1797511873635024896
author Luyang Zhang
Haitao Wang
Lingfeng Wang
Chunhong Pan
Qiang Liu
Xinyao Wang
author_facet Luyang Zhang
Haitao Wang
Lingfeng Wang
Chunhong Pan
Qiang Liu
Xinyao Wang
author_sort Luyang Zhang
collection DOAJ
description Rotated object detection is an extension of object detection that uses an oriented bounding box instead of a general horizontal bounding box to define the object position. It is widely used in remote sensing images, scene text, and license plate recognition. The existing rotated object detection methods usually add an angle prediction channel in the bounding box prediction branch, and smooth <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><msub><mi>L</mi><mn>1</mn></msub></semantics></math></inline-formula> loss is used as the regression loss function. However, we argue that smooth <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><msub><mi>L</mi><mn>1</mn></msub></semantics></math></inline-formula> loss causes a sudden change in loss and slow convergence due to the angle solving mechanism of open CV (the angle between the horizontal line and the first side of the bounding box in the counter-clockwise direction is defined as the rotation angle), and this problem exists in most existing regression loss functions. To solve the above problems, we propose a decoupling modulation mechanism to overcome the problem of sudden changes in loss. On this basis, we also proposed a constraint mechanism, the purpose of which is to accelerate the convergence of the network and ensure optimization toward the ideal direction. In addition, the proposed decoupling modulation mechanism and constraint mechanism can be integrated into the popular regression loss function individually or together, which further improves the performance of the model and makes the model converge faster. The experimental results show that our method achieves 75.2% performance on the aerial image dataset DOTA (OBB task), and saves more than 30% of computing resources. The method also achieves a state-of-the-art performance in HRSC2016, and saved more than 40% of computing resources, which confirms the applicability of the approach.
first_indexed 2024-03-10T05:54:06Z
format Article
id doaj.art-acd2c41bbb26464b8615e0d8c1436ce8
institution Directory Open Access Journal
issn 2072-4292
language English
last_indexed 2024-03-10T05:54:06Z
publishDate 2021-10-01
publisher MDPI AG
record_format Article
series Remote Sensing
spelling doaj.art-acd2c41bbb26464b8615e0d8c1436ce82023-11-22T21:31:17ZengMDPI AGRemote Sensing2072-42922021-10-011321429110.3390/rs13214291Constraint Loss for Rotated Object Detection in Remote Sensing ImagesLuyang Zhang0Haitao Wang1Lingfeng Wang2Chunhong Pan3Qiang Liu4Xinyao Wang5College of Automation Engineering, Nanjing University of Aeronautics & Astronautics, Nanjing 210016, ChinaCollege of Automation Engineering, Nanjing University of Aeronautics & Astronautics, Nanjing 210016, ChinaCollege of Information Science and Technology, Beijing University of Chemical Technology, Beijing 100029, ChinaNational Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, Beijing 100190, ChinaCollege of Automation Engineering, Nanjing University of Aeronautics & Astronautics, Nanjing 210016, ChinaCollege of Automation Engineering, Nanjing University of Aeronautics & Astronautics, Nanjing 210016, ChinaRotated object detection is an extension of object detection that uses an oriented bounding box instead of a general horizontal bounding box to define the object position. It is widely used in remote sensing images, scene text, and license plate recognition. The existing rotated object detection methods usually add an angle prediction channel in the bounding box prediction branch, and smooth <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><msub><mi>L</mi><mn>1</mn></msub></semantics></math></inline-formula> loss is used as the regression loss function. However, we argue that smooth <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><msub><mi>L</mi><mn>1</mn></msub></semantics></math></inline-formula> loss causes a sudden change in loss and slow convergence due to the angle solving mechanism of open CV (the angle between the horizontal line and the first side of the bounding box in the counter-clockwise direction is defined as the rotation angle), and this problem exists in most existing regression loss functions. To solve the above problems, we propose a decoupling modulation mechanism to overcome the problem of sudden changes in loss. On this basis, we also proposed a constraint mechanism, the purpose of which is to accelerate the convergence of the network and ensure optimization toward the ideal direction. In addition, the proposed decoupling modulation mechanism and constraint mechanism can be integrated into the popular regression loss function individually or together, which further improves the performance of the model and makes the model converge faster. The experimental results show that our method achieves 75.2% performance on the aerial image dataset DOTA (OBB task), and saves more than 30% of computing resources. The method also achieves a state-of-the-art performance in HRSC2016, and saved more than 40% of computing resources, which confirms the applicability of the approach.https://www.mdpi.com/2072-4292/13/21/4291rotated object detectionremote sensing imageloss functionsfast convergence
spellingShingle Luyang Zhang
Haitao Wang
Lingfeng Wang
Chunhong Pan
Qiang Liu
Xinyao Wang
Constraint Loss for Rotated Object Detection in Remote Sensing Images
Remote Sensing
rotated object detection
remote sensing image
loss functions
fast convergence
title Constraint Loss for Rotated Object Detection in Remote Sensing Images
title_full Constraint Loss for Rotated Object Detection in Remote Sensing Images
title_fullStr Constraint Loss for Rotated Object Detection in Remote Sensing Images
title_full_unstemmed Constraint Loss for Rotated Object Detection in Remote Sensing Images
title_short Constraint Loss for Rotated Object Detection in Remote Sensing Images
title_sort constraint loss for rotated object detection in remote sensing images
topic rotated object detection
remote sensing image
loss functions
fast convergence
url https://www.mdpi.com/2072-4292/13/21/4291
work_keys_str_mv AT luyangzhang constraintlossforrotatedobjectdetectioninremotesensingimages
AT haitaowang constraintlossforrotatedobjectdetectioninremotesensingimages
AT lingfengwang constraintlossforrotatedobjectdetectioninremotesensingimages
AT chunhongpan constraintlossforrotatedobjectdetectioninremotesensingimages
AT qiangliu constraintlossforrotatedobjectdetectioninremotesensingimages
AT xinyaowang constraintlossforrotatedobjectdetectioninremotesensingimages