Hardware-Aware Design of Speed-Up Algorithms for Synthetic Aperture Radar Ship Target Detection Networks
Recently, synthetic aperture radar (SAR) target detection algorithms based on Convolutional Neural Networks (CNN) have received increasing attention. However, the large amount of computation required burdens the real-time detection of SAR ship targets on resource-limited and power-constrained satell...
Main Authors: | , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2023-10-01
|
Series: | Remote Sensing |
Subjects: | |
Online Access: | https://www.mdpi.com/2072-4292/15/20/4995 |
_version_ | 1797572426104569856 |
---|---|
author | Yue Zhang Shuai Jiang Yue Cao Jiarong Xiao Chengkun Li Xuan Zhou Zhongjun Yu |
author_facet | Yue Zhang Shuai Jiang Yue Cao Jiarong Xiao Chengkun Li Xuan Zhou Zhongjun Yu |
author_sort | Yue Zhang |
collection | DOAJ |
description | Recently, synthetic aperture radar (SAR) target detection algorithms based on Convolutional Neural Networks (CNN) have received increasing attention. However, the large amount of computation required burdens the real-time detection of SAR ship targets on resource-limited and power-constrained satellite-based platforms. In this paper, we propose a hardware-aware model speed-up method for single-stage SAR ship targets detection tasks, oriented towards the most widely used hardware for neural network computing—Graphic Processing Unit (GPU). We first analyze the process by which the task of detection is executed on GPUs and propose two strategies according to this process. Firstly, in order to speed up the execution of the model on a GPU, we propose SAR-aware model quantification to allow the original model to be stored and computed in a low-precision format. Next, to ensure the loss of accuracy is negligible after the acceleration and compression process, precision-aware scheduling is used to filter out layers that are not suitable for quantification and store and execute them in a high-precision mode. Trained on the dataset HRSID, the effectiveness of this model speed-up algorithm was demonstrated by compressing four different sizes of models (yolov5n, yolov5s, yolov5m, yolov5l). The experimental results show that the detection speeds of yolov5n, yolov5s, yolov5m, and yolov5l can reach 234.7785 fps, 212.8341 fps, 165.6523 fps, and 139.8758 fps on the NVIDIA AGX Xavier development board with negligible loss of accuracy, which is 1.230 times, 1.469 times, 1.955 times, and 2.448 times faster than the original before the use of this method, respectively. |
first_indexed | 2024-03-10T20:56:06Z |
format | Article |
id | doaj.art-df6f62bb4a184db4aef5105aa34b89b8 |
institution | Directory Open Access Journal |
issn | 2072-4292 |
language | English |
last_indexed | 2024-03-10T20:56:06Z |
publishDate | 2023-10-01 |
publisher | MDPI AG |
record_format | Article |
series | Remote Sensing |
spelling | doaj.art-df6f62bb4a184db4aef5105aa34b89b82023-11-19T17:59:19ZengMDPI AGRemote Sensing2072-42922023-10-011520499510.3390/rs15204995Hardware-Aware Design of Speed-Up Algorithms for Synthetic Aperture Radar Ship Target Detection NetworksYue Zhang0Shuai Jiang1Yue Cao2Jiarong Xiao3Chengkun Li4Xuan Zhou5Zhongjun Yu6Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100190, ChinaAerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100190, ChinaAerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100190, ChinaAerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100190, ChinaAerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100190, ChinaAerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100190, ChinaAerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100190, ChinaRecently, synthetic aperture radar (SAR) target detection algorithms based on Convolutional Neural Networks (CNN) have received increasing attention. However, the large amount of computation required burdens the real-time detection of SAR ship targets on resource-limited and power-constrained satellite-based platforms. In this paper, we propose a hardware-aware model speed-up method for single-stage SAR ship targets detection tasks, oriented towards the most widely used hardware for neural network computing—Graphic Processing Unit (GPU). We first analyze the process by which the task of detection is executed on GPUs and propose two strategies according to this process. Firstly, in order to speed up the execution of the model on a GPU, we propose SAR-aware model quantification to allow the original model to be stored and computed in a low-precision format. Next, to ensure the loss of accuracy is negligible after the acceleration and compression process, precision-aware scheduling is used to filter out layers that are not suitable for quantification and store and execute them in a high-precision mode. Trained on the dataset HRSID, the effectiveness of this model speed-up algorithm was demonstrated by compressing four different sizes of models (yolov5n, yolov5s, yolov5m, yolov5l). The experimental results show that the detection speeds of yolov5n, yolov5s, yolov5m, and yolov5l can reach 234.7785 fps, 212.8341 fps, 165.6523 fps, and 139.8758 fps on the NVIDIA AGX Xavier development board with negligible loss of accuracy, which is 1.230 times, 1.469 times, 1.955 times, and 2.448 times faster than the original before the use of this method, respectively.https://www.mdpi.com/2072-4292/15/20/4995synthetic aperture radar (SAR)target detectionconvolutional neural networks (CNN)model speed-up algorithmsgraphic processing unit (GPU) |
spellingShingle | Yue Zhang Shuai Jiang Yue Cao Jiarong Xiao Chengkun Li Xuan Zhou Zhongjun Yu Hardware-Aware Design of Speed-Up Algorithms for Synthetic Aperture Radar Ship Target Detection Networks Remote Sensing synthetic aperture radar (SAR) target detection convolutional neural networks (CNN) model speed-up algorithms graphic processing unit (GPU) |
title | Hardware-Aware Design of Speed-Up Algorithms for Synthetic Aperture Radar Ship Target Detection Networks |
title_full | Hardware-Aware Design of Speed-Up Algorithms for Synthetic Aperture Radar Ship Target Detection Networks |
title_fullStr | Hardware-Aware Design of Speed-Up Algorithms for Synthetic Aperture Radar Ship Target Detection Networks |
title_full_unstemmed | Hardware-Aware Design of Speed-Up Algorithms for Synthetic Aperture Radar Ship Target Detection Networks |
title_short | Hardware-Aware Design of Speed-Up Algorithms for Synthetic Aperture Radar Ship Target Detection Networks |
title_sort | hardware aware design of speed up algorithms for synthetic aperture radar ship target detection networks |
topic | synthetic aperture radar (SAR) target detection convolutional neural networks (CNN) model speed-up algorithms graphic processing unit (GPU) |
url | https://www.mdpi.com/2072-4292/15/20/4995 |
work_keys_str_mv | AT yuezhang hardwareawaredesignofspeedupalgorithmsforsyntheticapertureradarshiptargetdetectionnetworks AT shuaijiang hardwareawaredesignofspeedupalgorithmsforsyntheticapertureradarshiptargetdetectionnetworks AT yuecao hardwareawaredesignofspeedupalgorithmsforsyntheticapertureradarshiptargetdetectionnetworks AT jiarongxiao hardwareawaredesignofspeedupalgorithmsforsyntheticapertureradarshiptargetdetectionnetworks AT chengkunli hardwareawaredesignofspeedupalgorithmsforsyntheticapertureradarshiptargetdetectionnetworks AT xuanzhou hardwareawaredesignofspeedupalgorithmsforsyntheticapertureradarshiptargetdetectionnetworks AT zhongjunyu hardwareawaredesignofspeedupalgorithmsforsyntheticapertureradarshiptargetdetectionnetworks |