FPGA-oriented lightweight multi-modal free-space detection network

For autonomous vehicles, free-space detection is an essential part of visual perception. With the development of multi-modal convolutional neural networks (CNNs) in recent years, the performance of driving scene semantic segmentation algorithms has been dramatically improved. Therefore most free-spa...

Full description

Bibliographic Details
Main Authors: Feiyi Fang, Junzhu Mao, Wei Yu, Jianfeng Lu
Format: Article
Language:English
Published: Taylor & Francis Group 2023-12-01
Series:Connection Science
Subjects:
Online Access:http://dx.doi.org/10.1080/09540091.2022.2159333
_version_ 1797683980044075008
author Feiyi Fang
Junzhu Mao
Wei Yu
Jianfeng Lu
author_facet Feiyi Fang
Junzhu Mao
Wei Yu
Jianfeng Lu
author_sort Feiyi Fang
collection DOAJ
description For autonomous vehicles, free-space detection is an essential part of visual perception. With the development of multi-modal convolutional neural networks (CNNs) in recent years, the performance of driving scene semantic segmentation algorithms has been dramatically improved. Therefore most free-space detection algorithms are developed based on multiple sensors. However, multi-modal CNNs have high data throughput and contain a large number of computationally intensive convolution calculations, limiting their feasibility for real-time applications. Field Programmable Gate Arrays (FPGAs) provide a unique combination of flexibility, performance, and low power for these problems to accommodate multi-modal data and the computational acceleration of different compression algorithms. Network lightweight methods offer great assurance for facilitating the deployment of CNNs on such resource-constrained devices. In this paper, we propose a network lightweight method for a multi-modal free-space detection algorithm. We first propose an FPGA-friendly multi-modal free-space detection lightweight network. It comprises operators that FPGA prefers and achieves a $ 95.54\% $ MaxF score on the test set of KITTI-Road free-space detection tasks and 81 ms runtime when running on 700 W GPU devices. Then we present a pruning approach for this network to reduce the number of parameters in case the complete model exceeds the FPGA chip memory. The pruning is in two parts. For the feature extractors, we propose a data-dependent filter pruner according to the principle that the low-rank feature map contains less information. To not compromise the integrity of the multi-modal information, the pruner is independent for each modality. For the segmentation decoder, we apply a channel pruning approach to remove redundant parameters. Finally, we implement our designs on an FPGA board using 8-bit quantisation, and the accelerator achieves outstanding performance. A real-time application of scene segmentation on KITTI-Road is used to evaluate our algorithm, and the model achieves a $ 94.39\% $ MaxF score and minimum 14 ms runtime on 20W FPGA devices.
first_indexed 2024-03-12T00:23:44Z
format Article
id doaj.art-0823dd7eb5a44f45894615960debdd68
institution Directory Open Access Journal
issn 0954-0091
1360-0494
language English
last_indexed 2024-03-12T00:23:44Z
publishDate 2023-12-01
publisher Taylor & Francis Group
record_format Article
series Connection Science
spelling doaj.art-0823dd7eb5a44f45894615960debdd682023-09-15T10:48:01ZengTaylor & Francis GroupConnection Science0954-00911360-04942023-12-0135110.1080/09540091.2022.21593332159333FPGA-oriented lightweight multi-modal free-space detection networkFeiyi Fang0Junzhu Mao1Wei Yu2Jianfeng Lu3Nanjing University of Science and TechnologyNanjing University of Science and TechnologyBeijing RICH AI information Technology Co., LtdNanjing University of Science and TechnologyFor autonomous vehicles, free-space detection is an essential part of visual perception. With the development of multi-modal convolutional neural networks (CNNs) in recent years, the performance of driving scene semantic segmentation algorithms has been dramatically improved. Therefore most free-space detection algorithms are developed based on multiple sensors. However, multi-modal CNNs have high data throughput and contain a large number of computationally intensive convolution calculations, limiting their feasibility for real-time applications. Field Programmable Gate Arrays (FPGAs) provide a unique combination of flexibility, performance, and low power for these problems to accommodate multi-modal data and the computational acceleration of different compression algorithms. Network lightweight methods offer great assurance for facilitating the deployment of CNNs on such resource-constrained devices. In this paper, we propose a network lightweight method for a multi-modal free-space detection algorithm. We first propose an FPGA-friendly multi-modal free-space detection lightweight network. It comprises operators that FPGA prefers and achieves a $ 95.54\% $ MaxF score on the test set of KITTI-Road free-space detection tasks and 81 ms runtime when running on 700 W GPU devices. Then we present a pruning approach for this network to reduce the number of parameters in case the complete model exceeds the FPGA chip memory. The pruning is in two parts. For the feature extractors, we propose a data-dependent filter pruner according to the principle that the low-rank feature map contains less information. To not compromise the integrity of the multi-modal information, the pruner is independent for each modality. For the segmentation decoder, we apply a channel pruning approach to remove redundant parameters. Finally, we implement our designs on an FPGA board using 8-bit quantisation, and the accelerator achieves outstanding performance. A real-time application of scene segmentation on KITTI-Road is used to evaluate our algorithm, and the model achieves a $ 94.39\% $ MaxF score and minimum 14 ms runtime on 20W FPGA devices.http://dx.doi.org/10.1080/09540091.2022.2159333pruningfree-space detectionlightweight networkmulti-modal learningfpga
spellingShingle Feiyi Fang
Junzhu Mao
Wei Yu
Jianfeng Lu
FPGA-oriented lightweight multi-modal free-space detection network
Connection Science
pruning
free-space detection
lightweight network
multi-modal learning
fpga
title FPGA-oriented lightweight multi-modal free-space detection network
title_full FPGA-oriented lightweight multi-modal free-space detection network
title_fullStr FPGA-oriented lightweight multi-modal free-space detection network
title_full_unstemmed FPGA-oriented lightweight multi-modal free-space detection network
title_short FPGA-oriented lightweight multi-modal free-space detection network
title_sort fpga oriented lightweight multi modal free space detection network
topic pruning
free-space detection
lightweight network
multi-modal learning
fpga
url http://dx.doi.org/10.1080/09540091.2022.2159333
work_keys_str_mv AT feiyifang fpgaorientedlightweightmultimodalfreespacedetectionnetwork
AT junzhumao fpgaorientedlightweightmultimodalfreespacedetectionnetwork
AT weiyu fpgaorientedlightweightmultimodalfreespacedetectionnetwork
AT jianfenglu fpgaorientedlightweightmultimodalfreespacedetectionnetwork