FPGA-oriented lightweight multi-modal free-space detection network

For autonomous vehicles, free-space detection is an essential part of visual perception. With the development of multi-modal convolutional neural networks (CNNs) in recent years, the performance of driving scene semantic segmentation algorithms has been dramatically improved. Therefore most free-spa...

Full description

Bibliographic Details
Main Authors:	Feiyi Fang, Junzhu Mao, Wei Yu, Jianfeng Lu
Format:	Article
Language:	English
Published:	Taylor & Francis Group 2023-12-01
Series:	Connection Science
Subjects:	pruning free-space detection lightweight network multi-modal learning fpga
Online Access:	http://dx.doi.org/10.1080/09540091.2022.2159333

_version_	1797683980044075008
author	Feiyi Fang Junzhu Mao Wei Yu Jianfeng Lu
author_facet	Feiyi Fang Junzhu Mao Wei Yu Jianfeng Lu
author_sort	Feiyi Fang
collection	DOAJ
description	For autonomous vehicles, free-space detection is an essential part of visual perception. With the development of multi-modal convolutional neural networks (CNNs) in recent years, the performance of driving scene semantic segmentation algorithms has been dramatically improved. Therefore most free-space detection algorithms are developed based on multiple sensors. However, multi-modal CNNs have high data throughput and contain a large number of computationally intensive convolution calculations, limiting their feasibility for real-time applications. Field Programmable Gate Arrays (FPGAs) provide a unique combination of flexibility, performance, and low power for these problems to accommodate multi-modal data and the computational acceleration of different compression algorithms. Network lightweight methods offer great assurance for facilitating the deployment of CNNs on such resource-constrained devices. In this paper, we propose a network lightweight method for a multi-modal free-space detection algorithm. We first propose an FPGA-friendly multi-modal free-space detection lightweight network. It comprises operators that FPGA prefers and achieves a $ 95.54\% $ MaxF score on the test set of KITTI-Road free-space detection tasks and 81 ms runtime when running on 700 W GPU devices. Then we present a pruning approach for this network to reduce the number of parameters in case the complete model exceeds the FPGA chip memory. The pruning is in two parts. For the feature extractors, we propose a data-dependent filter pruner according to the principle that the low-rank feature map contains less information. To not compromise the integrity of the multi-modal information, the pruner is independent for each modality. For the segmentation decoder, we apply a channel pruning approach to remove redundant parameters. Finally, we implement our designs on an FPGA board using 8-bit quantisation, and the accelerator achieves outstanding performance. A real-time application of scene segmentation on KITTI-Road is used to evaluate our algorithm, and the model achieves a $ 94.39\% $ MaxF score and minimum 14 ms runtime on 20W FPGA devices.
first_indexed	2024-03-12T00:23:44Z
format	Article
id	doaj.art-0823dd7eb5a44f45894615960debdd68
institution	Directory Open Access Journal
issn	0954-0091 1360-0494
language	English
last_indexed	2024-03-12T00:23:44Z
publishDate	2023-12-01
publisher	Taylor & Francis Group
record_format	Article
series	Connection Science
spelling	doaj.art-0823dd7eb5a44f45894615960debdd682023-09-15T10:48:01ZengTaylor & Francis GroupConnection Science0954-00911360-04942023-12-0135110.1080/09540091.2022.21593332159333FPGA-oriented lightweight multi-modal free-space detection networkFeiyi Fang0Junzhu Mao1Wei Yu2Jianfeng Lu3Nanjing University of Science and TechnologyNanjing University of Science and TechnologyBeijing RICH AI information Technology Co., LtdNanjing University of Science and TechnologyFor autonomous vehicles, free-space detection is an essential part of visual perception. With the development of multi-modal convolutional neural networks (CNNs) in recent years, the performance of driving scene semantic segmentation algorithms has been dramatically improved. Therefore most free-space detection algorithms are developed based on multiple sensors. However, multi-modal CNNs have high data throughput and contain a large number of computationally intensive convolution calculations, limiting their feasibility for real-time applications. Field Programmable Gate Arrays (FPGAs) provide a unique combination of flexibility, performance, and low power for these problems to accommodate multi-modal data and the computational acceleration of different compression algorithms. Network lightweight methods offer great assurance for facilitating the deployment of CNNs on such resource-constrained devices. In this paper, we propose a network lightweight method for a multi-modal free-space detection algorithm. We first propose an FPGA-friendly multi-modal free-space detection lightweight network. It comprises operators that FPGA prefers and achieves a $ 95.54\% $ MaxF score on the test set of KITTI-Road free-space detection tasks and 81 ms runtime when running on 700 W GPU devices. Then we present a pruning approach for this network to reduce the number of parameters in case the complete model exceeds the FPGA chip memory. The pruning is in two parts. For the feature extractors, we propose a data-dependent filter pruner according to the principle that the low-rank feature map contains less information. To not compromise the integrity of the multi-modal information, the pruner is independent for each modality. For the segmentation decoder, we apply a channel pruning approach to remove redundant parameters. Finally, we implement our designs on an FPGA board using 8-bit quantisation, and the accelerator achieves outstanding performance. A real-time application of scene segmentation on KITTI-Road is used to evaluate our algorithm, and the model achieves a $ 94.39\% $ MaxF score and minimum 14 ms runtime on 20W FPGA devices.http://dx.doi.org/10.1080/09540091.2022.2159333pruningfree-space detectionlightweight networkmulti-modal learningfpga
spellingShingle	Feiyi Fang Junzhu Mao Wei Yu Jianfeng Lu FPGA-oriented lightweight multi-modal free-space detection network Connection Science pruning free-space detection lightweight network multi-modal learning fpga
title	FPGA-oriented lightweight multi-modal free-space detection network
title_full	FPGA-oriented lightweight multi-modal free-space detection network
title_fullStr	FPGA-oriented lightweight multi-modal free-space detection network
title_full_unstemmed	FPGA-oriented lightweight multi-modal free-space detection network
title_short	FPGA-oriented lightweight multi-modal free-space detection network
title_sort	fpga oriented lightweight multi modal free space detection network
topic	pruning free-space detection lightweight network multi-modal learning fpga
url	http://dx.doi.org/10.1080/09540091.2022.2159333
work_keys_str_mv	AT feiyifang fpgaorientedlightweightmultimodalfreespacedetectionnetwork AT junzhumao fpgaorientedlightweightmultimodalfreespacedetectionnetwork AT weiyu fpgaorientedlightweightmultimodalfreespacedetectionnetwork AT jianfenglu fpgaorientedlightweightmultimodalfreespacedetectionnetwork

FPGA-oriented lightweight multi-modal free-space detection network

Similar Items