Improving multi-scale detection layers in the deep learning network for wheat spike detection based on interpretive analysis

Abstract Background Detecting and counting wheat spikes is essential for predicting and measuring wheat yield. However, current wheat spike detection researches often directly apply the new network structure. There are few studies that can combine the prior knowledge of wheat spike size characterist...

Full description

Bibliographic Details
Main Authors:	Jiawei Yan, Jianqing Zhao, Yucheng Cai, Suwan Wang, Xiaolei Qiu, Xia Yao, Yongchao Tian, Yan Zhu, Weixing Cao, Xiaohu Zhang
Format:	Article
Language:	English
Published:	BMC 2023-05-01
Series:	Plant Methods
Subjects:	Wheat spike detection Deep learning network Attention score Interpretive analysis
Online Access:	https://doi.org/10.1186/s13007-023-01020-2

_version_	1797827536279830528
author	Jiawei Yan Jianqing Zhao Yucheng Cai Suwan Wang Xiaolei Qiu Xia Yao Yongchao Tian Yan Zhu Weixing Cao Xiaohu Zhang
author_facet	Jiawei Yan Jianqing Zhao Yucheng Cai Suwan Wang Xiaolei Qiu Xia Yao Yongchao Tian Yan Zhu Weixing Cao Xiaohu Zhang
author_sort	Jiawei Yan
collection	DOAJ
description	Abstract Background Detecting and counting wheat spikes is essential for predicting and measuring wheat yield. However, current wheat spike detection researches often directly apply the new network structure. There are few studies that can combine the prior knowledge of wheat spike size characteristics to design a suitable wheat spike detection model. It remains unclear whether the complex detection layers of the network play their intended role. Results This study proposes an interpretive analysis method for quantitatively evaluating the role of three-scale detection layers in a deep learning-based wheat spike detection model. The attention scores in each detection layer of the YOLOv5 network are calculated using the Gradient-weighted Class Activation Mapping (Grad-CAM) algorithm, which compares the prior labeled wheat spike bounding boxes with the attention areas of the network. By refining the multi-scale detection layers using the attention scores, a better wheat spike detection network is obtained. The experiments on the Global Wheat Head Detection (GWHD) dataset show that the large-scale detection layer performs poorly, while the medium-scale detection layer performs best among the three-scale detection layers. Consequently, the large-scale detection layer is removed, a micro-scale detection layer is added, and the feature extraction ability in the medium-scale detection layer is enhanced. The refined model increases the detection accuracy and reduces the network complexity by decreasing the network parameters. Conclusion The proposed interpretive analysis method to evaluate the contribution of different detection layers in the wheat spike detection network and provide a correct network improvement scheme. The findings of this study will offer a useful reference for future applications of deep network refinement in this field.
first_indexed	2024-04-09T12:49:53Z
format	Article
id	doaj.art-d33294198d7a43cba4a3bfecdf6141c3
institution	Directory Open Access Journal
issn	1746-4811
language	English
last_indexed	2024-04-09T12:49:53Z
publishDate	2023-05-01
publisher	BMC
record_format	Article
series	Plant Methods
spelling	doaj.art-d33294198d7a43cba4a3bfecdf6141c32023-05-14T11:18:07ZengBMCPlant Methods1746-48112023-05-0119111310.1186/s13007-023-01020-2Improving multi-scale detection layers in the deep learning network for wheat spike detection based on interpretive analysisJiawei Yan0Jianqing Zhao1Yucheng Cai2Suwan Wang3Xiaolei Qiu4Xia Yao5Yongchao Tian6Yan Zhu7Weixing Cao8Xiaohu Zhang9National Engineering and Technology Center for Information Agriculture, Nanjing Agricultural UniversityNational Engineering and Technology Center for Information Agriculture, Nanjing Agricultural UniversityNational Engineering and Technology Center for Information Agriculture, Nanjing Agricultural UniversityNational Engineering and Technology Center for Information Agriculture, Nanjing Agricultural UniversityNational Engineering and Technology Center for Information Agriculture, Nanjing Agricultural UniversityNational Engineering and Technology Center for Information Agriculture, Nanjing Agricultural UniversityNational Engineering and Technology Center for Information Agriculture, Nanjing Agricultural UniversityNational Engineering and Technology Center for Information Agriculture, Nanjing Agricultural UniversityNational Engineering and Technology Center for Information Agriculture, Nanjing Agricultural UniversityNational Engineering and Technology Center for Information Agriculture, Nanjing Agricultural UniversityAbstract Background Detecting and counting wheat spikes is essential for predicting and measuring wheat yield. However, current wheat spike detection researches often directly apply the new network structure. There are few studies that can combine the prior knowledge of wheat spike size characteristics to design a suitable wheat spike detection model. It remains unclear whether the complex detection layers of the network play their intended role. Results This study proposes an interpretive analysis method for quantitatively evaluating the role of three-scale detection layers in a deep learning-based wheat spike detection model. The attention scores in each detection layer of the YOLOv5 network are calculated using the Gradient-weighted Class Activation Mapping (Grad-CAM) algorithm, which compares the prior labeled wheat spike bounding boxes with the attention areas of the network. By refining the multi-scale detection layers using the attention scores, a better wheat spike detection network is obtained. The experiments on the Global Wheat Head Detection (GWHD) dataset show that the large-scale detection layer performs poorly, while the medium-scale detection layer performs best among the three-scale detection layers. Consequently, the large-scale detection layer is removed, a micro-scale detection layer is added, and the feature extraction ability in the medium-scale detection layer is enhanced. The refined model increases the detection accuracy and reduces the network complexity by decreasing the network parameters. Conclusion The proposed interpretive analysis method to evaluate the contribution of different detection layers in the wheat spike detection network and provide a correct network improvement scheme. The findings of this study will offer a useful reference for future applications of deep network refinement in this field.https://doi.org/10.1186/s13007-023-01020-2Wheat spike detectionDeep learning networkAttention scoreInterpretive analysis
spellingShingle	Jiawei Yan Jianqing Zhao Yucheng Cai Suwan Wang Xiaolei Qiu Xia Yao Yongchao Tian Yan Zhu Weixing Cao Xiaohu Zhang Improving multi-scale detection layers in the deep learning network for wheat spike detection based on interpretive analysis Plant Methods Wheat spike detection Deep learning network Attention score Interpretive analysis
title	Improving multi-scale detection layers in the deep learning network for wheat spike detection based on interpretive analysis
title_full	Improving multi-scale detection layers in the deep learning network for wheat spike detection based on interpretive analysis
title_fullStr	Improving multi-scale detection layers in the deep learning network for wheat spike detection based on interpretive analysis
title_full_unstemmed	Improving multi-scale detection layers in the deep learning network for wheat spike detection based on interpretive analysis
title_short	Improving multi-scale detection layers in the deep learning network for wheat spike detection based on interpretive analysis
title_sort	improving multi scale detection layers in the deep learning network for wheat spike detection based on interpretive analysis
topic	Wheat spike detection Deep learning network Attention score Interpretive analysis
url	https://doi.org/10.1186/s13007-023-01020-2
work_keys_str_mv	AT jiaweiyan improvingmultiscaledetectionlayersinthedeeplearningnetworkforwheatspikedetectionbasedoninterpretiveanalysis AT jianqingzhao improvingmultiscaledetectionlayersinthedeeplearningnetworkforwheatspikedetectionbasedoninterpretiveanalysis AT yuchengcai improvingmultiscaledetectionlayersinthedeeplearningnetworkforwheatspikedetectionbasedoninterpretiveanalysis AT suwanwang improvingmultiscaledetectionlayersinthedeeplearningnetworkforwheatspikedetectionbasedoninterpretiveanalysis AT xiaoleiqiu improvingmultiscaledetectionlayersinthedeeplearningnetworkforwheatspikedetectionbasedoninterpretiveanalysis AT xiayao improvingmultiscaledetectionlayersinthedeeplearningnetworkforwheatspikedetectionbasedoninterpretiveanalysis AT yongchaotian improvingmultiscaledetectionlayersinthedeeplearningnetworkforwheatspikedetectionbasedoninterpretiveanalysis AT yanzhu improvingmultiscaledetectionlayersinthedeeplearningnetworkforwheatspikedetectionbasedoninterpretiveanalysis AT weixingcao improvingmultiscaledetectionlayersinthedeeplearningnetworkforwheatspikedetectionbasedoninterpretiveanalysis AT xiaohuzhang improvingmultiscaledetectionlayersinthedeeplearningnetworkforwheatspikedetectionbasedoninterpretiveanalysis

Improving multi-scale detection layers in the deep learning network for wheat spike detection based on interpretive analysis

Similar Items