Improving multi-scale detection layers in the deep learning network for wheat spike detection based on interpretive analysis

Abstract Background Detecting and counting wheat spikes is essential for predicting and measuring wheat yield. However, current wheat spike detection researches often directly apply the new network structure. There are few studies that can combine the prior knowledge of wheat spike size characterist...

Full description

Bibliographic Details
Main Authors: Jiawei Yan, Jianqing Zhao, Yucheng Cai, Suwan Wang, Xiaolei Qiu, Xia Yao, Yongchao Tian, Yan Zhu, Weixing Cao, Xiaohu Zhang
Format: Article
Language:English
Published: BMC 2023-05-01
Series:Plant Methods
Subjects:
Online Access:https://doi.org/10.1186/s13007-023-01020-2
_version_ 1797827536279830528
author Jiawei Yan
Jianqing Zhao
Yucheng Cai
Suwan Wang
Xiaolei Qiu
Xia Yao
Yongchao Tian
Yan Zhu
Weixing Cao
Xiaohu Zhang
author_facet Jiawei Yan
Jianqing Zhao
Yucheng Cai
Suwan Wang
Xiaolei Qiu
Xia Yao
Yongchao Tian
Yan Zhu
Weixing Cao
Xiaohu Zhang
author_sort Jiawei Yan
collection DOAJ
description Abstract Background Detecting and counting wheat spikes is essential for predicting and measuring wheat yield. However, current wheat spike detection researches often directly apply the new network structure. There are few studies that can combine the prior knowledge of wheat spike size characteristics to design a suitable wheat spike detection model. It remains unclear whether the complex detection layers of the network play their intended role. Results This study proposes an interpretive analysis method for quantitatively evaluating the role of three-scale detection layers in a deep learning-based wheat spike detection model. The attention scores in each detection layer of the YOLOv5 network are calculated using the Gradient-weighted Class Activation Mapping (Grad-CAM) algorithm, which compares the prior labeled wheat spike bounding boxes with the attention areas of the network. By refining the multi-scale detection layers using the attention scores, a better wheat spike detection network is obtained. The experiments on the Global Wheat Head Detection (GWHD) dataset show that the large-scale detection layer performs poorly, while the medium-scale detection layer performs best among the three-scale detection layers. Consequently, the large-scale detection layer is removed, a micro-scale detection layer is added, and the feature extraction ability in the medium-scale detection layer is enhanced. The refined model increases the detection accuracy and reduces the network complexity by decreasing the network parameters. Conclusion The proposed interpretive analysis method to evaluate the contribution of different detection layers in the wheat spike detection network and provide a correct network improvement scheme. The findings of this study will offer a useful reference for future applications of deep network refinement in this field.
first_indexed 2024-04-09T12:49:53Z
format Article
id doaj.art-d33294198d7a43cba4a3bfecdf6141c3
institution Directory Open Access Journal
issn 1746-4811
language English
last_indexed 2024-04-09T12:49:53Z
publishDate 2023-05-01
publisher BMC
record_format Article
series Plant Methods
spelling doaj.art-d33294198d7a43cba4a3bfecdf6141c32023-05-14T11:18:07ZengBMCPlant Methods1746-48112023-05-0119111310.1186/s13007-023-01020-2Improving multi-scale detection layers in the deep learning network for wheat spike detection based on interpretive analysisJiawei Yan0Jianqing Zhao1Yucheng Cai2Suwan Wang3Xiaolei Qiu4Xia Yao5Yongchao Tian6Yan Zhu7Weixing Cao8Xiaohu Zhang9National Engineering and Technology Center for Information Agriculture, Nanjing Agricultural UniversityNational Engineering and Technology Center for Information Agriculture, Nanjing Agricultural UniversityNational Engineering and Technology Center for Information Agriculture, Nanjing Agricultural UniversityNational Engineering and Technology Center for Information Agriculture, Nanjing Agricultural UniversityNational Engineering and Technology Center for Information Agriculture, Nanjing Agricultural UniversityNational Engineering and Technology Center for Information Agriculture, Nanjing Agricultural UniversityNational Engineering and Technology Center for Information Agriculture, Nanjing Agricultural UniversityNational Engineering and Technology Center for Information Agriculture, Nanjing Agricultural UniversityNational Engineering and Technology Center for Information Agriculture, Nanjing Agricultural UniversityNational Engineering and Technology Center for Information Agriculture, Nanjing Agricultural UniversityAbstract Background Detecting and counting wheat spikes is essential for predicting and measuring wheat yield. However, current wheat spike detection researches often directly apply the new network structure. There are few studies that can combine the prior knowledge of wheat spike size characteristics to design a suitable wheat spike detection model. It remains unclear whether the complex detection layers of the network play their intended role. Results This study proposes an interpretive analysis method for quantitatively evaluating the role of three-scale detection layers in a deep learning-based wheat spike detection model. The attention scores in each detection layer of the YOLOv5 network are calculated using the Gradient-weighted Class Activation Mapping (Grad-CAM) algorithm, which compares the prior labeled wheat spike bounding boxes with the attention areas of the network. By refining the multi-scale detection layers using the attention scores, a better wheat spike detection network is obtained. The experiments on the Global Wheat Head Detection (GWHD) dataset show that the large-scale detection layer performs poorly, while the medium-scale detection layer performs best among the three-scale detection layers. Consequently, the large-scale detection layer is removed, a micro-scale detection layer is added, and the feature extraction ability in the medium-scale detection layer is enhanced. The refined model increases the detection accuracy and reduces the network complexity by decreasing the network parameters. Conclusion The proposed interpretive analysis method to evaluate the contribution of different detection layers in the wheat spike detection network and provide a correct network improvement scheme. The findings of this study will offer a useful reference for future applications of deep network refinement in this field.https://doi.org/10.1186/s13007-023-01020-2Wheat spike detectionDeep learning networkAttention scoreInterpretive analysis
spellingShingle Jiawei Yan
Jianqing Zhao
Yucheng Cai
Suwan Wang
Xiaolei Qiu
Xia Yao
Yongchao Tian
Yan Zhu
Weixing Cao
Xiaohu Zhang
Improving multi-scale detection layers in the deep learning network for wheat spike detection based on interpretive analysis
Plant Methods
Wheat spike detection
Deep learning network
Attention score
Interpretive analysis
title Improving multi-scale detection layers in the deep learning network for wheat spike detection based on interpretive analysis
title_full Improving multi-scale detection layers in the deep learning network for wheat spike detection based on interpretive analysis
title_fullStr Improving multi-scale detection layers in the deep learning network for wheat spike detection based on interpretive analysis
title_full_unstemmed Improving multi-scale detection layers in the deep learning network for wheat spike detection based on interpretive analysis
title_short Improving multi-scale detection layers in the deep learning network for wheat spike detection based on interpretive analysis
title_sort improving multi scale detection layers in the deep learning network for wheat spike detection based on interpretive analysis
topic Wheat spike detection
Deep learning network
Attention score
Interpretive analysis
url https://doi.org/10.1186/s13007-023-01020-2
work_keys_str_mv AT jiaweiyan improvingmultiscaledetectionlayersinthedeeplearningnetworkforwheatspikedetectionbasedoninterpretiveanalysis
AT jianqingzhao improvingmultiscaledetectionlayersinthedeeplearningnetworkforwheatspikedetectionbasedoninterpretiveanalysis
AT yuchengcai improvingmultiscaledetectionlayersinthedeeplearningnetworkforwheatspikedetectionbasedoninterpretiveanalysis
AT suwanwang improvingmultiscaledetectionlayersinthedeeplearningnetworkforwheatspikedetectionbasedoninterpretiveanalysis
AT xiaoleiqiu improvingmultiscaledetectionlayersinthedeeplearningnetworkforwheatspikedetectionbasedoninterpretiveanalysis
AT xiayao improvingmultiscaledetectionlayersinthedeeplearningnetworkforwheatspikedetectionbasedoninterpretiveanalysis
AT yongchaotian improvingmultiscaledetectionlayersinthedeeplearningnetworkforwheatspikedetectionbasedoninterpretiveanalysis
AT yanzhu improvingmultiscaledetectionlayersinthedeeplearningnetworkforwheatspikedetectionbasedoninterpretiveanalysis
AT weixingcao improvingmultiscaledetectionlayersinthedeeplearningnetworkforwheatspikedetectionbasedoninterpretiveanalysis
AT xiaohuzhang improvingmultiscaledetectionlayersinthedeeplearningnetworkforwheatspikedetectionbasedoninterpretiveanalysis