End-to-End Pavement Crack Detection Method Based on Transformer
Aiming at the problem of low detection accuracy caused by irregular crack shape and complex background in pavement crack detection scene, an end-to-end pavement crack detection method based on transformer, CrackFormerNet, was proposed. First, in the feature extraction stage, a multi-scale feature fu...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Editorial Office of Journal of Taiyuan University of Technology
2022-11-01
|
Series: | Taiyuan Ligong Daxue xuebao |
Subjects: | |
Online Access: | https://tyutjournal.tyut.edu.cn/englishpaper/show-1999.html |
_version_ | 1797208322671116288 |
---|---|
author | Jun LIU Huimin WANG Xingzhong ZHANG Ting ZHANG Meiqing GUO |
author_facet | Jun LIU Huimin WANG Xingzhong ZHANG Ting ZHANG Meiqing GUO |
author_sort | Jun LIU |
collection | DOAJ |
description | Aiming at the problem of low detection accuracy caused by irregular crack shape and complex background in pavement crack detection scene, an end-to-end pavement crack detection method based on transformer, CrackFormerNet, was proposed. First, in the feature extraction stage, a multi-scale feature fusion mechanism is introduced, and a Multi-Scale Transformer feature extraction network is designed to fuse the feature maps of different downsampling magnifications in the Swin Transformer process to extract crack texture features with rich details. Second, a joint regression loss function based on CIoU Loss and L1 Loss is proposed to measure the distance between the predicted box and the label, and more accurately evaluate the detection effect of the predicted box. At the same time, in order to deal with the problem of slow convergence of the transformer model, the model convergence is accelerated by using the Pre-LN Transformer structure in the encoder-decoder stage and using layer normalization inside the residual block. The experimental results show that the MAP of the method in this paper reaches 84.2%. Compared with the mainstream benchmark methods, the proposed method gives significantly improved detection accuracy in the pavement crack detection task. Compared with DETR detection method, the is compressed the proposed method in model convergence round by 18.4%, while is improved in detection accuracy by 3.6%, which proves the effectiveness of this method. |
first_indexed | 2024-04-24T09:36:58Z |
format | Article |
id | doaj.art-de9252cffe004e13bdb131c8dde103db |
institution | Directory Open Access Journal |
issn | 1007-9432 |
language | English |
last_indexed | 2024-04-24T09:36:58Z |
publishDate | 2022-11-01 |
publisher | Editorial Office of Journal of Taiyuan University of Technology |
record_format | Article |
series | Taiyuan Ligong Daxue xuebao |
spelling | doaj.art-de9252cffe004e13bdb131c8dde103db2024-04-15T09:16:10ZengEditorial Office of Journal of Taiyuan University of TechnologyTaiyuan Ligong Daxue xuebao1007-94322022-11-015361143115110.16355/j.cnki.issn1007-9432tyut.2022.06.0211007-9432(2022)06-1143-09End-to-End Pavement Crack Detection Method Based on TransformerJun LIU0Huimin WANG1Xingzhong ZHANG2Ting ZHANG3Meiqing GUO4College of Software, Taiyuan University of Technology, Taiyuan 030024, ChinaCollege of Software, Taiyuan University of Technology, Taiyuan 030024, ChinaCollege of Software, Taiyuan University of Technology, Taiyuan 030024, ChinaCollege of Software, Taiyuan University of Technology, Taiyuan 030024, ChinaCollege of Software, Taiyuan University of Technology, Taiyuan 030024, ChinaAiming at the problem of low detection accuracy caused by irregular crack shape and complex background in pavement crack detection scene, an end-to-end pavement crack detection method based on transformer, CrackFormerNet, was proposed. First, in the feature extraction stage, a multi-scale feature fusion mechanism is introduced, and a Multi-Scale Transformer feature extraction network is designed to fuse the feature maps of different downsampling magnifications in the Swin Transformer process to extract crack texture features with rich details. Second, a joint regression loss function based on CIoU Loss and L1 Loss is proposed to measure the distance between the predicted box and the label, and more accurately evaluate the detection effect of the predicted box. At the same time, in order to deal with the problem of slow convergence of the transformer model, the model convergence is accelerated by using the Pre-LN Transformer structure in the encoder-decoder stage and using layer normalization inside the residual block. The experimental results show that the MAP of the method in this paper reaches 84.2%. Compared with the mainstream benchmark methods, the proposed method gives significantly improved detection accuracy in the pavement crack detection task. Compared with DETR detection method, the is compressed the proposed method in model convergence round by 18.4%, while is improved in detection accuracy by 3.6%, which proves the effectiveness of this method.https://tyutjournal.tyut.edu.cn/englishpaper/show-1999.htmlpavement crack detectionmulti-scale feature fusionpre-ln transformer networkjoint regression lossend-to-end |
spellingShingle | Jun LIU Huimin WANG Xingzhong ZHANG Ting ZHANG Meiqing GUO End-to-End Pavement Crack Detection Method Based on Transformer Taiyuan Ligong Daxue xuebao pavement crack detection multi-scale feature fusion pre-ln transformer network joint regression loss end-to-end |
title | End-to-End Pavement Crack Detection Method Based on Transformer |
title_full | End-to-End Pavement Crack Detection Method Based on Transformer |
title_fullStr | End-to-End Pavement Crack Detection Method Based on Transformer |
title_full_unstemmed | End-to-End Pavement Crack Detection Method Based on Transformer |
title_short | End-to-End Pavement Crack Detection Method Based on Transformer |
title_sort | end to end pavement crack detection method based on transformer |
topic | pavement crack detection multi-scale feature fusion pre-ln transformer network joint regression loss end-to-end |
url | https://tyutjournal.tyut.edu.cn/englishpaper/show-1999.html |
work_keys_str_mv | AT junliu endtoendpavementcrackdetectionmethodbasedontransformer AT huiminwang endtoendpavementcrackdetectionmethodbasedontransformer AT xingzhongzhang endtoendpavementcrackdetectionmethodbasedontransformer AT tingzhang endtoendpavementcrackdetectionmethodbasedontransformer AT meiqingguo endtoendpavementcrackdetectionmethodbasedontransformer |