End-to-End Pavement Crack Detection Method Based on Transformer

Aiming at the problem of low detection accuracy caused by irregular crack shape and complex background in pavement crack detection scene, an end-to-end pavement crack detection method based on transformer, CrackFormerNet, was proposed. First, in the feature extraction stage, a multi-scale feature fu...

Full description

Bibliographic Details
Main Authors: Jun LIU, Huimin WANG, Xingzhong ZHANG, Ting ZHANG, Meiqing GUO
Format: Article
Language:English
Published: Editorial Office of Journal of Taiyuan University of Technology 2022-11-01
Series:Taiyuan Ligong Daxue xuebao
Subjects:
Online Access:https://tyutjournal.tyut.edu.cn/englishpaper/show-1999.html
_version_ 1797208322671116288
author Jun LIU
Huimin WANG
Xingzhong ZHANG
Ting ZHANG
Meiqing GUO
author_facet Jun LIU
Huimin WANG
Xingzhong ZHANG
Ting ZHANG
Meiqing GUO
author_sort Jun LIU
collection DOAJ
description Aiming at the problem of low detection accuracy caused by irregular crack shape and complex background in pavement crack detection scene, an end-to-end pavement crack detection method based on transformer, CrackFormerNet, was proposed. First, in the feature extraction stage, a multi-scale feature fusion mechanism is introduced, and a Multi-Scale Transformer feature extraction network is designed to fuse the feature maps of different downsampling magnifications in the Swin Transformer process to extract crack texture features with rich details. Second, a joint regression loss function based on CIoU Loss and L1 Loss is proposed to measure the distance between the predicted box and the label, and more accurately evaluate the detection effect of the predicted box. At the same time, in order to deal with the problem of slow convergence of the transformer model, the model convergence is accelerated by using the Pre-LN Transformer structure in the encoder-decoder stage and using layer normalization inside the residual block. The experimental results show that the MAP of the method in this paper reaches 84.2%. Compared with the mainstream benchmark methods, the proposed method gives significantly improved detection accuracy in the pavement crack detection task. Compared with DETR detection method, the is compressed the proposed method in model convergence round by 18.4%, while is improved in detection accuracy by 3.6%, which proves the effectiveness of this method.
first_indexed 2024-04-24T09:36:58Z
format Article
id doaj.art-de9252cffe004e13bdb131c8dde103db
institution Directory Open Access Journal
issn 1007-9432
language English
last_indexed 2024-04-24T09:36:58Z
publishDate 2022-11-01
publisher Editorial Office of Journal of Taiyuan University of Technology
record_format Article
series Taiyuan Ligong Daxue xuebao
spelling doaj.art-de9252cffe004e13bdb131c8dde103db2024-04-15T09:16:10ZengEditorial Office of Journal of Taiyuan University of TechnologyTaiyuan Ligong Daxue xuebao1007-94322022-11-015361143115110.16355/j.cnki.issn1007-9432tyut.2022.06.0211007-9432(2022)06-1143-09End-to-End Pavement Crack Detection Method Based on TransformerJun LIU0Huimin WANG1Xingzhong ZHANG2Ting ZHANG3Meiqing GUO4College of Software, Taiyuan University of Technology, Taiyuan 030024, ChinaCollege of Software, Taiyuan University of Technology, Taiyuan 030024, ChinaCollege of Software, Taiyuan University of Technology, Taiyuan 030024, ChinaCollege of Software, Taiyuan University of Technology, Taiyuan 030024, ChinaCollege of Software, Taiyuan University of Technology, Taiyuan 030024, ChinaAiming at the problem of low detection accuracy caused by irregular crack shape and complex background in pavement crack detection scene, an end-to-end pavement crack detection method based on transformer, CrackFormerNet, was proposed. First, in the feature extraction stage, a multi-scale feature fusion mechanism is introduced, and a Multi-Scale Transformer feature extraction network is designed to fuse the feature maps of different downsampling magnifications in the Swin Transformer process to extract crack texture features with rich details. Second, a joint regression loss function based on CIoU Loss and L1 Loss is proposed to measure the distance between the predicted box and the label, and more accurately evaluate the detection effect of the predicted box. At the same time, in order to deal with the problem of slow convergence of the transformer model, the model convergence is accelerated by using the Pre-LN Transformer structure in the encoder-decoder stage and using layer normalization inside the residual block. The experimental results show that the MAP of the method in this paper reaches 84.2%. Compared with the mainstream benchmark methods, the proposed method gives significantly improved detection accuracy in the pavement crack detection task. Compared with DETR detection method, the is compressed the proposed method in model convergence round by 18.4%, while is improved in detection accuracy by 3.6%, which proves the effectiveness of this method.https://tyutjournal.tyut.edu.cn/englishpaper/show-1999.htmlpavement crack detectionmulti-scale feature fusionpre-ln transformer networkjoint regression lossend-to-end
spellingShingle Jun LIU
Huimin WANG
Xingzhong ZHANG
Ting ZHANG
Meiqing GUO
End-to-End Pavement Crack Detection Method Based on Transformer
Taiyuan Ligong Daxue xuebao
pavement crack detection
multi-scale feature fusion
pre-ln transformer network
joint regression loss
end-to-end
title End-to-End Pavement Crack Detection Method Based on Transformer
title_full End-to-End Pavement Crack Detection Method Based on Transformer
title_fullStr End-to-End Pavement Crack Detection Method Based on Transformer
title_full_unstemmed End-to-End Pavement Crack Detection Method Based on Transformer
title_short End-to-End Pavement Crack Detection Method Based on Transformer
title_sort end to end pavement crack detection method based on transformer
topic pavement crack detection
multi-scale feature fusion
pre-ln transformer network
joint regression loss
end-to-end
url https://tyutjournal.tyut.edu.cn/englishpaper/show-1999.html
work_keys_str_mv AT junliu endtoendpavementcrackdetectionmethodbasedontransformer
AT huiminwang endtoendpavementcrackdetectionmethodbasedontransformer
AT xingzhongzhang endtoendpavementcrackdetectionmethodbasedontransformer
AT tingzhang endtoendpavementcrackdetectionmethodbasedontransformer
AT meiqingguo endtoendpavementcrackdetectionmethodbasedontransformer