Research on Small Acceptance Domain Text Detection Algorithm Based on Attention Mechanism and Hybrid Feature Pyramid

In the traditional text detection process, the text area of the small receptive field in the video image is easily ignored, the features that can be extracted are few, and the calculation is large. These problems are not conducive to the recognition of text information. In this paper, a lightweight...

Full description

Bibliographic Details
Main Authors: Mingzhu Liu, Ben Li, Wei Zhang
Format: Article
Language:English
Published: MDPI AG 2022-10-01
Series:Electronics
Subjects:
Online Access:https://www.mdpi.com/2079-9292/11/21/3559
_version_ 1797468537936150528
author Mingzhu Liu
Ben Li
Wei Zhang
author_facet Mingzhu Liu
Ben Li
Wei Zhang
author_sort Mingzhu Liu
collection DOAJ
description In the traditional text detection process, the text area of the small receptive field in the video image is easily ignored, the features that can be extracted are few, and the calculation is large. These problems are not conducive to the recognition of text information. In this paper, a lightweight network structure on the basis of the EAST algorithm, the Convolution Block Attention Module (CBAM), is proposed. It is suitable for the spatial and channel hybrid attention module of text feature extraction of the natural scene video images. The improved structure proposed in this paper can obtain deep network features of text and reduce the computation of text feature extraction. Additionally, a hybrid feature pyramid + BLSTM network is designed to improve the attention to the small acceptance domain text regions and the text sequence features of the region. The test results on the ICDAR2015 demonstrate that the improved construction can effectively boost the attention of small acceptance domain text regions and improve the sequence feature detection accuracy of small acceptance domain of long text regions without significantly increasing computation. At the same time, the proposed network constructions are superior to the traditional EAST algorithm and other improved algorithms in accuracy rate P, recall rate R, and F-value.
first_indexed 2024-03-09T19:07:51Z
format Article
id doaj.art-dc9f791b9c844d73895b2464bb2cfdb0
institution Directory Open Access Journal
issn 2079-9292
language English
last_indexed 2024-03-09T19:07:51Z
publishDate 2022-10-01
publisher MDPI AG
record_format Article
series Electronics
spelling doaj.art-dc9f791b9c844d73895b2464bb2cfdb02023-11-24T04:25:53ZengMDPI AGElectronics2079-92922022-10-011121355910.3390/electronics11213559Research on Small Acceptance Domain Text Detection Algorithm Based on Attention Mechanism and Hybrid Feature PyramidMingzhu Liu0Ben Li1Wei Zhang2The Higher Educational Key Laboratory for Measuring & Control Technology and Instrumentation of Heilongjiang Province, Harbin University of Science and Technology, Harbin 150080, ChinaThe Higher Educational Key Laboratory for Measuring & Control Technology and Instrumentation of Heilongjiang Province, Harbin University of Science and Technology, Harbin 150080, ChinaThe Higher Educational Key Laboratory for Measuring & Control Technology and Instrumentation of Heilongjiang Province, Harbin University of Science and Technology, Harbin 150080, ChinaIn the traditional text detection process, the text area of the small receptive field in the video image is easily ignored, the features that can be extracted are few, and the calculation is large. These problems are not conducive to the recognition of text information. In this paper, a lightweight network structure on the basis of the EAST algorithm, the Convolution Block Attention Module (CBAM), is proposed. It is suitable for the spatial and channel hybrid attention module of text feature extraction of the natural scene video images. The improved structure proposed in this paper can obtain deep network features of text and reduce the computation of text feature extraction. Additionally, a hybrid feature pyramid + BLSTM network is designed to improve the attention to the small acceptance domain text regions and the text sequence features of the region. The test results on the ICDAR2015 demonstrate that the improved construction can effectively boost the attention of small acceptance domain text regions and improve the sequence feature detection accuracy of small acceptance domain of long text regions without significantly increasing computation. At the same time, the proposed network constructions are superior to the traditional EAST algorithm and other improved algorithms in accuracy rate P, recall rate R, and F-value.https://www.mdpi.com/2079-9292/11/21/3559lightweight network structurehybrid feature pyramidBLSTM networkacceptance domainattention mechanism
spellingShingle Mingzhu Liu
Ben Li
Wei Zhang
Research on Small Acceptance Domain Text Detection Algorithm Based on Attention Mechanism and Hybrid Feature Pyramid
Electronics
lightweight network structure
hybrid feature pyramid
BLSTM network
acceptance domain
attention mechanism
title Research on Small Acceptance Domain Text Detection Algorithm Based on Attention Mechanism and Hybrid Feature Pyramid
title_full Research on Small Acceptance Domain Text Detection Algorithm Based on Attention Mechanism and Hybrid Feature Pyramid
title_fullStr Research on Small Acceptance Domain Text Detection Algorithm Based on Attention Mechanism and Hybrid Feature Pyramid
title_full_unstemmed Research on Small Acceptance Domain Text Detection Algorithm Based on Attention Mechanism and Hybrid Feature Pyramid
title_short Research on Small Acceptance Domain Text Detection Algorithm Based on Attention Mechanism and Hybrid Feature Pyramid
title_sort research on small acceptance domain text detection algorithm based on attention mechanism and hybrid feature pyramid
topic lightweight network structure
hybrid feature pyramid
BLSTM network
acceptance domain
attention mechanism
url https://www.mdpi.com/2079-9292/11/21/3559
work_keys_str_mv AT mingzhuliu researchonsmallacceptancedomaintextdetectionalgorithmbasedonattentionmechanismandhybridfeaturepyramid
AT benli researchonsmallacceptancedomaintextdetectionalgorithmbasedonattentionmechanismandhybridfeaturepyramid
AT weizhang researchonsmallacceptancedomaintextdetectionalgorithmbasedonattentionmechanismandhybridfeaturepyramid