A Fast Scene Text Detector Using Knowledge Distillation

Incidental scene text detection is a challenging problem because of arbitrary orientation, low resolution, perspective distortion, and variant aspect ratios of text in natural images. In this paper, we present an end-to-end trainable deep model, which can effectively and efficiently locate multi-ori...

Full description

Bibliographic Details
Main Authors: Peng Yang, Fanlong Zhang, Guowei Yang
Format: Article
Language:English
Published: IEEE 2019-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/8626192/
_version_ 1828967018202660864
author Peng Yang
Fanlong Zhang
Guowei Yang
author_facet Peng Yang
Fanlong Zhang
Guowei Yang
author_sort Peng Yang
collection DOAJ
description Incidental scene text detection is a challenging problem because of arbitrary orientation, low resolution, perspective distortion, and variant aspect ratios of text in natural images. In this paper, we present an end-to-end trainable deep model, which can effectively and efficiently locate multi-oriented scene text. Our detector includes a student network and a teacher network, and they inherit complex VGGNet and lightweight PVANet architecture, respectively. While deploying for text detection, the teacher network is used to guide the training process of a student via knowledge distilling so as to maintain the tradeoff between accuracy and efficiency. We have evaluated the proposed detector on three popular benchmarks, and it achieves F-measures of 83.7%, 57.27%, and 90% on ICDAR2015 Incidental Scene Text, COCO-Text, and ICDAR2013, respectively, which outperforms the most state-of-the-art methods.
first_indexed 2024-12-14T11:42:59Z
format Article
id doaj.art-9c5c5fc306684054b245ab16cf0b9e9d
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-12-14T11:42:59Z
publishDate 2019-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-9c5c5fc306684054b245ab16cf0b9e9d2022-12-21T23:02:44ZengIEEEIEEE Access2169-35362019-01-017225882259810.1109/ACCESS.2019.28953308626192A Fast Scene Text Detector Using Knowledge DistillationPeng Yang0https://orcid.org/0000-0002-1505-7857Fanlong Zhang1https://orcid.org/0000-0001-8865-9683Guowei Yang2School of Information Engineering, Nanjing Audit University, Jiangshu, ChinaSchool of Information Engineering, Nanjing Audit University, Jiangshu, ChinaSchool of Information Engineering, Nanjing Audit University, Jiangshu, ChinaIncidental scene text detection is a challenging problem because of arbitrary orientation, low resolution, perspective distortion, and variant aspect ratios of text in natural images. In this paper, we present an end-to-end trainable deep model, which can effectively and efficiently locate multi-oriented scene text. Our detector includes a student network and a teacher network, and they inherit complex VGGNet and lightweight PVANet architecture, respectively. While deploying for text detection, the teacher network is used to guide the training process of a student via knowledge distilling so as to maintain the tradeoff between accuracy and efficiency. We have evaluated the proposed detector on three popular benchmarks, and it achieves F-measures of 83.7%, 57.27%, and 90% on ICDAR2015 Incidental Scene Text, COCO-Text, and ICDAR2013, respectively, which outperforms the most state-of-the-art methods.https://ieeexplore.ieee.org/document/8626192/Scene text detectionmulti-oriented textdeep neural networkknowledge distilling
spellingShingle Peng Yang
Fanlong Zhang
Guowei Yang
A Fast Scene Text Detector Using Knowledge Distillation
IEEE Access
Scene text detection
multi-oriented text
deep neural network
knowledge distilling
title A Fast Scene Text Detector Using Knowledge Distillation
title_full A Fast Scene Text Detector Using Knowledge Distillation
title_fullStr A Fast Scene Text Detector Using Knowledge Distillation
title_full_unstemmed A Fast Scene Text Detector Using Knowledge Distillation
title_short A Fast Scene Text Detector Using Knowledge Distillation
title_sort fast scene text detector using knowledge distillation
topic Scene text detection
multi-oriented text
deep neural network
knowledge distilling
url https://ieeexplore.ieee.org/document/8626192/
work_keys_str_mv AT pengyang afastscenetextdetectorusingknowledgedistillation
AT fanlongzhang afastscenetextdetectorusingknowledgedistillation
AT guoweiyang afastscenetextdetectorusingknowledgedistillation
AT pengyang fastscenetextdetectorusingknowledgedistillation
AT fanlongzhang fastscenetextdetectorusingknowledgedistillation
AT guoweiyang fastscenetextdetectorusingknowledgedistillation