Development of Vertical Text Interpreter for Natural Scene Images

Automatic text recognition in natural scene images is essential for accessing information and understanding our surroundings. Scene text orientations include horizontal scene texts, arbitrarily oriented scene texts, curved scene texts, and vertically oriented scene texts. While attention has been gi...

Full description

Bibliographic Details
Main Authors: Ong Yi Ling, Lau Bee Theng, Almon Chai Weiyen, Christopher Mccarthy
Format: Article
Language:English
Published: IEEE 2021-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9580909/
_version_ 1818837136978214912
author Ong Yi Ling
Lau Bee Theng
Almon Chai Weiyen
Christopher Mccarthy
author_facet Ong Yi Ling
Lau Bee Theng
Almon Chai Weiyen
Christopher Mccarthy
author_sort Ong Yi Ling
collection DOAJ
description Automatic text recognition in natural scene images is essential for accessing information and understanding our surroundings. Scene text orientations include horizontal scene texts, arbitrarily oriented scene texts, curved scene texts, and vertically oriented scene texts. While attention has been given to horizontal, arbitrarily oriented, and curved text, limited research has been carried out on vertically oriented scene text recognition. To this end, we propose Vertical Text Interpreter, an autonomous vertically oriented scene text recognizer model. Vertical Text Interpreter detects and recognizes vertically oriented scene texts in natural scenes, including vertically-stacked texts, bottom-to-top vertical texts, and top-to-bottom vertical texts. It consists of a shared convolutional neural network, a Vertical Text Spotter, and a Vertical Text Reader. We developed a dataset, namely Vertically Oriented Scene Text 1250 Dataset, created as part of this research, addressing the need for a dataset for this category of scene texts. The performance of the Vertical Text Interpreter is evaluated using benchmark datasets and the VOST-1250 dataset. Results show that Vertical Text Interpreter can detect and recognize different types of vertically oriented scene texts simultaneously. For future work, we can explore Vertical Text Interpreter for the contexts such as reading assistance and visual navigation systems.
first_indexed 2024-12-19T03:17:43Z
format Article
id doaj.art-35cbd862fb13404c9f4c024f90235616
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-12-19T03:17:43Z
publishDate 2021-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-35cbd862fb13404c9f4c024f902356162022-12-21T20:37:51ZengIEEEIEEE Access2169-35362021-01-01914434114435110.1109/ACCESS.2021.31216089580909Development of Vertical Text Interpreter for Natural Scene ImagesOng Yi Ling0https://orcid.org/0000-0003-3728-1187Lau Bee Theng1Almon Chai Weiyen2Christopher Mccarthy3https://orcid.org/0000-0003-3848-1631Faculty of Engineering, Computing, and Science, Swinburne University of Technology Sarawak Campus, Kuching, Sarawak, MalaysiaFaculty of Engineering, Computing, and Science, Swinburne University of Technology Sarawak Campus, Kuching, Sarawak, MalaysiaFaculty of Engineering, Computing, and Science, Swinburne University of Technology Sarawak Campus, Kuching, Sarawak, MalaysiaFaculty of Science, Engineering and Technology, Swinburne University of Technology, Melbourne, VIC, AustraliaAutomatic text recognition in natural scene images is essential for accessing information and understanding our surroundings. Scene text orientations include horizontal scene texts, arbitrarily oriented scene texts, curved scene texts, and vertically oriented scene texts. While attention has been given to horizontal, arbitrarily oriented, and curved text, limited research has been carried out on vertically oriented scene text recognition. To this end, we propose Vertical Text Interpreter, an autonomous vertically oriented scene text recognizer model. Vertical Text Interpreter detects and recognizes vertically oriented scene texts in natural scenes, including vertically-stacked texts, bottom-to-top vertical texts, and top-to-bottom vertical texts. It consists of a shared convolutional neural network, a Vertical Text Spotter, and a Vertical Text Reader. We developed a dataset, namely Vertically Oriented Scene Text 1250 Dataset, created as part of this research, addressing the need for a dataset for this category of scene texts. The performance of the Vertical Text Interpreter is evaluated using benchmark datasets and the VOST-1250 dataset. Results show that Vertical Text Interpreter can detect and recognize different types of vertically oriented scene texts simultaneously. For future work, we can explore Vertical Text Interpreter for the contexts such as reading assistance and visual navigation systems.https://ieeexplore.ieee.org/document/9580909/Deep learningscene text detectionscene text recognitionshared convolutionvertical text interpreter
spellingShingle Ong Yi Ling
Lau Bee Theng
Almon Chai Weiyen
Christopher Mccarthy
Development of Vertical Text Interpreter for Natural Scene Images
IEEE Access
Deep learning
scene text detection
scene text recognition
shared convolution
vertical text interpreter
title Development of Vertical Text Interpreter for Natural Scene Images
title_full Development of Vertical Text Interpreter for Natural Scene Images
title_fullStr Development of Vertical Text Interpreter for Natural Scene Images
title_full_unstemmed Development of Vertical Text Interpreter for Natural Scene Images
title_short Development of Vertical Text Interpreter for Natural Scene Images
title_sort development of vertical text interpreter for natural scene images
topic Deep learning
scene text detection
scene text recognition
shared convolution
vertical text interpreter
url https://ieeexplore.ieee.org/document/9580909/
work_keys_str_mv AT ongyiling developmentofverticaltextinterpreterfornaturalsceneimages
AT laubeetheng developmentofverticaltextinterpreterfornaturalsceneimages
AT almonchaiweiyen developmentofverticaltextinterpreterfornaturalsceneimages
AT christophermccarthy developmentofverticaltextinterpreterfornaturalsceneimages