Image-to-character-to-word transformers for accurate scene text recognition

Leveraging the advances of natural language processing, most recent scene text recognizers adopt an encoder-decoder architecture where text images are first converted to representative features and then a sequence of characters via 'sequential decoding'. However, scene text images suffer f...

Full description

Bibliographic Details
Main Authors: Xue, Chuhui, Huang, Jiaxing, Zhang, Wenqing, Lu, Shijian, Wang, Changhu, Bai, Song
Other Authors: School of Computer Science and Engineering
Format: Journal Article
Language:English
Published: 2023
Subjects:
Online Access:https://hdl.handle.net/10356/172173