Image-to-character-to-word transformers for accurate scene text recognition

Leveraging the advances of natural language processing, most recent scene text recognizers adopt an encoder-decoder architecture where text images are first converted to representative features and then a sequence of characters via 'sequential decoding'. However, scene text images suffer f...

Ful tanımlama

Detaylı Bibliyografya
Asıl Yazarlar: Xue, Chuhui, Huang, Jiaxing, Zhang, Wenqing, Lu, Shijian, Wang, Changhu, Bai, Song
Diğer Yazarlar: School of Computer Science and Engineering
Materyal Türü: Journal Article
Dil:English
Baskı/Yayın Bilgisi: 2023
Konular:
Online Erişim:https://hdl.handle.net/10356/172173