Language matters: a weakly SupervisedVision-Language pre-training approach for scene text detection and spotting

Recently, Vision-Language Pre-training (VLP) techniques have greatly benefited various vision-language tasks by jointly learning visual and textual representations, which intuitively helps in Optical Character Recognition (OCR) tasks due to the rich visual and textual information in scene text image...

Full description

Bibliographic Details
Main Authors: Xue, C, Hao, Y, Lu, S, Torr, P, Bai, S
Format: Conference item
Language:English
Published: Springer 2022