Language matters: a weakly SupervisedVision-Language pre-training approach for scene text detection and spotting
Recently, Vision-Language Pre-training (VLP) techniques have greatly benefited various vision-language tasks by jointly learning visual and textual representations, which intuitively helps in Optical Character Recognition (OCR) tasks due to the rich visual and textual information in scene text image...
Main Authors: | , , , , |
---|---|
Format: | Conference item |
Language: | English |
Published: |
Springer
2022
|