Chinese text-line detection from web videos with fully convolutional networks

Abstract Background In recent years, video becomes the dominant resource of information on the Web, where the text within video usually carries significant semantic information. Video text extraction and recognition plays an essential role in web multimedia understanding and retrieval for big visual...

Full description

Bibliographic Details
Main Authors: Chun Yang, Wei-Yi Pei, Long-Huang Wu, Xu-Cheng Yin
Format: Article
Language:English
Published: BMC 2018-01-01
Series:Big Data Analytics
Subjects:
Online Access:http://link.springer.com/article/10.1186/s41044-017-0028-2
_version_ 1819098816532447232
author Chun Yang
Wei-Yi Pei
Long-Huang Wu
Xu-Cheng Yin
author_facet Chun Yang
Wei-Yi Pei
Long-Huang Wu
Xu-Cheng Yin
author_sort Chun Yang
collection DOAJ
description Abstract Background In recent years, video becomes the dominant resource of information on the Web, where the text within video usually carries significant semantic information. Video text extraction and recognition plays an essential role in web multimedia understanding and retrieval for big visual data analytics and applications. To deal with challenging backgrounds and embedding noises, most conventional approaches usually tend to design sophisticated pre-processing and post-progressing steps before and after text detection. In this paper, we present a simple yet powerful pipeline that directly and uniformly detects Chinese text lines for embedded captions from web videos. Results In this Chinese text-line detection system, a fully convolutional network with local context is adopted to localize via an end-to-end learning way. The produced caption predictions are with the word level that could be directly fed into the character classifier. Text-line construction is then performed by heuristic strategies. A variety of experiments are conducted on several real-world web video datasets and demonstrated the effectiveness and efficiency of our proposed method. Conclusion The proposed system can directly detect the English word and Chinese characters in the caption text-lines without word or character segmentation with the high performance on real-world web video datasets.
first_indexed 2024-12-22T00:37:00Z
format Article
id doaj.art-ecb674af0ee14d18b89bf77df41f7d40
institution Directory Open Access Journal
issn 2058-6345
language English
last_indexed 2024-12-22T00:37:00Z
publishDate 2018-01-01
publisher BMC
record_format Article
series Big Data Analytics
spelling doaj.art-ecb674af0ee14d18b89bf77df41f7d402022-12-21T18:44:48ZengBMCBig Data Analytics2058-63452018-01-013111110.1186/s41044-017-0028-2Chinese text-line detection from web videos with fully convolutional networksChun Yang0Wei-Yi Pei1Long-Huang Wu2Xu-Cheng Yin3Department of Computer Science and Technology, University of Science and Technology BeijingDepartment of Computer Science and Technology, University of Science and Technology BeijingDepartment of Computer Science and Technology, University of Science and Technology BeijingDepartment of Computer Science and Technology, University of Science and Technology BeijingAbstract Background In recent years, video becomes the dominant resource of information on the Web, where the text within video usually carries significant semantic information. Video text extraction and recognition plays an essential role in web multimedia understanding and retrieval for big visual data analytics and applications. To deal with challenging backgrounds and embedding noises, most conventional approaches usually tend to design sophisticated pre-processing and post-progressing steps before and after text detection. In this paper, we present a simple yet powerful pipeline that directly and uniformly detects Chinese text lines for embedded captions from web videos. Results In this Chinese text-line detection system, a fully convolutional network with local context is adopted to localize via an end-to-end learning way. The produced caption predictions are with the word level that could be directly fed into the character classifier. Text-line construction is then performed by heuristic strategies. A variety of experiments are conducted on several real-world web video datasets and demonstrated the effectiveness and efficiency of our proposed method. Conclusion The proposed system can directly detect the English word and Chinese characters in the caption text-lines without word or character segmentation with the high performance on real-world web video datasets.http://link.springer.com/article/10.1186/s41044-017-0028-2Video text detectionText segmentationFully convolutional networksEmbedded captionsWeb videos
spellingShingle Chun Yang
Wei-Yi Pei
Long-Huang Wu
Xu-Cheng Yin
Chinese text-line detection from web videos with fully convolutional networks
Big Data Analytics
Video text detection
Text segmentation
Fully convolutional networks
Embedded captions
Web videos
title Chinese text-line detection from web videos with fully convolutional networks
title_full Chinese text-line detection from web videos with fully convolutional networks
title_fullStr Chinese text-line detection from web videos with fully convolutional networks
title_full_unstemmed Chinese text-line detection from web videos with fully convolutional networks
title_short Chinese text-line detection from web videos with fully convolutional networks
title_sort chinese text line detection from web videos with fully convolutional networks
topic Video text detection
Text segmentation
Fully convolutional networks
Embedded captions
Web videos
url http://link.springer.com/article/10.1186/s41044-017-0028-2
work_keys_str_mv AT chunyang chinesetextlinedetectionfromwebvideoswithfullyconvolutionalnetworks
AT weiyipei chinesetextlinedetectionfromwebvideoswithfullyconvolutionalnetworks
AT longhuangwu chinesetextlinedetectionfromwebvideoswithfullyconvolutionalnetworks
AT xuchengyin chinesetextlinedetectionfromwebvideoswithfullyconvolutionalnetworks