Chinese text-line detection from web videos with fully convolutional networks
Abstract Background In recent years, video becomes the dominant resource of information on the Web, where the text within video usually carries significant semantic information. Video text extraction and recognition plays an essential role in web multimedia understanding and retrieval for big visual...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
BMC
2018-01-01
|
Series: | Big Data Analytics |
Subjects: | |
Online Access: | http://link.springer.com/article/10.1186/s41044-017-0028-2 |
_version_ | 1819098816532447232 |
---|---|
author | Chun Yang Wei-Yi Pei Long-Huang Wu Xu-Cheng Yin |
author_facet | Chun Yang Wei-Yi Pei Long-Huang Wu Xu-Cheng Yin |
author_sort | Chun Yang |
collection | DOAJ |
description | Abstract Background In recent years, video becomes the dominant resource of information on the Web, where the text within video usually carries significant semantic information. Video text extraction and recognition plays an essential role in web multimedia understanding and retrieval for big visual data analytics and applications. To deal with challenging backgrounds and embedding noises, most conventional approaches usually tend to design sophisticated pre-processing and post-progressing steps before and after text detection. In this paper, we present a simple yet powerful pipeline that directly and uniformly detects Chinese text lines for embedded captions from web videos. Results In this Chinese text-line detection system, a fully convolutional network with local context is adopted to localize via an end-to-end learning way. The produced caption predictions are with the word level that could be directly fed into the character classifier. Text-line construction is then performed by heuristic strategies. A variety of experiments are conducted on several real-world web video datasets and demonstrated the effectiveness and efficiency of our proposed method. Conclusion The proposed system can directly detect the English word and Chinese characters in the caption text-lines without word or character segmentation with the high performance on real-world web video datasets. |
first_indexed | 2024-12-22T00:37:00Z |
format | Article |
id | doaj.art-ecb674af0ee14d18b89bf77df41f7d40 |
institution | Directory Open Access Journal |
issn | 2058-6345 |
language | English |
last_indexed | 2024-12-22T00:37:00Z |
publishDate | 2018-01-01 |
publisher | BMC |
record_format | Article |
series | Big Data Analytics |
spelling | doaj.art-ecb674af0ee14d18b89bf77df41f7d402022-12-21T18:44:48ZengBMCBig Data Analytics2058-63452018-01-013111110.1186/s41044-017-0028-2Chinese text-line detection from web videos with fully convolutional networksChun Yang0Wei-Yi Pei1Long-Huang Wu2Xu-Cheng Yin3Department of Computer Science and Technology, University of Science and Technology BeijingDepartment of Computer Science and Technology, University of Science and Technology BeijingDepartment of Computer Science and Technology, University of Science and Technology BeijingDepartment of Computer Science and Technology, University of Science and Technology BeijingAbstract Background In recent years, video becomes the dominant resource of information on the Web, where the text within video usually carries significant semantic information. Video text extraction and recognition plays an essential role in web multimedia understanding and retrieval for big visual data analytics and applications. To deal with challenging backgrounds and embedding noises, most conventional approaches usually tend to design sophisticated pre-processing and post-progressing steps before and after text detection. In this paper, we present a simple yet powerful pipeline that directly and uniformly detects Chinese text lines for embedded captions from web videos. Results In this Chinese text-line detection system, a fully convolutional network with local context is adopted to localize via an end-to-end learning way. The produced caption predictions are with the word level that could be directly fed into the character classifier. Text-line construction is then performed by heuristic strategies. A variety of experiments are conducted on several real-world web video datasets and demonstrated the effectiveness and efficiency of our proposed method. Conclusion The proposed system can directly detect the English word and Chinese characters in the caption text-lines without word or character segmentation with the high performance on real-world web video datasets.http://link.springer.com/article/10.1186/s41044-017-0028-2Video text detectionText segmentationFully convolutional networksEmbedded captionsWeb videos |
spellingShingle | Chun Yang Wei-Yi Pei Long-Huang Wu Xu-Cheng Yin Chinese text-line detection from web videos with fully convolutional networks Big Data Analytics Video text detection Text segmentation Fully convolutional networks Embedded captions Web videos |
title | Chinese text-line detection from web videos with fully convolutional networks |
title_full | Chinese text-line detection from web videos with fully convolutional networks |
title_fullStr | Chinese text-line detection from web videos with fully convolutional networks |
title_full_unstemmed | Chinese text-line detection from web videos with fully convolutional networks |
title_short | Chinese text-line detection from web videos with fully convolutional networks |
title_sort | chinese text line detection from web videos with fully convolutional networks |
topic | Video text detection Text segmentation Fully convolutional networks Embedded captions Web videos |
url | http://link.springer.com/article/10.1186/s41044-017-0028-2 |
work_keys_str_mv | AT chunyang chinesetextlinedetectionfromwebvideoswithfullyconvolutionalnetworks AT weiyipei chinesetextlinedetectionfromwebvideoswithfullyconvolutionalnetworks AT longhuangwu chinesetextlinedetectionfromwebvideoswithfullyconvolutionalnetworks AT xuchengyin chinesetextlinedetectionfromwebvideoswithfullyconvolutionalnetworks |