Automatic video genre classification with visual words

Automated content analysis has been growing popular in the research field given the vast and increasing amount of digital content. Content analysis is applicable in many areas including content management, searching and browsing. It is going to transcend the need to manually process digital content....

Full description

Bibliographic Details
Main Author: Vu, Minh Khue
Other Authors: Teoh Eam Khwang
Format: Final Year Project (FYP)
Language:English
Published: 2011
Subjects:
Online Access:http://hdl.handle.net/10356/44274
_version_ 1826122594307276800
author Vu, Minh Khue
author2 Teoh Eam Khwang
author_facet Teoh Eam Khwang
Vu, Minh Khue
author_sort Vu, Minh Khue
collection NTU
description Automated content analysis has been growing popular in the research field given the vast and increasing amount of digital content. Content analysis is applicable in many areas including content management, searching and browsing. It is going to transcend the need to manually process digital content. One of the promising topics is automatic video content classification. Numerous research works have been done on this topic. The result, however, have not been very attractive. This project aims to develop a reliable framework to automatically classify content of a video stream. It proposes to apply bag-of-words, a well-known method in text processing literature to the problem of video content classification. Recently this method has received attention in some problem domain such as object retrieval. Bag-of-words characterizes a text document by occurrences of different words and their frequencies of occurrence. This project builds the visual analogy of word and represents visual documents based on this analogy. Text classification techniques are then applied. Two major visual features, Scale-Invariant Feature Transform (SIFT) and Gabor, are evaluated in implementing bag-of-words. The implementation with SIFT is found to be more robust. Bag-of-words’ performance is also empirically proven to be more effective than the alternative of using global Gabor method. An automatic video genre classification framework is developed based on these results. Its scope is limited to sport videos. Four genres are experimented with: football, basketball, golf and tennis. The classification result is very promising. The overall accuracy rate is 91 percent. The algorithm’s speed, however, still needs to be further improved.
first_indexed 2024-10-01T05:50:42Z
format Final Year Project (FYP)
id ntu-10356/44274
institution Nanyang Technological University
language English
last_indexed 2024-10-01T05:50:42Z
publishDate 2011
record_format dspace
spelling ntu-10356/442742023-07-07T17:24:50Z Automatic video genre classification with visual words Vu, Minh Khue Teoh Eam Khwang School of Electrical and Electronic Engineering DRNTU::Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision DRNTU::Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence DRNTU::Engineering::Computer science and engineering::Computing methodologies::Pattern recognition Automated content analysis has been growing popular in the research field given the vast and increasing amount of digital content. Content analysis is applicable in many areas including content management, searching and browsing. It is going to transcend the need to manually process digital content. One of the promising topics is automatic video content classification. Numerous research works have been done on this topic. The result, however, have not been very attractive. This project aims to develop a reliable framework to automatically classify content of a video stream. It proposes to apply bag-of-words, a well-known method in text processing literature to the problem of video content classification. Recently this method has received attention in some problem domain such as object retrieval. Bag-of-words characterizes a text document by occurrences of different words and their frequencies of occurrence. This project builds the visual analogy of word and represents visual documents based on this analogy. Text classification techniques are then applied. Two major visual features, Scale-Invariant Feature Transform (SIFT) and Gabor, are evaluated in implementing bag-of-words. The implementation with SIFT is found to be more robust. Bag-of-words’ performance is also empirically proven to be more effective than the alternative of using global Gabor method. An automatic video genre classification framework is developed based on these results. Its scope is limited to sport videos. Four genres are experimented with: football, basketball, golf and tennis. The classification result is very promising. The overall accuracy rate is 91 percent. The algorithm’s speed, however, still needs to be further improved. Bachelor of Engineering 2011-05-31T07:58:39Z 2011-05-31T07:58:39Z 2011 2011 Final Year Project (FYP) http://hdl.handle.net/10356/44274 en Nanyang Technological University 93 p. application/pdf
spellingShingle DRNTU::Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision
DRNTU::Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
DRNTU::Engineering::Computer science and engineering::Computing methodologies::Pattern recognition
Vu, Minh Khue
Automatic video genre classification with visual words
title Automatic video genre classification with visual words
title_full Automatic video genre classification with visual words
title_fullStr Automatic video genre classification with visual words
title_full_unstemmed Automatic video genre classification with visual words
title_short Automatic video genre classification with visual words
title_sort automatic video genre classification with visual words
topic DRNTU::Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision
DRNTU::Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
DRNTU::Engineering::Computer science and engineering::Computing methodologies::Pattern recognition
url http://hdl.handle.net/10356/44274
work_keys_str_mv AT vuminhkhue automaticvideogenreclassificationwithvisualwords