Community tagging for mobile media

The main objective of this project is to develop an automatic image annotation system as manual tagging of images is a cumbersome process. This report propose to annotate images based on text information available in the image since text are useful for describing the content of an image and is a pow...

Full description

Bibliographic Details
Main Author: Chun, Gary Wei Qiang.
Other Authors: Yap Kim Hui
Format: Final Year Project (FYP)
Language:English
Published: 2009
Subjects:
Online Access:http://hdl.handle.net/10356/17920
_version_ 1826125543004700672
author Chun, Gary Wei Qiang.
author2 Yap Kim Hui
author_facet Yap Kim Hui
Chun, Gary Wei Qiang.
author_sort Chun, Gary Wei Qiang.
collection NTU
description The main objective of this project is to develop an automatic image annotation system as manual tagging of images is a cumbersome process. This report propose to annotate images based on text information available in the image since text are useful for describing the content of an image and is a powerful source of high-level semantics. The most direct way of extracting text from an image is to use a commercial OCR. However OCR is found to perform well only on simple background images where the contrast of background to text is high. The OCR is unable to handle images of complicated background. As such, preprocessing of images is needed prior to feeding it to OCR for text recognition. Such preprocessing includes text segmentation and binarization. Text segmentation is used to segment the text from the complex background and text binarization is used to enhance the contrast of background to text for optimal OCR performance. This report discuss the various approach to text segmentation and text binarization and concludes that text segmentation using edge and texture analysis and text binarization using joint entropy yields better performance. Finally, the text recognition output from the OCR will be further processed by a keyword extraction algorithm to extract suitable keywords for image annotation.
first_indexed 2024-10-01T06:38:03Z
format Final Year Project (FYP)
id ntu-10356/17920
institution Nanyang Technological University
language English
last_indexed 2024-10-01T06:38:03Z
publishDate 2009
record_format dspace
spelling ntu-10356/179202023-07-07T15:59:58Z Community tagging for mobile media Chun, Gary Wei Qiang. Yap Kim Hui School of Electrical and Electronic Engineering DRNTU::Engineering::Electrical and electronic engineering The main objective of this project is to develop an automatic image annotation system as manual tagging of images is a cumbersome process. This report propose to annotate images based on text information available in the image since text are useful for describing the content of an image and is a powerful source of high-level semantics. The most direct way of extracting text from an image is to use a commercial OCR. However OCR is found to perform well only on simple background images where the contrast of background to text is high. The OCR is unable to handle images of complicated background. As such, preprocessing of images is needed prior to feeding it to OCR for text recognition. Such preprocessing includes text segmentation and binarization. Text segmentation is used to segment the text from the complex background and text binarization is used to enhance the contrast of background to text for optimal OCR performance. This report discuss the various approach to text segmentation and text binarization and concludes that text segmentation using edge and texture analysis and text binarization using joint entropy yields better performance. Finally, the text recognition output from the OCR will be further processed by a keyword extraction algorithm to extract suitable keywords for image annotation. Bachelor of Engineering 2009-06-18T01:29:56Z 2009-06-18T01:29:56Z 2009 2009 Final Year Project (FYP) http://hdl.handle.net/10356/17920 en Nanyang Technological University 68 p. application/pdf
spellingShingle DRNTU::Engineering::Electrical and electronic engineering
Chun, Gary Wei Qiang.
Community tagging for mobile media
title Community tagging for mobile media
title_full Community tagging for mobile media
title_fullStr Community tagging for mobile media
title_full_unstemmed Community tagging for mobile media
title_short Community tagging for mobile media
title_sort community tagging for mobile media
topic DRNTU::Engineering::Electrical and electronic engineering
url http://hdl.handle.net/10356/17920
work_keys_str_mv AT chungaryweiqiang communitytaggingformobilemedia