Flickr tag segmentation and classification

With the increasing usage and popularity of the social media on the Internet such as Flickr and Facebook, there is also an increase in the amount of information within the internet. An example is that some of the information is stored in tags. Tags are keywords or terms assigned to a piece of inform...

Full description

Bibliographic Details
Main Author: Teo, Queenie Yiting
Other Authors: Sun Aixin
Format: Final Year Project (FYP)
Language:English
Published: 2014
Subjects:
Online Access:http://hdl.handle.net/10356/58956
_version_ 1811679630899281920
author Teo, Queenie Yiting
author2 Sun Aixin
author_facet Sun Aixin
Teo, Queenie Yiting
author_sort Teo, Queenie Yiting
collection NTU
description With the increasing usage and popularity of the social media on the Internet such as Flickr and Facebook, there is also an increase in the amount of information within the internet. An example is that some of the information is stored in tags. Tags are keywords or terms assigned to a piece of information. With the increasing amount of data, there is a need for the users to search and obtain the required information efficiently and hence, tags are important as they are widely used to filter the unwanted information and reduce the search space. The words in tags are joined together without spaces such as “thisphoto”. Hence the aim of the project is to implement a program to carry out tag segmentation and classification. During segmentation, the program will split the words (For example, “thisphoto” to “this photo”). After segmentation, the program will carry out classification which will classify the tags from the segmentation (For example, “New York” is classified as a “LOCATION”). The dataset of tags used in this project were obtained from Flickr, a photo sharing website. Hence, the tags used in this project are for photos and images only. The completed project was tested against a total of 1000 tags which were randomly extracted from the Flickr dataset. Based on the segmentation results, it was observed that the accuracy rate of the segmentation was at 96.5%. As for the classification results, it was observed that the accuracy rate was at 86.21%. Hence based on the results, the program is able to segment and classify the tags at an acceptable accuracy.
first_indexed 2024-10-01T03:12:13Z
format Final Year Project (FYP)
id ntu-10356/58956
institution Nanyang Technological University
language English
last_indexed 2024-10-01T03:12:13Z
publishDate 2014
record_format dspace
spelling ntu-10356/589562023-03-03T20:35:06Z Flickr tag segmentation and classification Teo, Queenie Yiting Sun Aixin School of Computer Engineering DRNTU::Engineering::Computer science and engineering With the increasing usage and popularity of the social media on the Internet such as Flickr and Facebook, there is also an increase in the amount of information within the internet. An example is that some of the information is stored in tags. Tags are keywords or terms assigned to a piece of information. With the increasing amount of data, there is a need for the users to search and obtain the required information efficiently and hence, tags are important as they are widely used to filter the unwanted information and reduce the search space. The words in tags are joined together without spaces such as “thisphoto”. Hence the aim of the project is to implement a program to carry out tag segmentation and classification. During segmentation, the program will split the words (For example, “thisphoto” to “this photo”). After segmentation, the program will carry out classification which will classify the tags from the segmentation (For example, “New York” is classified as a “LOCATION”). The dataset of tags used in this project were obtained from Flickr, a photo sharing website. Hence, the tags used in this project are for photos and images only. The completed project was tested against a total of 1000 tags which were randomly extracted from the Flickr dataset. Based on the segmentation results, it was observed that the accuracy rate of the segmentation was at 96.5%. As for the classification results, it was observed that the accuracy rate was at 86.21%. Hence based on the results, the program is able to segment and classify the tags at an acceptable accuracy. Bachelor of Engineering (Computer Engineering) 2014-04-17T05:13:31Z 2014-04-17T05:13:31Z 2014 2014 Final Year Project (FYP) http://hdl.handle.net/10356/58956 en Nanyang Technological University 49 p. application/pdf
spellingShingle DRNTU::Engineering::Computer science and engineering
Teo, Queenie Yiting
Flickr tag segmentation and classification
title Flickr tag segmentation and classification
title_full Flickr tag segmentation and classification
title_fullStr Flickr tag segmentation and classification
title_full_unstemmed Flickr tag segmentation and classification
title_short Flickr tag segmentation and classification
title_sort flickr tag segmentation and classification
topic DRNTU::Engineering::Computer science and engineering
url http://hdl.handle.net/10356/58956
work_keys_str_mv AT teoqueenieyiting flickrtagsegmentationandclassification