Visual product search in mobile business

Visual Search Technology allows users to retrieve information regarding visual objects. With the recent development of smartphones, this function can be performed on mobiles and is known as Mobile Visual Search (MVS). This report focuses on some of the image processing and pattern recognition techni...

Full description

Bibliographic Details
Main Author: Vijay, Dalmia Devanshu
Other Authors: Yap, Kim Hui
Format: Final Year Project (FYP)
Language:English
Published: 2014
Subjects:
Online Access:http://hdl.handle.net/10356/61006
_version_ 1826130207496470528
author Vijay, Dalmia Devanshu
author2 Yap, Kim Hui
author_facet Yap, Kim Hui
Vijay, Dalmia Devanshu
author_sort Vijay, Dalmia Devanshu
collection NTU
description Visual Search Technology allows users to retrieve information regarding visual objects. With the recent development of smartphones, this function can be performed on mobiles and is known as Mobile Visual Search (MVS). This report focuses on some of the image processing and pattern recognition techniques using the Mobile Visual Search. A mobile visual search application has already been developed on the Android platform using the client-server architecture. It uses the image processing techniques like Bag-of-Words model, Scale-invariant Feature Transform (SIFT) detector and descriptor, Inverted Index, Vocabulary Tree and Geometric Verification. One major part of this image recognition process is known as Keypoint Detection. The current application uses the SIFT (Difference of Gaussian) keypoint detector. This report seeks to evaluate some other keypoint detection techniques like Harris Affine and Hessian Affine. First, preliminary analysis is performed on the Harris Affine, Hessian Affine and the SIFT (DoG) detectors using the 48 image database provided by the Visual Geometry Group (VGG) at Oxford. These detectors are evaluated across five different image transformations which are viewpoint change, scale change, blur, light change and JPEG compression. Across all these transformations Hessian Affine is found to be the most optimal detector using the criteria which is explained in the chapter 4 of the report. Since the current Mobile Visual Search (MVS) application is in the C programming language, a C version of the Hessian Affine detector is found and is integrated in the current code and pipeline. The performance of this new MVS pipeline using the Hessian Affine detector is tested against the old pipeline which uses the SIFT (DoG) detector using the NTU Landmark Database. The percentage of images successfully recognized using the SIFT (DoG) detector is 84.36% whereas the percentage of images successfully recognized using the Hessian detector is only 81.60%. Contrary to the preliminary analysis, the SIFT (DoG) detector outperforms the Hessian Affine detector by 2.76%.
first_indexed 2024-10-01T07:52:42Z
format Final Year Project (FYP)
id ntu-10356/61006
institution Nanyang Technological University
language English
last_indexed 2024-10-01T07:52:42Z
publishDate 2014
record_format dspace
spelling ntu-10356/610062023-07-07T17:09:06Z Visual product search in mobile business Vijay, Dalmia Devanshu Yap, Kim Hui School of Electrical and Electronic Engineering DRNTU::Engineering Visual Search Technology allows users to retrieve information regarding visual objects. With the recent development of smartphones, this function can be performed on mobiles and is known as Mobile Visual Search (MVS). This report focuses on some of the image processing and pattern recognition techniques using the Mobile Visual Search. A mobile visual search application has already been developed on the Android platform using the client-server architecture. It uses the image processing techniques like Bag-of-Words model, Scale-invariant Feature Transform (SIFT) detector and descriptor, Inverted Index, Vocabulary Tree and Geometric Verification. One major part of this image recognition process is known as Keypoint Detection. The current application uses the SIFT (Difference of Gaussian) keypoint detector. This report seeks to evaluate some other keypoint detection techniques like Harris Affine and Hessian Affine. First, preliminary analysis is performed on the Harris Affine, Hessian Affine and the SIFT (DoG) detectors using the 48 image database provided by the Visual Geometry Group (VGG) at Oxford. These detectors are evaluated across five different image transformations which are viewpoint change, scale change, blur, light change and JPEG compression. Across all these transformations Hessian Affine is found to be the most optimal detector using the criteria which is explained in the chapter 4 of the report. Since the current Mobile Visual Search (MVS) application is in the C programming language, a C version of the Hessian Affine detector is found and is integrated in the current code and pipeline. The performance of this new MVS pipeline using the Hessian Affine detector is tested against the old pipeline which uses the SIFT (DoG) detector using the NTU Landmark Database. The percentage of images successfully recognized using the SIFT (DoG) detector is 84.36% whereas the percentage of images successfully recognized using the Hessian detector is only 81.60%. Contrary to the preliminary analysis, the SIFT (DoG) detector outperforms the Hessian Affine detector by 2.76%. Bachelor of Engineering 2014-06-04T02:08:34Z 2014-06-04T02:08:34Z 2014 2014 Final Year Project (FYP) http://hdl.handle.net/10356/61006 en Nanyang Technological University 72 p. application/pdf
spellingShingle DRNTU::Engineering
Vijay, Dalmia Devanshu
Visual product search in mobile business
title Visual product search in mobile business
title_full Visual product search in mobile business
title_fullStr Visual product search in mobile business
title_full_unstemmed Visual product search in mobile business
title_short Visual product search in mobile business
title_sort visual product search in mobile business
topic DRNTU::Engineering
url http://hdl.handle.net/10356/61006
work_keys_str_mv AT vijaydalmiadevanshu visualproductsearchinmobilebusiness