Human cognition inspired technique for estimation of 2D localization and pose parameters of baggage in uncluttered scenes

Object localization and its pose estimation in a scene is an extremely important research area in computer vision. In fact, it forms an important first step for performing further manipulation tasks. Several techniques have been proposed in the literature that strive to capture this information. Sha...

Full description

Bibliographic Details
Main Author: Sridhar Sriram
Other Authors: Er Meng Joo
Format: Thesis
Language:English
Published: 2017
Subjects:
Online Access:http://hdl.handle.net/10356/69978
_version_ 1811689955636805632
author Sridhar Sriram
author2 Er Meng Joo
author_facet Er Meng Joo
Sridhar Sriram
author_sort Sridhar Sriram
collection NTU
description Object localization and its pose estimation in a scene is an extremely important research area in computer vision. In fact, it forms an important first step for performing further manipulation tasks. Several techniques have been proposed in the literature that strive to capture this information. Shape matching is one such technique, as even simple hand drawn contours without the aid of other cues such as color, texture etc. can be used to delineate, identify, localize and compute object pose. In this work, the power and the flexibility of object centered and viewpoint dependent representations of objects from cognitive science are combined to estimate the 2D parameters of baggage in a captured image. Object-centered theories allow us to construe the problem of object representation in terms of its parts. In particular, the recognition by component model represents objects in terms of part primitives referred to as geons. The objects are parsed at the points of negative minima of curvature in accordance with the principle of transversality. This concept of object divisibility into parts can be exploited to extract the most visually salient part(s) for pose estimation. In this research, considering that the baggage is a two-part structure comprising of the salient body and the handle, a trained single layer feedforward neural network based on Extreme Learning Machine (ELM) and Harris corner points is used to extract the cuboidal contour of the baggage at fixed scale. The extracted cuboidal contour is then matched with stored cuboidal templates of different poses and aspect ratios using the Chamfer matching technique. This allows for obtaining the best template and the corresponding translation parameters thereby pointing to the integrated approach of object-centered and viewpoint dependent representations for baggage localization and pose estimation. This approach also aligns well with the concept of part saliency, selective attention to and processing of parts from the field of cognitive science in which the top-down influence of the task to be performed (pose estimation) is also taken into consideration. In this research, baggage in uncluttered scenes are considered for the purpose of 2D pose estimation based on which an algorithm is proposed. The success of the algorithm is demonstrated through simulations in MATLAB. Future research in this direction will enable a potential robotics solution for automated baggage handling for which pose estimation is a must.
first_indexed 2024-10-01T05:56:19Z
format Thesis
id ntu-10356/69978
institution Nanyang Technological University
language English
last_indexed 2024-10-01T05:56:19Z
publishDate 2017
record_format dspace
spelling ntu-10356/699782023-07-04T17:14:51Z Human cognition inspired technique for estimation of 2D localization and pose parameters of baggage in uncluttered scenes Sridhar Sriram Er Meng Joo School of Electrical and Electronic Engineering DRNTU::Engineering::Electrical and electronic engineering Object localization and its pose estimation in a scene is an extremely important research area in computer vision. In fact, it forms an important first step for performing further manipulation tasks. Several techniques have been proposed in the literature that strive to capture this information. Shape matching is one such technique, as even simple hand drawn contours without the aid of other cues such as color, texture etc. can be used to delineate, identify, localize and compute object pose. In this work, the power and the flexibility of object centered and viewpoint dependent representations of objects from cognitive science are combined to estimate the 2D parameters of baggage in a captured image. Object-centered theories allow us to construe the problem of object representation in terms of its parts. In particular, the recognition by component model represents objects in terms of part primitives referred to as geons. The objects are parsed at the points of negative minima of curvature in accordance with the principle of transversality. This concept of object divisibility into parts can be exploited to extract the most visually salient part(s) for pose estimation. In this research, considering that the baggage is a two-part structure comprising of the salient body and the handle, a trained single layer feedforward neural network based on Extreme Learning Machine (ELM) and Harris corner points is used to extract the cuboidal contour of the baggage at fixed scale. The extracted cuboidal contour is then matched with stored cuboidal templates of different poses and aspect ratios using the Chamfer matching technique. This allows for obtaining the best template and the corresponding translation parameters thereby pointing to the integrated approach of object-centered and viewpoint dependent representations for baggage localization and pose estimation. This approach also aligns well with the concept of part saliency, selective attention to and processing of parts from the field of cognitive science in which the top-down influence of the task to be performed (pose estimation) is also taken into consideration. In this research, baggage in uncluttered scenes are considered for the purpose of 2D pose estimation based on which an algorithm is proposed. The success of the algorithm is demonstrated through simulations in MATLAB. Future research in this direction will enable a potential robotics solution for automated baggage handling for which pose estimation is a must. Master of Engineering 2017-04-06T09:22:32Z 2017-04-06T09:22:32Z 2017 Thesis Sridhar Sriram. (2017). Human cognition inspired technique for estimation of 2D localization and pose parameters of baggage in uncluttered scenes. Master's thesis, Nanyang Technological University, Singapore. http://hdl.handle.net/10356/69978 10.32657/10356/69978 en 83 p. application/pdf
spellingShingle DRNTU::Engineering::Electrical and electronic engineering
Sridhar Sriram
Human cognition inspired technique for estimation of 2D localization and pose parameters of baggage in uncluttered scenes
title Human cognition inspired technique for estimation of 2D localization and pose parameters of baggage in uncluttered scenes
title_full Human cognition inspired technique for estimation of 2D localization and pose parameters of baggage in uncluttered scenes
title_fullStr Human cognition inspired technique for estimation of 2D localization and pose parameters of baggage in uncluttered scenes
title_full_unstemmed Human cognition inspired technique for estimation of 2D localization and pose parameters of baggage in uncluttered scenes
title_short Human cognition inspired technique for estimation of 2D localization and pose parameters of baggage in uncluttered scenes
title_sort human cognition inspired technique for estimation of 2d localization and pose parameters of baggage in uncluttered scenes
topic DRNTU::Engineering::Electrical and electronic engineering
url http://hdl.handle.net/10356/69978
work_keys_str_mv AT sridharsriram humancognitioninspiredtechniqueforestimationof2dlocalizationandposeparametersofbaggageinunclutteredscenes