Semantic image segmentation and evaluation

Though quite a few image segmentation benchmark datasets have been constructed, there is no suitable benchmark for semantic image segmentation. In this thesis, we first construct a benchmark for such a purpose, where the groundtruths are generated by leveraging the existing fine granular groundtruth...

Full description

Bibliographic Details
Main Author: Li, Hui
Other Authors: Zheng Jianmin
Format: Thesis
Language:English
Published: 2015
Subjects:
Online Access:https://hdl.handle.net/10356/62262
_version_ 1826119337312780288
author Li, Hui
author2 Zheng Jianmin
author_facet Zheng Jianmin
Li, Hui
author_sort Li, Hui
collection NTU
description Though quite a few image segmentation benchmark datasets have been constructed, there is no suitable benchmark for semantic image segmentation. In this thesis, we first construct a benchmark for such a purpose, where the groundtruths are generated by leveraging the existing fine granular groundtruths in the Berkeley Segmentation Dataset (BSD) as well as using an interactive segmentation tool for new images. We also propose a percept-tree-based region merging strategy for dynamically adapting the groundtruth for evaluating test segmentation. Moreover, we propose a new evaluation metric that is easy to understand and compute, and does not require boundary matching. Experimental results show that, compared with the BSD, the generated groundtruth dataset is more suitable for evaluating semantic image segmentation, and the conducted user study demonstrates that the proposed evaluation metric matches user ranking very well. In the second part of this thesis, we focus on segmentation application by utilizing prior information (i.e., depth in this thesis) to improve segmentation quality. To the best of our knowledge, little work has been attempted so far to achieve automatic image segmentation on RGB-D image. Users are usually asked to input scribbles to indicate the foreground and background or the framework needs to be trained on a database to obtain the bounding box for a specified target. All these methods require external information. We propose to utilize Kinect shadow information into state-of-the-art algorithms for automatic foreground segmentation and multiple object segmentation. Experimental results demonstrate that the proposed shadow-assisted segmentation methods can achieve fully automatic cutout with superior segmentation performance.
first_indexed 2024-10-01T04:58:19Z
format Thesis
id ntu-10356/62262
institution Nanyang Technological University
language English
last_indexed 2024-10-01T04:58:19Z
publishDate 2015
record_format dspace
spelling ntu-10356/622622023-03-04T00:49:43Z Semantic image segmentation and evaluation Li, Hui Zheng Jianmin Cai Jianfei School of Computer Engineering DRNTU::Engineering::Computer science and engineering Though quite a few image segmentation benchmark datasets have been constructed, there is no suitable benchmark for semantic image segmentation. In this thesis, we first construct a benchmark for such a purpose, where the groundtruths are generated by leveraging the existing fine granular groundtruths in the Berkeley Segmentation Dataset (BSD) as well as using an interactive segmentation tool for new images. We also propose a percept-tree-based region merging strategy for dynamically adapting the groundtruth for evaluating test segmentation. Moreover, we propose a new evaluation metric that is easy to understand and compute, and does not require boundary matching. Experimental results show that, compared with the BSD, the generated groundtruth dataset is more suitable for evaluating semantic image segmentation, and the conducted user study demonstrates that the proposed evaluation metric matches user ranking very well. In the second part of this thesis, we focus on segmentation application by utilizing prior information (i.e., depth in this thesis) to improve segmentation quality. To the best of our knowledge, little work has been attempted so far to achieve automatic image segmentation on RGB-D image. Users are usually asked to input scribbles to indicate the foreground and background or the framework needs to be trained on a database to obtain the bounding box for a specified target. All these methods require external information. We propose to utilize Kinect shadow information into state-of-the-art algorithms for automatic foreground segmentation and multiple object segmentation. Experimental results demonstrate that the proposed shadow-assisted segmentation methods can achieve fully automatic cutout with superior segmentation performance. MASTER OF ENGINEERING (SCE) 2015-03-12T08:27:56Z 2015-03-12T08:27:56Z 2015 2015 Thesis Li, H. (2015). Semantic image segmentation and evaluation. Master's thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/62262 10.32657/10356/62262 en 65 p. application/pdf
spellingShingle DRNTU::Engineering::Computer science and engineering
Li, Hui
Semantic image segmentation and evaluation
title Semantic image segmentation and evaluation
title_full Semantic image segmentation and evaluation
title_fullStr Semantic image segmentation and evaluation
title_full_unstemmed Semantic image segmentation and evaluation
title_short Semantic image segmentation and evaluation
title_sort semantic image segmentation and evaluation
topic DRNTU::Engineering::Computer science and engineering
url https://hdl.handle.net/10356/62262
work_keys_str_mv AT lihui semanticimagesegmentationandevaluation