On the Importance of Label Quality for Semantic Segmentation

Convolutional networks (ConvNets) have become the dominant approach to semantic image segmentation. Producing accurate, pixel-level labels required for this task is a tedious and time consuming process; however, producing approximate, coarse labels could take only a fraction of the time and effort....

Full description

Bibliographic Details
Main Authors:	Zlateski, Aleksandar, Jaroensri, Ronnachai, Sharma, Prafull, Durand, Frederic
Other Authors:	Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
Format:	Article
Language:	English
Published:	IEEE 2020
Online Access:	https://hdl.handle.net/1721.1/124403

_version_	1826216795190591488
author	Zlateski, Aleksandar Jaroensri, Ronnachai Sharma, Prafull Durand, Frederic
author2	Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
author_facet	Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science Zlateski, Aleksandar Jaroensri, Ronnachai Sharma, Prafull Durand, Frederic
author_sort	Zlateski, Aleksandar
collection	MIT
description	Convolutional networks (ConvNets) have become the dominant approach to semantic image segmentation. Producing accurate, pixel-level labels required for this task is a tedious and time consuming process; however, producing approximate, coarse labels could take only a fraction of the time and effort. We investigate the relationship between the quality of labels and the performance of ConvNets for semantic segmentation. We create a very large synthetic dataset with perfectly labeled street view scenes. From these perfect labels, we synthetically coarsen labels with different qualities and estimate human-hours required for producing them. We perform a series of experiments by training ConvNets with a varying number of training images and label quality. We found that the performance of ConvNets mostly depends on the time spent creating the training labels. That is, a larger coarsely-annotated dataset can yield the same performance as a smaller finely-annotated one. Furthermore, fine-tuning coarsely pre-trained ConvNets with few finely-annotated labels can yield comparable or superior performance to training it with a large amount of finely-annotated labels alone, at a fraction of the labeling cost. We demonstrate that our result is also valid for different network architectures, and various object classes in an urban scene.
first_indexed	2024-09-23T16:53:14Z
format	Article
id	mit-1721.1/124403
institution	Massachusetts Institute of Technology
language	English
last_indexed	2024-09-23T16:53:14Z
publishDate	2020
publisher	IEEE
record_format	dspace
spelling	mit-1721.1/1244032022-10-03T08:58:07Z On the Importance of Label Quality for Semantic Segmentation Zlateski, Aleksandar Jaroensri, Ronnachai Sharma, Prafull Durand, Frederic Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science Convolutional networks (ConvNets) have become the dominant approach to semantic image segmentation. Producing accurate, pixel-level labels required for this task is a tedious and time consuming process; however, producing approximate, coarse labels could take only a fraction of the time and effort. We investigate the relationship between the quality of labels and the performance of ConvNets for semantic segmentation. We create a very large synthetic dataset with perfectly labeled street view scenes. From these perfect labels, we synthetically coarsen labels with different qualities and estimate human-hours required for producing them. We perform a series of experiments by training ConvNets with a varying number of training images and label quality. We found that the performance of ConvNets mostly depends on the time spent creating the training labels. That is, a larger coarsely-annotated dataset can yield the same performance as a smaller finely-annotated one. Furthermore, fine-tuning coarsely pre-trained ConvNets with few finely-annotated labels can yield comparable or superior performance to training it with a large amount of finely-annotated labels alone, at a fraction of the labeling cost. We demonstrate that our result is also valid for different network architectures, and various object classes in an urban scene. 2020-03-30T13:45:05Z 2020-03-30T13:45:05Z 2018-12 2019-05-29T13:33:27Z Article http://purl.org/eprint/type/ConferencePaper 9781538664209 https://hdl.handle.net/1721.1/124403 Zlateski, Aleksandar et al. "On the Importance of Label Quality for Semantic Segmentation." 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, June 18-23, 2018, Salt Lake City, UT, IEEE, December 2018. © 2018 IEEE. en http://dx.doi.org/10.1109/cvpr.2018.00160 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Creative Commons Attribution-Noncommercial-Share Alike http://creativecommons.org/licenses/by-nc-sa/4.0/ application/pdf IEEE Computer Vision Foundation
spellingShingle	Zlateski, Aleksandar Jaroensri, Ronnachai Sharma, Prafull Durand, Frederic On the Importance of Label Quality for Semantic Segmentation
title	On the Importance of Label Quality for Semantic Segmentation
title_full	On the Importance of Label Quality for Semantic Segmentation
title_fullStr	On the Importance of Label Quality for Semantic Segmentation
title_full_unstemmed	On the Importance of Label Quality for Semantic Segmentation
title_short	On the Importance of Label Quality for Semantic Segmentation
title_sort	on the importance of label quality for semantic segmentation
url	https://hdl.handle.net/1721.1/124403
work_keys_str_mv	AT zlateskialeksandar ontheimportanceoflabelqualityforsemanticsegmentation AT jaroensrironnachai ontheimportanceoflabelqualityforsemanticsegmentation AT sharmaprafull ontheimportanceoflabelqualityforsemanticsegmentation AT durandfrederic ontheimportanceoflabelqualityforsemanticsegmentation

On the Importance of Label Quality for Semantic Segmentation

Similar Items