On the Importance of Label Quality for Semantic Segmentation

Convolutional networks (ConvNets) have become the dominant approach to semantic image segmentation. Producing accurate, pixel-level labels required for this task is a tedious and time consuming process; however, producing approximate, coarse labels could take only a fraction of the time and effort....

Full description

Bibliographic Details
Main Authors: Zlateski, Aleksandar, Jaroensri, Ronnachai, Sharma, Prafull, Durand, Frederic
Other Authors: Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
Format: Article
Language:English
Published: IEEE 2020
Online Access:https://hdl.handle.net/1721.1/124403
_version_ 1826216795190591488
author Zlateski, Aleksandar
Jaroensri, Ronnachai
Sharma, Prafull
Durand, Frederic
author2 Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
author_facet Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
Zlateski, Aleksandar
Jaroensri, Ronnachai
Sharma, Prafull
Durand, Frederic
author_sort Zlateski, Aleksandar
collection MIT
description Convolutional networks (ConvNets) have become the dominant approach to semantic image segmentation. Producing accurate, pixel-level labels required for this task is a tedious and time consuming process; however, producing approximate, coarse labels could take only a fraction of the time and effort. We investigate the relationship between the quality of labels and the performance of ConvNets for semantic segmentation. We create a very large synthetic dataset with perfectly labeled street view scenes. From these perfect labels, we synthetically coarsen labels with different qualities and estimate human-hours required for producing them. We perform a series of experiments by training ConvNets with a varying number of training images and label quality. We found that the performance of ConvNets mostly depends on the time spent creating the training labels. That is, a larger coarsely-annotated dataset can yield the same performance as a smaller finely-annotated one. Furthermore, fine-tuning coarsely pre-trained ConvNets with few finely-annotated labels can yield comparable or superior performance to training it with a large amount of finely-annotated labels alone, at a fraction of the labeling cost. We demonstrate that our result is also valid for different network architectures, and various object classes in an urban scene.
first_indexed 2024-09-23T16:53:14Z
format Article
id mit-1721.1/124403
institution Massachusetts Institute of Technology
language English
last_indexed 2024-09-23T16:53:14Z
publishDate 2020
publisher IEEE
record_format dspace
spelling mit-1721.1/1244032022-10-03T08:58:07Z On the Importance of Label Quality for Semantic Segmentation Zlateski, Aleksandar Jaroensri, Ronnachai Sharma, Prafull Durand, Frederic Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science Convolutional networks (ConvNets) have become the dominant approach to semantic image segmentation. Producing accurate, pixel-level labels required for this task is a tedious and time consuming process; however, producing approximate, coarse labels could take only a fraction of the time and effort. We investigate the relationship between the quality of labels and the performance of ConvNets for semantic segmentation. We create a very large synthetic dataset with perfectly labeled street view scenes. From these perfect labels, we synthetically coarsen labels with different qualities and estimate human-hours required for producing them. We perform a series of experiments by training ConvNets with a varying number of training images and label quality. We found that the performance of ConvNets mostly depends on the time spent creating the training labels. That is, a larger coarsely-annotated dataset can yield the same performance as a smaller finely-annotated one. Furthermore, fine-tuning coarsely pre-trained ConvNets with few finely-annotated labels can yield comparable or superior performance to training it with a large amount of finely-annotated labels alone, at a fraction of the labeling cost. We demonstrate that our result is also valid for different network architectures, and various object classes in an urban scene. 2020-03-30T13:45:05Z 2020-03-30T13:45:05Z 2018-12 2019-05-29T13:33:27Z Article http://purl.org/eprint/type/ConferencePaper 9781538664209 https://hdl.handle.net/1721.1/124403 Zlateski, Aleksandar et al. "On the Importance of Label Quality for Semantic Segmentation." 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, June 18-23, 2018, Salt Lake City, UT, IEEE, December 2018. © 2018 IEEE. en http://dx.doi.org/10.1109/cvpr.2018.00160 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Creative Commons Attribution-Noncommercial-Share Alike http://creativecommons.org/licenses/by-nc-sa/4.0/ application/pdf IEEE Computer Vision Foundation
spellingShingle Zlateski, Aleksandar
Jaroensri, Ronnachai
Sharma, Prafull
Durand, Frederic
On the Importance of Label Quality for Semantic Segmentation
title On the Importance of Label Quality for Semantic Segmentation
title_full On the Importance of Label Quality for Semantic Segmentation
title_fullStr On the Importance of Label Quality for Semantic Segmentation
title_full_unstemmed On the Importance of Label Quality for Semantic Segmentation
title_short On the Importance of Label Quality for Semantic Segmentation
title_sort on the importance of label quality for semantic segmentation
url https://hdl.handle.net/1721.1/124403
work_keys_str_mv AT zlateskialeksandar ontheimportanceoflabelqualityforsemanticsegmentation
AT jaroensrironnachai ontheimportanceoflabelqualityforsemanticsegmentation
AT sharmaprafull ontheimportanceoflabelqualityforsemanticsegmentation
AT durandfrederic ontheimportanceoflabelqualityforsemanticsegmentation