Unsupervised Domain Adaptation for Remote Sensing Semantic Segmentation with Transformer

With the development of deep learning, the performance of image semantic segmentation in remote sensing has been constantly improved. However, the performance usually degrades while testing on different datasets because of the domain gap. To achieve feasible performance, extensive pixel-wise annotat...

Full description

Bibliographic Details
Main Authors: Weitao Li, Hui Gao, Yi Su, Biffon Manyura Momanyi
Format: Article
Language:English
Published: MDPI AG 2022-10-01
Series:Remote Sensing
Subjects:
Online Access:https://www.mdpi.com/2072-4292/14/19/4942
_version_ 1797477112819482624
author Weitao Li
Hui Gao
Yi Su
Biffon Manyura Momanyi
author_facet Weitao Li
Hui Gao
Yi Su
Biffon Manyura Momanyi
author_sort Weitao Li
collection DOAJ
description With the development of deep learning, the performance of image semantic segmentation in remote sensing has been constantly improved. However, the performance usually degrades while testing on different datasets because of the domain gap. To achieve feasible performance, extensive pixel-wise annotations are acquired in a new environment, which is time-consuming and labor-intensive. Therefore, unsupervised domain adaptation (UDA) has been proposed to alleviate the effort of labeling. However, most previous approaches are based on outdated network architectures that hinder the improvement of performance in UDA. Since the effects of recent architectures for UDA have been barely studied, we reveal the potential of Transformer in UDA for remote sensing with a self-training framework. Additionally, two training strategies have been proposed to enhance the performance of UDA: (1) Gradual Class Weights (GCW) to stabilize the model on the source domain by addressing the class-imbalance problem; (2) Local Dynamic Quality (LDQ) to improve the quality of the pseudo-labels via distinguishing the discrete and clustered pseudo-labels on the target domain. Overall, our proposed method improves the state-of-the-art performance by 8.23% mIoU on Potsdam→Vaihingen and 9.2% mIoU on Vaihingen→Potsdam and facilitates learning even for difficult classes such as clutter/background.
first_indexed 2024-03-09T21:13:05Z
format Article
id doaj.art-84374946256446489a140c4a895ec5e8
institution Directory Open Access Journal
issn 2072-4292
language English
last_indexed 2024-03-09T21:13:05Z
publishDate 2022-10-01
publisher MDPI AG
record_format Article
series Remote Sensing
spelling doaj.art-84374946256446489a140c4a895ec5e82023-11-23T21:41:15ZengMDPI AGRemote Sensing2072-42922022-10-011419494210.3390/rs14194942Unsupervised Domain Adaptation for Remote Sensing Semantic Segmentation with TransformerWeitao Li0Hui Gao1Yi Su2Biffon Manyura Momanyi3School of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu 611700, ChinaSchool of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu 611700, ChinaSchool of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu 611700, ChinaSchool of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu 611700, ChinaWith the development of deep learning, the performance of image semantic segmentation in remote sensing has been constantly improved. However, the performance usually degrades while testing on different datasets because of the domain gap. To achieve feasible performance, extensive pixel-wise annotations are acquired in a new environment, which is time-consuming and labor-intensive. Therefore, unsupervised domain adaptation (UDA) has been proposed to alleviate the effort of labeling. However, most previous approaches are based on outdated network architectures that hinder the improvement of performance in UDA. Since the effects of recent architectures for UDA have been barely studied, we reveal the potential of Transformer in UDA for remote sensing with a self-training framework. Additionally, two training strategies have been proposed to enhance the performance of UDA: (1) Gradual Class Weights (GCW) to stabilize the model on the source domain by addressing the class-imbalance problem; (2) Local Dynamic Quality (LDQ) to improve the quality of the pseudo-labels via distinguishing the discrete and clustered pseudo-labels on the target domain. Overall, our proposed method improves the state-of-the-art performance by 8.23% mIoU on Potsdam→Vaihingen and 9.2% mIoU on Vaihingen→Potsdam and facilitates learning even for difficult classes such as clutter/background.https://www.mdpi.com/2072-4292/14/19/4942unsupervised domain adaptationsemantic segmentationremote sensing imagetransformerself-training
spellingShingle Weitao Li
Hui Gao
Yi Su
Biffon Manyura Momanyi
Unsupervised Domain Adaptation for Remote Sensing Semantic Segmentation with Transformer
Remote Sensing
unsupervised domain adaptation
semantic segmentation
remote sensing image
transformer
self-training
title Unsupervised Domain Adaptation for Remote Sensing Semantic Segmentation with Transformer
title_full Unsupervised Domain Adaptation for Remote Sensing Semantic Segmentation with Transformer
title_fullStr Unsupervised Domain Adaptation for Remote Sensing Semantic Segmentation with Transformer
title_full_unstemmed Unsupervised Domain Adaptation for Remote Sensing Semantic Segmentation with Transformer
title_short Unsupervised Domain Adaptation for Remote Sensing Semantic Segmentation with Transformer
title_sort unsupervised domain adaptation for remote sensing semantic segmentation with transformer
topic unsupervised domain adaptation
semantic segmentation
remote sensing image
transformer
self-training
url https://www.mdpi.com/2072-4292/14/19/4942
work_keys_str_mv AT weitaoli unsuperviseddomainadaptationforremotesensingsemanticsegmentationwithtransformer
AT huigao unsuperviseddomainadaptationforremotesensingsemanticsegmentationwithtransformer
AT yisu unsuperviseddomainadaptationforremotesensingsemanticsegmentationwithtransformer
AT biffonmanyuramomanyi unsuperviseddomainadaptationforremotesensingsemanticsegmentationwithtransformer