Information Leakage in Deep Learning-Based Hyperspectral Image Classification: A Survey

In deep learning-based hyperspectral remote sensing image classification tasks, random sampling strategies are typically used to train model parameters for testing and evaluation. However, this approach leads to strong spatial autocorrelation between the training set samples and the surrounding test...

Full description

Bibliographic Details
Main Authors: Hao Feng, Yongcheng Wang, Zheng Li, Ning Zhang, Yuxi Zhang, Yunxiao Gao
Format: Article
Language:English
Published: MDPI AG 2023-07-01
Series:Remote Sensing
Subjects:
Online Access:https://www.mdpi.com/2072-4292/15/15/3793
_version_ 1827730919503429632
author Hao Feng
Yongcheng Wang
Zheng Li
Ning Zhang
Yuxi Zhang
Yunxiao Gao
author_facet Hao Feng
Yongcheng Wang
Zheng Li
Ning Zhang
Yuxi Zhang
Yunxiao Gao
author_sort Hao Feng
collection DOAJ
description In deep learning-based hyperspectral remote sensing image classification tasks, random sampling strategies are typically used to train model parameters for testing and evaluation. However, this approach leads to strong spatial autocorrelation between the training set samples and the surrounding test set samples, and some unlabeled test set data directly participate in the training of the network. This leaked information makes the model overly optimistic. Models trained under these conditions tend to overfit to a single dataset, which limits the range of practical applications. This paper analyzes the causes and effects of information leakage and summarizes the methods from existing models to mitigate the effects of information leakage. Specifically, this paper states the main issues in this area, where the issue of information leakage is addressed in detail. Second, some algorithms and related models used to mitigate information leakage are categorized, including reducing the number of training samples, using spatially disjoint sampling strategies, few-shot learning, and unsupervised learning. These models and methods are classified according to the sample-related phase and the feature extraction phase. Finally, several representative hyperspectral image classification models experiments are conducted on the common datasets and their effectiveness in mitigating information leakage is analyzed.
first_indexed 2024-03-11T00:17:23Z
format Article
id doaj.art-50571a42fd6d48598c9bb34567907552
institution Directory Open Access Journal
issn 2072-4292
language English
last_indexed 2024-03-11T00:17:23Z
publishDate 2023-07-01
publisher MDPI AG
record_format Article
series Remote Sensing
spelling doaj.art-50571a42fd6d48598c9bb345679075522023-11-18T23:30:52ZengMDPI AGRemote Sensing2072-42922023-07-011515379310.3390/rs15153793Information Leakage in Deep Learning-Based Hyperspectral Image Classification: A SurveyHao Feng0Yongcheng Wang1Zheng Li2Ning Zhang3Yuxi Zhang4Yunxiao Gao5Changchun Institute of Optics, Fine Mechanics and Physics, Chinese Academy of Sciences, Changchun 130033, ChinaChangchun Institute of Optics, Fine Mechanics and Physics, Chinese Academy of Sciences, Changchun 130033, ChinaChangchun Institute of Optics, Fine Mechanics and Physics, Chinese Academy of Sciences, Changchun 130033, ChinaDepartment of Electronic Engineering, Tsinghua University, Beijing 100084, ChinaChangchun Institute of Optics, Fine Mechanics and Physics, Chinese Academy of Sciences, Changchun 130033, ChinaChangchun Institute of Optics, Fine Mechanics and Physics, Chinese Academy of Sciences, Changchun 130033, ChinaIn deep learning-based hyperspectral remote sensing image classification tasks, random sampling strategies are typically used to train model parameters for testing and evaluation. However, this approach leads to strong spatial autocorrelation between the training set samples and the surrounding test set samples, and some unlabeled test set data directly participate in the training of the network. This leaked information makes the model overly optimistic. Models trained under these conditions tend to overfit to a single dataset, which limits the range of practical applications. This paper analyzes the causes and effects of information leakage and summarizes the methods from existing models to mitigate the effects of information leakage. Specifically, this paper states the main issues in this area, where the issue of information leakage is addressed in detail. Second, some algorithms and related models used to mitigate information leakage are categorized, including reducing the number of training samples, using spatially disjoint sampling strategies, few-shot learning, and unsupervised learning. These models and methods are classified according to the sample-related phase and the feature extraction phase. Finally, several representative hyperspectral image classification models experiments are conducted on the common datasets and their effectiveness in mitigating information leakage is analyzed.https://www.mdpi.com/2072-4292/15/15/3793hyperspectral image classificationspatial autocorrelationinformation leakagesampling strategy
spellingShingle Hao Feng
Yongcheng Wang
Zheng Li
Ning Zhang
Yuxi Zhang
Yunxiao Gao
Information Leakage in Deep Learning-Based Hyperspectral Image Classification: A Survey
Remote Sensing
hyperspectral image classification
spatial autocorrelation
information leakage
sampling strategy
title Information Leakage in Deep Learning-Based Hyperspectral Image Classification: A Survey
title_full Information Leakage in Deep Learning-Based Hyperspectral Image Classification: A Survey
title_fullStr Information Leakage in Deep Learning-Based Hyperspectral Image Classification: A Survey
title_full_unstemmed Information Leakage in Deep Learning-Based Hyperspectral Image Classification: A Survey
title_short Information Leakage in Deep Learning-Based Hyperspectral Image Classification: A Survey
title_sort information leakage in deep learning based hyperspectral image classification a survey
topic hyperspectral image classification
spatial autocorrelation
information leakage
sampling strategy
url https://www.mdpi.com/2072-4292/15/15/3793
work_keys_str_mv AT haofeng informationleakageindeeplearningbasedhyperspectralimageclassificationasurvey
AT yongchengwang informationleakageindeeplearningbasedhyperspectralimageclassificationasurvey
AT zhengli informationleakageindeeplearningbasedhyperspectralimageclassificationasurvey
AT ningzhang informationleakageindeeplearningbasedhyperspectralimageclassificationasurvey
AT yuxizhang informationleakageindeeplearningbasedhyperspectralimageclassificationasurvey
AT yunxiaogao informationleakageindeeplearningbasedhyperspectralimageclassificationasurvey