A spatial distance-based spatial clustering algorithm for sparse image data

By allocating each object to one of the predefined categories, image classification deeply understands the attributes and features of the data on each object in the scene, and further mines the potential features and internal connections of the data, supporting the subsequent application decision-ma...

Full description

Bibliographic Details
Main Authors: Tian-fan Zhang, Zhe Li, Qi Yuan, You-ning Wang
Format: Article
Language:English
Published: Elsevier 2022-12-01
Series:Alexandria Engineering Journal
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S1110016822004252
_version_ 1797978791632437248
author Tian-fan Zhang
Zhe Li
Qi Yuan
You-ning Wang
author_facet Tian-fan Zhang
Zhe Li
Qi Yuan
You-ning Wang
author_sort Tian-fan Zhang
collection DOAJ
description By allocating each object to one of the predefined categories, image classification deeply understands the attributes and features of the data on each object in the scene, and further mines the potential features and internal connections of the data, supporting the subsequent application decision-making with necessary structured data. One of the key challenges to image classification is how to accurately classify sparse data, when there is an imbalance between different categories of data, i.e., how to identify small objects in images. Recognizing a person in satellite images is such a challenging task. These objects are sparse either globally or in each recognizable local segment. Therefore, they are often overlooked by the classifier, or removed as noises. During deep learning, feature sparsity means the samples contain too much useless information, which suppresses the generalization and accuracy of the model. To solve the problem, this paper presents a spatial distance-based spatial clustering algorithm for sparse image data (SDBSCA-SID). Firstly, the imaging range of the image sensor constitutes a two-dimensional (2D) constraint space. Under the constraint, spatial clustering was carried out based on the features of each sample to aggregate dense data into primary categories, and aggregate sparse data and noises into secondary categories. Referring to the 2D constrained space, multiple spatial classification surfaces were constructed to aggregate the sparse data to the two sides of these surfaces as much as possible. If the error is minimized, then the sparse data belong to these classification surfaces. To shorten the convergence time of the clustering algorithm on imbalanced data, the original sample set was cut into slices, and assigned to several calculation units for separate clustering. Next, the same-class clusters were merged through reduction. Finally, the obtained class labels were compared with the preset class labels, wrapping up the semantic segmentation of images. The stability and accuracy of our algorithm were demonstrated through tests on image samples.
first_indexed 2024-04-11T05:28:39Z
format Article
id doaj.art-d33943d64b4647f59ab9eb86af8bdf32
institution Directory Open Access Journal
issn 1110-0168
language English
last_indexed 2024-04-11T05:28:39Z
publishDate 2022-12-01
publisher Elsevier
record_format Article
series Alexandria Engineering Journal
spelling doaj.art-d33943d64b4647f59ab9eb86af8bdf322022-12-23T04:39:43ZengElsevierAlexandria Engineering Journal1110-01682022-12-0161121260912622A spatial distance-based spatial clustering algorithm for sparse image dataTian-fan Zhang0Zhe Li1Qi Yuan2You-ning Wang3Institute of Economics and Management, Hubei Engineering University, Xiaogan 432000, ChinaInstitute of Economics and Management, Hubei Engineering University, Xiaogan 432000, China; Corresponding author.Mechanical and Electrical Engineering, Hubei Polytechnic Institute, Xiaogan 432000, ChinaMOE Key Laboratory of Tibetan Plateau Land Surface Processes and Ecological Conservation/College of Geographical Science, Qinghai Normal University, Xining 810008, China; College of Life Science and Technology, Hubei Engineering University, Xiaogan 432000, ChinaBy allocating each object to one of the predefined categories, image classification deeply understands the attributes and features of the data on each object in the scene, and further mines the potential features and internal connections of the data, supporting the subsequent application decision-making with necessary structured data. One of the key challenges to image classification is how to accurately classify sparse data, when there is an imbalance between different categories of data, i.e., how to identify small objects in images. Recognizing a person in satellite images is such a challenging task. These objects are sparse either globally or in each recognizable local segment. Therefore, they are often overlooked by the classifier, or removed as noises. During deep learning, feature sparsity means the samples contain too much useless information, which suppresses the generalization and accuracy of the model. To solve the problem, this paper presents a spatial distance-based spatial clustering algorithm for sparse image data (SDBSCA-SID). Firstly, the imaging range of the image sensor constitutes a two-dimensional (2D) constraint space. Under the constraint, spatial clustering was carried out based on the features of each sample to aggregate dense data into primary categories, and aggregate sparse data and noises into secondary categories. Referring to the 2D constrained space, multiple spatial classification surfaces were constructed to aggregate the sparse data to the two sides of these surfaces as much as possible. If the error is minimized, then the sparse data belong to these classification surfaces. To shorten the convergence time of the clustering algorithm on imbalanced data, the original sample set was cut into slices, and assigned to several calculation units for separate clustering. Next, the same-class clusters were merged through reduction. Finally, the obtained class labels were compared with the preset class labels, wrapping up the semantic segmentation of images. The stability and accuracy of our algorithm were demonstrated through tests on image samples.http://www.sciencedirect.com/science/article/pii/S1110016822004252Semantic image segmentationSparse dataImage clusteringSpace clusteringReductionSynthetic aperture radar (SAR) satellite images
spellingShingle Tian-fan Zhang
Zhe Li
Qi Yuan
You-ning Wang
A spatial distance-based spatial clustering algorithm for sparse image data
Alexandria Engineering Journal
Semantic image segmentation
Sparse data
Image clustering
Space clustering
Reduction
Synthetic aperture radar (SAR) satellite images
title A spatial distance-based spatial clustering algorithm for sparse image data
title_full A spatial distance-based spatial clustering algorithm for sparse image data
title_fullStr A spatial distance-based spatial clustering algorithm for sparse image data
title_full_unstemmed A spatial distance-based spatial clustering algorithm for sparse image data
title_short A spatial distance-based spatial clustering algorithm for sparse image data
title_sort spatial distance based spatial clustering algorithm for sparse image data
topic Semantic image segmentation
Sparse data
Image clustering
Space clustering
Reduction
Synthetic aperture radar (SAR) satellite images
url http://www.sciencedirect.com/science/article/pii/S1110016822004252
work_keys_str_mv AT tianfanzhang aspatialdistancebasedspatialclusteringalgorithmforsparseimagedata
AT zheli aspatialdistancebasedspatialclusteringalgorithmforsparseimagedata
AT qiyuan aspatialdistancebasedspatialclusteringalgorithmforsparseimagedata
AT youningwang aspatialdistancebasedspatialclusteringalgorithmforsparseimagedata
AT tianfanzhang spatialdistancebasedspatialclusteringalgorithmforsparseimagedata
AT zheli spatialdistancebasedspatialclusteringalgorithmforsparseimagedata
AT qiyuan spatialdistancebasedspatialclusteringalgorithmforsparseimagedata
AT youningwang spatialdistancebasedspatialclusteringalgorithmforsparseimagedata