Deep Embedded Clustering Framework for Mixed Data

Deep embedded clustering (DEC) is a representative clustering algorithm that leverages deep-learning frameworks. DEC jointly learns low-dimensional feature representations and optimizes the clustering goals but only works with numerical data. However, in practice, the real-world data to be clustered...

Full description

Bibliographic Details
Main Authors: Yonggu Lee, Chulwung Park, Shinjin Kang
Format: Article
Language:English
Published: IEEE 2023-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9999360/
_version_ 1797962608501850112
author Yonggu Lee
Chulwung Park
Shinjin Kang
author_facet Yonggu Lee
Chulwung Park
Shinjin Kang
author_sort Yonggu Lee
collection DOAJ
description Deep embedded clustering (DEC) is a representative clustering algorithm that leverages deep-learning frameworks. DEC jointly learns low-dimensional feature representations and optimizes the clustering goals but only works with numerical data. However, in practice, the real-world data to be clustered includes not only numerical features but also categorical features that DEC cannot handle. In addition, if the difference between the soft assignment and target values is large, DEC applications may suffer from convergence problems. In this study, to overcome these limitations, we propose a deep embedded clustering framework that can utilize mixed data to increase the convergence stability using soft-target updates; a concept that is borrowed from an improved deep Q learning algorithm used in reinforcement learning. To evaluate the performance of the framework, we utilized various benchmark datasets composed of mixed data and empirically demonstrated that our approach outperformed existing clustering algorithms in most standard metrics. To the best of our knowledge, we state that our work achieved state-of-the-art performance among its contemporaries in this field.
first_indexed 2024-04-11T01:16:56Z
format Article
id doaj.art-0ba8c7be26d1409c9382d0a80aec2e25
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-04-11T01:16:56Z
publishDate 2023-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-0ba8c7be26d1409c9382d0a80aec2e252023-01-04T00:00:08ZengIEEEIEEE Access2169-35362023-01-0111334010.1109/ACCESS.2022.32323729999360Deep Embedded Clustering Framework for Mixed DataYonggu Lee0Chulwung Park1https://orcid.org/0000-0003-1338-1474Shinjin Kang2https://orcid.org/0000-0002-3839-5947NCSOFT, Bundag-gu, Seongnam-si, Republic of KoreaNCSOFT, Bundag-gu, Seongnam-si, Republic of KoreaSchool of Games, Hongik University, Jochiwon, Sejong, South KoreaDeep embedded clustering (DEC) is a representative clustering algorithm that leverages deep-learning frameworks. DEC jointly learns low-dimensional feature representations and optimizes the clustering goals but only works with numerical data. However, in practice, the real-world data to be clustered includes not only numerical features but also categorical features that DEC cannot handle. In addition, if the difference between the soft assignment and target values is large, DEC applications may suffer from convergence problems. In this study, to overcome these limitations, we propose a deep embedded clustering framework that can utilize mixed data to increase the convergence stability using soft-target updates; a concept that is borrowed from an improved deep Q learning algorithm used in reinforcement learning. To evaluate the performance of the framework, we utilized various benchmark datasets composed of mixed data and empirically demonstrated that our approach outperformed existing clustering algorithms in most standard metrics. To the best of our knowledge, we state that our work achieved state-of-the-art performance among its contemporaries in this field.https://ieeexplore.ieee.org/document/9999360/Clustering algorithmmixed datadeep learning
spellingShingle Yonggu Lee
Chulwung Park
Shinjin Kang
Deep Embedded Clustering Framework for Mixed Data
IEEE Access
Clustering algorithm
mixed data
deep learning
title Deep Embedded Clustering Framework for Mixed Data
title_full Deep Embedded Clustering Framework for Mixed Data
title_fullStr Deep Embedded Clustering Framework for Mixed Data
title_full_unstemmed Deep Embedded Clustering Framework for Mixed Data
title_short Deep Embedded Clustering Framework for Mixed Data
title_sort deep embedded clustering framework for mixed data
topic Clustering algorithm
mixed data
deep learning
url https://ieeexplore.ieee.org/document/9999360/
work_keys_str_mv AT yonggulee deepembeddedclusteringframeworkformixeddata
AT chulwungpark deepembeddedclusteringframeworkformixeddata
AT shinjinkang deepembeddedclusteringframeworkformixeddata