Rate-Distortion Theory for Clustering in the Perceptual Space
How to extract relevant information from large data sets has become a main challenge in data visualization. Clustering techniques that classify data into groups according to similarity metrics are a suitable strategy to tackle this problem. Generally, these techniques are applied in the data space a...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2017-08-01
|
Series: | Entropy |
Subjects: | |
Online Access: | https://www.mdpi.com/1099-4300/19/9/438 |
_version_ | 1817993107904397312 |
---|---|
author | Anton Bardera Roger Bramon Marc Ruiz Imma Boada |
author_facet | Anton Bardera Roger Bramon Marc Ruiz Imma Boada |
author_sort | Anton Bardera |
collection | DOAJ |
description | How to extract relevant information from large data sets has become a main challenge in data visualization. Clustering techniques that classify data into groups according to similarity metrics are a suitable strategy to tackle this problem. Generally, these techniques are applied in the data space as an independent step previous to visualization. In this paper, we propose clustering on the perceptual space by maximizing the mutual information between the original data and the final visualization. With this purpose, we present a new information-theoretic framework based on the rate-distortion theory that allows us to achieve a maximally compressed data with a minimal signal distortion. Using this framework, we propose a methodology to design a visualization process that minimizes the information loss during the clustering process. Three application examples of the proposed methodology in different visualization techniques such as scatterplot, parallel coordinates, and summary trees are presented. |
first_indexed | 2024-04-14T01:35:22Z |
format | Article |
id | doaj.art-13919d56c0904a20babf15aad9c3792d |
institution | Directory Open Access Journal |
issn | 1099-4300 |
language | English |
last_indexed | 2024-04-14T01:35:22Z |
publishDate | 2017-08-01 |
publisher | MDPI AG |
record_format | Article |
series | Entropy |
spelling | doaj.art-13919d56c0904a20babf15aad9c3792d2022-12-22T02:19:57ZengMDPI AGEntropy1099-43002017-08-0119943810.3390/e19090438e19090438Rate-Distortion Theory for Clustering in the Perceptual SpaceAnton Bardera0Roger Bramon1Marc Ruiz2Imma Boada3Graphics and Imaging Laboratory, University of Girona, 17003 Girona, SpainGraphics and Imaging Laboratory, University of Girona, 17003 Girona, SpainGraphics and Imaging Laboratory, University of Girona, 17003 Girona, SpainGraphics and Imaging Laboratory, University of Girona, 17003 Girona, SpainHow to extract relevant information from large data sets has become a main challenge in data visualization. Clustering techniques that classify data into groups according to similarity metrics are a suitable strategy to tackle this problem. Generally, these techniques are applied in the data space as an independent step previous to visualization. In this paper, we propose clustering on the perceptual space by maximizing the mutual information between the original data and the final visualization. With this purpose, we present a new information-theoretic framework based on the rate-distortion theory that allows us to achieve a maximally compressed data with a minimal signal distortion. Using this framework, we propose a methodology to design a visualization process that minimizes the information loss during the clustering process. Three application examples of the proposed methodology in different visualization techniques such as scatterplot, parallel coordinates, and summary trees are presented.https://www.mdpi.com/1099-4300/19/9/438information visualizationrate-distortion theoryclusteringinformation theory |
spellingShingle | Anton Bardera Roger Bramon Marc Ruiz Imma Boada Rate-Distortion Theory for Clustering in the Perceptual Space Entropy information visualization rate-distortion theory clustering information theory |
title | Rate-Distortion Theory for Clustering in the Perceptual Space |
title_full | Rate-Distortion Theory for Clustering in the Perceptual Space |
title_fullStr | Rate-Distortion Theory for Clustering in the Perceptual Space |
title_full_unstemmed | Rate-Distortion Theory for Clustering in the Perceptual Space |
title_short | Rate-Distortion Theory for Clustering in the Perceptual Space |
title_sort | rate distortion theory for clustering in the perceptual space |
topic | information visualization rate-distortion theory clustering information theory |
url | https://www.mdpi.com/1099-4300/19/9/438 |
work_keys_str_mv | AT antonbardera ratedistortiontheoryforclusteringintheperceptualspace AT rogerbramon ratedistortiontheoryforclusteringintheperceptualspace AT marcruiz ratedistortiontheoryforclusteringintheperceptualspace AT immaboada ratedistortiontheoryforclusteringintheperceptualspace |