Rate-Distortion Theory for Clustering in the Perceptual Space

How to extract relevant information from large data sets has become a main challenge in data visualization. Clustering techniques that classify data into groups according to similarity metrics are a suitable strategy to tackle this problem. Generally, these techniques are applied in the data space a...

Full description

Bibliographic Details
Main Authors: Anton Bardera, Roger Bramon, Marc Ruiz, Imma Boada
Format: Article
Language:English
Published: MDPI AG 2017-08-01
Series:Entropy
Subjects:
Online Access:https://www.mdpi.com/1099-4300/19/9/438
_version_ 1817993107904397312
author Anton Bardera
Roger Bramon
Marc Ruiz
Imma Boada
author_facet Anton Bardera
Roger Bramon
Marc Ruiz
Imma Boada
author_sort Anton Bardera
collection DOAJ
description How to extract relevant information from large data sets has become a main challenge in data visualization. Clustering techniques that classify data into groups according to similarity metrics are a suitable strategy to tackle this problem. Generally, these techniques are applied in the data space as an independent step previous to visualization. In this paper, we propose clustering on the perceptual space by maximizing the mutual information between the original data and the final visualization. With this purpose, we present a new information-theoretic framework based on the rate-distortion theory that allows us to achieve a maximally compressed data with a minimal signal distortion. Using this framework, we propose a methodology to design a visualization process that minimizes the information loss during the clustering process. Three application examples of the proposed methodology in different visualization techniques such as scatterplot, parallel coordinates, and summary trees are presented.
first_indexed 2024-04-14T01:35:22Z
format Article
id doaj.art-13919d56c0904a20babf15aad9c3792d
institution Directory Open Access Journal
issn 1099-4300
language English
last_indexed 2024-04-14T01:35:22Z
publishDate 2017-08-01
publisher MDPI AG
record_format Article
series Entropy
spelling doaj.art-13919d56c0904a20babf15aad9c3792d2022-12-22T02:19:57ZengMDPI AGEntropy1099-43002017-08-0119943810.3390/e19090438e19090438Rate-Distortion Theory for Clustering in the Perceptual SpaceAnton Bardera0Roger Bramon1Marc Ruiz2Imma Boada3Graphics and Imaging Laboratory, University of Girona, 17003 Girona, SpainGraphics and Imaging Laboratory, University of Girona, 17003 Girona, SpainGraphics and Imaging Laboratory, University of Girona, 17003 Girona, SpainGraphics and Imaging Laboratory, University of Girona, 17003 Girona, SpainHow to extract relevant information from large data sets has become a main challenge in data visualization. Clustering techniques that classify data into groups according to similarity metrics are a suitable strategy to tackle this problem. Generally, these techniques are applied in the data space as an independent step previous to visualization. In this paper, we propose clustering on the perceptual space by maximizing the mutual information between the original data and the final visualization. With this purpose, we present a new information-theoretic framework based on the rate-distortion theory that allows us to achieve a maximally compressed data with a minimal signal distortion. Using this framework, we propose a methodology to design a visualization process that minimizes the information loss during the clustering process. Three application examples of the proposed methodology in different visualization techniques such as scatterplot, parallel coordinates, and summary trees are presented.https://www.mdpi.com/1099-4300/19/9/438information visualizationrate-distortion theoryclusteringinformation theory
spellingShingle Anton Bardera
Roger Bramon
Marc Ruiz
Imma Boada
Rate-Distortion Theory for Clustering in the Perceptual Space
Entropy
information visualization
rate-distortion theory
clustering
information theory
title Rate-Distortion Theory for Clustering in the Perceptual Space
title_full Rate-Distortion Theory for Clustering in the Perceptual Space
title_fullStr Rate-Distortion Theory for Clustering in the Perceptual Space
title_full_unstemmed Rate-Distortion Theory for Clustering in the Perceptual Space
title_short Rate-Distortion Theory for Clustering in the Perceptual Space
title_sort rate distortion theory for clustering in the perceptual space
topic information visualization
rate-distortion theory
clustering
information theory
url https://www.mdpi.com/1099-4300/19/9/438
work_keys_str_mv AT antonbardera ratedistortiontheoryforclusteringintheperceptualspace
AT rogerbramon ratedistortiontheoryforclusteringintheperceptualspace
AT marcruiz ratedistortiontheoryforclusteringintheperceptualspace
AT immaboada ratedistortiontheoryforclusteringintheperceptualspace