Rate-Distortion Theory for Clustering in the Perceptual Space

Anton Bardera,Roger Bramon,Marc Ruiz,Imma Boada

doi:10.3390/e19090438

Abstract

How to extract relevant information from large data sets has become a main challenge in data visualization. Clustering techniques that classify data into groups according to similarity metrics are a suitable strategy to tackle this problem. Generally, these techniques are applied in the data space as an independent step previous to visualization. In this paper, we propose clustering on the perceptual space by maximizing the mutual information between the original data and the final visualization. With this purpose, we present a new information-theoretic framework based on the rate-distortion theory that allows us to achieve a maximally compressed data with a minimal signal distortion. Using this framework, we propose a methodology to design a visualization process that minimizes the information loss during the clustering process. Three application examples of the proposed methodology in different visualization techniques such as scatterplot, parallel coordinates, and summary trees are presented.

Highlights

Technology advances allow for obtaining large amounts of data related to any process in any application field
We have presented a new methodology to deal with the visualization of clustered data that ensures an optimal information transfer between the original data and the final visualization
We have presented a new mathematical framework, based on information theory and rate-distortion theory, that models the visualization as an information channel between the source data and the final user

Summary

Introduction

Technology advances allow for obtaining large amounts of data related to any process in any application field. Examples include visual analysis of business data [1], scientific data [2], and images and videos [3], amongst others. Information visualization techniques have become a powerful tool to extract the valuable and useful information hidden in the data. A great variety of visualization techniques has been proposed [5], most of them lose their effectiveness when dealing with large data sets. Screen space limitations transform visualizations into cluttered images that are incomprehensible. To overcome these limitations data, clustering techniques can be applied

Objectives

Methods

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Entropy	Publication Date: Aug 23, 2017
Citations: 4	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Rate-Distortion Theory for Clustering in the Perceptual Space

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Entropy

Lead the way for us

Similar Papers

43 - Visual Data-Mining Techniques
Daniel A Keim ... MIHAEL ANKERST
Visualization Handbook | VOL. -
Daniel A Keim, et. al.Daniel A Keim ... MIHAEL ANKERST
01 Jan 2004
Visualization Handbook | VOL. -

High-dimensional Data Visualization
Martin Theus
-
Martin TheusMartin Theus
01 Jan 2008
01 Jan 2008

High-dimensional data visualization by interactive construction of low-dimensional parallel coordinate plots
Takayuki Itoh ... Jinman Kim
Computer Languages, Systems & Structures | VOL. 43
Takayuki Itoh, et. al.Takayuki Itoh ... Jinman Kim
20 Apr 2017
Computer Languages, Systems & Structures | VOL. 43

Chapter 2 - Information Visualization: Scope, Techniques and Opportunities for Geovisualization
Daniel A Keim
Exploring Geovisualization | VOL. -
Daniel A KeimDaniel A Keim
01 Jan 2004
Exploring Geovisualization | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Rate-Distortion Theory for Clustering in the Perceptual Space

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Entropy