A Constrained Randomization Approach to Interactive Visual Data Exploration with Subjective Feedback

Bo Kang,Kai Puolamaki,Tijl De Bie,Jefrey Lijffijt

doi:10.1109/tkde.2019.2907082

Bo Kang, Kai Puolamaki + Show 2 more

Open Access

https://doi.org/10.1109/tkde.2019.2907082

Copy DOI

Abstract

Data visualization and iterative/interactive data mining are growing rapidly in attention, both in research as well as in industry. However, while there are a plethora of advanced data mining methods and lots of works in the field of visualization, integrated methods that combine advanced visualization and/or interaction with data mining techniques in a principled way are rare. We present a framework based on constrained randomization which lets users explore high-dimensional data via ‘subjectively informative’ two-dimensional data visualizations. The user is presented with ‘interesting’ projections, allowing users to express their observations using visual interactions that update a background model representing the user's belief state. This background model is then considered by a projection-finding algorithm employing data randomization to compute a new ‘interesting’ projection. By providing users with information that contrasts with the background model, we maximize the chance that the user encounters striking new information present in the data. This process can be iterated until the user runs out of time or until the difference between the randomized and the real data is insignificant. We present two case studies, one controlled study on synthetic data and another on census data, using the proof-of-concept tool SIDE that demonstrates the presented framework.

Highlights

DATA visualization and iterative/interactive data mining are both mature, actively researched topics of great practical importance
A symbiosis of human analysts and well-designed computer systems promises to provide the most efficient way of navigating the complex information space hidden within high-dimensional data. This idea has been advocated within the visual analytics field already a long time ago [1], [2], [3]
We evaluated the scalability on synthetic data with d 2 f16; 32; 64; 128g dimensions and n 2 f64; 128; 256; 512g data points scattered around k 2 f2; 4; 8; 16g randomly drawn cluster centroids (Table 4)

Summary

Introduction

DATA visualization and iterative/interactive data mining are both mature, actively researched topics of great practical importance. Methods that combine state-of-the-art data mining with visualization and interaction are highly desirable as they could exploit the strengths of both human data analysts and of computer algorithms. A symbiosis of human analysts and well-designed computer systems promises to provide the most efficient way of navigating the complex information space hidden within high-dimensional data. This idea has been advocated within the visual analytics field already a long time ago [1], [2], [3]

Objectives

Methods

Findings

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Transactions on Knowledge and Data Engineering	Publication Date: Jan 1, 2019
Citations: 44	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

A Constrained Randomization Approach to Interactive Visual Data Exploration with Subjective Feedback

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Transactions on Knowledge and Data Engineering

Lead the way for us

Similar Papers

Interactive Visual Data Exploration with Subjective Feedback
Kai Puolamäki ... Bo Kang
-
Kai Puolamäki, et. al.Kai Puolamäki ... Bo Kang
01 Jan 2015
01 Jan 2015

Interactive visual data exploration with subjective feedback: an information-theoretic approach
Kai Puolamäki ... Jefrey Lijffijt
Data Mining and Knowledge Discovery | VOL. 34
Kai Puolamäki, et. al.Kai Puolamäki ... Jefrey Lijffijt
03 Oct 2019
Data Mining and Knowledge Discovery | VOL. 34

Performance Analysis of Data Mining Techniques in IoT
Isha Batra ... Sahil Verma
-
Isha Batra, et. al.Isha Batra ... Sahil Verma
01 Aug 2018
01 Aug 2018

Parâmetros na escolha de técnicas e ferramentas de mineração de dados

Acta Scientiarum-technology | VOL. 24

01 Jan 2002
Acta Scientiarum-technology | VOL. 24

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Constrained Randomization Approach to Interactive Visual Data Exploration with Subjective Feedback

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Transactions on Knowledge and Data Engineering