Continuous Learning Graphical Knowledge Unit for Cluster Identification in High Density Data Sets

K.K.L.B Adikaram,K.K.L.B Adikaram,K.K.L.B Adikaram,Mathias Effenberger,Thomas Becker,Mohamed Hussein

doi:10.3390/sym8120152

Continuous Learning Graphical Knowledge Unit for Cluster Identification in High Density Data Sets

K.K.L.B Adikaram, K.K.L.B Adikaram + Show 4 more

Open Access

https://doi.org/10.3390/sym8120152

Copy DOI

Abstract

Big data are visually cluttered by overlapping data points. Rather than removing, reducing or reformulating overlap, we propose a simple, effective and powerful technique for density cluster generation and visualization, where point marker (graphical symbol of a data point) overlap is exploited in an additive fashion in order to obtain bitmap data summaries in which clusters can be identified visually, aided by automatically generated contour lines. In the proposed method, the plotting area is a bitmap and the marker is a shape of more than one pixel. As the markers overlap, the red, green and blue (RGB) colour values of pixels in the shared region are added. Thus, a pixel of a 24-bit RGB bitmap can code up to 224 (over 1.6 million) overlaps. A higher number of overlaps at the same location makes the colour of this area identical, which can be identified by the naked eye. A bitmap is a matrix of colour values that can be represented as integers. The proposed method updates this matrix while adding new points. Thus, this matrix can be considered as an up-to-time knowledge unit of processed data. Results show cluster generation, cluster identification, missing and out-of-range data visualization, and outlier detection capability of the newly proposed method.

Highlights

Plotted data are visually cluttered by overlapping data points
Rather than removing, reducing or reformulating overlap, we propose a simple, effective and powerful technique for density cluster generation and visualization, where point marker overlap is exploited in an additive fashion in order to obtain bitmap data summaries in which clusters can be identified visually, aided by automatically generated contour lines
The graphical knowledge unit (GKU) can be seen as a combination of the quadrat sampling method with contour lines

Summary

Introduction

Plotted data are visually cluttered by overlapping data points. Reducing, avoiding and reformulating (as a cluster) such overlap are the three major techniques recommended for clutter reduction in the data visualization field [1,2,3,4,5]. The method we introduce in this paper incorporates overlaps to generate density clusters without reducing, avoiding or reformulating overlaps. The proposed method requires more overlaps for better cluster formation and better visualization, which contrasts the general practice. The proposed method can be considered as an anytime cluster formation technique (without a separate cluster identification algorithm), which provides faster cluster generation than online methods [8]

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Symmetry	Publication Date: Dec 14, 2016
Citations: 2	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Continuous Learning Graphical Knowledge Unit for Cluster Identification in High Density Data Sets

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Symmetry

Lead the way for us

Similar Papers

Analysis of lesional color to differentiate infantile hemangiomas from port-wine birthmarks in infants less than 3 months old: A pilot study.
Kathleen F O’Brien ... Robert Silverman
Pediatric dermatology | VOL. 38
Kathleen F O’Brien, et. al.Kathleen F O’Brien ... Robert Silverman
20 Mar 2021
Pediatric dermatology | VOL. 38

Chiral recognition of tryptophan enantiomers using chitosan-capped silver nanoparticles: Scanometry and spectrophotometry approaches
Marzieh Jafari ... Ghodratollah Absalan
Talanta | VOL. 178
Marzieh Jafari, et. al.Marzieh Jafari ... Ghodratollah Absalan
18 Oct 2017
Talanta | VOL. 178

Approaches to quantitating the results of differentially dyed cottons
Donna V Peralta ... Debbie Boykin
Textile Research Journal | VOL. 89
Donna V Peralta, et. al.Donna V Peralta ... Debbie Boykin
20 Apr 2018
Textile Research Journal | VOL. 89

Plant growth stage and leaf part to diagnose sweet corn nitrogen status using chlorophyll sensor and scanner image analysis
Carla Do Carmo Milagres ... Mairon Neves De Figueiredo
Journal of Plant Nutrition | VOL. 44
Carla Do Carmo Milagres, et. al.Carla Do Carmo Milagres ... Mairon Neves De Figueiredo
24 Apr 2021
Journal of Plant Nutrition | VOL. 44

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Continuous Learning Graphical Knowledge Unit for Cluster Identification in High Density Data Sets

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Symmetry