Abstract

Visualization of large datasets is computationally expensive. For this reason, enveloping methods have been used to visualize such datasets. Using enveloping methods, we visualize summary statistics of the data in the space transformed visualization (STV) plots, such as the traditional parallel coordinate plot (TPCP), instead of the actual data records. Existing enveloping methods, however, are limited only to the TPCP and they can also be misleading. This is because the parallel coordinates are parameter transformations and the summary statistics computed for the original data records are not preserved throughout the transformation to the parallel coordinates space. We propose enveloping methods that avoid this drawback and that can be applied not only to the TPCP but also to a family of STV plots such as the smooth parallel coordinate plot (SPCP) and the Andrews plot. We apply the proposed methods to the min–max, the quartiles, and the concentration interval envelopes (CIEs). These enveloping methods allow us to visually describe the geometry of given classes without the need of visualizing each single data record. These methods are effective for visualizing large datasets, as illustrated for real datasets, because they mitigate the cluttering effect in visualizing large-sized classes in the STV plots. Supplemental materials, including R-code, are available online to enable readers to reproduce the graphs in this article and/or apply the proposed methods to their own data.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call