Abstract

With the development of different kinds of techniques, especially the Internet of Things (IoT), a large amount of quantitative (either numeric or categorical) data have been generated, transmitted, and stored in the modern society. People hope to understand the interested phenomenon from the collected quantitative data by utilizing different data analysis methods. Exploring the structure of data (e.g., the cluster centers or prototypes) has always been a hot spot in the domain of data mining and knowledge discovery, yet it seems that the modeling and analyzing process still focus on a low-level abstraction of the data because normally, the structure found is only represented by some numeric data points. In this study, we highlight that a low-level abstraction may not be a user-friendly way for people to grasp the knowledge contained in the data. Instead, we explore the structure of the data from a perspective of symbolic analysis. Specifically, two modes of abstraction are proposed. In the vertical mode (i.e., values of each feature are abstracted), the numeric prototypes are characterized by the symbolic prototypes such that people could get rid of being stuck in minor details of each feature. In the horizontal mode (i.e., values of each prototype are abstracted), the linguistic summarization is used to describe all the features of each symbolic prototype such that people could immediately grasp the essential information conveyed in the symbolic prototype. We conduct comprehensive experimental studies on the publicly available data to illustrate the feasibility and validity of the proposed symbolic analysis process.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call