Abstract

Most of the literature on utility pattern mining (UPM) assumes that the particular patterns’ utility in known in advance. Concurrently, in frequent pattern mining (FPM) it is assumed that all patterns take the same value. In reality, the information about the utility of patterns is not or hardly available in most cases. Moreover, the utility and frequency of the particular pattern are not directly proportional. An algorithm for estimating a generic pattern utility has been recently proposed, but the numeric results might be difficult to interpret. In particular, in datasets with many independent instances or groups. In this paper, we present an approach to generating utility bitmaps that provide visual representation of the numeric data obtained using generic pattern utility algorithm. We demonstrate validity of this approach on two datasets: PAMAP2 Physical Activity Monitoring Data Set , an open dataset from the UCI Machine Learning Repository, and an ECG dataset collected using Biopac Student Lab during Ruffier’s test. For PAMAP2 dataset, utility bitmaps allow for immediate separation of various physical activities. Variation between participants are present, but do not overshadow differences between the activity types. For the ECG dataset, utility bitmaps immediately indicate age and fitness differences between the participants, even thought this information was not available to the algorithm. In both cases, partial similarity in bitmaps can be traced back to partial similarity in activities or participants generating the data. Based on these tests, the approach seems to be promising for exploratory analysis of large collections of long time series and possibly other sequential patterns such as distance series common in sports data analysis and depth series common in petroleum engineering.

Highlights

  • Most of the literature on utility patterns (UPM) assumes a priori knowledge about the utility of given patterns

  • SPECIFICATION we first provide an overview of an approximate method to calculating generic utility of sequential patterns, including some improvements we introduced with this work

  • WORK In this paper, we proposed an approach to visualization of generic utility of sequential patterns

Read more

Summary

INTRODUCTION

Most of the literature on utility patterns (UPM) assumes a priori knowledge about the utility of given patterns. On the other hand not much attention is related to cases where there is no a priori knowledge about the utility of the examined dataset but the higher utility can be assumed for some particular (sequential) patterns It can be considered as a frequent pattern mining problem (FMP) but in this approach all types of patterns take the same value [2]. In the survey by Fournier-Viger et al [5] developments in SPM are discussed They consider high-utility sequential pattern mining as a popular extension to SPM when frequency of occurrence is not sufficient and utility metric is required. They assume that there exists prior knowledge about the utility that is associated with particular elements of the dataset. The episode visualizations enable the identification of periodic occurrence-patterns and show the occurrences of patterns present in an event sequence

BACKGROUND
METHOD SPECIFICATION
GENERIC UTILITY OF SEQUENTIAL PATTERNS
UTILITY BITMAPS DEFINITION
Result
VALIDATION WITH ECG DATASET
CONCLUSION AND FUTURE WORK
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.