Abstract

Time series data are found in diverse fields including, science, business, medicine and engineering. In this paper, we consider sequential pattern mining for categorical time series data that contain multiple independent time-series. Frequent patterns are considered important in a variety of applications. However, it is common for data to contain noise, and/or for the source process to have considerable variability. Conventional sequential pattern mining methods that use exact matching address, some but not all of these difficulties. Two general approaches used in previous studies to mine sequential patterns in data with noise are distance-based clustering and hidden Markov models. While these approaches are useful in mining frequent sequential patterns in noisy data, we further propose a framework (MWASP: multiple-width approximate sequential pattern mining) that uncovers frequent approximate sequential patterns with various widths. A mined pattern in this framework is representative of a group of sequences that follow the pattern's event flow order. This gives insight into the occurrence of the pattern longitudinally, as well as across the population. The pattern can be recognised as a common pattern across the multiple time series, time, or both.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.