Abstract

This paper examines existing sequence mining algorithms. Sequence mining algorithms are used in many domains, including cyber-security, telecommunications, user behaviour, and air quality patterns. We draw the underlying principles of the representative sequence mining algorithms and introduce a comparative analysis methodology for them. To test the methodology, we provide a prototype testing framework. We conduct a comprehensive experimental study on publicly available data sets, real-life telecommunication data set and data sets generated by a data generator. We compare GSP, PrefixSpan and CMRules algorithms. Comparing these sequence mining algorithms, we conclude that the fastest among the targeted three algorithms may differ for different data sets. Furthermore, we search for situations where sequential pattern mining algorithms can be used instead of sequential rule mining algorithms.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.