Finding sequential patterns with TF-IDF metrics in health-care databases

Zsolt T Kardkovács,Gábor Kovács

doi:10.1515/ausi-2015-0008

Zsolt T Kardkovács, Gábor Kovács

Open Access

https://doi.org/10.1515/ausi-2015-0008

Copy DOI

Journal: Acta Universitatis Sapientiae, Informatica	Publication Date: Dec 1, 2014
Citations: 11	License type: CC BY-NC-ND 3.0

Affiliation: Dennis Gabor College

Abstract

Abstract Finding frequent sequential patterns has been defined as finding ordered list of items that occur more times in a database than a user defined threshold. For big and dense databases that contain really long sequences and large itemset such as medical case histories, algorithm proposed on this idea of counting the occurrences output enourmous number of highly redundant frequent sequences, and are therefore simply impractical. Therefore, there is a need for algorithm that perform frequent pattern search and prefiltering simultaneously. In this paper, we propose an algorithm that reinterprets the term support on text mining basis. Experiments show that our method not only eliminates redundancy among the output sequences, but it scales much better with huge input data sizes. We apply our algorithm for mining medical databases: what diagnoses are likely to lead to a certain future health condition.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Finding sequential patterns with TF-IDF metrics in health-care databases

Abstract

Talk to us

Similar Papers

More From: Acta Universitatis Sapientiae, Informatica

Lead the way for us

Similar Papers

Weighted approximate sequential pattern mining within tolerance factors
Unil Yun ... Eunchul Yoon
Intelligent Data Analysis | VOL. 15
Unil Yun, et. al.Unil Yun ... Eunchul Yoon
23 Jun 2011
Intelligent Data Analysis | VOL. 15

An efficient model for information gain of sequential pattern from web logs based on dynamic weight constraint
Dhirendra Kumar Jha ... Archana Tomar
-
Dhirendra Kumar Jha, et. al.Dhirendra Kumar Jha ... Archana Tomar
01 Oct 2010
01 Oct 2010

A Fast Interactive Sequential Pattern Mining Algorithm Based on Memory Indexing
Jia-Dong Ren ... Jun-Sheng Zong
-
Jia-Dong Ren, et. al.Jia-Dong Ren ... Jun-Sheng Zong
01 Jan 2006
01 Jan 2006

Top-k Closed Sequential Graph Pattern Mining
K Vijay Bhaskar ... K Thammi Reddy
International Journal of Information Engineering and Electronic Business | VOL. 8
K Vijay Bhaskar, et. al.K Vijay Bhaskar ... K Thammi Reddy
08 Jul 2016
International Journal of Information Engineering and Electronic Business | VOL. 8

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Finding sequential patterns with TF-IDF metrics in health-care databases

Abstract

Talk to us

Similar Papers

More From: Acta Universitatis Sapientiae, Informatica