Frequent Association Rules Research Articles

Documents on the Internet have increased in number exponentially; this has resulted in users having difficulty finding documents or information needed. Special techniques are needed to retrieve documents that are relevant to user queries. One technique that can be used is Information Retrieval (IR). IR is the process of finding data (generally documents) in the form of text that matches the information needed from a collection of documents stored on a computer. Problems that often appear on IRs are incorrect user queries; this is caused by user limitations in representing their needs in the query. Researchers have proposed various solutions to overcome these limitations, one of which is to use the Expansion Query (QE). Various methods that have been applied to QE include Ontology, Latent Semantic Indexing (LSI), Local Co-Occurrence, Relevance Feedback, Concept Based, WordNet / Synonym Mapping. However, these methods still have limitations, one of them in terms of displaying the connection or relevance of the appearance of words or phrases in the document collection. To overcome this limitation, in this study we have proposed an approach to QE using the FP-Growth algorithm for the search for frequent itemset and Association Rules (AR) on QE. In this study, we applied the use of AR to QE to display the relevance of the appearance of a word or term with another word or term in the collection of documents, where the term produced is used to perform QE on user queries. The main contribution in this study is the use of Association rules with FP-Growth in the collection of documents to look for the connection of the emergence of words, which is then used to expand the original query of users on IR. For the evaluation of QE performance, we use recall, precision, and f-measure. Based on the research that has been done, it can be concluded that the use of AR on QE can improve the relevance of the documents produced. This is indicated by the average recall, precision, and f-measure values produced at 94.44%, 89.98%, and 92.07%. After comparing the IR process without QE with IR using QE, an increase in recall value was 25.65%, precision was 1.93%, and F-Measure was 15.78%.

Read full abstract

ABSTRACT In knowledge discovery studies, association rules mining has been extensively studied to discover hidden knowledge and relationships among set of items in a transactional dataset. Most research on association rule mining focuses on discovering frequent patterns based on the most frequent items occurring in the dataset. However, the process of extracting rare rules has received less attention. In medical dataset studies, the discovery of rare association rules (RARs) is more challenging, because it could likely be used to obtain more potentially rare and unusual knowledge for physicians, beside frequent association rules. Hence, the aim of this paper is to discover non-frequent or rare-unusual association rules (RUARs) from a stroke medical dataset to provide potential meaningful knowledge to the user domain. A discretization method needs to be performed as the data preprocessing step before generating rules. To the best of our knowledge, fewer studies have focused on the role of discretization results to support the extraction of a better amount and quality of RUARs, particularly for medical datasets. In addition, the extracted RUARs is expected to provide potential new unusual insights on stroke risk patterns. This paper applies mutual information measure to discretize a stroke examination dataset collected from a medical center in Taiwan. The interval merging method was proposed to simplify the discrete form and enrich the quality of generated rules. Towards the end, rare association rules, with relatively low support, were generated by employing the Apriori-Rare method accordingly. In addition, a filtering process was applied to the content of the rule itemsets to discover the expected set of RUARs for physicians. Furthermore, the extracted RUARs was analyzed based on the relative risk values toward the occurrence of stroke. Results indicated that the mutual information discretization outperformed the traditional discretization methods in terms of how the discretization scheme can support the extraction of RUARs with a better quantity and quality measurements for further analysis purpose in medical point of view. Moreover, the proposed method had a relatively higher number of RUARs. The knowledge of unusual rule patterns from rare association rules might provide potential new and unusual insights for medical pratitioners and increase the awareness of stroke examination results.

Read full abstract

Frequent Association Rules Research Articles

Related Topics

Articles published on Frequent Association Rules

Map Reduce Based Association Rule Mining from Big Data

Utilizing graphics processing unit to accelerate drug-symptom association mining

Fast Dimensional Analysis for Root Cause Investigation in a Large-Scale Service Environment

Determination of Temporal Association Rules Pattern Using Apriori Algorithm

Privacy-preserving frequent itemset mining in vertically partitioned database using symmetric homomorphic encryption scheme

Privacy-preserving frequent itemset mining in vertically partitioned database using symmetric homomorphic encryption scheme

An improved Frequent Pattern Mining in Sustainable Learning Practice using Generalized Association Rules

Various Research Opportunities in High Utility Itemset Mining

Assessment Method of the Economic Vulnerability of the Coastal Zone Based on the Analytic Hierarchy Process

Identification of Risk Factors for Early Childhood Diseases Using Association Rules Algorithm with Feature Reduction

A data-driven approximate dynamic programming approach based on association rule learning: Spacecraft autonomy as a case study

A Survey of Parallel Sequential Pattern Mining

An Intelligent Decision in Smart Systems Using A Weighted Frequent Itemset Mining Algorithm

Using Localized Features for Analyzing College Students’ Imagination

Query Expansion in Information Retrieval using Frequent Pattern (FP) Growth Algorithm for Frequent Itemset Search and Association Rules Mining

Evaluation of relationships between onychomycosis and vascular diseases using sequential pattern mining

Rare association rule mining from incremental databases

Hiding sensitive itemsets without side effects

Applying mutual information for discretization to support the discovery of rare-unusual association rule in cerebrovascular examination dataset

ARM–AMO: An efficient association rule mining algorithm based on animal migration optimization

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Frequent Association Rules Research Articles

Related Topics

Articles published on Frequent Association Rules

Map Reduce Based Association Rule Mining from Big Data

Utilizing graphics processing unit to accelerate drug-symptom association mining

Fast Dimensional Analysis for Root Cause Investigation in a Large-Scale Service Environment

Determination of Temporal Association Rules Pattern Using Apriori Algorithm

Privacy-preserving frequent itemset mining in vertically partitioned database using symmetric homomorphic encryption scheme

Privacy-preserving frequent itemset mining in vertically partitioned database using symmetric homomorphic encryption scheme

An improved Frequent Pattern Mining in Sustainable Learning Practice using Generalized Association Rules

Various Research Opportunities in High Utility Itemset Mining

Assessment Method of the Economic Vulnerability of the Coastal Zone Based on the Analytic Hierarchy Process

Identification of Risk Factors for Early Childhood Diseases Using Association Rules Algorithm with Feature Reduction

A data-driven approximate dynamic programming approach based on association rule learning: Spacecraft autonomy as a case study

A Survey of Parallel Sequential Pattern Mining

An Intelligent Decision in Smart Systems Using A Weighted Frequent Itemset Mining Algorithm

Using Localized Features for Analyzing College Students’ Imagination

Query Expansion in Information Retrieval using Frequent Pattern (FP) Growth Algorithm for Frequent Itemset Search and Association Rules Mining

Evaluation of relationships between onychomycosis and vascular diseases using sequential pattern mining

Rare association rule mining from incremental databases

Hiding sensitive itemsets without side effects

Applying mutual information for discretization to support the discovery of rare-unusual association rule in cerebrovascular examination dataset

ARM–AMO: An efficient association rule mining algorithm based on animal migration optimization