The Hash base Apriori Technique for Association Rule Mining and Data Sanitization

Nitin Pundlik Jagtap

doi:10.18090/samriddhi.v14i04.04

Nitin Pundlik Jagtap

Open Access

PDF Available

https://doi.org/10.18090/samriddhi.v14i04.04

Copy DOI

Export

Save

Cite

Abstract
Full-Text PDF
Similar Papers

Abstract

Listen

Under different circumstances, private information is exposed, and it must be sanitised before even being shared to address privacy issues. Data mining techniques can collect large amounts of data in a short amount of time. The information gathered by the powerful machine learning techniques may identify the most sensitive content, which pertains to an individual or organization. The degree of sensitivity of data belonging to a business or an agency might vary. Only approved individuals and organizations have access to this information. As a result, using access limitations to confirm the security of complicated data is not a complete operation. It can have an impact on the utility of a data mining solution, and the user may be able to re-identify sensitive data. To introduce instruments to find a mechanism for the security of confidential information. Finding ways to secure confidential data by developing data mining tools and procedures that can be applied to databases, despite the fact that this diminishes the data mining results' trust worthiness. We proposed a data sanitization strategy in this article that uses a frequent itemset classification approach with a modified apriori algorithm. The problem is to maintain intelligence information for vital arrangements while preventing the numerous exposures of company rule mining at the same time. Data sanitization strategy is used to perform a thorough investigation of numerous sequential pattern algorithms for ensuring the privacy of large amounts of data. Our research shows that our approach is efficient, scalable, and provides meaningful correction when compared to other methods used in existing systems. Keywords: Association Rule Mining, Classification, Frequent itemset, data privacy, data hiding, rule generation, support, confidence and transactional dataset

Full Text