Abstract

Under different circumstances, private information is exposed, and it must be sanitised before even being shared to address privacy issues. Data mining techniques can collect large amounts of data in a short amount of time. The information gathered by the powerful machine learning techniques may identify the most sensitive content, which pertains to an individual or organization. The degree of sensitivity of data belonging to a business or an agency might vary. Only approved individuals and organizations have access to this information. As a result, using access limitations to confirm the security of complicated data is not a complete operation. It can have an impact on the utility of a data mining solution, and the user may be able to re-identify sensitive data. To introduce instruments to find a mechanism for the security of confidential information. Finding ways to secure confidential data by developing data mining tools and procedures that can be applied to databases, despite the fact that this diminishes the data mining results' trust worthiness. We proposed a data sanitization strategy in this article that uses a frequent itemset classification approach with a modified apriori algorithm. The problem is to maintain intelligence information for vital arrangements while preventing the numerous exposures of company rule mining at the same time. Data sanitization strategy is used to perform a thorough investigation of numerous sequential pattern algorithms for ensuring the privacy of large amounts of data. Our research shows that our approach is efficient, scalable, and provides meaningful correction when compared to other methods used in existing systems.
 Keywords: Association Rule Mining, Classification, Frequent itemset, data privacy, data hiding, rule generation, support, confidence and transactional dataset

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call