Abstract
Utility mining emerges as an important topic in data mining field. Mining high utility itemsets from databases refers to finding the itemsets with high profits. The meaning of itemset utility is interestingness, importance, or profitability of an item to users. Mining high utility itemsets from a transactional database refers to the discovery of itemsets with high utility like profits. Although several relevant algorithm has been proposed in recent years, they incur the problem of producing a large number of candidate itemsets for high utility itemsets. Such a large number of candidate itemsets degrades the mining performance in terms of execution time and space requirement. The situation may become worse when the database contains lots of long transactions or long high utility itemsets. This research work focuses on efficient method for high utility itemset mining, to reduce number of overestimated itemsets and to reduce the search space in order to improve the performance of high utility itemset mining. Key Terms: Candidate itemset, frequent itemset, high utility itemset, utility mining, data mining. I. Introduction Now a days computers are used widely in different areas. Fast reliable and unlimited secondary storage provides a perfect environment for the users to collect and store large amount of the data. Computers are also used to extract the useful information from the mass of data. This is called as the knowledge discovery or Data mining. Data mining is the process of revealing previously unknown and potentially useful information from large databases. The primary goal is to discover the hidden patterns in the data and to extract previously unknown interesting patterns. Discovering useful patterns hidden in a database plays an essential role in several data mining tasks, like frequent pattern mining and high utility pattern mining. Among them the fundamental research topic is frequent pattern mining which can be applied to different kinds of databases such as transactional database, streaming database and time series database and many applications domains like bioinformatics, web click-stream analysis and mobile environment.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.