Abstract
Frequent item sets is one of the most investigated fields of data mining. The significant feature is to find new techniques to reduce candidate item sets in order to generate frequent item sets efficiently. This paper introduces an efficient algorithm called Enhanced Candidate Generation for Frequent item set Generation (ECG for FIG) for finding frequent item sets from large databases. The existing algorithm for frequent item set generation scan the original database more than once, use more storage space, take more processing time. The proposed algorithm gives a solution to this by representing the transactions in the database with decimal numbers instead of binary values and strings. The original database is scanned only once and is converted into an equivalent decimal value to reduce the storage space. The subset generation concept is used to generate frequent item sets. Thus the proposed algorithm reduces the scanning time, processing time and the storage space respectively. When compared with the existing algorithms, the proposed algorithm takes very less execution time and memory. When implemented the algorithm using java and tested with WEKA tool, for 400 transactions of twenty five items, ECG for FIG is taking only 800 bytes of memory and 2000000000 ns (two seconds), whereas all the other above mentioned algorithms are taking 20800 bytes of memory and more than two seconds.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.