Effect of Count Estimation in Finding Frequent Itemsets over Online Transactional Data Streams

Joong Hyuk Chang,Won Suk Lee

doi:10.1007/s11390-005-0007-3

Abstract

A data stream is a massive unbounded sequence of data elements continuously generated at a rapid rate. Due to this reason, most algorithms for data streams sacrifice the correctness of their results for fast processing time. The processing time is greatly influenced by the amount of information that should be maintained. This issue becomes more serious in finding frequent itemsets or frequency counting over an online transactional data stream since there can be a large number of itemsets to be monitored. We have proposed a method called theestDec method for finding frequent itemsets over an online data stream. In order to reduce the number of monitored itemsets in this method, monitoring the count of an itemset is delayed until its support is large enough to become a frequent itemset in the near future. For this purpose, the count of an itemset should be estimated. Consequently, how to estimate the count of an itemset is a critical issue in minimizing memory usage as well as processing time. In this paper, the effects of various count estimation methods for finding frequent itemsets are analyzed in terms of mining accuracy, memory usage and processing time.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Effect of Count Estimation in Finding Frequent Itemsets over Online Transactional Data Streams

Abstract

Talk to us

Similar Papers

More From: Journal of Computer Science and Technology

Lead the way for us

Journal: Journal of Computer Science and Technology	Publication Date: Jan 1, 2005
Citations: 4

Similar Papers

An Efficient Closed Frequent Item Sets Mining Algorithm-For Mining Closed Frequent Item Sets from Data Streams
Venu Madhav Kuthadi ... Rajalakshmi Selvaraj
Journal of Computational and Theoretical Nanoscience | VOL. 13
Venu Madhav Kuthadi, et. al.Venu Madhav Kuthadi ... Rajalakshmi Selvaraj
01 Oct 2016
Journal of Computational and Theoretical Nanoscience | VOL. 13

Finding frequent itemsets over online data streams
Joong Hyuk Chang ... Won Suk Lee
Information and Software Technology | VOL. 48
Joong Hyuk Chang, et. al.Joong Hyuk Chang ... Won Suk Lee
09 Aug 2005
Information and Software Technology | VOL. 48

CP-tree: An adaptive synopsis structure for compressing frequent itemsets over online data streams
Se Jung Shin ... Won Suk Lee
Information Sciences | VOL. 278
Se Jung Shin, et. al.Se Jung Shin ... Won Suk Lee
24 Mar 2014
Information Sciences | VOL. 278

Mining Frequent Itemsets with Normalized Weight in Continuous Data Streams
Young-Hee Kim ... Won-Young Kim
Journal of Information Processing Systems | VOL. 6
Young-Hee Kim, et. al.Young-Hee Kim ... Won-Young Kim
31 Mar 2010
Journal of Information Processing Systems | VOL. 6

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Effect of Count Estimation in Finding Frequent Itemsets over Online Transactional Data Streams

Abstract

Talk to us

Similar Papers

More From: Journal of Computer Science and Technology