Mining Frequent Itemsets from Online Data Streams: Comparative Study

Hebatallah Mohamed,Mohamed Abd,Ahmed Sharaf

doi:10.14569/ijacsa.2013.040717

Hebatallah Mohamed, Mohamed Abd + Show 1 more

Open Access

https://doi.org/10.14569/ijacsa.2013.040717

Copy DOI

Abstract

Online mining of data streams poses many new challenges more than mining static databases. In addition to the one-scan nature, the unbounded memory requirement, the high data arrival rate of data streams and the combinatorial explosion of itemsets exacerbate the mining task. The high complexity of the frequent itemsets mining problem hinders the application of the stream mining techniques. In this review, we present a comparative study among almost all, as we are acquainted, the algorithms for mining frequent itemsets from online data streams. All those techniques immolate with the accuracy of the results due to the relatively limited storage, leading, at all times, to approximated results.

Highlights

The data generation rates in some data sources become faster than ever before
F. et al, 2006], data streams are further classified into: 1) offline data streams: which characterized by discontinuity or regular bulk arrivals [Manku G. and Motwani R., 2002], such as a bulk addition of new transactions as in a data warehouse system, and 2) online data streams: which characterized by real-time updated data that come one by one in time, such as a continuously generated transaction as in a network monitoring system
W is timebased if W consists of a sequence of fixed-length time units, where a variable number of transactions may arrive within each time unit

Summary

INTRODUCTION

The data generation rates in some data sources become faster than ever before. Examples include network traffic analysis, Web click stream mining, network intrusion detection, sensor networks, web logs, and on-line transaction analysis. This rapid generation of continuous streams of information has challenged our storage, computation and communication capabilities in computing systems. Data streams differ from the conventional stored relation model in several ways: 1) Continuity: Data continuously arrive at a high rate. In the process of mining frequent itemset, traditional methods for static data usually read the database more than once. Traditional methods cannot be directly applied to data stream mining [Pauray S. and Tsai M., 2009]

BACKGROUND

Landmark model

Fading model

Sliding window model

Results

CONCLUSION

FUTURE WORK

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: International Journal of Advanced Computer Science and Applications	Publication Date: Jan 1, 2013
Citations: 3	License type: cc-by

R Discovery Prime

R Discovery Prime

Mining Frequent Itemsets from Online Data Streams: Comparative Study

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: International Journal of Advanced Computer Science and Applications

Lead the way for us

Similar Papers

A survey on algorithms for mining frequent itemsets over data streams
James Cheng ... Yiping Ke
Knowledge and Information Systems | VOL. 16
James Cheng, et. al.James Cheng ... Yiping Ke
12 Jul 2007
Knowledge and Information Systems | VOL. 16

Interactive stream mining of maximal frequent itemsets allowing flexible time intervals and support thresholds
Ming-Yen Lin ... Sue-Chen Hsueh
-
Ming-Yen Lin, et. al.Ming-Yen Lin ... Sue-Chen Hsueh
14 Jan 2010
14 Jan 2010

Mining frequent itemsets over data streams using efficient window sliding techniques
Hua-Fu Li ... Suh-Yin Lee
Expert Systems with Applications | VOL. 36
Hua-Fu Li, et. al.Hua-Fu Li ... Suh-Yin Lee
15 Dec 2007
Expert Systems with Applications | VOL. 36

A new algorithm for fast mining frequent itemsets using N-lists
ZhiHong Deng ... JiaJian Jiang
Science China Information Sciences | VOL. 55
ZhiHong Deng, et. al.ZhiHong Deng ... JiaJian Jiang
19 Jul 2012
Science China Information Sciences | VOL. 55

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Mining Frequent Itemsets from Online Data Streams: Comparative Study

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: International Journal of Advanced Computer Science and Applications