Abstract

Event detection by discovering frequent itemsets is very popular in sensor network communities. However, the recorded data is often a probability rather than a determined value in a really productive environment as sensed data is often affected by noise. In this paper, we study to detect events by finding frequent patterns over probabilistic sensor data under the Possible World Semantics. This is technically challenging as probabilistic records can generate an exponential number of possible worlds. Although several efficient algorithms are proposed in the literature, it is still difficult to mine probabilistic maximal frequent items (PMFIs) in large uncertain database due to the high time complexity. To address this issue, we employ approximate idea to further reduce the time complexity from O(nlogn) to O(n) and propose a two-step solution (Aproximation Probabilistic Frequent Itemset-MAX, APFI-MAX in short) including PMFI candidates generation and PMFIs confirmation. We also provide the necessary proofs of our approximation method to make APFI-MAX more solid and convincing. Finally, extensive experiments have been conducted on synthetic and real databases, demonstrating that the proposed APFI-MAX always running faster than state-of-art methods under different parameter settings.

Highlights

  • Event detection or monitoring is a key application for the environmental surveillance in sensor networks [2], [3]

  • We present a top-down probabilistic maximal frequent items (PMFIs) confirmation framework APFI-MAX, which is proved more efficient than state-of-art framework Top-Down Inheritance of Support Algorithm (TODIS)-MAX [8]

  • Since probability mass function of itemsets must be calculated in TODIS-MAX which is time consuming, we propose an approximation of pmf but high efficient for calculation inspired by the Central Limit Theorem

Read more

Summary

INTRODUCTION

Event detection or monitoring is a key application for the environmental surveillance in sensor networks [2], [3]. Such as underground coal mine monitoring [5] and moving object search [6], etc We name this noise-affected sensed data as probabilistic or uncertain data. S. Chen et al.: Approximation of Probabilistic Maximal Frequent Itemset Mining Over Uncertain Sensed Data. We study how to extract the FMPIs over uncertain database efficiently and employ approximate idea to further reduce the time complexity. We give and prove the bound for the support expectation of real PMFIs. In the second step, we present a top-down PMFIs confirmation framework APFI-MAX, which is proved more efficient than state-of-art framework TODIS-MAX [8]. Since probability mass function (pmf) of itemsets must be calculated in TODIS-MAX which is time consuming, we propose an approximation of pmf but high efficient for calculation inspired by the Central Limit Theorem.

RELATED WORK
ALGORITHM DESIGN
GENERATE CANDIDATES
EXPECTATION AND VARIANCE ESTIMATION
EXPERIMENTAL EVALUATION
ACCURACY EVALUATION
Findings
CONCLUSION
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call