Abstract

With the advances of wireless sensor networks, they yield massive volumes of disparate, dynamic and geographically-distributed and heterogeneous data. The data mining community has attempted to extract knowledge from the huge amount of data that they generate. However, previous mining work in WSNs has focused on supporting simple relational data structures, like one table per network, while there is a need for more complex data structures. This deficiency motivates XML, which is the current de facto format for the data exchange and modeling of a wide variety of data sources over the web, to be used in WSNs in order to encourage the interchangeability of heterogeneous types of sensors and systems. However, mining XML data for WSNs has two challenging issues: one is the endless data flow; and the other is the complex tree structure. In this paper, we present several new definitions and techniques related to association rule mining over XML data streams in WSNs. To the best of our knowledge, this work provides the first approach to mining XML stream data that generates frequent tree items without any redundancy.

Highlights

  • Wireless sensor networks (WSNs) have been identified as an important research area for the 21st century [1]

  • Throughout the paper, we focus on embedded subtrees from the dataset of XML stream data and use them in providing the definitions for association rules

  • Since we focus on the rule detection from XML stream data, each XML document corresponds to a set of XML data in a sink node, and the data stream is a continuous sequence of XML data blocks

Read more

Summary

Introduction

Wireless sensor networks (WSNs) have been identified as an important research area for the 21st century [1]. The frequency of errors in the outputs generated by the online algorithms should be constricted to be as small as possible Due to these differences, previous multiple-pass data mining techniques presented for traditional data sets cannot be directly applied to the domain of mining the stream data. To the best of our knowledge, our proposed scheme is the first approach to mining XML stream data in the sense that it generates frequent tree items without any redundancy (see Section 4 for the definition of a tree item). The rest of this paper is organized as follows: Section 2 discusses prior work related to mining association rules from sensor data and XML data.

Related Work
Association Rules for Relational Data
XML Data Structure
A Framework for XML Stream Data Mining
Item Sets
Association Rules
Support and Confidence
Rule 3:
XB2 -support:
Mining XML Stream Association Rules with the Label Projection Approach
Scheme and Construction of Label Projection
Pruning and Deriving from Ldic
Correlating Concrete Contents with Label Lists
Findings
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call