Abstract

In this paper, we propose mining maximal frequent patterns from univariate uncertain data. Univariate uncertain data refers to cases where each attribute in a transaction is associated with a quantitative interval and a probability density function that assigns a probability to each value in the interval. The number of frequent U2 patterns (i.e. frequent patterns of univariate uncertain data) is usually very large. To return a concise and informative mining result to users, we propose mining maximal frequent U2 patterns (MFU2Ps). A maximal frequent U2 pattern is a frequent U2 pattern without any frequent superset. The three proposed algorithms, MU2P-Miner, U2GenMax, and U2MAFIA, are different in terms of both the data formats used to store transactions and the structures used to store the MFU2Ps which are found during the mining process. The experiment results show that different algorithms excel when applied to different datasets and settings. We have applied the proposed algorithms to univariate uncertain data comprising measurements of the air quality and weather conditions in Taiwan; the derived MFU2Ps show that the air quality in Taiwan is usually good (unless a sand storm affects the island) and the weather is often wet.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.