Abstract
AbstractAs data stream grows exponentially, the aggregate query technique is widely used since it can rapidly obtain the summary information. Typical approximate aggregate query methods, like sliding-window, random sampling, wavelet, sketch index structure, histogram, etc., all evaluate the quality of the algorithms by the average size of query errors and ignore the maximum relative error, which determines the availability of the methods. Regarding this issue, this paper proposes the Reasonable Histogram (RH) method to improve the classic aggregate query method AMH. Based on the analysis of AMH errors’ mathematical characteristics, we build an aggregate query mathematical model based on the Kalman filter, using the optimal estimate of the buckets’ average frequency to calculate the aggregate values of the anomalous points, so as to restrain the maximum relative error.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.