Abstract
The classification query method of data stream can not only improve the efficiency of data stream query, also achieve data stream query in the best matching state. The difficulty of classification query of data stream difficulty is how to achieve data matching in the optimal matching degree, the traditional classification query method for data stream is the method based on keyword matching, the effect on a single condition is better, but when there are more query conditions, query efficiency is low and matching degree is poor. To this end, a classification query optimization method on data stream is proposed based on improved TFIDF algorithm, the information entropy between data characteristics and the information entropy within characteristic are viewed as weighting factors of data classification query, nonlinear mapping ability of neural network is adopted to realize weight calculation and the fuzzification of TFIDF algorithm, so as to solve classification query problems of data streams. With actual database to process classification query, experimental results show that, the proposed algorithm for classification query on data stream have greatly improved query efficiency, which has good application value. Introduction Data stream is a major achievement since the development of computer technology. The data processing technology related to data stream, especially the mature of classification query technology of data stream, which makes the management and query of information possible . In the past few decades, the classification query technology on data stream is at the stage of rapid development, different characteristics have been continuously improving . However, in the modern mass data information, the classification query on data stream is always a serious bottleneck faced by management and development of modern data . The multi polarization of data characteristics, massive amount of data, fuzzification of data information, making the modern classification query on the data flow becomes more and more difficult . Related principle of classification query optimization algorithm in data stream Description of improved TFIDF method. Defined under the given probability distribution 1 1 ( , ,..., ) n P p p p = , information entropy is defined for the data stream transmission as:
Published Version (Free)
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have