Abstract

Data mining is concerned with the analysis of data for finding patterns and regularities in the data sets. Statistics is a mathematical science concerned with the collection, analysis, interpretation or explanation, and presentation of data. Statistics plays a very important role in the process of data mining analysis and equally visualization of data plays a very important role in decision making process. Instance Based Learning Streams is an instance-based learning algorithm used to perform regression analysis on data streams. The algorithm is able to handle large data streams with less memory and computational power. The paper aims at the implementation of Instance Based Learning Streams as an extension to the massive online analysis framework for data stream mining to develop a regression model. The study reveals that the regression analysis could be performed not only on small data sets but also on data streams as in the present case but the method of analysis will be different in the two cases. In the case of small data set the regression models are linear, multiple and polynomial, while in the case of data streams the entire analysis is performed under the massive online analysis framework by taking the two evaluation parameters basic regression performance evaluator and windows regression performance evaluator. This finding is first of its kind in literature.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.