Abstract

Concept drift refers to sudden changes in the fundamental structure of the streaming data distribution over time. The core objective of concept drift research is to develop techniques and strategies for detect, understand, and adapt data streaming drifts. Data research has shown that if concept drift is not handled properly, machine learning in such an environment would provide subpar learning outcomes. In this paper, a historical Isolated Forest (HIF) is presented that depends on a decision tree, which split the data streaming into chunks and each chuck considers a region in the tree. HIF is employed to detect concept drifts and adapt this region with current changes. Which HIF stores previously generated models and employs the most similar model of each concept drift distribution as the current model until generate the best performance model. HIF doesn’t stop the main system model when retraining a new model, which HIF is divided into three primary parallel blocks: detection block, similarity block (online block), and retraining block (offline block). For several authentic data sets (three data sets), our suggested algorithm was verified and contrasted. the accuracy and execution speed were specifically assessed, and memory usage. The experimental results demonstrate that our modifications use fewer resources and have comparable or greater detection accuracy than the original IForestASD.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.