Abstract

Abstract Mining data streams require to cope with time, data size and possible concept drift constraints. Even more challenging is the case where, apart from the above, one has to deal with imbalanced data. Mining non stationary and imbalanced data streams is a relatively new area of research. In this paper, we propose the Gene Expression Programming (GEP) classifier with drift detection and data reuse for mining imbalanced data streams. GEP is used to evolve a complex expression tree returning predictions. Drift detector role is to signal the occurrence of drift which triggers inducing a new learner. Data reuse mechanism allows for improving the balance between minority and majority instances in a subset of data used for evolving the learner. The proposed approach is validated experimentally. The experiment results confirm that our classifier produces high-quality predictions.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.