Abstract
Concept drift has been a widely concerned problem in data analysis and many important achievements have been obtained. However, there are few discussions on concept drift detection in inconsistent information system that is very common in various application fields. In this paper, a concept drift detection algorithm based on the decision distribution (IDP-DD) is proposed. To begin, we present the notion of decision distribution function by means of inclusion degree and delineate the decision empirical vector. Subsequently, we expound upon decision empirical distance and analyze the limits characteristics of sampling; thirdly, we display the algorithm implementation process of IDP-DD. Lastly, we evaluate the efficacy of IDP-DD with a some commonly employed datasets and a real-word example dataset. Theory analysis and experiment verification indicate that the algorithm IDP-DD can not only quantify the severity of concept drift, but also visually present the region where concept drift occurs. At the same time, combined with sampling method, this algorithm does not need to traverse all samples at one time, so the complexity and calculation cost can be effectively reduced. So IDP-DD is more stable and can effectively facilitate data-driven decision problems.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.