Abstract

Class imbalance and overlap coupling is the primary cause of performance degradation for classifiers. Unfortunately, it always occurs. In this paper, we propose a hybrid sampling method derived from optimized generative adversarial network and natural neighbor search (HD-GNNS) for this scenario. The new approach considers both the global distribution and local distribution of the minority class to improve the data distribution fundamentally. First, natural neighbor search with Fisher’s discriminant ratio is conducted to screen overlapped sample subset and remove noise samples. It effectively overcomes the parameter sensitivity by adaptively determining the search radius. Then, an encoder with squeeze and excite block is introduced into generative adversarial network, and the structure of generative adversarial network is optimized with cross-layer and low-rank matrix. It better captures the distribution characteristics of minority samples in overlapped subset for oversampling. Afterwards, the local density of majority samples in overlapped subset is calculated by the aforementioned natural neighbor search method, and Thornton’s Separation Index is used to implement under-sampling adaptively. We evaluate the proposed approach on 1 artificial dataset, 14 UCI datasets and 8 real-word datasets. The experimental results show that the proposed HD-GNNS exhibits more impressive performance compared to other benchmark methods.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call