Development of the DQFM method to consider the effect of correlation of component failures in seismic PSA of nuclear power plant : Watanabe, Y. et al. Reliability Engineering & System Safety, 2003, 79, (3), 265–279

doi:10.1016/s0140-6701(03)82879-5

Abstract

Learning a classifier from an imbalanced dataset is an important problem in data mining and machine learning. Since there is more information from the majority classes than the minorities in an imbalanced dataset, the classifier would become over-fitted to the former and under-fitted to the latter classes. Previous attempts to address the problem have been focusing on increasing the learning sensitivity to the minorities and/or rebalancing sample sizes among classes before learning. However, how to efficiently identify their optimal mix in rebalancing is still an unresolved problem. Due to non-linear relationships between attributes and class labels, merely to rebalance sample sizes rarely comes up with optimal results. Moreover, brute-force search for the perfect combination is known to be NP-hard and hence a smarter heuristic is required. In this paper, we propose a notion of swarm fusion to address the problem – using stochastic swarm heuristics to cooperatively optimize the mixtures. Comparing with conventional rebalancing methods, e.g., linear search, our novel fusion approach is able to find a close to optimal mix with improved accuracy and reliability. Most importantly, it has found to be with higher computational speed than other coupled swarm optimization techniques and iteration methods. In our experiments, we first compared our proposed solution with traditional methods on thirty publicly available imbalanced datasets. Using neural network as base learner, our proposed method is found to outperform other traditional methods by up to 69% in terms of the credibility of the learned classifiers. Secondly, we wrapped our proposed swarm fusion method with decision tree. Notably, it defeated six state-of-the-art methods on ten imbalanced datasets in all evolution metrics that we considered.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Development of the DQFM method to consider the effect of correlation of component failures in seismic PSA of nuclear power plant : Watanabe, Y. et al. Reliability Engineering & System Safety, 2003, 79, (3), 265–279

Abstract

Talk to us

Similar Papers

More From: Fuel and Energy Abstracts

Lead the way for us

Similar Papers

Adaptive multi-objective swarm fusion for imbalanced data classification
Jinyan Li ... Victor W Chu
Information Fusion | VOL. 39
Jinyan Li, et. al.Jinyan Li ... Victor W Chu
28 Mar 2017
Information Fusion | VOL. 39

Imbalance Learning and Its Application on Medical Datasets
Yachao Shao
-
Yachao ShaoYachao Shao
21 Feb 2022
21 Feb 2022

Learning from imbalanced data sets with boosting and data generation
Hongyu Guo ... Herna L Viktor
ACM SIGKDD Explorations Newsletter | VOL. 6
Hongyu Guo, et. al.Hongyu Guo ... Herna L Viktor
01 Jun 2004
ACM SIGKDD Explorations Newsletter | VOL. 6

Benchmarking Swarm Rebalancing Algorithm for Relieving Imbalanced Machine Learning Problems
Jinyan Li ... Simon Fong
-
Jinyan Li, et. al.Jinyan Li ... Simon Fong
01 Jan 2018
01 Jan 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Development of the DQFM method to consider the effect of correlation of component failures in seismic PSA of nuclear power plant : Watanabe, Y. et al. Reliability Engineering & System Safety, 2003, 79, (3), 265–279

Abstract

Talk to us

Similar Papers

More From: Fuel and Energy Abstracts