Noisy Training Set Research Articles

With the popularity of online payment, how to perform credit card fraud detection more accurately has also become a hot issue. And with the emergence of the adaptive boosting algorithm (Adaboost), credit card fraud detection has started to use this method in large numbers, but the traditional Adaboost is prone to overfitting in the presence of noisy samples. Therefore, in order to alleviate this phenomenon, this paper proposes a new idea: using the number of consecutive sample misclassifications to determine the noisy samples, while constructing a penalty factor to reconstruct the sample weight assignment. Firstly, the theoretical analysis shows that the traditional Adaboost method is overfitting in a noisy training set, which leads to the degradation of classification accuracy. To this end, the penalty factor constructed by the number of consecutive misclassifications of samples is used to reconstruct the sample weight assignment to prevent the classifier from over-focusing on noisy samples, and its reasonableness is demonstrated. Then, by comparing the penalty strength of the three different penalty factors proposed in this paper, a more reasonable penalty factor is selected. Meanwhile, in order to make the constructed model more in line with the actual requirements on training time consumption, the Adaboost algorithm with adaptive weight trimming (AWTAdaboost) is used in this paper, so the penalty factor-based AWTAdaboost (PF_AWTAdaboost) is finally obtained. Finally, PF_AWTAdaboost is experimentally validated against other traditional machine learning algorithms on credit card fraud datasets and other datasets. The results show that the PF_AWTAdaboost method has better performance, including detection accuracy, model recall and robustness, than other methods on the credit card fraud dataset. And the PF_AWTAdaboost method also shows excellent generalization performance on other datasets. From the experimental results, it is shown that the PF_AWTAdaboost algorithm has better classification performance.

One of the key and difficult points in seismic data processing is seismic data denoising. Under the influence of the acquisition environment, the collected seismic data usually have low signal-to-noise ratio (SNR) and low resolution. In desert region of western China, different from other regions, the random noise there has complex characteristics of non-Gaussian, non-stationary and non-linearity. Its main frequency is quite low and its spectral overlap with those of effective signals. Moreover, large amount of data and intelligent requirements make traditional denoising methods encounter difficulties. In addition, the existing deep learning denoising methods also reveal two problems: first, they can only effectively suppress simple seismic noise such as Gaussian noise. But for the case of desert noise, they usually mistakenly judge noise as effective signals and destroy signal characteristic structure. Second, what is more important, in supervised-based methods paired noisy and pure data are usually used as training set. However, there is no noise-free data in actual desert seismic data, which severely limits the processing performance of supervised-based networks. Thus, a new parameter-shared variational auto-encoding adversarial network (PS-VAAN) is proposed for desert seismic data denoising in this paper. This new method includes two encoders, two generators and two discriminators. It can realize domain conversion from noisy data domain to pure data domain. We construct relatively complete unpaired pure and noisy data training sets for training, and design reconstruction loss function and adversarial loss function to optimize network parameters. Moreover, cycle-consistency loss is introduced to make sure two domains mapping into the same space. Compared with the state-of-art denoising methods on synthetic and actual seismic records, the proposed method not only has superior denoising ability, but also has strong feature extraction and fitting ability to recover effective signals with almost no energy loss.

Noisy Training Set Research Articles

Articles published on Noisy Training Set

Knockoffs-SPR: Clean Sample Selection in Learning With Noisy Labels.

Uncertainty Quantification of a Machine Learning Model for Identification of Isolated Nonlinearities with Conformal Prediction

A balanced random learning strategy for CNN based Landsat image segmentation under imbalanced and noisy labels

Multi-proxy feature learning for robust fine-grained visual recognition

A Credit Card Fraud Model Prediction Method Based on Penalty Factor Optimization AWTadaboost

Attentive-Adaptive Network for Hyperspectral Images Classification With Noisy Labels

Parameter-shared variational auto-encoding adversarial network for desert seismic data denoising in Northwest China

Training redefinition with entropy-based structure set density for supervised hyperspectral imagery classification

Training deep retrieval models with noisy datasets: Bag exponential loss

Hyperspectral Classification With Noisy Label Detection via Superpixel-to-Pixel Weighting Distance

Robust Learning of Mislabeled Training Samples for Remote Sensing Image Scene Classification

Set2Model networks: Learning discriminatively to learn generative models

TopologyNet: Topology based deep convolutional and multi-task neural networks for biomolecular property predictions.

Multispectral Image Analysis using Decision Trees

A new hybrid semi-supervised algorithm for text classification with class-based semantics

Knowledge base population using semantic label propagation

Unconfused ultraconservative multiclass algorithms

Enhanced regulatory sequence prediction using gapped k-mer features.

A novel virtual sample generation method based on Gaussian distribution

Weight decay backpropagation for noisy data

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Noisy Training Set Research Articles

Articles published on Noisy Training Set

Knockoffs-SPR: Clean Sample Selection in Learning With Noisy Labels.

Uncertainty Quantification of a Machine Learning Model for Identification of Isolated Nonlinearities with Conformal Prediction

A balanced random learning strategy for CNN based Landsat image segmentation under imbalanced and noisy labels

Multi-proxy feature learning for robust fine-grained visual recognition

A Credit Card Fraud Model Prediction Method Based on Penalty Factor Optimization AWTadaboost

Attentive-Adaptive Network for Hyperspectral Images Classification With Noisy Labels

Parameter-shared variational auto-encoding adversarial network for desert seismic data denoising in Northwest China

Training redefinition with entropy-based structure set density for supervised hyperspectral imagery classification

Training deep retrieval models with noisy datasets: Bag exponential loss

Hyperspectral Classification With Noisy Label Detection via Superpixel-to-Pixel Weighting Distance

Robust Learning of Mislabeled Training Samples for Remote Sensing Image Scene Classification

Set2Model networks: Learning discriminatively to learn generative models

TopologyNet: Topology based deep convolutional and multi-task neural networks for biomolecular property predictions.

Multispectral Image Analysis using Decision Trees

A new hybrid semi-supervised algorithm for text classification with class-based semantics

Knowledge base population using semantic label propagation

Unconfused ultraconservative multiclass algorithms

Enhanced regulatory sequence prediction using gapped k-mer features.

A novel virtual sample generation method based on Gaussian distribution

Weight decay backpropagation for noisy data