Cross-entropy Training Research Articles

Node classification in graph learning faces significant challenges due to imbalanced data, particularly for under-represented samples from minority classes. To address this issue, existing methods often rely on synthetic minority over-sampling techniques, introducing additional complexity during model training. In light of the challenges faced, we introduce GraphECC, an innovative approach that addresses numerical anomalies in large-scale datasets by supplanting the traditional CE loss function with an Enhanced Complementary Classifier (ECC) loss function’a novel modification to the CCE loss. This alteration ensures computational stability and mitigates potential numerical anomalies by incorporating a slight offset in the denominator during the computation of the complementary probability distribution. In this paper, we present a novel training paradigm, the Enhanced Complementary Classifier (ECC), which offers “imbalance defense for free” without the need for extra procedures to improve node classification accuracy.The ECC approach optimizes model probabilities for the ground-truth class, akin to the cross-entropy method. Additionally, it effectively neutralizes probabilities associated with incorrect classes through a “guided” term, achieving a balanced trade-off between the two aspects. Experimental results demonstrate that our proposed method not only enhances model robustness but also surpasses the widely used cross-entropy training objective.Moreover, we demonstrate the versatility of our method by seamlessly integrating it with various well-known adversarial training techniques, resulting in significant gains in robustness. Notably, our approach represents a breakthrough, as it enhances model robustness without compromising performance, distinguishing it from previous attempts.The code for GraphECC can be accessed from the following link:https://github.com/12chen20/GraphECC.

Read full abstract

Recently, a series of attempts have incorporated spatial attention mechanisms into the task of image captioning, which achieves a remarkable improvement in the quality of generative captions. However, the traditional spatial attention mechanism adopts latent and delayed semantic representations to decide which area should be paid more attention to, resulting in inaccurate semantic guidance and the introduction of redundant information. In order to optimize the spatial attention mechanism, we propose the Semantic Guidance Attention (SGA) mechanism in this article. Specifically, SGA utilizes semantic word representations to provide an intuitive semantic guidance that focuses accurately on semantic-related regions. Moreover, we reduce the difficulty of generating fluent sentences by updating the attention information in time. At the same time, the beam search algorithm is widely used to predict words during sequence generation. This algorithm generates a sentence according to the probabilities of words, so it is easy to push out a generic sentence and discard some distinctive captions. In order to overcome this limitation, we design the Consensus Selection (CS) strategy to choose the most descriptive and informative caption, which is selected by the semantic similarity of captions instead of the probabilities of words. The consensus caption is determined by selecting the one with the highest cumulative semantic similarity with respect to the reference captions. Our proposed model (SGA-CS) is validated on Flickr30k and MSCOCO, which shows that SGA-CS outperforms state-of-the-art approaches. To our best knowledge, SGA-CS is the first attempt to jointly produce semantic attention guidance and select descriptive captions for image captioning tasks, achieving one of the best performance ratings among any cross-entropy training methods.

Read full abstract

Cross-entropy Training Research Articles

Related Topics

Articles published on Cross-entropy Training

Transforming the bootstrap: using transformers to compute scattering amplitudes in planar N=4 super Yang–Mills theory

Enhancing Brain Stroke Detection: A Novel Deep Neural Network with Weighted Binary Cross Entropy Training

GraphECC: Enhancing GraphSMOTE with enhanced complementary classifier

Transformers in High-Frequency Trading

Curricular-balanced long-tailed learning

Multi-Loss U-Net Reformulation as an Efficient Solution to the Colony-Forming Unit Counting Problem.

Deep Motion Prior for Weakly-Supervised Temporal Action Localization.

Cross-Entropy Training of DNN Ensemble Acoustic Models for Low-Resource ASR

Image Captioning via Semantic Guidance Attention and Consensus Selection Strategy

A Continuous Relaxation of Beam Search for End-to-End Training of Neural Sequence Models

Nonrecurrent Neural Structure for Long-Term Dependence

A Bayesian Framework for XML Information Retrieval: Searching and Learning with the INEX Collection

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Cross-entropy Training Research Articles

Related Topics

Articles published on Cross-entropy Training

Transforming the bootstrap: using transformers to compute scattering amplitudes in planar N=4 super Yang–Mills theory

Enhancing Brain Stroke Detection: A Novel Deep Neural Network with Weighted Binary Cross Entropy Training

GraphECC: Enhancing GraphSMOTE with enhanced complementary classifier

Transformers in High-Frequency Trading

Curricular-balanced long-tailed learning

Multi-Loss U-Net Reformulation as an Efficient Solution to the Colony-Forming Unit Counting Problem.

Deep Motion Prior for Weakly-Supervised Temporal Action Localization.

Cross-Entropy Training of DNN Ensemble Acoustic Models for Low-Resource ASR

Image Captioning via Semantic Guidance Attention and Consensus Selection Strategy

A Continuous Relaxation of Beam Search for End-to-End Training of Neural Sequence Models

Nonrecurrent Neural Structure for Long-Term Dependence

A Bayesian Framework for XML Information Retrieval: Searching and Learning with the INEX Collection