Loss Metrics Research Articles

Evaluating classification accuracy is a key component of the training and validation stages of thematic map production, and the choice of metric has profound implications for both the success of the training process and the reliability of the final accuracy assessment. We explore key considerations in selecting and interpreting loss and assessment metrics in the context of data imbalance, which arises when the classes have unequal proportions within the dataset or landscape being mapped. The challenges involved in calculating single, integrated measures that summarize classification success, especially for datasets with considerable data imbalance, have led to much confusion in the literature. This confusion arises from a range of issues, including a lack of clarity over the redundancy of some accuracy measures, the importance of calculating final accuracy from population-based statistics, the effects of class imbalance on accuracy statistics, and the differing roles of accuracy measures when used for training and final evaluation. In order to characterize classification success at the class level, users typically generate averages from the class-based measures. These averages are sometimes generated at the macro-level, by taking averages of the individual-class statistics, or at the micro-level, by aggregating values within a confusion matrix, and then, calculating the statistic. We show that the micro-averaged producer’s accuracy (recall), user’s accuracy (precision), and F1-score, as well as weighted macro-averaged statistics where the class prevalences are used as weights, are all equivalent to each other and to the overall accuracy, and thus, are redundant and should be avoided. Our experiment, using a variety of loss metrics for training, suggests that the choice of loss metric is not as complex as it might appear to be, despite the range of choices available, which include cross-entropy (CE), weighted CE, and micro- and macro-Dice. The highest, or close to highest, accuracies in our experiments were obtained by using CE loss for models trained with balanced data, and for models trained with imbalanced data, the highest accuracies were obtained by using weighted CE loss. We recommend that, since weighted CE loss used with balanced training is equivalent to CE, weighted CE loss is a good all-round choice. Although Dice loss is commonly suggested as an alternative to CE loss when classes are imbalanced, micro-averaged Dice is similar to overall accuracy, and thus, is particularly poor for training with imbalanced data. Furthermore, although macro-Dice resulted in models with high accuracy when the training used balanced data, when the training used imbalanced data, the accuracies were lower than for weighted CE. In summary, the significance of this paper lies in its provision of readers with an overview of accuracy and loss metric terminology, insight regarding the redundancy of some measures, and guidance regarding best practices.

Read full abstract

Background: Multi-output Time series forecasting is a complex problem that requires handling interdependencies and interactions between variables. Traditional statistical approaches and machine learning techniques often struggle to predict such scenarios accurately. Advanced techniques and model reconstruction are necessary to improve forecasting accuracy in complex scenarios. Objective: This study proposed an Encoder-Decoder network to address multi-output time series forecasting challenges by simultaneously predicting each output. This objective is to investigate the capabilities of the Encoder-Decoder architecture in handling multi-output time series forecasting tasks. Methods: This proposed model utilizes a 1-Dimensional Convolution Neural Network with Bidirectional Long Short-Term Memory, specifically in the encoder part. The encoder extracts time series features, incorporating a residual connection to produce a context representation used by the decoder. The decoder employs multiple unidirectional LSTM modules and Linear transformation layers to generate the outputs each time step. Each module is responsible for specific output and shares information and context along the outputs and steps. Results: The result demonstrates that the proposed model achieves lower error rates, as measured by MSE, RMSE, and MAE loss metrics, for all outputs and forecasting horizons. Notably, the 6-hour horizon achieves the highest accuracy across all outputs. Furthermore, the proposed model exhibits robustness in single-output forecast and transfer learning, showing adaptability to different tasks and datasets. Conclusion: The experiment findings highlight the successful multi-output forecasting capabilities of the proposed model in time series data, with consistently low error rates (MSE, RMSE, MAE). Surprisingly, the model also performs well in single-output forecasts, demonstrating its versatility. Therefore, the proposed model effectively various time series forecasting tasks, showing promise for practical applications. Keywords: Bidirectional Long Short-Term Memory, Convolutional Neural Network, Encoder-Decoder Networks, Multi-output forecasting, Multi-step forecasting, Time-series forecasting

Read full abstract

Loss Metrics Research Articles

Related Topics

Articles published on Loss Metrics

An Analysis of Loss Functions for Heavily Imbalanced Lesion Segmentation.

Emotion Detection from Photos Using MobleNet-based Deep Learning

Evaluation of a multi-function in-ear device performance in the presence of impulse noise using acoustic test fixtures

Investigating the sensitivity of losses to time-dependent components of seismic risk modeling

Selecting and Interpreting Multiclass Loss and Accuracy Assessment Metrics for Classifications with Class Imbalance: Guidance and Best Practices

Analysis of Investors’ Choices in Technology Companies

A Deep Learning Approach for the Automated Classification of Geomagnetically Induced Current Scalograms

An evaluation of machine learning approaches for early diagnosis of autism spectrum disorder

Multi-scale and multi-refinement framework for seismic risk assessment of urban areas

Research on incentive mechanisms for anti-heterogeneous federated learning based on reputation and contribution

Application of Machine Learning in Estimating Milk Yield According to the Phenotypic and Pedigree Data of Holstein-Friesian Cattle in Serbia

A Study on The Improvement of Information Loss Metrics in Real-Time Stream Data Anonymization

Fully Automated Analysis of Muscle Architecture from B-Mode Ultrasound Images with DL_Track_US

Surpassing early stopping: A novel correlation-based stopping criterion for neural networks

Postoperative Hemodynamics of Total Knee Arthroplasty Unaffected by Cementless Approach under Contemporary Patient Blood Management Protocol: A Propensity Score-Matched Study.

Transforming Text Generation in NLP: Deep Learning with GPT Models and 2023 Twitter Corpus Using Transformer Architecture

Enhancing Image Quality: A Comparative Study of Spatial, Frequency Domain, and Deep Learning Methods

Enhancing Multi-Output Time Series Forecasting with Encoder-Decoder Networks

Vision Transformers and Transfer Learning Approaches for Arabic Sign Language Recognition

The Impact of Early-Stage Chronic Kidney Disease on Weight Loss Outcomes After Gastric Bypass

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Loss Metrics Research Articles

Related Topics

Articles published on Loss Metrics

An Analysis of Loss Functions for Heavily Imbalanced Lesion Segmentation.

Emotion Detection from Photos Using MobleNet-based Deep Learning

Evaluation of a multi-function in-ear device performance in the presence of impulse noise using acoustic test fixtures

Investigating the sensitivity of losses to time-dependent components of seismic risk modeling

Selecting and Interpreting Multiclass Loss and Accuracy Assessment Metrics for Classifications with Class Imbalance: Guidance and Best Practices

Analysis of Investors’ Choices in Technology Companies

A Deep Learning Approach for the Automated Classification of Geomagnetically Induced Current Scalograms

An evaluation of machine learning approaches for early diagnosis of autism spectrum disorder

Multi-scale and multi-refinement framework for seismic risk assessment of urban areas

Research on incentive mechanisms for anti-heterogeneous federated learning based on reputation and contribution

Application of Machine Learning in Estimating Milk Yield According to the Phenotypic and Pedigree Data of Holstein-Friesian Cattle in Serbia

A Study on The Improvement of Information Loss Metrics in Real-Time Stream Data Anonymization

Fully Automated Analysis of Muscle Architecture from B-Mode Ultrasound Images with DL_Track_US

Surpassing early stopping: A novel correlation-based stopping criterion for neural networks

Postoperative Hemodynamics of Total Knee Arthroplasty Unaffected by Cementless Approach under Contemporary Patient Blood Management Protocol: A Propensity Score-Matched Study.

Transforming Text Generation in NLP: Deep Learning with GPT Models and 2023 Twitter Corpus Using Transformer Architecture

Enhancing Image Quality: A Comparative Study of Spatial, Frequency Domain, and Deep Learning Methods

Enhancing Multi-Output Time Series Forecasting with Encoder-Decoder Networks

Vision Transformers and Transfer Learning Approaches for Arabic Sign Language Recognition

The Impact of Early-Stage Chronic Kidney Disease on Weight Loss Outcomes After Gastric Bypass