Dataset Shift Research Articles

In the field of Data Mining, the estimation of the quality of the learned models is a key step in order to select the most appropriate tool for the problem to be solved. Traditionally, a k-fold validation technique has been carried out so that there is a certain degree of independency among the results for the different partitions. In this way, the highest average performance will be obtained by the most robust approach. However, applying a “random” division of the instances over the folds may result in a problem known as dataset shift, which consists in having a different data distribution between the training and test folds.In classification with imbalanced datasets, in which the number of instances of one class is much lower than the other class, this problem is more severe. The misclassification of minority class instances due to an incorrect learning of the real boundaries caused by a not well fitted data distribution, truly affects the measures of performance in this scenario. Regarding this fact, we propose the use of a specific validation technique for the partitioning of the data, known as “Distribution optimally balanced stratified cross-validation” to avoid this harmful situation in the presence of imbalance. This methodology makes the decision of placing close-by samples on different folds, so that each partition will end up with enough representatives of every region.We have selected a wide number of imbalanced datasets from KEEL dataset repository for our study, using several learning techniques from different paradigms, thus making the conclusions extracted to be independent of the underlying classifier. The analysis of the results has been carried out by means of the proper statistical study, which shows the goodness of this approach for dealing with imbalanced data.

<b>2724</b> <h3><b>Introduction:</b></h3> In developing artificial intelligence (AI) algorithms for nuclear medicine applications, several pitfalls are frequently encountered which can impede progress, lead to erroneous findings, and ultimately limit the clinical utility of algorithms. The AI Task Force of the Society of Nuclear Medicine and Molecular Imaging has identified pitfalls that commonly afflict AI algorithm development and we provide suggestions on how to best avoid them. <h3><b>Methods:</b></h3> Here we address three of the most common and detrimental pitfalls that affect AI algorithm development, including for applications within nuclear medicine: 1) exaggerated estimates of algorithm performance (reproducibility); 2) algorithms with acceptable performance in only limited populations (generalizability); and 3) algorithms that are poorly matched to the clinical need (suitability). <h3><b>Results:</b></h3> To address the challenge of poor reproducibility (i.e., the inability to replicate previous research findings), algorithms must be evaluated in datasets that are independent from the training data. Developmental datasets should be partitioned into training and holdout testing cohorts and performance measurements should be reported in the withheld test cohort. For all but large datasets, cross-validation methods such as nested cross validation should be used. Data leakage, in which information from the test set influences the model’s training, should be avoided. For hypothesis testing, statistical power analysis should be used to determine the sample size of the test cohort, and preplanned statistical analysis can help avoid “p-hacking”. Codes and models should be made publicly available and be sufficient to enable replication. The reporting of results in literature should be thorough and transparent, and we recommend the use of reporting checklists. To address the challenge of poor generalizability, developmental datasets should be collected from diverse sources, representing the anticipated variability of the real world clinical population, including images from different scanner technologies. Data samples should be collected from groups that might be vulnerable to biases and subgroup analyses should be performed. Dataset shift should be evaluated, in which the trained model is evaluated on external cohorts from different institutions. To address the challenge of poor suitability, development teams should include not just AI experts but also clinical domain experts and stakeholders, including physicians and technologists, so that the algorithm’s output can be best aligned with the clinical need. For applications in which algorithm explainability is deemed beneficial to users, algorithms should be designed so that the model’s predictions are interpretable and explainable. The confidence associated with the algorithm output should be provided whenever possible. <h3><b>Conclusions:</b></h3> The recent growth of interest in AI has led to many promising technologies in nuclear medicine but also poses several challenges, including algorithms with poor reproducibility, poor generalizability, and poor suitability for clinical tasks. By following best practices for AI algorithm development, these challenges can be overcome and developers can realize the promise of AI while avoiding the pitfalls.

Dataset Shift Research Articles

Related Topics

Articles published on Dataset Shift

Kernel-Based Domain-Invariant Feature Selection in Hyperspectral Images for Transfer Learning

Fieller Stability Measure: a novel model-dependent backtesting approach

DDoS attack protection in the era of cloud computing and Software-Defined Networking

EWMA model based shift-detection methods for detecting covariate shifts in non-stationary environments

A review of microarray datasets and applied feature selection methods

Virtual and Real World Adaptation for Pedestrian Detection.

FAULT DETECTION FOR THE CLASS IMBALANCE PROBLEM IN SEMICONDUCTOR MANUFACTURING PROCESSES

On the importance of the validation technique for classification with imbalanced datasets: Addressing covariate shift when data is skewed

An insight into classification with imbalanced data: Empirical results and current trends on using data intrinsic characteristics

Multitask Remote Sensing Data Classification

Study on the impact of partition-induced dataset shift on k-fold cross-validation.

Analysis of preprocessing vs. cost-sensitive learning for imbalanced classification. Open problems on intrinsic data characteristics

On the dataset shift problem in software engineering prediction models

A unifying view on dataset shift in classification

Electrocardiographic abnormalities and autonomic dysfunction in Guillain-Barre syndrome

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Dataset Shift Research Articles

Related Topics

Articles published on Dataset Shift

Kernel-Based Domain-Invariant Feature Selection in Hyperspectral Images for Transfer Learning

Fieller Stability Measure: a novel model-dependent backtesting approach

DDoS attack protection in the era of cloud computing and Software-Defined Networking

EWMA model based shift-detection methods for detecting covariate shifts in non-stationary environments

A review of microarray datasets and applied feature selection methods

Virtual and Real World Adaptation for Pedestrian Detection.

FAULT DETECTION FOR THE CLASS IMBALANCE PROBLEM IN SEMICONDUCTOR MANUFACTURING PROCESSES

On the importance of the validation technique for classification with imbalanced datasets: Addressing covariate shift when data is skewed

An insight into classification with imbalanced data: Empirical results and current trends on using data intrinsic characteristics

Multitask Remote Sensing Data Classification

Study on the impact of partition-induced dataset shift on k-fold cross-validation.

Analysis of preprocessing vs. cost-sensitive learning for imbalanced classification. Open problems on intrinsic data characteristics

On the dataset shift problem in software engineering prediction models

A unifying view on dataset shift in classification

Electrocardiographic abnormalities and autonomic dysfunction in Guillain-Barre syndrome