Automatic Model Selection Research Articles

Context:The increasing use of artificial neural network (ANN) classifiers in systems, especially safety-critical systems (SCSs), requires ensuring their robustness against out-of-distribution (OOD) shifts in operation, which are changes in the underlying data distribution from the data training the classifier. However, measuring the robustness of classifiers in operation with only unlabeled data is challenging. Additionally, machine learning engineers may need to compare different models or versions of the same model and switch to an optimal version based on their robustness. Objective:This paper explores the problem of dynamic robustness evaluation for automated model selection. We aim to find efficient and effective metrics for evaluating and comparing the robustness of multiple ANN classifiers using unlabeled operational data. Methods:To quantitatively measure the differences between the model outputs and assess robustness under OOD shifts using unlabeled data, we choose distance-based metrics. An empirical comparison of five such metrics, suitable for higher-dimensional data like images, is performed. The selected metrics include Wasserstein distance (WD), maximum mean discrepancy (MMD), Hellinger distance (HL), Kolmogorov–Smirnov statistic (KS), and Kullback–Leibler divergence (KL), known for their efficacy in quantifying distribution differences. We evaluate these metrics on 20 state-of-the-art models (ten CIFAR10-based models, five CIFAR100-based models, and five ImageNet-based models) from a widely used robustness benchmark (RobustBench) using data perturbed with various types and magnitudes of corruptions to mimic real-world OOD shifts. Results:Our findings reveal that the WD metric outperforms others when ranking multiple ANN models for CIFAR10- and CIFAR100-based models, while the KS metric demonstrates superior performance for ImageNet-based models. MMD can be used as a reliable second option for both datasets. Conclusion:This study highlights the effectiveness of distance-based metrics in ranking models’ robustness for automated model selection. It also emphasizes the significance of advancing research in dynamic robustness evaluation.

Read full abstract

The mixture of Gaussian processes is a powerful statistical learning model that can be effectively applied to curve clustering and prediction. However, the corresponding model selection problem, that is, selecting an appropriate number of components in the mixture, is rather difficult to solve. In our previous work, we established the split-and-merge automatic model selection algorithm for mixtures of Gaussian processes along the output space under the framework of Reversible Jump Markov Chain Monte Carlo (RJMCMC), which can not only determine the number of actual Gaussian processes but also dynamically adjust the Gaussian process components to avoid dependence on parameter initialization and initial partitioning of the dataset during the parameter learning on a given dataset. In this study, we propose two algorithms: Penalized Likelihood RJMCMC and Penalized Prior RJMCMC. The former integrates a penalized term into the likelihood, while the latter incorporates a penalized term into the prior and operates within the full Bayesian inference framework, both aiming to focus more sharply on determining the number of components in the convergence process. Furthermore, we prove the geometric ergodicity of the RJMCMC algorithm for the mixture of Gaussian processes model, ensuring convergence of the posterior distribution with sufficient iterations. The experimental results further demonstrate the robustness of our PP-RJMCMC algorithm in model selection, showing superior performance compared to traditional approaches in curve classification and clustering. Additionally, the prediction performance is comparable to the EM algorithm. Although not directly explored in this study, the RJMCMC results can be used to initialize the EM algorithm, which could potentially improve prediction accuracy and accelerate computation.

Read full abstract

Automatic Model Selection Research Articles

Related Topics

Articles published on Automatic Model Selection

Dynamic robustness evaluation for automated model selection in operation

Automated model selection for multivariate anomaly detection in manufacturing systems

Learning macroscopic equations of motion from dissipative particle dynamics simulations of fluids

Split-and-merge model selection of mixtures of Gaussian processes with RJMCMC

Towards Automated Model Selection for Wind Speed and Solar Irradiance Forecasting.

Automated selection of nanoparticle models for small-angle X-ray scattering data analysis using machine learning.

Forecasting seasonal influenza activity in Canada-Comparing seasonal Auto-Regressive integrated moving average and artificial neural network approaches for public health preparedness.

Automated Model Selection Using Bayesian Optimization and the Asynchronous Successive Halving Algorithm for Predicting Daily Minimum and Maximum Temperatures

Automatic selection model to identify neurodegenerative diseases.

Comparison between inverse-probability weighting and multiple imputation in Cox model with missing failure subtype.

Temporal classification of short time series data

Iterative Feedback Tuning with automated reference model selection

Automatic Initialization and Model Selection for Li-ion Battery Impedance Identification in the Frequency Domain

A Hybrid Bearing Prognostic Method With Fault Diagnosis and Model Fusion

Enhancing Object Detection in Remote Sensing: A Hybrid YOLOv7 and Transformer Approach with Automatic Model Selection

Structured model selection via optimization

The generalized hyperbolic family and automatic model selection through the multiple‐choiceLASSO

Semi-Supervised Classification of Malware Families Under Extreme Class Imbalance via Hierarchical Non-Negative Matrix Factorization with Automatic Model Selection

Strategies for automatic constitutive model selection and recommendation

Distributed out-of-memory NMF on CPU/GPU architectures

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Automatic Model Selection Research Articles

Related Topics

Articles published on Automatic Model Selection

Dynamic robustness evaluation for automated model selection in operation

Automated model selection for multivariate anomaly detection in manufacturing systems

Learning macroscopic equations of motion from dissipative particle dynamics simulations of fluids

Split-and-merge model selection of mixtures of Gaussian processes with RJMCMC

Towards Automated Model Selection for Wind Speed and Solar Irradiance Forecasting.

Automated selection of nanoparticle models for small-angle X-ray scattering data analysis using machine learning.

Forecasting seasonal influenza activity in Canada-Comparing seasonal Auto-Regressive integrated moving average and artificial neural network approaches for public health preparedness.

Automated Model Selection Using Bayesian Optimization and the Asynchronous Successive Halving Algorithm for Predicting Daily Minimum and Maximum Temperatures

Automatic selection model to identify neurodegenerative diseases.

Comparison between inverse-probability weighting and multiple imputation in Cox model with missing failure subtype.

Temporal classification of short time series data

Iterative Feedback Tuning with automated reference model selection

Automatic Initialization and Model Selection for Li-ion Battery Impedance Identification in the Frequency Domain

A Hybrid Bearing Prognostic Method With Fault Diagnosis and Model Fusion

Enhancing Object Detection in Remote Sensing: A Hybrid YOLOv7 and Transformer Approach with Automatic Model Selection

Structured model selection via optimization

The generalized hyperbolic family and automatic model selection through the multiple‐choiceLASSO

Semi-Supervised Classification of Malware Families Under Extreme Class Imbalance via Hierarchical Non-Negative Matrix Factorization with Automatic Model Selection

Strategies for automatic constitutive model selection and recommendation

Distributed out-of-memory NMF on CPU/GPU architectures