Real-world Domains Research Articles

Abstract Deep learning algorithms often are developed and trained on a training dataset and deployed on test datasets. Any systematic difference between the training and a test dataset may severely degrade the final algorithm performance on the test dataset -- what is known as the domain shift problem. This issue is prevalent in many scientific domains where algorithms are trained on simulated data but applied to real-world datasets. Typically, the domain shift problem is solved through various domain adaptation methods. However, these methods are often tailored to a specific downstream task and may not easily generalize to different tasks. This work explores the feasibility of using an alternative way to solve the domain shift problem that is not specific to any downstream algorithm. The proposed approach relies on modern Unpaired Image-to-Image translation techniques, designed to find translations between different image domains in a fully unsupervised fashion. In this study, the approach is applied to a domain shift problem commonly encountered in Liquid Argon Time Projection Chamber detector research when seeking a way to translate samples between two differently distributed LArTPC detector datasets deterministically. This translation allows for mapping real-world data into the simulated data domain where the downstream algorithms can be run with much less domain-shift-related performance degradation. Conversely, using the translation from the simulated data in a real-world domain can increase the realism of the simulated dataset and reduce the magnitude of any systematic uncertainties. We adapted several popular UI2I translation algorithms to work on scientific data and demonstrated the viability of these techniques for solving the domain shift problem with LArTPC detector data. To facilitate further development of domain adaptation techniques for scientific datasets, the "Simple Liquid-Argon Track Samples" (SLATS) dataset used in this study also is published.

Read full abstract

Classically, in classification and clustering problems, an individual is exclusively assigned to a class (labeled data) or a cluster (unlabeled data). However, the assignment to a single class or cluster may be too strict in several real-world domains. In many cases, an individual may belong to multiple classes or clusters at the same time. On the other hand, we are starting to see machine learning algorithms that solve the problem of multi-label classification and multiple cluster assignment, but there are no algorithms that solve both problems simultaneously. Classic semi-supervised algorithms can work with labeled and unlabeled data simultaneously, but these types of algorithms assign individuals to a single class or a single cluster. Today, artificial intelligence faces the challenge of developing semi-supervised learning algorithms to work with semi-labeled data that can be assigned to different classes or clusters at the same time. In particular, this paper proposes a semi-supervised learning algorithm to fill this gap, which can solve the problems of multi-label classification and multiple cluster assignment simultaneously, on a semi-labeled dataset. This proposal is based on the LAMDA algorithm (Learning Algorithm for Multivariate Data Analysis), which calculates the degree of membership of a data to a group/class. In this work, a membership threshold is defined, which allows individuals to be assigned to classes or clusters that have a membership greater than this membership threshold. Thus, the main contribution of this work is the development of a semi-supervised algorithm that can process semi-labeled datasets to assign them to multiple classes and/or clusters. Furthermore, the work defines a metric to evaluate its efficiency in a semi-supervised context, called Multi Label-Cluster Index (MULCI). This proposal is tested on several datasets from the domains of multi-label classification or multi-assignment clustering, or a combination of both, showing very encouraging results. Very good quality metrics results are achieved in multiclass and multicluster tasks.

Read full abstract

Real-world Domains Research Articles

Related Topics

Articles published on Real-world Domains

Characterizing and Understanding HGNN Training on GPUs

Unpaired Image Translation to Mitigate Domain Shift in Liquid Argon Time Projection Chamber Detector Responses

An Overview Of Deep Deterministic Policy Gradient Algorithm And Applications

Graph pooling in graph neural networks: methods and their applications in omics studies

A semi-supervised learning algorithm for multi-label classification and multi-assignment clustering problems based on a Multivariate Data Analysis

From Single-Objective to Bi-Objective Maximum Satisfiability Solving

Improving Exploration in Actor-Critic With Weakly Pessimistic Value Estimation and Optimistic Policy Optimization.

The educational competition optimizer

Investigating the role of flight phase and task difficulty on low-time pilot performance, gaze dynamics and subjective situation awareness during simulated flight.

Multivariate Time Series Change-Point Detection with a Novel Pearson-like Scaled Bregman Divergence.

AGDF-Net: Learning Domain Generalizable Depth Features With Adaptive Guidance Fusion.

A containerised approach for multiform robotic applications.

MEAformer: An all-MLP transformer with temporal external attention for long-term time series forecasting

Scaling Offline Evaluation of Reinforcement Learning Agents through Abstraction

Siamese Autoencoder Architecture for the Imputation of Data Missing Not at Random

Exploiting Action Impact Regularity and Exogenous State Variables for Offline Reinforcement Learning (Abstract Reprint)

Continual Relation Extraction via Sequential Multi-Task Learning

Test-Time Domain Adaptation by Learning Domain-Aware Batch Normalization

Reinforcement learning applications in environmental sustainability: a review

An Effective Algorithm using Moth Flame Optimization (MFO) for Numerical Expression Solutions.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Real-world Domains Research Articles

Related Topics

Articles published on Real-world Domains

Characterizing and Understanding HGNN Training on GPUs

Unpaired Image Translation to Mitigate Domain Shift in Liquid Argon Time Projection Chamber Detector Responses

An Overview Of Deep Deterministic Policy Gradient Algorithm And Applications

Graph pooling in graph neural networks: methods and their applications in omics studies

A semi-supervised learning algorithm for multi-label classification and multi-assignment clustering problems based on a Multivariate Data Analysis

From Single-Objective to Bi-Objective Maximum Satisfiability Solving

Improving Exploration in Actor-Critic With Weakly Pessimistic Value Estimation and Optimistic Policy Optimization.

The educational competition optimizer

Investigating the role of flight phase and task difficulty on low-time pilot performance, gaze dynamics and subjective situation awareness during simulated flight.

Multivariate Time Series Change-Point Detection with a Novel Pearson-like Scaled Bregman Divergence.

AGDF-Net: Learning Domain Generalizable Depth Features With Adaptive Guidance Fusion.

A containerised approach for multiform robotic applications.

MEAformer: An all-MLP transformer with temporal external attention for long-term time series forecasting

Scaling Offline Evaluation of Reinforcement Learning Agents through Abstraction

Siamese Autoencoder Architecture for the Imputation of Data Missing Not at Random

Exploiting Action Impact Regularity and Exogenous State Variables for Offline Reinforcement Learning (Abstract Reprint)

Continual Relation Extraction via Sequential Multi-Task Learning

Test-Time Domain Adaptation by Learning Domain-Aware Batch Normalization

Reinforcement learning applications in environmental sustainability: a review

An Effective Algorithm using Moth Flame Optimization (MFO) for Numerical Expression Solutions.