Distribution Mismatch Research Articles

The out-of-distribution (OOD) detection task assumes samples that follow the distribution of training data as in-distribution (ID), while samples from other data distributions are considered OOD. In recent years, the OOD detection tasks have made significant progress since many studies observed that the distribution mismatch between training and real datasets can severely deteriorate the reliability of AI systems. Nevertheless, the lack of precise interpretation for the in-distribution (ID) limits the application of the OOD detection methods to real-world systems. To tackle this, we decompose the definition of the ID into texture and semantics, motivated by the demands of real-world scenarios. We also design new benchmarks to measure the robustness that OOD detection methods should have. Our proposed benchmark verifies not only the precision but also the robustness of the detection models. It is crucial to measure both factors in OOD detection as they indicate different traits of the model. For instance, precision is relevant to scenarios that detect minor cracks in the conveyor belt of a smart factory, whereas robustness pertains to maintaining performance under diverse weather conditions, as required by autonomous driving. To achieve a good balance between the OOD detection performance and robustness, our method takes a divide-and-conquer approach. Specifically, the proposed model first handles each component of the texture and semantics separately and then fuses these later. This philosophy is empirically proven by a series of benchmarks including both the proposed and the conventional counterpart. By decomposing the prior “unclear” definition of the ID into texture and semantic components, our novel approach better suits the demands of a reliable machine learning system, which requires robustness and consistent performance across varied scenarios. Unlike prior works, our approach does not rely on any extra datasets or labels. This prevents our proposed framework from being dependent on a particular dataset distribution.

Read full abstract

CNN model computation on edge devices is tightly restricted to the limited resource and power budgets, which motivates the low-bit quantization technology to compress CNN models into 4-bit or lower format to reduce the model size and increase hardware efficiency. Most current low-bit quantization methods use uniform quantization that maps weight and activation values onto evenly-distributed levels, which usually results in accuracy loss due to distribution mismatch. Meanwhile, some non-uniform quantization methods propose specialized representation that can better match various distribution shapes but are usually difficult to be efficiently accelerated on hardware. In order to achieve low-bit quantization with high accuracy and hardware efficiency, this paper proposes Universal Power-of-Two (UPoT), a novel low-bit quantization method that represents values as the addition of multiple power-of-two values selected from a series of subsets. By updating the subset contents, UPoT can provide adaptive quantization levels for various distributions. For each CNN model layer, UPoT automatically searches for the optimized distribution that minimizes the quantization error. Moreover, we design an efficient accelerator system with specifically optimized power-of-two multipliers and requantization units. Evaluations show that the proposed architecture can provide high-performance CNN inference with reduced circuit area and energy, and outperforms several mainstream CNN accelerators with higher ( <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$8\times $ </tex-math></inline-formula> – <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$65\times $ </tex-math></inline-formula> ) area efficiency and ( <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$2\times $ </tex-math></inline-formula> – <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$19\times $ </tex-math></inline-formula> ) energy efficiency. Further experiments of 4/3/2-bit quantization on ResNet18/50, MobileNet_V2 and EfficientNet models show that our UPoT can achieve high model accuracy which greatly outperform other state-of-the-art low-bit quantization methods by 0.3%–6%. The results indicate that our approach provides a highly-efficient accelerator for low-bit CNN model quantization with low hardware overheads and good model accuracy.

Read full abstract

Distribution Mismatch Research Articles

Related Topics

Articles published on Distribution Mismatch

Joint Energy-based Model for Semi-supervised Respiratory Sound Classification: A Method of Insensitive to Distribution Mismatch.

Across Sessions and Subjects Domain Adaptation for Building Robust Myoelectric Interface.

Improving Human Activity Recognition With Wearable Sensors Through BEE: Leveraging Early Exit and Gradient Boosting.

A Data-Driven Framework for Power System Event Type Identification via Safe Semi-Supervised Techniques

Self-supervised learning minimax entropy domain adaptation for the underwater target recognition

Enhanced 3D Pose Estimation in Multi-Person, Multi-View Scenarios through Unsupervised Domain Adaptation with Dropout Discriminator.

Estimating water pollution and economic cost embodied in the mining industry: An interprovincial analysis in China

Decomposing texture and semantic for out-of-distribution detection

Unsupervised Domain Adaptation with CycleGAN: Adapting Image Style and Content for Improved Cross-Domain Performance

An agile autonomous car driving assistance using hybrid optimization-based kernel support vector convolutional network

Designing optimal training sets for genomic prediction using adversarial validation with probit regression

Model-Based Offline Reinforcement Learning with Local Misspecification

DM²: Decentralized Multi-Agent Reinforcement Learning via Distribution Matching

Discriminable feature enhancement for unsupervised domain adaptation

Domain embedding transfer for unequal RGB-D image recognition

Translating AI to Clinical Practice: Overcoming Data Shift with Explainability.

Spatio‐temporal pattern and driving mechanisms of land use conflicts changes (2010–2018) in the Bohai Rim transition zone

Dataset Similarity to Assess Semisupervised Learning Under Distribution Mismatch Between the Labeled and Unlabeled Datasets

An Energy-and-Area-Efficient CNN Accelerator for Universal Powers-of-Two Quantization

Active Incremental Learning for Health State Assessment of Dynamic Systems With Unknown Scenarios

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Distribution Mismatch Research Articles

Related Topics

Articles published on Distribution Mismatch

Joint Energy-based Model for Semi-supervised Respiratory Sound Classification: A Method of Insensitive to Distribution Mismatch.

Across Sessions and Subjects Domain Adaptation for Building Robust Myoelectric Interface.

Improving Human Activity Recognition With Wearable Sensors Through BEE: Leveraging Early Exit and Gradient Boosting.

A Data-Driven Framework for Power System Event Type Identification via Safe Semi-Supervised Techniques

Self-supervised learning minimax entropy domain adaptation for the underwater target recognition

Enhanced 3D Pose Estimation in Multi-Person, Multi-View Scenarios through Unsupervised Domain Adaptation with Dropout Discriminator.

Estimating water pollution and economic cost embodied in the mining industry: An interprovincial analysis in China

Decomposing texture and semantic for out-of-distribution detection

Unsupervised Domain Adaptation with CycleGAN: Adapting Image Style and Content for Improved Cross-Domain Performance

An agile autonomous car driving assistance using hybrid optimization-based kernel support vector convolutional network

Designing optimal training sets for genomic prediction using adversarial validation with probit regression

Model-Based Offline Reinforcement Learning with Local Misspecification

DM²: Decentralized Multi-Agent Reinforcement Learning via Distribution Matching

Discriminable feature enhancement for unsupervised domain adaptation

Domain embedding transfer for unequal RGB-D image recognition

Translating AI to Clinical Practice: Overcoming Data Shift with Explainability.

Spatio‐temporal pattern and driving mechanisms of land use conflicts changes (2010–2018) in the Bohai Rim transition zone

Dataset Similarity to Assess Semisupervised Learning Under Distribution Mismatch Between the Labeled and Unlabeled Datasets

An Energy-and-Area-Efficient CNN Accelerator for Universal Powers-of-Two Quantization

Active Incremental Learning for Health State Assessment of Dynamic Systems With Unknown Scenarios