An Unsupervised Method for Industrial Image Anomaly Detection with Vision Transformer-Based Autoencoder.

Qiying Yang,Rongzuo Guo

doi:10.3390/s24082440

Abstract

Existing industrial image anomaly detection techniques predominantly utilize codecs based on convolutional neural networks (CNNs). However, traditional convolutional autoencoders are limited to local features, struggling to assimilate global feature information. CNNs' generalizability enables the reconstruction of certain anomalous regions. This is particularly evident when normal and abnormal regions, despite having similar pixel values, contain different semantic information, leading to ineffective anomaly detection. Furthermore, collecting abnormal image samples during actual industrial production poses challenges, often resulting in data imbalance. To mitigate these issues, this study proposes an unsupervised anomaly detection model employing the Vision Transformer (ViT) architecture, incorporating a Transformer structure to understand the global context between image blocks, thereby extracting a superior representation of feature information. It integrates a memory module to catalog normal sample features, both to counteract anomaly reconstruction issues and bolster feature representation, and additionally introduces a coordinate attention (CA) mechanism to intensify focus on image features at both spatial and channel dimensions, minimizing feature information loss and thereby enabling more precise anomaly identification and localization. Experiments conducted on two public datasets, MVTec AD and BeanTech AD, substantiate the method's effectiveness, demonstrating an approximate 20% improvement in average AUROC% at the image level over traditional convolutional encoders.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Sensors	Publication Date: Apr 11, 2024
Citations: 1	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

An Unsupervised Method for Industrial Image Anomaly Detection with Vision Transformer-Based Autoencoder.

Abstract

Talk to us

Similar Papers

More From: Sensors

Lead the way for us

Similar Papers

Super-resolution reconstruction of binocular image based on multi-level fusion attention network
Lei Xu ... Huihui Song
Journal of Image and Graphics | VOL. 28
Lei Xu, et. al.Lei Xu ... Huihui Song
01 Jan 2023
Journal of Image and Graphics | VOL. 28

Feature semantic alignment and information supplement for Text-based person search
Hang Zhou ... Xuening Tian
Frontiers in Physics | VOL. 11
Hang Zhou, et. al.Hang Zhou ... Xuening Tian
19 May 2023
Frontiers in Physics | VOL. 11

TriConvUNeXt: A Pure CNN-Based Lightweight Symmetrical Network for Biomedical Image Segmentation.
Chao Ma ... Ziyang Wang
Journal of Imaging Informatics in Medicine | VOL. -
Chao Ma, et. al.Chao Ma ... Ziyang Wang
23 Apr 2024
Journal of Imaging Informatics in Medicine | VOL. -

Eff-PCNet: An Efficient Pure CNN Network for Medical Image Classification
Wenwen Yue ... Yongming Li
Applied Sciences | VOL. 13
Wenwen Yue, et. al.Wenwen Yue ... Yongming Li
14 Aug 2023
Applied Sciences | VOL. 13

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An Unsupervised Method for Industrial Image Anomaly Detection with Vision Transformer-Based Autoencoder.

Abstract

Talk to us

Similar Papers

More From: Sensors