Long-range Dependence Research Articles

Ceramic substrates serve as the foundational material for numerous electronic devices, and their surface quality directly affects performance and longevity. Therefore, surface defect detection on ceramic substrates is an indispensable task during the manufacturing process. Ceramic substrate surface defects exhibit low contrast, along with intraclass difference and interclass similarity, posing substantial challenges for automated visual inspection. Existing deep learning-based methods utilize carefully crafted backbone networks and feature pyramid structures that perceive multilevel information to deal with the above challenges. However, these methods still suffer from insufficient extraction of discriminative features and low effectiveness in feature fusion, thereby hindering the detection performance. To address these issues, we propose the profound feature exploration and interaction network (PFEI-Net). Initially, we propose the discriminative feature mining backbone (DFM-backbone) to progressively explore the effective discriminative features. In the DFM-backbone, the global information grated perception module is devised to construct long-range feature dependencies and aggregate rich features, whereas the contextual semantic flexible extraction module is designed to meticulously perceive contextual semantic information in a targeted manner, enhancing the comprehensive representation of the discriminative features. Then, we propose the multipath semantic interaction guided-feature pyramid network (MSIG-FPN) to facilitate the fusion and interaction of the valuable information. In the MSIG-FPN, the hierarchical refocus–reaggregation module is constructed to refocus on task-relevant features and provide reliable guidance for the network, whereas the cross-stage semantic deep fusion module is designed to integrate multi-level contextual information deeply across stages and is employed to construct the semantic complementary paths, thereby preventing the loss of crucial features. Finally, built upon the backbone of DFM-backbone and MSIG-FPN, we establish the PFEI-Net, achieving accurate detection of surface defects on ceramic substrates. Experimental results highlight that the PFEI-Net achieves excellent performance with a remarkable 89.3 % mean average precision (mAP) and an inference time of 26.1 ms, which offers a promising and viable solution for automated surface defect detection of ceramic substrate.

Read full abstract

Convolutional neural networks (CNNs) have made a significant contribution to hyperspectral image (HSI) generation. However, capturing long-range dependencies can be challenging with CNNs due to the limitations of their local receptive fields, which can lead to distortions in fused images. Transformers excel at capturing long-range dependencies but have limited capacity for handling fine details. Additionally, priorwork has often overlooked the extraction of global features during the image preprocessing stage, resulting in the potential loss of fine details. To address these issues, we propose a hybrid cross-multiscale spectral-spatial Transformer (HCMSST) that combines the advantages of CNNs in feature extraction and Transformers in capturing long-range dependencies. To fully extract and retain local and global information in the shallow feature extraction phase, the network incorporatesCNNs with a staggered cascade-dense residual block (SCDRB). This block employs staggered residuals to establish direct connections bothwithin and between branches and integrates attention modules to enhance the response to important features. This approach facilitates unrestricted information exchange and fosters deeper feature representations. To address the limitationsof Transformer in processing fine details, we introduce multiscale spatial-spectral coding-decoding structures to obtain comprehensive spatial-spectral features, which are utilized to capture the long-range dependencies via the cross-multiscale spectral-spatial Transformer (CMSST). Further, the CMSST incorporates a cross-level dual-stream feature interaction strategy that integrates spatial and spectral features from different levels and then feeds the fused features back to their corresponding branches for information interaction. Experimental results indicate that the proposed HCMSST achieves superior performance compared to many state-of-the-art (SOTA) methods. Specifically, HCMSST reduces the ERGAS metric by 3.05% compared to the SOTA methods on the CAVE dataset, while on the Harvard dataset, it achieves a 2.69% reduction in ERGAS compared to the SOTA results.

Read full abstract

Long-range Dependence Research Articles

Related Topics

Articles published on Long-range Dependence

2S-SGCN: A two-stage stratified graph convolutional network model for facial landmark detection on 3D data

PFEI-Net: A profound feature exploration and interaction network for ceramic substrate surface defect detection

HTD-TS3: Weakly Supervised Hyperspectral Target Detection Based on Transformer via Spectral-Spatial Similarity.

A frequency channel-attention based vision Transformer method for bearing fault identification across different working conditions

A flexible 2.5D medical image segmentation approach with in-slice and cross-slice attention

Exploring drug-target interaction prediction on cold-start scenarios via meta-learning-based graph transformer.

Graph-enhanced visual representations and question-guided dual attention for visual question answering

A method based on hybrid cross-multiscale spectral-spatial transformer network for hyperspectral and multispectral image fusion

Pan-Mamba: Effective pan-sharpening with state space model

MIMO-Uformer: A Transformer-Based Image Deblurring Network for Vehicle Surveillance Scenarios

ACMamba: A State Space Model-Based Approach for Multi-Weather Degraded Image Restoration

TTMGNet: Tree Topology Mamba-Guided Network Collaborative Hierarchical Incremental Aggregation for Change Detection

A Feature-Driven Inception Dilated Network for Infrared Image Super-Resolution Reconstruction

A spatial hierarchical network learning framework for drug repositioning allowing interpretation from macro to micro scale

Efficient Metal Corrosion Area Detection Model Combining Convolution and Transformer

Multi-Modal LiDAR Point Cloud Semantic Segmentation with Salience Refinement and Boundary Perception

A newton interpolation network for smoke semantic segmentation

CNN-Informer: A hybrid deep learning model for seizure detection on long-term EEG

Global induced local network for infrared: dim small target detection

Multi-label dental disorder diagnosis based on MobileNetV2 and swin transformer using bagging ensemble classifier

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Long-range Dependence Research Articles

Related Topics

Articles published on Long-range Dependence

2S-SGCN: A two-stage stratified graph convolutional network model for facial landmark detection on 3D data

PFEI-Net: A profound feature exploration and interaction network for ceramic substrate surface defect detection

HTD-TS3: Weakly Supervised Hyperspectral Target Detection Based on Transformer via Spectral-Spatial Similarity.

A frequency channel-attention based vision Transformer method for bearing fault identification across different working conditions

A flexible 2.5D medical image segmentation approach with in-slice and cross-slice attention

Exploring drug-target interaction prediction on cold-start scenarios via meta-learning-based graph transformer.

Graph-enhanced visual representations and question-guided dual attention for visual question answering

A method based on hybrid cross-multiscale spectral-spatial transformer network for hyperspectral and multispectral image fusion

Pan-Mamba: Effective pan-sharpening with state space model

MIMO-Uformer: A Transformer-Based Image Deblurring Network for Vehicle Surveillance Scenarios

ACMamba: A State Space Model-Based Approach for Multi-Weather Degraded Image Restoration

TTMGNet: Tree Topology Mamba-Guided Network Collaborative Hierarchical Incremental Aggregation for Change Detection

A Feature-Driven Inception Dilated Network for Infrared Image Super-Resolution Reconstruction

A spatial hierarchical network learning framework for drug repositioning allowing interpretation from macro to micro scale

Efficient Metal Corrosion Area Detection Model Combining Convolution and Transformer

Multi-Modal LiDAR Point Cloud Semantic Segmentation with Salience Refinement and Boundary Perception

A newton interpolation network for smoke semantic segmentation

CNN-Informer: A hybrid deep learning model for seizure detection on long-term EEG

Global induced local network for infrared: dim small target detection

Multi-label dental disorder diagnosis based on MobileNetV2 and swin transformer using bagging ensemble classifier