Self-attention Module Research Articles

Instance segmentation is crucial to modern agriculture and the management of pig farms. In practical farming environments, challenges arise due to the mutual adhesion, occlusion, and dynamic changes in body posture among pigs, making accurate segmentation of multiple target pigs complex. To address these challenges, we conducted experiments using video data captured from varying angles and non-fixed lenses. We selected 45 pigs aged between 20 and 105 days from eight pens as research subjects. Among these, 1917 images were meticulously labeled, with 959 images designated for the training set, 192 for validation, and 766 for testing. To enhance feature utilization and address limitations in the fusion process between bottom-up and top-down feature maps within the feature pyramid network (FPN) module of the YOLACT model, we propose a pixel self-attention (PSA) module, incorporating joint channel and spatial attention. The PSA module seamlessly integrates into multiple stages of the FPN feature extraction within the YOLACT model. We utilized ResNet50 and ResNet101 as backbone networks and compared performance metrics, including AP0.5, AP0.75, AP0.5-0.95, and AR0.5-0.95, between the YOLACT model with the PSA module and YOLACT models equipped with BAM, CBAM, and SCSE attention modules. Experimental results indicated that the PSA attention module outperforms BAM, CBAM, and SCSE, regardless of the selected backbone network. In particular, when employing ResNet101 as the backbone network, integrating the PSA module yields a 2.7% improvement over no attention, 2.3% over BAM, 2.4% over CBAM, and 2.1% over SCSE across the AP0.5-0.95 metric. We visualized prototype masks within YOLACT to elucidate the model’s mechanism. Furthermore, we visualized the PSA attention to confirm its ability to capture valuable pig-related information. Additionally, we validated the transfer performance of our model on a top-down view dataset, affirming the robustness of the YOLACT model with the PSA module.

Read full abstract

Generative models have recently become a prominent research topic in the field of artificial intelligence. Among these models, Generative Adversarial Networks (GAN) have revolutionized the field of deep learning by enabling the production of high-quality synthetic data that is very similar to real-world data. However, the effectiveness of GANs largely depends on the size and quality of training data. In many real-world applications, collecting large amounts of high-quality training data is impractical, time-consuming, and expensive. Accordingly, in recent years, there has been intense interest in the development of GAN models that can work with limited data. These models are particularly useful in scenarios where available data is sparse, such as medical imaging, or in creative applications such as creating new works of art. In this study, we propose a GAN model that can learn from a single training image. Our model is based on the principle of multiple GANs operating sequentially at different scales. At each scale, the GAN learns the features of the training image in different dimensions and transfers them to the next GAN. Samples produced by the GAN at the finest scale are images that have the characteristics of the training image but have different realistic structures. In our model, we utilized a self-attention module to increase the realism and quality of the generated images. Additionally, we used a new scaling method to increase the success of the model. The quantitative and qualitative results we obtained from our experimental studies show that our model performs image generation successfully. In addition, we demonstrated the robustness of our model by testing its success in different image manipulation applications. As a result, our model can successfully produce realistic, high-quality, diverse images from a single training image, providing short training time, memory efficiency, and good training stability. Our model is flexible enough to be used in areas where limited data needs to be worked on.

Read full abstract

Self-attention Module Research Articles

Related Topics

Articles published on Self-attention Module

Pancreatic cancer pathology image segmentation with channel and spatial long-range dependencies

Point Cloud Segmentation Network Based on Attention Mechanism and Dual Graph Convolution

JSMNET: IMPROVING INDOOR POINT CLOUD SEMANTIC AND INSTANCE SEGMENTATION THROUGH SELF-ATTENTION AND MULTISCALE FUSION

Multi-scale attention fusion network for semantic segmentation of remote sensing images

A lightweight segmentation network for endoscopic surgical instruments based on edge refinement and efficient self-attention.

MSDSE: Predicting drug-side effects based on multi-scale features and deep multi-structure neural network

CFM-YOLOv5:CFPNet moudle and muti-target prediction head incorporating YOLOv5 for metal surface defect detection

Intelligent Fault Diagnosis of Marine Diesel Engines Based on Efficient Channel Attention-Improved Convolutional Neural Networks

Surface defect segmentation of magnetic tiles based on cross self-attention module

Mutually Beneficial Transformer for Multimodal Data Fusion

Self-Supervised Triplet Contrastive Learning for Classifying Endometrial Histopathological Images.

ZeroBind: a protein-specific zero-shot predictor with subgraph matching for drug-target interactions

Multi-layered self-attention mechanism for weakly supervised semantic segmentation

Pixel Self-Attention Guided Real-Time Instance Segmentation for Group Raised Pigs

MFA-UNet: a vessel segmentation method based on multi-scale feature fusion and attention module

Auditory stimulus reconstruction from ECoG with DNN and self-attention modules

A Novel Lightweight Object Detection Network with Attention Modules and Hierarchical Feature Pyramid

Tek görüntü üretimi için öz-dikkat modüllü koşulsuz üretken bir model

Multi-level knowledge-driven feature representation and triplet loss optimization network for image–text retrieval

Three-dimensional reconstruction of granular porous media based on deep generative models.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Self-attention Module Research Articles

Related Topics

Articles published on Self-attention Module

Pancreatic cancer pathology image segmentation with channel and spatial long-range dependencies

Point Cloud Segmentation Network Based on Attention Mechanism and Dual Graph Convolution

JSMNET: IMPROVING INDOOR POINT CLOUD SEMANTIC AND INSTANCE SEGMENTATION THROUGH SELF-ATTENTION AND MULTISCALE FUSION

Multi-scale attention fusion network for semantic segmentation of remote sensing images

A lightweight segmentation network for endoscopic surgical instruments based on edge refinement and efficient self-attention.

MSDSE: Predicting drug-side effects based on multi-scale features and deep multi-structure neural network

CFM-YOLOv5:CFPNet moudle and muti-target prediction head incorporating YOLOv5 for metal surface defect detection

Intelligent Fault Diagnosis of Marine Diesel Engines Based on Efficient Channel Attention-Improved Convolutional Neural Networks

Surface defect segmentation of magnetic tiles based on cross self-attention module

Mutually Beneficial Transformer for Multimodal Data Fusion

Self-Supervised Triplet Contrastive Learning for Classifying Endometrial Histopathological Images.

ZeroBind: a protein-specific zero-shot predictor with subgraph matching for drug-target interactions

Multi-layered self-attention mechanism for weakly supervised semantic segmentation

Pixel Self-Attention Guided Real-Time Instance Segmentation for Group Raised Pigs

MFA-UNet: a vessel segmentation method based on multi-scale feature fusion and attention module

Auditory stimulus reconstruction from ECoG with DNN and self-attention modules

A Novel Lightweight Object Detection Network with Attention Modules and Hierarchical Feature Pyramid

Tek görüntü üretimi için öz-dikkat modüllü koşulsuz üretken bir model

Multi-level knowledge-driven feature representation and triplet loss optimization network for image–text retrieval

Three-dimensional reconstruction of granular porous media based on deep generative models.