Fusion Performance Research Articles

Abstract This paper designs a lightweight high-precision transmission line component detection model, named grouped dense, monotonic self-regularized, and partial faster convolution, pruning, and distillation optimized—you only look once (GMPPD-YOLO), in transmission line inspection. It addresses the issue of low detection accuracy of target detection algorithms due to the complex background, large differences in target shape, location, texture, etc, as well as diversified and smaller defects in insulator and vibration hammer images taken by unmanned aerial vehicles from multiple angles. To enhance the model’s feature extraction capabilities in complex backgrounds and across different scales, the grouped dense C3 dense feature extraction module was designed, enabling the model to more effectively handle diverse defect forms. Simultaneously, the monotonic self-regularized pyramid pooling–fast (MSPPF) module is proposed to enhance the model’s capability to process multi-scale information. Additionally, the partial-faster C3 feature awareness module is designed to improve feature fusion performance, enhancing the model’s ability to perceive features at different scales. Finally, channel pruning was used to reduce redundant parameters, and knowledge distillation was employed to compensate for the accuracy loss caused by pruning. This approach further compressed the model size while ensuring its detection performance. The experimental results demonstrate that compared to the original YOLOv5s algorithm, the proposed GMPPD-YOLO algorithm achieves a reduction in parameters by 68.4%, a decrease in Giga floating-point operations per second by 58.2%, and a reduction in the model size by 66.4%, while achieving an increase in precision by 1%, mAP50 by 1.1%, and mAP95 by 0.4%. This confirms the significant potential of the GMPPD-YOLO algorithm for deployment in real-time drone-based power transmission line inspections.

Read full abstract

Multimodal Sentiment Analysis (MSA) holds extensive applicability owing to its capacity to analyze and interpret users' emotions, feelings, and perspectives by integrating complementary information from multiple modalities. However, inefficient and unbalanced cross-modal information fusion substantially undermines the accuracy and reliability of MSA models. Consequently, a critical challenge in the field now lies in effectively assessing the information integration capabilities of these models to ensure balanced and equitable processing of multimodal data. In this paper, a Disentanglement-based Variable Auto-Encoder (DVAE) is proposed for systematically assessing fusion performance and investigating the factors that facilitate multimodal fusion. Specifically, a distribution constraint module is presented to decouple the fusion matrices and generate multiple low-dimensional and trustworthy disentangled latent vectors that adhere to the authentic unimodal input distribution. In addition, a combined loss term is modified to effectively balance inductive bias, signal reconstruction, and distribution constraint items to facilitate the optimization of neural network weights and parameters. Utilizing the proposed evaluation method, we can evaluate the fusion performance of multimodal models by contrasting the classification degradation ratio derived from disentangled hidden representations and joint representations. Experiments conducted with eight state-of-the-art multimodal fusion methods on the CMU-MOSEI and CMU-MOSEI benchmark datasets demonstrate that DVAE is capable of effectively evaluating the effects of multimodal fusion. Moreover, the comparative experimental results indicate that the equalizing effect among various advanced mechanisms in multimodal sentiment analysis, as well as the single-peak characteristic of the ground label distribution, both contribute significantly to multimodal data fusion.

Read full abstract

Fusion Performance Research Articles

Related Topics

Articles published on Fusion Performance

MEEAFusion: Multi-Scale Edge Enhancement and Joint Attention Mechanism Based Infrared and Visible Image Fusion.

Fault detection method for transmission line components based on lightweight GMPPD-YOLO

Flat-top plasma operational space of the STEP power plant

High-Cycle Fatigue Performance of Laser Powder Bed Fusion Ti-6Al-4V Alloy with Inherent Internal Defects: A Critical Literature Review

Enhancing three-source cross-modality image fusion with improved DenseNet for infrared polarization and visible light images

LFDT-Fusion: A latent feature-guided diffusion Transformer model for general image fusion

IMQFusion: Infrared and visible image fusion via implicit multi-resolution preservation and query aggregation

Disentangled variational auto-encoder for multimodal fusion performance analysis in multimodal sentiment analysis

One improved YOLOX-s algorithm for lightweight section-steel surface defect detection

Biomimetic Porous Ti6Al4V Implants: A Novel Interbody Fusion Cage via Gel-Casting Technique to Promote Spine Fusion.

Observation of alpha-particles in recent D–T experiments on JET

Verification of the Laser Powder Bed Fusion Performance of 2024 Aluminum Alloys Modified Using Nano-LaB6.

YOLOv8 Model for Weed Detection in Wheat Fields Based on a Visual Converter and Multi-Scale Feature Fusion.

CUFNet: A fusion network based on cross-reconstruction uniqueness for visible and infrared images

Performance of an inertial electrostatic confinement fusion device having a multi-grid configuration

Build Orientation Effect on Bending Fatigue Performance and Impact Toughness of Laser Powder Bed Fusion Manufactured Ti6Al4V Without Heat Treatment

ReLAP-Net: Residual Learning and Attention Based Parallel Network for Hyperspectral and Multispectral Image Fusion

A Novel Model for Instance Segmentation and Quantification of Bridge Surface Cracks-The YOLOv8-AFPN-MPD-IoU.

Axial confinement of wire array Z-pinch precursor plasmas by a pulsed magnetic mirror field

Comparative study on the process, anisotropy, and mechanical performance of laser powder bed fusion fabricated truss-lattice structures with different unit cell designs

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Fusion Performance Research Articles

Related Topics

Articles published on Fusion Performance

MEEAFusion: Multi-Scale Edge Enhancement and Joint Attention Mechanism Based Infrared and Visible Image Fusion.

Fault detection method for transmission line components based on lightweight GMPPD-YOLO

Flat-top plasma operational space of the STEP power plant

High-Cycle Fatigue Performance of Laser Powder Bed Fusion Ti-6Al-4V Alloy with Inherent Internal Defects: A Critical Literature Review

Enhancing three-source cross-modality image fusion with improved DenseNet for infrared polarization and visible light images

LFDT-Fusion: A latent feature-guided diffusion Transformer model for general image fusion

IMQFusion: Infrared and visible image fusion via implicit multi-resolution preservation and query aggregation

Disentangled variational auto-encoder for multimodal fusion performance analysis in multimodal sentiment analysis

One improved YOLOX-s algorithm for lightweight section-steel surface defect detection

Biomimetic Porous Ti6Al4V Implants: A Novel Interbody Fusion Cage via Gel-Casting Technique to Promote Spine Fusion.

Observation of alpha-particles in recent D–T experiments on JET

Verification of the Laser Powder Bed Fusion Performance of 2024 Aluminum Alloys Modified Using Nano-LaB6.

YOLOv8 Model for Weed Detection in Wheat Fields Based on a Visual Converter and Multi-Scale Feature Fusion.

CUFNet: A fusion network based on cross-reconstruction uniqueness for visible and infrared images

Performance of an inertial electrostatic confinement fusion device having a multi-grid configuration

Build Orientation Effect on Bending Fatigue Performance and Impact Toughness of Laser Powder Bed Fusion Manufactured Ti6Al4V Without Heat Treatment

ReLAP-Net: Residual Learning and Attention Based Parallel Network for Hyperspectral and Multispectral Image Fusion

A Novel Model for Instance Segmentation and Quantification of Bridge Surface Cracks-The YOLOv8-AFPN-MPD-IoU.

Axial confinement of wire array Z-pinch precursor plasmas by a pulsed magnetic mirror field

Comparative study on the process, anisotropy, and mechanical performance of laser powder bed fusion fabricated truss-lattice structures with different unit cell designs