Hybrid Vision Research Articles

The laser welding technique is quite common in various manufacturing lines, including those for lithium-ion power batteries, due to its remarkable productivity, effectiveness, and flexibility. However, it has been observed that the consistency of the welding quality is not always optimal. This study investigates the adoption of laser welding in the manufacturing of lithium-ion batteries, such as their anode, cathode and safety vent, where consistent weld quality is crucial for battery performance and safety. Therefore, evaluating the laser-welding product is indispensable for industrial production and the public domain. Several techniques have been utilized for laser welding evaluation, both destructive and non-destructive. However, these methods are ineffective and too cumbersome to be adopted in mass manufacturing. In contrast, a machine vision strategy has recently been adopted to distinguish between successful and unsuccessful laser welding items. This opened new perspectives for evaluating the quality of the laser welding product using digital image techniques. However, these methods cannot deliver outstanding performance across multiple laser-welding products and achieve optimal classification accuracy. Deep learning has evolved remarkably in recent years and gained popularity in detecting welding defects. This paper presents the observation of a Hybrid Vision Transformer (HViT) to classify the laser welding images corresponding to their feature patch images based on multi-model feature aggregation. The VGG-16 and MobileNet, which were pre-trained on ImageNet, were utilized as the core models to extract the rich features of the laser welding image. To integrate these features, the squeeze excitation module (SE) was utilized, and a multi-layer perception (MLP) approach with a label smoothing optimizer was used for classification. To determine the effectiveness of the proposed strategy, a comparative analysis was conducted with a multitude of alternative machine learning and deep learning approaches. The results indicate that the proposed strategy surpasses all other methods in terms of classification accuracy and evaluation metrics on the test dataset.

Accurate and timely monitoring of forest canopy height is critical for assessing forest dynamics, biodiversity, carbon sequestration as well as forest degradation and deforestation. Recent advances in deep learning techniques, coupled with the vast amount of spaceborne remote sensing data offer an unprecedented opportunity to map canopy height at high spatial and temporal resolutions. Current techniques for wall-to-wall canopy height mapping correlate remotely sensed information from optical and radar sensors in the 2D space to the vertical structure of trees using lidar's 3D measurement abilities serving as height proxies. While studies making use of deep learning algorithms have shown promising performances for the accurate mapping of canopy height, they have limitations due to the type of architectures and loss functions employed. Moreover, mapping canopy height over tropical forests remains poorly studied, and the accurate height estimation of tall canopies is a challenge due to signal saturation from optical and radar sensors, persistent cloud cover, and sometimes limited penetration capabilities of lidar instruments. In this study, we map heights at 10 m resolution across the diverse landscape of Ghana with a new vision transformer (ViT) model, dubbed Hy-TeC, optimized concurrently with a classification (discrete) and a regression (continuous) loss function. This model achieves significantly higher accuracy than previously employed convolutional-based approaches (ConvNets) optimized with only a continuous loss function. Hy-TeC results show that our proposed discrete/continuous loss formulation significantly increases the sensitivity for very tall trees (i.e., > 35 m). Overall, Hy-TeC has significantly reduced bias (0.8 m) and higher accuracy (RMSE = 6.6 m) over tropical forests for which other approaches show poorer performance and oftentimes a saturation effect. The height maps generated by Hy-TeC also have better ground sampling distance and better sensitivity to sparse vegetation. Over these areas, Hy-TeC showed an RMSE of 3.1 m in comparison to a reference dataset while the baseline ConvNet model had an RMSE of 4.3 m. Hy-TeC, which was used to generate a height map of Ghana using free and open access remotely sensed data with Sentinel-2 and Sentinel-1 images as predictors and GEDI height measurements as calibration data, has the potential to be used globally.

Hybrid Vision Research Articles

Related Topics

Articles published on Hybrid Vision

Research on the performance of hybrid vision models based on ViT

ChatMatch: Exploring the potential of hybrid vision–language deep learning approach for the intelligent analysis and inference of racket sports

A CNN- and Self-Attention-Based Maize Growth Stage Recognition Method and Platform from UAV Orthophoto Images

Enhanced Hybrid Vision Transformer with Multi-Scale Feature Integration and Patch Dropping for Facial Expression Recognition.

Efficient identification and classification of apple leaf diseases using lightweight vision transformer (ViT)

Time-Dependent Deep Learning Prediction of Multiple Sclerosis Disability.

Multi-model feature aggregation for classification of laser welding images with vision transformer

A Novel Hybrid Vision Transformer CNN for COVID-19 Detection from ECG Images

Graph-infused hybrid vision transformer: Advancing GeoAI for enhanced land cover classification

ViTCN: Hybrid Vision Transformer with Temporal Convolution for Multi-Emotion Recognition

Hybrid Quantum Vision Transformers for Event Classification in High Energy Physics

ViT-UperNet: a hybrid vision transformer with unified-perceptual-parsing network for medical image segmentation

UViT: Efficient and lightweight U-shaped hybrid vision transformer for human pose estimation

Explainable hybrid vision transformers and convolutional network for multimodal glioma segmentation in brain MRI

Polyp Segmentation Using a Hybrid Vision Transformer and a Hybrid Loss Function.

HVL-SLAM: Hybrid Vision and LiDAR Fusion for SLAM

Q-HyViT: Post-Training Quantization of Hybrid Vision Transformers With Bridge Block Reconstruction for IoT Systems

Hybrid Vision Transformers and CNNs for Enhanced Transmission Line Segmentation in Aerial Images

Enhancing Security: Infused Hybrid Vision Transformer for Signature Verification

Hy-TeC: a hybrid vision transformer model for high-resolution and large-scale mapping of canopy height

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Hybrid Vision Research Articles

Related Topics

Articles published on Hybrid Vision

Research on the performance of hybrid vision models based on ViT

ChatMatch: Exploring the potential of hybrid vision–language deep learning approach for the intelligent analysis and inference of racket sports

A CNN- and Self-Attention-Based Maize Growth Stage Recognition Method and Platform from UAV Orthophoto Images

Enhanced Hybrid Vision Transformer with Multi-Scale Feature Integration and Patch Dropping for Facial Expression Recognition.

Efficient identification and classification of apple leaf diseases using lightweight vision transformer (ViT)

Time-Dependent Deep Learning Prediction of Multiple Sclerosis Disability.

Multi-model feature aggregation for classification of laser welding images with vision transformer

A Novel Hybrid Vision Transformer CNN for COVID-19 Detection from ECG Images

Graph-infused hybrid vision transformer: Advancing GeoAI for enhanced land cover classification

ViTCN: Hybrid Vision Transformer with Temporal Convolution for Multi-Emotion Recognition

Hybrid Quantum Vision Transformers for Event Classification in High Energy Physics

ViT-UperNet: a hybrid vision transformer with unified-perceptual-parsing network for medical image segmentation

UViT: Efficient and lightweight U-shaped hybrid vision transformer for human pose estimation

Explainable hybrid vision transformers and convolutional network for multimodal glioma segmentation in brain MRI

Polyp Segmentation Using a Hybrid Vision Transformer and a Hybrid Loss Function.

HVL-SLAM: Hybrid Vision and LiDAR Fusion for SLAM

Q-HyViT: Post-Training Quantization of Hybrid Vision Transformers With Bridge Block Reconstruction for IoT Systems

Hybrid Vision Transformers and CNNs for Enhanced Transmission Line Segmentation in Aerial Images

Enhancing Security: Infused Hybrid Vision Transformer for Signature Verification

Hy-TeC: a hybrid vision transformer model for high-resolution and large-scale mapping of canopy height