Multi-scale Features Research Articles

In scenarios where global navigation satellite systems (GNSSs) and radio navigation systems are denied, vision-based autonomous landing (VAL) for fixed-wing unmanned aerial vehicles (UAVs) becomes essential. Accurate and real-time runway detection in VAL is vital for providing precise positional and orientational guidance. However, existing research faces significant challenges, including insufficient accuracy, inadequate real-time performance, poor robustness, and high susceptibility to disturbances. To address these challenges, this paper introduces a novel single-stage, anchor-free, and decoupled vision-based runway detection framework, referred to as YOLO-RWY. First, an enhanced data augmentation (EDA) module is incorporated to perform various augmentations, enriching image diversity, and introducing perturbations that improve generalization and safety. Second, a large separable kernel attention (LSKA) module is integrated into the backbone structure to provide a lightweight attention mechanism with a broad receptive field, enhancing feature representation. Third, the neck structure is reorganized as a bidirectional feature pyramid network (BiFPN) module with skip connections and attention allocation, enabling efficient multi-scale and across-stage feature fusion. Finally, the regression loss and task-aligned learning (TAL) assigner are optimized using efficient intersection over union (EIoU) to improve localization evaluation, resulting in faster and more accurate convergence. Comprehensive experiments demonstrate that YOLO-RWY achieves AP50:95 scores of 0.760, 0.611, and 0.413 on synthetic, real nominal, and real edge test sets of the landing approach runway detection (LARD) dataset, respectively. Deployment experiments on an edge device show that YOLO-RWY achieves an inference speed of 154.4 FPS under FP32 quantization with an image size of 640. The results indicate that the proposed YOLO-RWY model possesses strong generalization and real-time capabilities, enabling accurate runway detection in complex and challenging visual environments, and providing support for the onboard VAL systems of fixed-wing UAVs.

Artificial intelligence (AI) empowered edge computing has given rise to a new paradigm and effectively facilitated the promotion and development of multimedia applications. The speech assistant is one of the significant services provided by multimedia applications, which aims to offer intelligent interactive experiences between humans and machines. However, malicious attackers may exploit spoofed speeches to deceive speech assistants, posing great challenges to the security of multimedia applications. The limited resources of multimedia terminal devices hinder their ability to effectively load speech spoofing detection models. Furthermore, processing and analyzing speech in the cloud can result in poor real-time performance and potential privacy risks. Existing speech spoofing detection methods rely heavily on annotated data and exhibit poor generalization capabilities for unseen spoofed speeches. To address these challenges, this paper first proposes the Coordinate Attention Network (CA2Net) that consists of coordinate attention blocks and Res2Net blocks. CA2Net can simultaneously extract temporal and spectral speech feature information and represent multi-scale speech features at a granularity level. Besides, a contrastive learning-based speech spoofing detection framework named GEMINI is proposed. GEMINI can be effectively deployed on edge nodes and autonomously learn speech features with strong generalization capabilities. GEMINI first performs data augmentation on speech signals and extracts conventional acoustic features to enhance the feature robustness. Subsequently, GEMINI utilizes the proposed CA2Net to further explore the discriminative speech features. Then, a tensor-based multi-attention comparison model is employed to maximize the consistency between speech contexts. GEMINI continuously updates CA2Net with contrastive learning, which enables CA2Net to effectively represent speech signals and accurately detect spoofed speeches. Extensive experiments on the ASVspoof2019 dataset show that GEMINI reduces the Equal Error Rate and tandem Detection Cost Function by up to 96.75% and 96.35% in the physical access scenario, and by up to 86.62% and 87.71% in the logical access scenario compared to peer methods.

Multi-scale Features Research Articles

Related Topics

Articles published on Multi-scale Features

Self-supervised monocular depth estimation based on improved dense network and wavelet decomposition

AI-based automated breast cancer segmentation in ultrasound imaging based on Attention Gated Multi ResU-Net

Welding Defect Monitoring Based on Multi-Scale Feature Fusion of Molten Pool Videos.

A Study on Enhancing the Visual Fidelity of Aviation Simulators Using WGAN-GP for Remote Sensing Image Color Correction

CEDNet: A cascade encoder–decoder network for dense prediction

Multistage Spatial-Spectral Fusion Network for Spectral Super-Resolution.

YOLO-RWY: A Novel Runway Detection Model for Vision-Based Autonomous Landing of Fixed-Wing Unmanned Aerial Vehicles

Contour Detection Algorithm for α Phase Structure of TB6 Titanium Alloy fused with Multi-Scale Fretting Features

An Efficient UAV Image Object Detection Algorithm Based on Global Attention and Multi-Scale Feature Fusion

Semantic Segmentation of Satellite Images for Landslide Detection Using Foreground-Aware and Multi-Scale Convolutional Attention Mechanism.

New Energy Vehicle Tire Defect Detection Algorithm Based on Improved YOLOv8

Classification and extraction method of hidden dangers along railway lines based on semantic segmentation network

MSE-TCN: Multi-scale temporal convolutional network with channel attention for open-set gas classification

A Lightweight Strip Steel Surface Defect Detection Network Based on Improved YOLOv8

ZeroD-fender: A Resource-aware IoT Malware Detection Engine via Fine-grained Side-channel Analysis

Sparse-View Photoacoustic Reconstruction Method for Diabetic Retinopathy Using Feature Fusion Network.

Image Super-Resolution Reconstruction Based on Dense Residual Attention and Multi-Scale Feature Fusion

Multi-Scale Feature Attention Fusion for Image Splicing Forgery Detection

DLCH-YOLO: An Object Detection Algorithm for Monitoring the Operation Status of Circuit Breakers in Power Scenarios

Contrastive Learning based Speech Spoofing Detection for Multimedia Security in Edge Intelligence

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Multi-scale Features Research Articles

Related Topics

Articles published on Multi-scale Features

Self-supervised monocular depth estimation based on improved dense network and wavelet decomposition

AI-based automated breast cancer segmentation in ultrasound imaging based on Attention Gated Multi ResU-Net

Welding Defect Monitoring Based on Multi-Scale Feature Fusion of Molten Pool Videos.

A Study on Enhancing the Visual Fidelity of Aviation Simulators Using WGAN-GP for Remote Sensing Image Color Correction

CEDNet: A cascade encoder–decoder network for dense prediction

Multistage Spatial-Spectral Fusion Network for Spectral Super-Resolution.

YOLO-RWY: A Novel Runway Detection Model for Vision-Based Autonomous Landing of Fixed-Wing Unmanned Aerial Vehicles

Contour Detection Algorithm for α Phase Structure of TB6 Titanium Alloy fused with Multi-Scale Fretting Features

An Efficient UAV Image Object Detection Algorithm Based on Global Attention and Multi-Scale Feature Fusion

Semantic Segmentation of Satellite Images for Landslide Detection Using Foreground-Aware and Multi-Scale Convolutional Attention Mechanism.

New Energy Vehicle Tire Defect Detection Algorithm Based on Improved YOLOv8

Classification and extraction method of hidden dangers along railway lines based on semantic segmentation network

MSE-TCN: Multi-scale temporal convolutional network with channel attention for open-set gas classification

A Lightweight Strip Steel Surface Defect Detection Network Based on Improved YOLOv8

ZeroD-fender: A Resource-aware IoT Malware Detection Engine via Fine-grained Side-channel Analysis

Sparse-View Photoacoustic Reconstruction Method for Diabetic Retinopathy Using Feature Fusion Network.

Image Super-Resolution Reconstruction Based on Dense Residual Attention and Multi-Scale Feature Fusion

Multi-Scale Feature Attention Fusion for Image Splicing Forgery Detection

DLCH-YOLO: An Object Detection Algorithm for Monitoring the Operation Status of Circuit Breakers in Power Scenarios

Contrastive Learning based Speech Spoofing Detection for Multimedia Security in Edge Intelligence