Fusion Of Local Features Research Articles

Citrus fruits hold pivotal positions within the agricultural sector. Accurate yield estimation for citrus fruits is crucial in orchard management, especially when facing challenges of fruit occlusion due to dense foliage or overlapping fruits. This study addresses the issues of low detection accuracy and the significant instances of missed detections in citrus fruit detection algorithms, particularly in scenarios of occlusion. It introduces AG-YOLO, an attention-based network designed to fuse contextual information. Leveraging NextViT as its primary architecture, AG-YOLO harnesses its ability to capture holistic contextual information within nearby scenes. Additionally, it introduces a Global Context Fusion Module (GCFM), facilitating the interaction and fusion of local and global features through self-attention mechanisms, significantly improving the model’s occluded target detection capabilities. An independent dataset comprising over 8000 outdoor images was collected for the purpose of evaluating AG-YOLO’s performance. After a meticulous selection process, a subset of 957 images meeting the criteria for occlusion scenarios of citrus fruits was obtained. This dataset includes instances of occlusion, severe occlusion, overlap, and severe overlap, covering a range of complex scenarios. AG-YOLO demonstrated exceptional performance on this dataset, achieving a precision (P) of 90.6%, a mean average precision (mAP)@50 of 83.2%, and an mAP@50:95 of 60.3%. These metrics surpass existing mainstream object detection methods, confirming AG-YOLO’s efficacy. AG-YOLO effectively addresses the challenge of occlusion detection, achieving a speed of 34.22 frames per second (FPS) while maintaining a high level of detection accuracy. This speed of 34.22 FPS showcases a relatively faster performance, particularly evident in handling the complexities posed by occlusion challenges, while maintaining a commendable balance between speed and accuracy. AG-YOLO, compared to existing models, demonstrates advantages in high localization accuracy, minimal missed detection rates, and swift detection speed, particularly evident in effectively addressing the challenges posed by severe occlusions in object detection. This highlights its role as an efficient and reliable solution for handling severe occlusions in the field of object detection.

Gait recognition has received widespread attention due to its non-intrusive recognition mechanism. Currently, most gait recognition methods use appearance-based recognition methods, and such methods are easily affected by occlusions when facing complex environments, which in turn affects the recognition accuracy. With the maturity of pose estimation techniques, model-based gait recognition methods have received more and more attention due to their robustness in complex environments. However, the current model-based gait recognition methods mainly focus on modeling the global feature information in the spatial dimension, ignoring the importance of local features and their influence on recognition accuracy. Meanwhile, in the temporal dimension, these methods usually use single-scale temporal information extraction, which does not take into account the inconsistency of the motion cycles of the limbs when a human body is walking (e.g., arm swing and leg pace), leading to the loss of some limb temporal information. To solve these problems, we propose a gait recognition network based on a Global–Local Graph Convolutional Network, called GaitMGL. Specifically, we introduce a new spatio-temporal feature extraction module, MGL (Multi-scale Temporal and Global–Local Spatial Extraction Module), which consists of GLGCN (Global–Local Graph Convolutional Network) and MTCN (Multi-scale Temporal Convolutional Network). GLGCN models both global and local features, and extracts global–local motion information. MTCN, on the other hand, takes into account the inconsistency of local limb motion cycles, and facilitates multi-scale temporal convolution to capture the temporal information of limb motion. In short, our GaitMGL solves the problems of loss of local information and loss of temporal information at a single scale that exist in existing model-based gait recognition networks. We evaluated our method on three publicly available datasets, CASIA-B, Gait3D, and GREW, and the experimental results show that our method demonstrates surprising performance and achieves an accuracy of 63.12% in the dataset GREW, exceeding all existing model-based gait recognition networks.

Fusion Of Local Features Research Articles

Related Topics

Articles published on Fusion Of Local Features

Medical Image Segmentation with Dual-Encoding and Multi-Level Feature Adaptive Fusion

YOLOv5s fabric defect detection model with mixed attention mechanism

Fusion of deep and local gradient-based features for multimodal finger knuckle print identification

Composite descriptor based on contour and appearance for plant species identification

FE-FAIR: Feature-Enhanced Fused Attention for Image Super-Resolution

A graph convolutional network with dynamic weight fusion of multi-scale local features for diabetic retinopathy grading

Prediction of lncRNA and disease associations based on residual graph convolutional networks with attention mechanism

A New Automated Prognostic Prediction Method Based on Multi-Sequence Magnetic Resonance Imaging for Hepatic Resection of Colorectal Cancer Liver Metastases.

Hyper-S3NN: Spatial–spectral spiking neural network for hyperspectral image classification

Affine medical image registration with fusion feature mapping in local and global.

A hybrid ResNet-ViT approach to bridge the global and local features for myocardial infarction detection

Knee cartilage MR images segmentation based on multi-dimensional hybrid convolutional neural network

Volumetric Imitation Generative Adversarial Networks for Anatomical Human Body Modeling.

Aspect-level multimodal sentiment analysis based on co-attention fusion

AG-YOLO: A Rapid Citrus Fruit Detection Algorithm with Global Context Fusion

DeepmdQCT: A multitask network with domain invariant features and comprehensive attention mechanism for quantitative computer tomography diagnosis of osteoporosis

GaitMGL: Multi-Scale Temporal Dimension and Global–Local Feature Fusion for Gait Recognition

LGCDA: Predicting CircRNA-Disease Association Based on Fusion of Local and Global Features.

A Multi-Task Transformer with Local-Global Feature Interaction and Multiple Tumoral Region Guidance for Breast Cancer Diagnosis.

Attention Guided Food Recognition via Multi-Stage Local Feature Fusion

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Fusion Of Local Features Research Articles

Related Topics

Articles published on Fusion Of Local Features

Medical Image Segmentation with Dual-Encoding and Multi-Level Feature Adaptive Fusion

YOLOv5s fabric defect detection model with mixed attention mechanism

Fusion of deep and local gradient-based features for multimodal finger knuckle print identification

Composite descriptor based on contour and appearance for plant species identification

FE-FAIR: Feature-Enhanced Fused Attention for Image Super-Resolution

A graph convolutional network with dynamic weight fusion of multi-scale local features for diabetic retinopathy grading

Prediction of lncRNA and disease associations based on residual graph convolutional networks with attention mechanism

A New Automated Prognostic Prediction Method Based on Multi-Sequence Magnetic Resonance Imaging for Hepatic Resection of Colorectal Cancer Liver Metastases.

Hyper-S3NN: Spatial–spectral spiking neural network for hyperspectral image classification

Affine medical image registration with fusion feature mapping in local and global.

A hybrid ResNet-ViT approach to bridge the global and local features for myocardial infarction detection

Knee cartilage MR images segmentation based on multi-dimensional hybrid convolutional neural network

Volumetric Imitation Generative Adversarial Networks for Anatomical Human Body Modeling.

Aspect-level multimodal sentiment analysis based on co-attention fusion

AG-YOLO: A Rapid Citrus Fruit Detection Algorithm with Global Context Fusion

DeepmdQCT: A multitask network with domain invariant features and comprehensive attention mechanism for quantitative computer tomography diagnosis of osteoporosis

GaitMGL: Multi-Scale Temporal Dimension and Global–Local Feature Fusion for Gait Recognition

LGCDA: Predicting CircRNA-Disease Association Based on Fusion of Local and Global Features.

A Multi-Task Transformer with Local-Global Feature Interaction and Multiple Tumoral Region Guidance for Breast Cancer Diagnosis.

Attention Guided Food Recognition via Multi-Stage Local Feature Fusion