Feature Fusion Module Research Articles

Fruit maturity is the main factor affecting the quality and yield of Camellia oil. At present, accurate and efficient detection of the maturity level of Camellia oleifera fruits in orchards and enabling selective picking for harvesting robots is a crucial issue. Aiming at the challenge of low detection efficiency of Camellia oleifera fruit maturity and complex object detection models that are difficult to deploy to Camellia oleifera selective harvesting robots, a modified lightweight You Only Look Once model (YOLO-LM) based on YOLOv7-tiny is proposed. Specifically, three Criss-Cross Attention (CCA) modules are introduced to the backbone network to reduce the influence of leaves and branches occlusion as well as mutual occlusion between fruits. Then, the Adaptively Spatial Feature Fusion (ASFF) module is applied as the head to solve the limitation of inconsistency across different feature scales on the feature pyramid. Furthermore, the GSConv is introduced to replace the standard convolution of the Neck network to enable the model to simultaneously preserve model accuracy and reduce model complexity. Experiment results demonstrate that the precision, recall, mAP@0.5, parameters, FLOPs, and model size of YOLO-LM reached 93.96 %, 93.32 %, 93.18 %, 10.17 million, 19.46 G, and 19.82 MB, which outperformed YOLOv3, Faster R-CNN (VGG16), Faster R-CNN (ResNet50), YOLOv4, and YOLOv5m. Compared to YOLOv3-tiny, YOLOv4-tiny, YOLOv5s, YOLOv7-tiny, and YOLOv8s, the mAP@0.5 of YOLO-LM increased by 6.24 %, 6.11 %, 2.52 %, 2.45 %, and 0.57 %, respectively. Meanwhile, the parameters and model size of YOLO-LM are slightly higher than that of YOLOv3-tiny, YOLOv4-tiny, YOLOv5s and YOLOv7-tiny, and significantly lower than that of YOLOv8s. In addition, the detection heatmaps of YOLOv7-tiny and YOLO-LM demonstrate these optimization operations contribute to the enhanced performance of maturity detection for Camellia oleifera fruits within natural orchard environments. Overall, the YOLO-LM model can be used for the maturity detection of Camellia oleifera fruits in orchards, which is expected to provide theoretical reference for orchard yield estimation, growth monitoring, planting management optimization and the development of Camellia oleifera selective harvesting robots.

Black soil is a precious soil resource, yet it is severely affected by gully erosion, which is one of the most serious manifestations of land degradation. The determination of the location and shape of gullies is crucial for the work of gully erosion control. Traditional field measurement methods consume a large amount of human resources, so it is of great significance to use artificial intelligence techniques to automatically extract gullies from satellite remote sensing images. This study obtained the gully distribution map of the southwestern region of the Dahe Bay Farm in Inner Mongolia through field investigation and measurement and created a gully remote sensing dataset. We designed a multi-scale content structure feature extraction network to analyze remote sensing images and achieve automatic gully extraction. The multi-layer information obtained through the resnet34 network is input into the multi-scale structure extraction module and the multi-scale content extraction module designed by us, respectively, obtained richer intrinsic information about the image. We designed a structure content fusion network to further fuse structural features and content features and improve the depth of the model’s understanding of the image. Finally, we designed a muti-scale feature fusion module to further fuse low-level and high-level information, enhance the comprehensive understanding of the model, and improve the ability to extract gullies. The experimental results show that the multi-scale content structure feature extraction network can effectively avoid the interference of complex backgrounds in satellite remote sensing images. Compared with the classic semantic segmentation models, DeepLabV3+, PSPNet, and UNet, our model achieved the best results in several evaluation metrics, the F1 score, recall rate, and intersection over union (IoU), with an F1 score of 0.745, a recall of 0.777, and an IoU of 0.586. These results proved that our method is a highly automated and reliable method for extracting gullies from satellite remote sensing images, which simplifies the process of gully extraction and provides us with an accurate guide to locate the location of gullies, analyze the shape of gullies, and then provide accurate guidance for gully management.

Feature Fusion Module Research Articles

Related Topics

Articles published on Feature Fusion Module

S2VSNet: Single stage V-shaped network for image deraining & dehazing

HPIDN: A Hierarchical prior-guided iterative denoising network with global–local fusion for enhancing low-dose CT images

Skin Lesion Segmentation through Generative Adversarial Networks with Global and Local Semantic Feature Awareness

MLFEU-NET: A Multi-scale Low-level Feature Enhancement Unet for breast lesions segmentation in ultrasound images

BCL-Former: Localized Transformer Fusion with Balanced Constraint for polyp image segmentation

3D Point Cloud Semantic Segmentation based on Multi-scale Dense Nested Networks

TDAD: Self-supervised industrial anomaly detection with a two-stage diffusion model

Detection of Camellia oleifera fruit maturity in orchards based on modified lightweight YOLO

Exploiting K-Space in Magnetic Resonance Imaging Diagnosis: Dual-Path Attention Fusion for K-Space Global and Image Local Features.

A Multi-Scale Content-Structure Feature Extraction Network Applied to Gully Extraction

GPAC-YOLOv8: lightweight target detection for fire scenarios

A Chinese Nested Named Entity Recognition Model for Chicken Disease Based on Multiple Fine-Grained Feature Fusion and Efficient Global Pointer

Hybrid multiple instance learning network for weakly supervised medical image classification and localization

Specific Emitter Identification Algorithm Based on Time–Frequency Sequence Multimodal Feature Fusion Network

Multi-feature fusion dehazing based on CycleGAN

Prediction of mechanical properties of rolled steel based on dual-attention multiscale convolutional neural network

HPNet: Text Detection Network with Hybrid Attention and Pixel Aggregation for Irregularly-Shaped Nearby Texts

Efficient object detector via dynamic prior and dynamic feature fusion

Brain tumor segmentation by combining MultiEncoder UNet with wavelet fusion.

Structure-Guided Image Inpainting Based on Multi-Scale Attention Pyramid Network

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Feature Fusion Module Research Articles

Related Topics

Articles published on Feature Fusion Module

S2VSNet: Single stage V-shaped network for image deraining & dehazing

HPIDN: A Hierarchical prior-guided iterative denoising network with global–local fusion for enhancing low-dose CT images

Skin Lesion Segmentation through Generative Adversarial Networks with Global and Local Semantic Feature Awareness

MLFEU-NET: A Multi-scale Low-level Feature Enhancement Unet for breast lesions segmentation in ultrasound images

BCL-Former: Localized Transformer Fusion with Balanced Constraint for polyp image segmentation

3D Point Cloud Semantic Segmentation based on Multi-scale Dense Nested Networks

TDAD: Self-supervised industrial anomaly detection with a two-stage diffusion model

Detection of Camellia oleifera fruit maturity in orchards based on modified lightweight YOLO

Exploiting K-Space in Magnetic Resonance Imaging Diagnosis: Dual-Path Attention Fusion for K-Space Global and Image Local Features.

A Multi-Scale Content-Structure Feature Extraction Network Applied to Gully Extraction

GPAC-YOLOv8: lightweight target detection for fire scenarios

A Chinese Nested Named Entity Recognition Model for Chicken Disease Based on Multiple Fine-Grained Feature Fusion and Efficient Global Pointer

Hybrid multiple instance learning network for weakly supervised medical image classification and localization

Specific Emitter Identification Algorithm Based on Time–Frequency Sequence Multimodal Feature Fusion Network

Multi-feature fusion dehazing based on CycleGAN

Prediction of mechanical properties of rolled steel based on dual-attention multiscale convolutional neural network

HPNet: Text Detection Network with Hybrid Attention and Pixel Aggregation for Irregularly-Shaped Nearby Texts

Efficient object detector via dynamic prior and dynamic feature fusion

Brain tumor segmentation by combining MultiEncoder UNet with wavelet fusion.

Structure-Guided Image Inpainting Based on Multi-Scale Attention Pyramid Network