Complex Background Environment Research Articles

The semantic segmentation of remote sensing images is a significant research direction in digital image processing. The complex background environment, irregular size and shape of objects, and similar appearance of different categories of remote sensing images have brought great challenges to remote sensing image segmentation tasks. Traditional convolutional-neural-network-based models often ignore spatial information in the feature extraction stage and pay less attention to global context information. However, spatial context information is important in complex remote sensing images, which means that the segmentation effect of traditional models needs to be improved. In addition, neural networks with a superior segmentation performance often suffer from the problem of high computational resource consumption. To address the above issues, this paper proposes a combination model of a modified multiscale deformable convolutional neural network (mmsDCNN) and dense conditional random field (DenseCRF). Firstly, we designed a lightweight multiscale deformable convolutional network (mmsDCNN) with a large receptive field to generate a preliminary prediction probability map at each pixel. The output of the mmsDCNN model is a coarse segmentation result map, which has the same size as the input image. In addition, the preliminary segmentation result map contains rich multiscale features. Then, the multi-level DenseCRF model based on the superpixel level and the pixel level is proposed, which can make full use of the context information of the image at different levels and further optimize the rough segmentation result of mmsDCNN. To be specific, we converted the pixel-level preliminary probability map into a superpixel-level predicted probability map according to the simple linear iterative clustering (SILC) algorithm and defined the potential function of the DenseCRF model based on this. Furthermore, we added the pixel-level potential function constraint term to the superpixel-based Gaussian potential function to obtain a combined Gaussian potential function, which enabled our model to consider the features of various scales and prevent poor superpixel segmentation results from affecting the final result. To restore the contour of the object more clearly, we utilized the Sketch token edge detection algorithm to extract the edge contour features of the image and fused them into the potential function of the DenseCRF model. Finally, extensive experiments on the Potsdam and Vaihingen datasets demonstrated that the proposed model exhibited significant advantages compared to the current state-of-the-art models.

Read full abstract

Natural breeding scenes have the characteristics of a large number of cows, complex lighting, and a complex background environment, which presents great difficulties for the detection of dairy cow estrus behavior. However, the existing research on cow estrus behavior detection works well in ideal environments with a small number of cows and has a low inference speed and accuracy in natural scenes. To improve the inference speed and accuracy of cow estrus behavior in natural scenes, this paper proposes a cow estrus behavior detection method based on the improved YOLOv5. By improving the YOLOv5 model, it has stronger detection ability for complex environments and multi-scale objects. First, the atrous spatial pyramid pooling (ASPP) module is employed to optimize the YOLOv5l network at multiple scales, which improves the model’s receptive field and ability to perceive global contextual multiscale information. Second, a cow estrus behavior detection model is constructed by combining the channel-attention mechanism and a deep-asymmetric-bottleneck module. Last, K-means clustering is performed to obtain new anchors and complete intersection over union (CIoU) is used to introduce the relative ratio between the predicted box of the cow mounting and the true box of the cow mounting to the regression box prediction function to improve the scale invariance of the model. Multiple cameras were installed in a natural breeding scene containing 200 cows to capture videos of cows mounting. A total of 2668 images were obtained from 115 videos of cow mounting events from the training set, and 675 images were obtained from 29 videos of cow mounting events from the test set. The training set is augmented by the mosaic method to increase the diversity of the dataset. The experimental results show that the average accuracy of the improved model was 94.3%, that the precision was 97.0%, and that the recall was 89.5%, which were higher than those of mainstream models such as YOLOv5, YOLOv3, and Faster R-CNN. The results of the ablation experiments show that ASPP, new anchors, C3SAB, and C3DAB designed in this study can improve the accuracy of the model by 5.9%. Furthermore, when the ASPP dilated convolution was set to (1,5,9,13) and the loss function was set to CIoU, the model had the highest accuracy. The class activation map function was utilized to visualize the model’s feature extraction results and to explain the model’s region of interest for cow images in natural scenes, which demonstrates the effectiveness of the model. Therefore, the model proposed in this study can improve the accuracy of the model for detecting cow estrus events. Additionally, the model’s inference speed was 71 frames per second (fps), which meets the requirements of fast and accurate detection of cow estrus events in natural scenes and all-weather conditions.

Read full abstract

Complex Background Environment Research Articles

Related Topics

Articles published on Complex Background Environment

Semantic Segmentation of Remote Sensing Imagery Based on Multiscale Deformable CNN and DenseCRF

Application of FSVD Algorithm to Airborne Gamma Detection of Trace Radionuclides in the Process of a High Radon Background

A High-Precision Detection Method of Apple Leaf Diseases Using Improved Faster R-CNN

A novel Dynahead-Yolo neural network for the detection of landslides with variable proportions using remote sensing images

Segmentation and Evaluation of Crack Image From Aircraft Fuel Tank via Atrous Spatial Pyramid Fusion and Hybrid Attention Network

An Improved Differentiable Binarization Network for Natural Scene Street Sign Text Detection

Research on Grape-Planting Structure Perception Method Based on Unmanned Aerial Vehicle Multispectral Images in the Field

Seedling maize counting method in complex backgrounds based on YOLOV5 and Kalman filter tracking algorithm.

Adaptive CFAR Method for SAR Ship Detection Using Intensity and Texture Feature Fusion Attention Contrast Mechanism.

Dandelion segmentation with background transfer learning and RGB-attention module

Power Line Scene Recognition Based on Convolutional Capsule Network with Image Enhancement

Study on adaptive infrared camouflage of novel positive temperature coefficient (PTC) materials in space

Detection Method of Cow Estrus Behavior in Natural Scenes Based on Improved YOLOv5

Research on SLAM Road Sign Observation Based on Particle Filter.

Improved YOLOv5 network method for remote sensing image-based ground objects recognition

Fusion of RGB, optical flow and skeleton features for the detection of lameness in dairy cows

Vision-Based Power Line Segmentation With an Attention Fusion Network

Intelligent Image Saliency Detection Method Based on Convolution Neural Network Combining Global and Local Information

Optimization of Artistic Image Segmentation Algorithm Based on Feed Forward Neural Network under Complex Background Environment.

Quantum Transfer Learning Approach for Deepfake Detection

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Complex Background Environment Research Articles

Related Topics

Articles published on Complex Background Environment

Semantic Segmentation of Remote Sensing Imagery Based on Multiscale Deformable CNN and DenseCRF

Application of FSVD Algorithm to Airborne Gamma Detection of Trace Radionuclides in the Process of a High Radon Background

A High-Precision Detection Method of Apple Leaf Diseases Using Improved Faster R-CNN

A novel Dynahead-Yolo neural network for the detection of landslides with variable proportions using remote sensing images

Segmentation and Evaluation of Crack Image From Aircraft Fuel Tank via Atrous Spatial Pyramid Fusion and Hybrid Attention Network

An Improved Differentiable Binarization Network for Natural Scene Street Sign Text Detection

Research on Grape-Planting Structure Perception Method Based on Unmanned Aerial Vehicle Multispectral Images in the Field

Seedling maize counting method in complex backgrounds based on YOLOV5 and Kalman filter tracking algorithm.

Adaptive CFAR Method for SAR Ship Detection Using Intensity and Texture Feature Fusion Attention Contrast Mechanism.

Dandelion segmentation with background transfer learning and RGB-attention module

Power Line Scene Recognition Based on Convolutional Capsule Network with Image Enhancement

Study on adaptive infrared camouflage of novel positive temperature coefficient (PTC) materials in space

Detection Method of Cow Estrus Behavior in Natural Scenes Based on Improved YOLOv5

Research on SLAM Road Sign Observation Based on Particle Filter.

Improved YOLOv5 network method for remote sensing image-based ground objects recognition

Fusion of RGB, optical flow and skeleton features for the detection of lameness in dairy cows

Vision-Based Power Line Segmentation With an Attention Fusion Network

Intelligent Image Saliency Detection Method Based on Convolution Neural Network Combining Global and Local Information

Optimization of Artistic Image Segmentation Algorithm Based on Feed Forward Neural Network under Complex Background Environment.

Quantum Transfer Learning Approach for Deepfake Detection