ResNet Module Research Articles

Due to the advantages of small size, lightweight, and simple operation, the unmanned aerial vehicle (UAV) has been widely used, and it is also becoming increasingly convenient to capture high-resolution aerial images in a variety of environments. Existing target-detection methods for UAV aerial images lack outstanding performance in the face of challenges such as small targets, dense arrangement, sparse distribution, and a complex background. In response to the above problems, some improvements on the basis of YOLOv5l have been made by us. Specifically, three feature-extraction modules are proposed, using asymmetric convolutions. They are named the Asymmetric ResNet (ASResNet) module, Asymmetric Enhanced Feature Extraction (AEFE) module, and Asymmetric Res2Net (ASRes2Net) module, respectively. According to the respective characteristics of the above three modules, the residual blocks in different positions in the backbone of YOLOv5 were replaced accordingly. An Improved Efficient Channel Attention (IECA) module was added after Focus, and Group Spatial Pyramid Pooling (GSPP) was used to replace the Spatial Pyramid Pooling (SPP) module. In addition, the K-Means++ algorithm was used to obtain more accurate anchor boxes, and the new EIOU-NMS method was used to improve the postprocessing ability of the model. Finally, ablation experiments, comparative experiments, and visualization of results were performed on five datasets, namely CIFAR-10, PASCAL VOC, VEDAI, VisDrone 2019, and Forklift. The effectiveness of the improved strategies and the superiority of the proposed method (YOLO-UAV) were verified. Compared with YOLOv5l, the backbone of the proposed method increased the top-one accuracy of the classification task by 7.20% on the CIFAR-10 dataset. The mean average precision (mAP) of the proposed method on the four object-detection datasets was improved by 5.39%, 5.79%, 4.46%, and 8.90%, respectively.

Currently, means of semantic segmentation of images, based on the use of neural networks, are increasingly used in computer systems for various purposes. Despite significant successes in this field, one of the most important unsolved problems is the task of determining the type and parameters of convolutional neural networks, which are the basis of the encoder and decoder. As a result of the research, an appropriate procedure was developed that allows the neural network encoder and decoder to be adapted to the following conditions of the segmentation problem: image size, number of color channels, permissible minimum accuracy of segmentation, permissible maximum computational complexity of segmentation, the need to label segments, the need to select several segments, the need to select deformed, displaced and rotated objects, the maximum computational complexity of learning a neural network model is permissible; admissible training period of the neural network model. The implementation of the procedure of applying neural networks for image segmentation consists in the formation of the basic mathematical support, the construction of the main blocks and the general scheme of the procedure. The developed procedure was verified experimentally on examples of semantic segmentation of images containing objects such as a car. The obtained experimental results show that the application of the proposed procedure allows, avoiding complex long-term experiments, to build a neural network model that, with a sufficiently short training period, ensures the achievement of image segmentation accuracy of about 0.8, which corresponds to the best systems of a similar purpose. It is shown that the ways of further research in the direction of improving the methodological support of neural network segmentation of raster images should be correlated with the justified use of modern modules and mechanisms in the encoder and decoder, adapted to the significant conditions of the given task. For example, the use of the ResNet module allows you to increase the depth of the neural network due to the leveling of the gradient drop effect, and the Inception module provides a reduction in the number of weighting factors and the processing of objects of different sizes.

ResNet Module Research Articles

Articles published on ResNet Module

A Novel Multi-Feature Fusion Model Based on Pre-Trained Wav2vec 2.0 for Underwater Acoustic Target Recognition

Eye Disease Net: an algorithmic model for rapid diagnosis of diseases.

A Multi-Target Detection Method Based on Improved U-Net for UWB MIMO Through-Wall Radar

Research on Big Data-Driven Urban Traffic Flow Prediction Based on Deep Learning

Dual Residual Denoising Autoencoder with Channel Attention Mechanism for Modulation of Signals

Real-Time Detection of Mango Based on Improved YOLOv4

Target Detection Method of UAV Aerial Imagery Based on Improved YOLOv5

Dilated convolution based RCNN using feature fusion for Low-Altitude aerial objects

PROCEDURE FOR USING NEURAL NETWORKS FOR SEGMENTATION OF RASTER IMAGES

Human interaction recognition method based on parallel multi-feature fusion network

Lemon‐YOLO: An efficient object detection method for lemons in the natural environment

Improved protein model quality assessment by integrating sequential and pairwise features using deep learning.

End-to-End Classification Network for Ice Sheet Subsurface Targets in Radar Imagery

Three-dimensional rapid registration and reconstruction of multi-view rigid objects based on end-to-end deep surface model

CASR: a context-aware residual network for single-image super-resolution

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

ResNet Module Research Articles

Articles published on ResNet Module

A Novel Multi-Feature Fusion Model Based on Pre-Trained Wav2vec 2.0 for Underwater Acoustic Target Recognition

Eye Disease Net: an algorithmic model for rapid diagnosis of diseases.

A Multi-Target Detection Method Based on Improved U-Net for UWB MIMO Through-Wall Radar

Research on Big Data-Driven Urban Traffic Flow Prediction Based on Deep Learning

Dual Residual Denoising Autoencoder with Channel Attention Mechanism for Modulation of Signals

Real-Time Detection of Mango Based on Improved YOLOv4

Target Detection Method of UAV Aerial Imagery Based on Improved YOLOv5

Dilated convolution based RCNN using feature fusion for Low-Altitude aerial objects

PROCEDURE FOR USING NEURAL NETWORKS FOR SEGMENTATION OF RASTER IMAGES

Human interaction recognition method based on parallel multi-feature fusion network

Lemon‐YOLO: An efficient object detection method for lemons in the natural environment

Improved protein model quality assessment by integrating sequential and pairwise features using deep learning.

End-to-End Classification Network for Ice Sheet Subsurface Targets in Radar Imagery

Three-dimensional rapid registration and reconstruction of multi-view rigid objects based on end-to-end deep surface model

CASR: a context-aware residual network for single-image super-resolution