Normal Convolution Research Articles

Urban forestry is regarded as the green infrastructure of cities, the analysis of which is of great significance for research on statistical analysis, such as greening, carbon sink, etc. on the basis of remote sensing detection. However, current urban forestry detection algorithms are focused on typical machine learning, whose results are limited by the manually selected features. Therefore, an innovative detection algorithm for analyzing urban forestry is proposed. First, the simple linear iterative clustering algorithm is adopted to segment the dataset, and the edge information of the object is preserved as much as possible. Second, a new network structure is constructed on the basis of Mask-RCNN to refine the detection results: (1) dilated convolution is adopted to replace the normal convolution of C1 in backbone (ResNet101) to increase the size of receptive field; (2) convolutional block attention module is introduced in C2 to give more weight to the shallow features such as edges and colors; and (3) atrous spatial pyramid pooling (ASPP) is added to branch of C3, enhancing the ability of multiscale fusion. With the effect of ASPP on detection results at different positions of backbone compared, the results show that the more backward the ASPP is, the more obvious the trend of negative correlation between network convergence speed and detection accuracy (C3: 89.2%, C4: 84.3%, and C5: 81.6%) is. The accuracy of random forest and support vector machines is also compared, with the former of 69.4% and the latter of 71.2%. In addition, the method of transfer learning to unmanned aerial vehicle data is used in this research, with the accuracy of 81.6%. The result shows that the improved segmentation model features better ability of learning shallow features than the original, able to accurately extract urban forestry from remote sensing images, which provides the reference for the further research on urban forestry detection, mapping, as well as the multisource remote sensing data fusion.

Multiple and heterogenous Earth observation (EO) platforms are broadly used for a wide array of applications, and the integration of these diverse modalities facilitates better extraction of information than using them individually. The detection capability of the multispectral unmanned aerial vehicle (UAV) and satellite imagery can be significantly improved by fusing with ground hyperspectral data. However, variability in spatial and spectral resolution can affect the efficiency of such dataset's fusion. In this study, to address the modality bias, the input data was projected to a shared latent space using cross-modal generative approaches or guided unsupervised transformation. The proposed adversarial networks and variational encoder-based strategies used bi-directional transformations to model the cross-domain correlation without using cross-domain correspondence. It may be noted that an interpolation-based convolution was adopted instead of the normal convolution for learning the features of the point spectral data (ground spectra). The proposed generative adversarial network-based approach employed dynamic time wrapping based layers along with a cyclic consistency constraint to use the minimal number of unlabeled samples, having cross-domain correlation, to compute a cross-modal generative latent space. The proposed variational encoder-based transformation also addressed the cross-modal resolution differences and limited availability of cross-domain samples by using a mixture of expert-based strategy, cross-domain constraints, and adversarial learning. In addition, the latent space was modelled to be composed of modality independent and modality dependent spaces, thereby further reducing the requirement of training samples and addressing the cross-modality biases. An unsupervised covariance guided transformation was also proposed to transform the labelled samples without using cross-domain correlation prior. The proposed latent space transformation approaches resolved the requirement of cross-domain samples which has been a critical issue with the fusion of multi-modal Earth observation data. This study also proposed a latent graph generation and graph convolutional approach to predict the labels resolving the domain discrepancy and cross-modality biases. Based on the experiments over different standard benchmark airborne datasets and real-world UAV datasets, the developed approaches outperformed the prominent hyperspectral panchromatic sharpening, image fusion, and domain adaptation approaches. By using specific constraints and regularizations, the network developed was less sensitive to network parameters, unlike in similar implementations. The proposed approach illustrated improved generalizability in comparison with the prominent existing approaches. In addition to the fusion-based classification of the multispectral and hyperspectral datasets, the proposed approach was extended to the classification of hyperspectral airborne datasets where the latent graph generation and convolution were employed to resolve the domain bias with a small number of training samples. Overall, the developed transformations and architectures will be useful for the semantic interpretation and analysis of multimodal data and are applicable to signal processing, manifold learning, video analysis, data mining, and time series analysis, to name a few.

Normal Convolution Research Articles

Articles published on Normal Convolution

Vehicle detection method based on adaptive multi-scale feature fusion network

Topology-Aware Convolutional Neural Network for Efficient Skeleton-Based Action Recognition

Keypoint Message Passing for Video-Based Person Re-identification

EDR-Net: Lightweight Deep Neural Network Architecture for Detecting Referable Diabetic Retinopathy.

Image Desaturation for SDO/AIA Using Mixed Convolution Network

Urban forestry detection by deep learning method with GaoFen-2 remote sensing images

Scaling of Neural‐Network Quantum States for Time Evolution

Deformable Convolutional Networks for Multimodal Human Activity Recognition Using Wearable Sensors

Deep Convolutional Networks With Tunable Speed–Accuracy Tradeoff for Human Activity Recognition Using Wearables

An Image Edge Detection Algorithm Based on Multi-Feature Fusion

Automatic 12-Leading Electrocardiogram Classification Network with Deformable Convolution.

Evolution of ICTs-empowered-identification: A general re-ranking method for person re-identification

Multimodal Earth observation data fusion: Graph-based approach in shared latent space

A Novel Brain Image Segmentation Method Using an Improved 3D U-Net Model

Liver tumor segmentation from computed tomography images using multiscale residual dilated encoder‐decoder network

Subcortical band heterotopia and pachygyria with cognitive deterioration in an elderly patient

ParallelNet: multiple backbone network for detection tasks on thigh bone fracture

A Spectral Grouping and Attention-Driven Residual Dense Network for Hyperspectral Image Super-Resolution

A New Generative Neural Network for Bearing Fault Diagnosis with Imbalanced Data

Lightweight Ship Detection Methods Based on YOLOv3 and DenseNet

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Normal Convolution Research Articles

Articles published on Normal Convolution

Vehicle detection method based on adaptive multi-scale feature fusion network

Topology-Aware Convolutional Neural Network for Efficient Skeleton-Based Action Recognition

Keypoint Message Passing for Video-Based Person Re-identification

EDR-Net: Lightweight Deep Neural Network Architecture for Detecting Referable Diabetic Retinopathy.

Image Desaturation for SDO/AIA Using Mixed Convolution Network

Urban forestry detection by deep learning method with GaoFen-2 remote sensing images

Scaling of Neural‐Network Quantum States for Time Evolution

Deformable Convolutional Networks for Multimodal Human Activity Recognition Using Wearable Sensors

Deep Convolutional Networks With Tunable Speed–Accuracy Tradeoff for Human Activity Recognition Using Wearables

An Image Edge Detection Algorithm Based on Multi-Feature Fusion

Automatic 12-Leading Electrocardiogram Classification Network with Deformable Convolution.

Evolution of ICTs-empowered-identification: A general re-ranking method for person re-identification

Multimodal Earth observation data fusion: Graph-based approach in shared latent space

A Novel Brain Image Segmentation Method Using an Improved 3D U-Net Model

Liver tumor segmentation from computed tomography images using multiscale residual dilated encoder‐decoder network

Subcortical band heterotopia and pachygyria with cognitive deterioration in an elderly patient

ParallelNet: multiple backbone network for detection tasks on thigh bone fracture

A Spectral Grouping and Attention-Driven Residual Dense Network for Hyperspectral Image Super-Resolution

A New Generative Neural Network for Bearing Fault Diagnosis with Imbalanced Data

Lightweight Ship Detection Methods Based on YOLOv3 and DenseNet