Layered Decoding Research Articles

For real-world simulation, terrain models must combine various types of information on material and texture in terrain reconstruction for the three-dimensional numerical simulation of terrain. However, the construction of such models using the conventional method often involves high costs in both manpower and time. Therefore, this study used a convolutional neural network (CNN) architecture to classify material in multispectral remote sensing images to simplify the construction of future models. Visible light (i.e., RGB), near infrared (NIR), normalized difference vegetation index (NDVI), and digital surface model (DSM) images were examined.This paper proposes the use of the robust U-Net (RUNet) model, which integrates multiple CNN architectures, for material classification. This model, which is based on an improved U-Net architecture combined with the shortcut connections in the ResNet model, preserves the features of shallow network extraction. The architecture is divided into an encoding layer and a decoding layer. The encoding layer comprises 10 convolutional layers and 4 pooling layers. The decoding layer contains four upsampling layers, eight convolutional layers, and one classification convolutional layer. The material classification process in this study involved the training and testing of the RUNet model. Because of the large size of remote sensing images, the training process randomly cuts subimages of the same size from the training set and then inputs them into the RUNet model for training. To consider the spatial information of the material, the test process cuts multiple test subimages from the test set through mirror padding and overlapping cropping; RUNet then classifies the subimages. Finally, it merges the subimage classification results back into the original test image.The aerial image labeling dataset of the National Institute for Research in Digital Science and Technology (Inria, abbreviated from the French Institut national de recherche en sciences et technologies du numérique) was used as well as its configured dataset (called Inria-2) and a dataset from the International Society for Photogrammetry and Remote Sensing (ISPRS). Material classification was performed with RUNet. Moreover, the effects of the mirror padding and overlapping cropping were analyzed, as were the impacts of subimage size on classification performance. The Inria dataset achieved the optimal results; after the morphological optimization of RUNet, the overall intersection over union (IoU) and classification accuracy reached 70.82% and 95.66%, respectively. Regarding the Inria-2 dataset, the IoU and accuracy were 75.5% and 95.71%, respectively, after classification refinement. Although the overall IoU and accuracy were 0.46% and 0.04% lower than those of the improved fully convolutional network, the training time of the RUNet model was approximately 10.6 h shorter. In the ISPRS dataset experiment, the overall accuracy of the combined multispectral, NDVI, and DSM images reached 89.71%, surpassing that of the RGB images. NIR and DSM provide more information on material features, reducing the likelihood of misclassification caused by similar features (e.g., in color, shape, or texture) in RGB images. Overall, RUNet outperformed the other models in the material classification of remote sensing images. The present findings indicate that it has potential for application in land use monitoring and disaster assessment as well as in model construction for simulation systems.

Read full abstract

• In order to reestablish the random initialization input weights of ESN, a single layer ESN with a global reversible AE (GRAE) algorithm is proposed to ensure that the outputs of GRAE are remarkably correlated with the input data. • The reservoir layer with a reversible function is calculated by pulling the ESN output back and injecting it into the reservoir layer. Thus, the feature learning is enriched by additional information, which results in a better performance. • The GRAE which is based on ridge regression to solve a least squares problem is entirely different from existing back-propagation (BP) based AEs. So, the training speed of GRAE is many times faster than that of the BP based AEs. • In existing ESN autoencoder methods, all weights in the reservoir layer are generated randomly. The simple cycle connection is utilized in the reservoir layer of ESN to make sure the reservoir weights can be generated deterministically. An echo state network (z) can provide an efficient dynamic solution for predicting time series problems. However, in most cases, ESN models are applied for predictions rather than classifications. The applications of ESN in time series classification (TSC) problems have yet to be fully studied. Moreover, the conventional randomly generated ESN is unlikely to be optimal because of the randomly generated input and reservoir weights, which are not always guaranteed to be optimal. Randomly generating all layer weights is improper, because a purely random layer might destroy the useful features. To overcome this disadvantage, this study provides a new input weight establishment framework of ESN based on autoencoder (AE) theory for TSC tasks. A global reversible AE (GRAE) algorithm is proposed to reestablish the random initialization input weights of the ESN. In existing ESN-AEs, the output weights obtained in the encoding process are directly reused as the initial input weights. By contrast, in GRAE, the reservoir layer with a reversible activation function is calculated by pulling the decoding layer output back and injecting it into the reservoir layer. Thus, feature learning is enriched by additional information, which results in improved performance. The current weights of the encoding layer are iteratively replaced by the decoding layer to ensure that the outputs of the GRAE are remarkably correlated with the input data. Visualization analyses and experiments of the input weights on a massive set of UCR time series datasets indicate that the proposed GRAE method can considerably improve the original two-layer ESN-based classifiers and the proposed GRAE-ESN classifier yields better performance compared with traditional state-of-the-art TSC classifiers. Furthermore, the proposed method can provide comparable performance and considerably faster training speed compared with three deep learning classifiers.

Read full abstract

Layered Decoding Research Articles

Related Topics

Articles published on Layered Decoding

Continuous conditional random field convolution for point cloud segmentation

Modeling feedback representations in ventral visual cortex using a generative adversarial autoencoder

GCRFLDA: scoring lncRNA-disease associations using graph convolution matrix completion with conditional random field.

Attention Res-UNet with Guided Decoder for semantic segmentation of brain tumors

LDPC Decoder Design Using Compensation Scheme of Group Comparison for 5G Communication Systems

TSU-net: Two-stage multi-scale cascade and multi-field fusion U-net for right ventricular segmentation

Global-Attention-Based Neural Networks for Vision Language Intelligence

Double attention U-Net for brain tumor MR image segmentation

ONE-MINIUM-ONLY BASIC-SET TRELLIS MIN-MAX DECODER ARCHITECTURE FOR NONBINARY LDPC CODE

Polar Coding for Channels With Deletions

A novel convolutional neural network architecture of multispectral remote sensing images for automatic material classification

A generative image inpainting network based on the attention transfer network across layer mechanism

Automated Generation of Novel Fragments Using Screening Data, a Dual SMILES Autoencoder, Transfer Learning and Syntax Correction

Polarization image fusion with self-learned fusion strategy

An Efficient Transformer Decoder with Compressed Sub-layers

Dermo-DOCTOR: A framework for concurrent skin lesion detection and recognition using a deep convolutional neural network with end-to-end dual encoders

Echo state network with a global reversible autoencoder for time series classification

Enhanced U-Net: A Feature Enhancement Network for Polyp Segmentation.

A novel weight initialization with adaptive hyper-parameters for deep semantic segmentation

Jazz Bass Transcription Using a U-Net Architecture

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Layered Decoding Research Articles

Related Topics

Articles published on Layered Decoding

Continuous conditional random field convolution for point cloud segmentation

Modeling feedback representations in ventral visual cortex using a generative adversarial autoencoder

GCRFLDA: scoring lncRNA-disease associations using graph convolution matrix completion with conditional random field.

Attention Res-UNet with Guided Decoder for semantic segmentation of brain tumors

LDPC Decoder Design Using Compensation Scheme of Group Comparison for 5G Communication Systems

TSU-net: Two-stage multi-scale cascade and multi-field fusion U-net for right ventricular segmentation

Global-Attention-Based Neural Networks for Vision Language Intelligence

Double attention U-Net for brain tumor MR image segmentation

ONE-MINIUM-ONLY BASIC-SET TRELLIS MIN-MAX DECODER ARCHITECTURE FOR NONBINARY LDPC CODE

Polar Coding for Channels With Deletions

A novel convolutional neural network architecture of multispectral remote sensing images for automatic material classification

A generative image inpainting network based on the attention transfer network across layer mechanism

Automated Generation of Novel Fragments Using Screening Data, a Dual SMILES Autoencoder, Transfer Learning and Syntax Correction

Polarization image fusion with self-learned fusion strategy

An Efficient Transformer Decoder with Compressed Sub-layers

Dermo-DOCTOR: A framework for concurrent skin lesion detection and recognition using a deep convolutional neural network with end-to-end dual encoders

Echo state network with a global reversible autoencoder for time series classification

Enhanced U-Net: A Feature Enhancement Network for Polyp Segmentation.

A novel weight initialization with adaptive hyper-parameters for deep semantic segmentation

Jazz Bass Transcription Using a U-Net Architecture