3D Convolution Research Articles

In the brain tumor magnetic resonance image (MRI) segmentation, although the 3D convolution networks (CNNs) has achieved state-of-the-art results, the class and hard-voxel imbalances in the 3D images have not been well addressed. Voxel independent losses are dependent on the setting of class weights for the class imbalance issue, and are hard to assign each class equally. Region-related losses cannot correctly focus on hard voxels dynamically and not be robust to misclassification of small structures. Meanwhile, repeatedly training on the additional hard samples augmented by existing methods would bring more class imbalance, overfitting and incorrect knowledge learning to themodel. A novel region-related loss with balanced dynamic weighting while alleviating the sensitivity to small structures is necessary. In addition, we need to increase the diversity of hard samples in the training to improve the performance ofmodel. The proposed Region-related Focal Loss (RFL) reshapes standard Dice Loss (DL) by up-weighting the loss assigned to hard-classified voxels. Compared to DL, RFL adaptively modulate its gradient with an invariant focalized point that voxels with lower-confidence than it would achieve a larger gradient, and higher-confidence voxels would get a smaller gradient. Meanwhile, RFL can adjust the parameters to set where and how much the network is focused. In addition, an Intra-classly Transformed Augmentation network (ITA-NET) is proposed to increase the diversity of hard samples, in which the 3D registration network and intra-class transfer layer are used to transform the shape and intensity respectively. A selective hard sample mining(SHSM) strategy is used to train the ITA-NET for avoiding excessive class imbalance. Source code (in Tensorflow) is available at: https://github.com/lb-whu/RFL_ITA. The experiments are carried out on public data set: Brain Tumor Segmentation Challenge 2020 (BratS2020). Experiments with BraTS2020 online validation set show that proposed methods achieve an average Dice scores of 0.905, 0.821, and 0.806 for whole tumor (WT), tumor core (TC) and enhancing tumor (ET), respectively. Compared with DL (baseline), the proposed RFL significantly improves the Dice scores by an average of 1%, and for the small region ET it can even increase by 3%. And the proposed method combined with ITA-NET improves the Dice scores of ET and TC by 5% and 3%respectively. The proposed RFL can converge with a invariant focalized point in the training of segmentation network, thus effectively alleviating the hard-voxel imbalance in brain tumor MRI segmentation. The negative region term of RFL can effectively reduce the sensitivity of the segmentation model to the misclassification of small structures. The proposed ITA-NET can increase the diversity of hard samples by transforming their shape and transfer their intra-class intensity, thereby effectively improving the robustness of the segmentation network to hardsamples.

Lightweight semantic segmentation promotes the application of semantic segmentation in tiny devices. The existing lightweight semantic segmentation network (LSNet) has the problems of low precision and a large number of parameters. In response to the above problems, we designed a full 1D convolutional LSNet. The tremendous success of this network is attributed to the following three modules: 1D multi-layer space module (1D-MS), 1D multi-layer channel module (1D-MC), and flow alignment module (FA). The 1D-MS and the 1D-MC add global feature extraction operations based on the multi-layer perceptron (MLP) idea. This module uses 1D convolutional coding, which is more flexible than MLP. It increases the global information operation, improving features' coding ability. The FA module fuses high-level and low-level semantic information, which solves the problem of precision loss caused by the misalignment of features. We designed a 1D-mixer encoder based on the transformer structure. It performed fusion encoding of the feature space information extracted by the 1D-MS module and the channel information extracted by the 1D-MC module. 1D-mixer obtains high-quality encoded features with very few parameters, which is the key to the network's success. The attention pyramid with FA (AP-FA) uses an AP to decode features and adds a FA module to solve the problem of feature misalignment. Our network requires no pre-training and only needs a 1080Ti GPU for training. It achieved 72.6 mIoU and 95.6 FPS on the Cityscapes dataset and 70.5 mIoU and 122 FPS on the CamVid dataset. We ported the network trained on the ADE2K dataset to mobile devices, and the latency of 224 ms proves the application value of the network on mobile devices. The results on the three datasets prove that the network generalization ability we designed is powerful. Compared to state-of-the-art lightweight semantic segmentation algorithms, our designed network achieves the best balance between segmentation accuracy and parameters. The parameters of LSNet are only 0.62 M, which is currently the network with the highest segmentation accuracy within 1 M parameters.

3D Convolution Research Articles

Related Topics

Articles published on 3D Convolution

Point-Voxel and Bird-Eye-View Representation Aggregation Network for Single Stage 3D Object Detection

Shuffled-Xception-DarkNet-53: A content-based image retrieval model based on deep learning algorithm

Spatial–Temporal Complex Graph Convolution Network for Traffic Flow Prediction

A Grid Search-Based Multilayer Dynamic Ensemble System to Identify DNA N4-Methylcytosine Using Deep Learning Approach.

ConvMS: Improving Convolutional Knowledge Graph Embeddings via Integrating Information of Multiple Subspaces

SS-TMNet: Spatial–Spectral Transformer Network with Multi-Scale Convolution for Hyperspectral Image Classification

GAF-RCNN: Grid attention fusion 3D object detection from point cloud

Region-related focal loss for 3D brain tumor MRI segmentation.

NPDN-3D: A 3D neural partial differential network for spatiotemporal prediction

Spatial clustering of microscopic dynamics governs the slip avalanche of sheared granular materials

Rethinking 1D convolution for lightweight semantic segmentation.

HC-Net: A hybrid convolutional network for non-human primate brain extraction.

Rethinking 3D cost aggregation in stereo matching

MiRNA-gene network embedding for predicting cancer driver genes.

Quality Enhancement of Compressed 360-Degree Videos Using Viewport-based Deep Neural Networks

Sub-second photon dose prediction via transformer neural networks.

EEG emotion recognition using improved graph neural network with channel selection

Real Time Drowsiness Detection System Using CNN

Multislice input for 2D and 3D residual convolutional neural network noise reduction in CT.

Human-Computer Interaction with a Real-Time Speech Emotion Recognition with Ensembling Techniques 1D Convolution Neural Network and Attention.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

3D Convolution Research Articles

Related Topics

Articles published on 3D Convolution

Point-Voxel and Bird-Eye-View Representation Aggregation Network for Single Stage 3D Object Detection

Shuffled-Xception-DarkNet-53: A content-based image retrieval model based on deep learning algorithm

Spatial–Temporal Complex Graph Convolution Network for Traffic Flow Prediction

A Grid Search-Based Multilayer Dynamic Ensemble System to Identify DNA N4-Methylcytosine Using Deep Learning Approach.

ConvMS: Improving Convolutional Knowledge Graph Embeddings via Integrating Information of Multiple Subspaces

SS-TMNet: Spatial–Spectral Transformer Network with Multi-Scale Convolution for Hyperspectral Image Classification

GAF-RCNN: Grid attention fusion 3D object detection from point cloud

Region-related focal loss for 3D brain tumor MRI segmentation.

NPDN-3D: A 3D neural partial differential network for spatiotemporal prediction

Spatial clustering of microscopic dynamics governs the slip avalanche of sheared granular materials

Rethinking 1D convolution for lightweight semantic segmentation.

HC-Net: A hybrid convolutional network for non-human primate brain extraction.

Rethinking 3D cost aggregation in stereo matching

MiRNA-gene network embedding for predicting cancer driver genes.

Quality Enhancement of Compressed 360-Degree Videos Using Viewport-based Deep Neural Networks

Sub-second photon dose prediction via transformer neural networks.

EEG emotion recognition using improved graph neural network with channel selection

Real Time Drowsiness Detection System Using CNN

Multislice input for 2D and 3D residual convolutional neural network noise reduction in CT.

Human-Computer Interaction with a Real-Time Speech Emotion Recognition with Ensembling Techniques 1D Convolution Neural Network and Attention.