Multimodal Convolutional Neural Network Research Articles

The gold standard to assess whether a baby is at risk of oxygen deprivation during childbirth, is monitoring continuously the fetal heart rate with cardiotocography (CTG). The aim is to identify babies that could benefit from an emergency operative delivery (e.g., Cesarean section), in order to prevent death or permanent brain injury. The long, dynamic and complex CTG patterns are poorly understood and known to have high false positive and false negative rates. Visual interpretation by clinicians is challenging and reliable accurate fetal monitoring in labor remains an enormous unmet medical need. In this work, we applied deep learning methods to achieve data-driven automated CTG evaluation. Multimodal Convolutional Neural Network (MCNN) and Stacked MCNN models were used to analyze the largest available database of routinely collected CTG and linked clinical data (comprising more than 35000 births). We also assessed in detail the impact of the signal quality on the MCNN performance. On a large hold-out testing set from Oxford ( $n= 4429$ births), MCNN improved the prediction of cord acidemia at birth when compared with Clinical Practice and previous computerized approaches. On two external datasets, MCNN demonstrated better performance compared to current feature extraction-based methods. Our group is the first to apply deep learning for the analysis of CTG. We conclude that MCNN hold potential for the prediction of cord acidemia at birth and further work is warranted. Despite the advances, our deep learning models are currently not suitable for the detection of severe fetal injury in the absence of cord acidemia – a heterogeneous, small, and poorly understood group. We suggest that the most promising way forward are hybrid approaches to CTG interpretation in labor, in which different diagnostic models can estimate the risk for different types of fetal compromise, incorporating clinical knowledge with data-driven analyses.

Automated methods for prostate cancer (PCa) diagnosis in multi-parametric magnetic resonance imaging (MP-MRIs) are critical for alleviating requirements for interpretation of radiographs while helping to improve diagnostic accuracy (Artan et al 2010 IEEE Trans. Image Process. 19 2444–55, Litjens et al 2014 IEEE Trans. Med. Imaging 33 1083–92, Liu et al 2013 SPIE Medical Imaging (International Society for Optics and Photonics) p 86701G, Moradi et al 2012 J. Magn. Reson. Imaging 35 1403–13, Niaf et al 2014 IEEE Trans. Image Process. 23 979–91, Niaf et al 2012 Phys. Med. Biol. 57 3833, Peng et al 2013a SPIE Medical Imaging (International Society for Optics and Photonics) p 86701H, Peng et al 2013b Radiology 267 787–96, Wang et al 2014 BioMed. Res. Int. 2014). This paper presents an automated method based on multimodal convolutional neural networks (CNNs) for two PCa diagnostic tasks: (1) distinguishing between cancerous and noncancerous tissues and (2) distinguishing between clinically significant (CS) and indolent PCa. Specifically, our multimodal CNNs effectively fuse apparent diffusion coefficients (ADCs) and T2-weighted MP-MRI images (T2WIs). To effectively fuse ADCs and T2WIs we design a new similarity loss function to enforce consistent features being extracted from both ADCs and T2WIs. The similarity loss is combined with the conventional classification loss functions and integrated into the back-propagation procedure of CNN training. The similarity loss enables better fusion results than existing methods as the feature learning processes of both modalities are mutually guided, jointly facilitating CNN to ‘see’ the true visual patterns of PCa. The classification results of multimodal CNNs are further combined with the results based on handcrafted features using a support vector machine classifier. To achieve a satisfactory accuracy for clinical use, we comprehensively investigate three critical factors which could greatly affect the performance of our multimodal CNNs but have not been carefully studied previously. (1) Given limited training data, how can these be augmented in sufficient numbers and variety for fine-tuning deep CNN networks for PCa diagnosis? (2) How can multimodal MP-MRI information be effectively combined in CNNs? (3) What is the impact of different CNN architectures on the accuracy of PCa diagnosis? Experimental results on extensive clinical data from 364 patients with a total of 463 PCa lesions and 450 identified noncancerous image patches demonstrate that our system can achieve a sensitivity of 89.85% and a specificity of 95.83% for distinguishing cancer from noncancerous tissues and a sensitivity of 100% and a specificity of 76.92% for distinguishing indolent PCa from CS PCa. This result is significantly superior to the state-of-the-art method relying on handcrafted features.

Multimodal Convolutional Neural Network Research Articles

Related Topics

Articles published on Multimodal Convolutional Neural Network

Multimodal Convolutional Neural Networks to Detect Fetal Compromise During Labor and Delivery

RGB-D-Based Object Recognition Using Multimodal Convolutional Neural Networks: A Survey

多模深度卷积神经网络应用于视频表情识别

Learning Audio–Sheet Music Correspondences for Cross-Modal Retrieval and Piece Identification

Enhancing semantic image retrieval with limited labeled examples via deep learning

A Deep Multi-Modal CNN for Multi-Instance Multi-Label Image Classification.

RGB-D Scene Classification via Multi-modal Feature Learning

Effect of fusing features from multiple DCNN architectures in image classification

FOREST COVER CLASSIFICATION USING GEOSPATIAL MULTIMODAL DATA

Automated diagnosis of prostate cancer in multi-parametric MRI based on multimodal convolutional neural networks

Detection and Localization of Robotic Tools in Robot-Assisted Surgery Videos Using Deep Neural Networks for Region Proposal and Detection.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Multimodal Convolutional Neural Network Research Articles

Related Topics

Articles published on Multimodal Convolutional Neural Network

Multimodal Convolutional Neural Networks to Detect Fetal Compromise During Labor and Delivery

RGB-D-Based Object Recognition Using Multimodal Convolutional Neural Networks: A Survey

多模深度卷积神经网络应用于视频表情识别

Learning Audio–Sheet Music Correspondences for Cross-Modal Retrieval and Piece Identification

Enhancing semantic image retrieval with limited labeled examples via deep learning

A Deep Multi-Modal CNN for Multi-Instance Multi-Label Image Classification.

RGB-D Scene Classification via Multi-modal Feature Learning

Effect of fusing features from multiple DCNN architectures in image classification

FOREST COVER CLASSIFICATION USING GEOSPATIAL MULTIMODAL DATA

Automated diagnosis of prostate cancer in multi-parametric MRI based on multimodal convolutional neural networks

Detection and Localization of Robotic Tools in Robot-Assisted Surgery Videos Using Deep Neural Networks for Region Proposal and Detection.