Multimodal Deep Learning Research Articles

Deep learning methods have been applied when working to enhance the prediction accuracy of traditional statistical methods in the field of plant breeding. Although deep learning seems to be a promising approach for genomic prediction, it has proven to have some limitations, since its conventional methods fail to leverage all available information. Multimodal deep learning methods aim to improve the predictive power of their unimodal counterparts by introducing several modalities (sources) of input information. In this review, we introduce some theoretical basic concepts of multimodal deep learning and provide a list of the most widely used neural network architectures in deep learning, as well as the available strategies to fuse data from different modalities. We mention some of the available computational resources for the practical implementation of multimodal deep learning problems. We finally performed a review of applications of multimodal deep learning to genomic selection in plant breeding and other related fields. We present a meta-picture of the practical performance of multimodal deep learning methods to highlight how these tools can help address complex problems in the field of plant breeding. We discussed some relevant considerations that researchers should keep in mind when applying multimodal deep learning methods. Multimodal deep learning holds significant potential for various fields, including genomic selection. While multimodal deep learning displays enhanced prediction capabilities over unimodal deep learning and other machine learning methods, it demands more computational resources. Multimodal deep learning effectively captures intermodal interactions, especially when integrating data from different sources. To apply multimodal deep learning in genomic selection, suitable architectures and fusion strategies must be chosen. It is relevant to keep in mind that multimodal deep learning, like unimodal deep learning, is a powerful tool but should be carefully applied. Given its predictive edge over traditional methods, multimodal deep learning is valuable in addressing challenges in plant breeding and food security amid a growing global population.

Read full abstract

PurposeTo explore the application value of a multimodal deep learning radiomics (MDLR) model in predicting the risk status of postoperative progression in solid stage I non-small cell lung cancer (NSCLC).Materials and MethodsA total of 459 patients with histologically confirmed solid stage I NSCLC who underwent surgical resection in our institution from January 2014 to September 2019 were reviewed retrospectively. At another medical center, 104 patients were reviewed as an external validation cohort according to the same criteria. A univariate analysis was conducted on the clinicopathological characteristics and subjective CT findings of the progression and non-progression groups. The clinicopathological characteristics and subjective CT findings that exhibited significant differences were used as input variables for the extreme learning machine (ELM) classifier to construct the clinical model. We used the transfer learning strategy to train the ResNet18 model, used the model to extract deep learning features from all CT images, and then used the ELM classifier to classify the deep learning features to obtain the deep learning signature (DLS). A MDLR model incorporating clinicopathological characteristics, subjective CT findings and DLS was constructed. The diagnostic efficiencies of the clinical model, DLS model and MDLR model were evaluated by the area under the curve (AUC).ResultsUnivariate analysis indicated that size (p = 0.004), neuron-specific enolase (NSE) (p = 0.03), carbohydrate antigen 19 − 9 (CA199) (p = 0.003), and pathological stage (p = 0.027) were significantly associated with the progression of solid stage I NSCLC after surgery. Therefore, these clinical characteristics were incorporated into the clinical model to predict the risk of progression in postoperative solid-stage NSCLC patients. A total of 294 deep learning features with nonzero coefficients were selected. The DLS in the progressive group was (0.721 ± 0.371), which was higher than that in the nonprogressive group (0.113 ± 0.350) (p < 0.001). The combination of size、NSE、CA199、pathological stage and DLS demonstrated the superior performance in differentiating postoperative progression status. The AUC of the MDLR model was 0.885 (95% confidence interval [CI]: 0.842–0.927), higher than that of the clinical model (0.675 (95% CI: 0.599–0.752)) and DLS model (0.882 (95% CI: 0.835–0.929)). The DeLong test and decision in curve analysis revealed that the MDLR model was the most predictive and clinically useful model.ConclusionMDLR model is effective in predicting the risk of postoperative progression of solid stage I NSCLC, and it is helpful for the treatment and follow-up of solid stage I NSCLC patients.

Read full abstract

Multimodal Deep Learning Research Articles

Related Topics

Articles published on Multimodal Deep Learning

Open-world disaster information identification from multimodal social media

Damage‐level classification considering both correlation between image and text data and confidence of attention map

Unveiling the distinctive variations in multi-omics triggered by TP53 mutation in lung cancer subtypes: An insight from interaction among intratumoral microbiota, tumor microenvironment, and pathology

A review of multimodal deep learning methods for genomic-enabled prediction in plant breeding.

A multimodal deep learning-based algorithm for specific fetal heart rate events detection.

Reconstructing urban vegetation evolution in China using multimodal deep learning and 30-years Landsat archive

A Multimodal Deep Learning Nomogram for the Identification of Clinically Significant Prostate Cancer in Patients with Gray-Zone PSA Levels: Comparison with Clinical and Radiomics Models

Multimodal Deep Learning Integration of Image, Weather, and Phenotypic Data Under Temporal Effects for Early Prediction of Maize Yield

A cross-attention-based deep learning approach for predicting functional stroke outcomes using 4D CTP imaging and clinical metadata

DAB-LSTMNN: A novel respiratory disease diagnosis deep CNN framework based on attention scheme and BLSTM neural networks

HyMNet: A Multimodal Deep Learning System for Hypertension Prediction Using Fundus Images and Cardiometabolic Risk Factors

Drug–drug interaction extraction based on multimodal feature fusion by Transformer and BiGRU

Integrating Multimodal Deep Learning with Multipoint Statistics for 3D Crustal Modeling: A Case Study of the South China Sea

Real-time deep learning-assisted mechano-acoustic system for respiratory diagnosis and multifunctional classification

Impact of metadata in multimodal classification of bone tumours

Multimodal deep learning radiomics model for predicting postoperative progression in solid stage I non-small cell lung cancer

Enhancing societal security: a multimodal deep learning approach for a public person identification and tracking system

A comprehensive investigation of multimodal deep learning fusion strategies for breast cancer classification

Deep learning-based CT image for pulmonary nodule classification with intrathoracic fat: A multicenter study

Prostate Cancer Risk Stratification in NRG Oncology Phase III Randomized Trials Using Multimodal Deep Learning With Digital Histopathology.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Multimodal Deep Learning Research Articles

Related Topics

Articles published on Multimodal Deep Learning

Open-world disaster information identification from multimodal social media

Damage‐level classification considering both correlation between image and text data and confidence of attention map

Unveiling the distinctive variations in multi-omics triggered by TP53 mutation in lung cancer subtypes: An insight from interaction among intratumoral microbiota, tumor microenvironment, and pathology

A review of multimodal deep learning methods for genomic-enabled prediction in plant breeding.

A multimodal deep learning-based algorithm for specific fetal heart rate events detection.

Reconstructing urban vegetation evolution in China using multimodal deep learning and 30-years Landsat archive

A Multimodal Deep Learning Nomogram for the Identification of Clinically Significant Prostate Cancer in Patients with Gray-Zone PSA Levels: Comparison with Clinical and Radiomics Models

Multimodal Deep Learning Integration of Image, Weather, and Phenotypic Data Under Temporal Effects for Early Prediction of Maize Yield

A cross-attention-based deep learning approach for predicting functional stroke outcomes using 4D CTP imaging and clinical metadata

DAB-LSTMNN: A novel respiratory disease diagnosis deep CNN framework based on attention scheme and BLSTM neural networks

HyMNet: A Multimodal Deep Learning System for Hypertension Prediction Using Fundus Images and Cardiometabolic Risk Factors

Drug–drug interaction extraction based on multimodal feature fusion by Transformer and BiGRU

Integrating Multimodal Deep Learning with Multipoint Statistics for 3D Crustal Modeling: A Case Study of the South China Sea

Real-time deep learning-assisted mechano-acoustic system for respiratory diagnosis and multifunctional classification

Impact of metadata in multimodal classification of bone tumours

Multimodal deep learning radiomics model for predicting postoperative progression in solid stage I non-small cell lung cancer

Enhancing societal security: a multimodal deep learning approach for a public person identification and tracking system

A comprehensive investigation of multimodal deep learning fusion strategies for breast cancer classification

Deep learning-based CT image for pulmonary nodule classification with intrathoracic fat: A multicenter study

Prostate Cancer Risk Stratification in NRG Oncology Phase III Randomized Trials Using Multimodal Deep Learning With Digital Histopathology.