Multimodal Data Research Articles

Multimodal retrieval has received widespread consideration since it can commendably provide massive related data support for the development of computational social systems (CSSs). However, the existing works still face the following challenges: 1) rely on the tedious manual marking process when extended to CSS, which not only introduces subjective errors but also consumes abundant time and labor costs; 2) only using strongly aligned data for training, lacks concern for the adjacency information, which makes the poor robustness and semantic heterogeneity gap difficult to be effectively fit; and 3) mapping features into real-valued forms, which leads to the characteristics of high storage and low retrieval efficiency. To address these issues in turn, we have designed a multimodal retrieval framework based on web-knowledge-driven, called <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">unsupervised and robust graph convolutional hashing</i> (URGCH). The specific implementations are as follows: first, a “ <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">secondary semantic self-fusion</i> ” approach is proposed, which mainly extracts semantic-rich features through pretrained neural networks, constructs the joint semantic matrix through semantic fusion, and eliminates the process of manual marking; second, a “ <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">adaptive computing</i> ” approach is designed to construct enhanced semantic graph features through the knowledge-infused of neighborhoods and uses graph convolutional networks for knowledge fusion coding, which enables URGCH to sufficiently fit the semantic modality gap while obtaining satisfactory robustness features; Third, combined with hash learning, the multimodality data are mapped into the form of binary code, which reduces storage requirements and improves retrieval efficiency. Eventually, we perform plentiful experiments on the web dataset. The results evidence that URGCH exceeds other baselines about <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$1\%$</tex-math> </inline-formula> – <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$3.7\%$</tex-math> </inline-formula> in mean average precisions (MAPs), displays superior performance in all the aspects, and can meaningfully provide multimodal data retrieval services to CSS.

e23253 Background: This study attempted to address limitations in the A-PROACCT model developed by Stein et al. (JOP, 2023) to predict the rate of acute care events within 30 days (ACE30) of initial chemotherapy administration (ICA). The four most important predictors were drug category (immunotherapy), cancer type (gastrointestinal), age group ( < 40), and BMI category (underweight). Only linear relationships were considered. For patients for whom an ACE30 was predicted by their first and second models, only 0.18 and 0.23 actually had an ACE30, a low positive predictive value (PPV). We attempted to improve the PPV with advanced modeling techniques. Methods: Using Orlando Health data between February of 2012 and April 2021, random sampling was used to split the dataset into training (10,838) and validation (4645). For the training data, resampling was employed to obtain a balanced quota, equal observations followed by an ACE30 and not followed by an ACE30. We included only variables used in Stein (drug category, age group, ED visits and hospitalizations in the prior year, cancer type, insurance category, number of anti-cancer agents, race, and BMI category). Logistic regression with L1-penalty, XGBoost (a non-linear nonparametric tree-based algorithm to account for nonlinear relationships), and artificial neural networks were used to develop and evaluate predictive models. Different sampling methods (bootstrap and SMOTE) as well as cuff-off thresholds for high-risk groups were tested. Results: Based on evaluation with validation data set, the best performing approach was an XGBoost model with SMOTE resampling in the training data. The four most important predictors were ED visits in the prior year, payor category (self-insured), cancer type (genitourinary) and hospitalization in past year. This model reported a PPV of 0.27. Many other combinations of methodologies described above were performed, and the PPV varied between 0.19 to 0.27. Of the 282 ICAs identified as high-risk by the best model, 76 (27.0%) had an ACE30. Of the 4346 ICAs identified as low-risk, 533 (12.3%) had an ACE30. The differences between high risk and low risk ACE30 rates were statistically significant (p < 0.0001). In comparison, using Stein et al. models with our data reported a PPV of 0.27 and 0.25 which are basically at the same level of our best trained model. Conclusions: Using the variables included in A-PROACCT and a variety of machine learning models, advanced sampling methodologies and threshold cut-off limits, the best results were similar to those obtained using basic logistic regression. This suggests improvement in predicting acute care events following chemotherapy administration need to incorporate more extensive multimodal data, such as vital signs, performance status, patient reported outcomes, socioeconomic factors, laboratory and radiology results, remote patient monitoring and wearable device data.

Multimodal Data Research Articles

Articles published on Multimodal Data

The affordances of artificial intelligence-based tools for supporting 21st-century skills:

Toward Enhanced Prediction of High‐Impact Solar Energetic Particle Events Using Multimodal Time Series Data Fusion Models

A Web Knowledge-Driven Multimodal Retrieval Method in Computational Social Systems: Unsupervised and Robust Graph Convolutional Hashing

Integrating 4 Methods (In4M) to evaluate physical function in patients with cancer: Results of a comprehensive digital health study.

Adversarial Learning Based Node-Edge Graph Attention Networks for Autism Spectrum Disorder Identification.

Using memes and emoji-scales in a web survey: experimental assessment of consequences for multimodal cognitive effort and data quality

MGIML: Cancer Grading With Incomplete Radiology-Pathology Data via Memory Learning and Gradient Homogenization.

Deep Learning Model for Estimation of LV Ejection Fraction from Echocardiogram

Refining the 30-day emergency department visit after initial chemotherapy administration A-PROACCT model.

Enhanced Day-Ahead Electricity Price Forecasting Using a Convolutional Neural Network–Long Short-Term Memory Ensemble Learning Approach with Multimodal Data Integration

MultiCogniGraph: A multimodal data fusion and graph convolutional network‐based multi‐hop reasoning method for large equipment fault diagnosis

Attention-Like Multimodality Fusion With Data Augmentation for Diagnosis of Mental Disorders Using MRI.

Learning Robust Representations of Tonic-Clonic Seizures With Cyclic Transformer.

Immediate traffic flow monitoring and management based on multimodal data in cloud computing

A multi-modal extraction integrated model for neuropsychiatric disorders classification

Graph convolutional network with attention mechanism improve major depressive depression diagnosis based on plasma biomarkers and neuroimaging data

Application of emotion recognition technology in psychological counseling for college students

Predicting glaucoma before onset using a large language model chatbot

DMGM: deformable-mechanism based cervical cancer staging via MRI multi-sequence * *This work was financially aided by the Natural Science Foundation of China (U1934221, 61773323), and Sichuan Science and Technology Program (2019YFG0345, 2019YJ0210).

Protocol for the Development and Analysis of the Oxford and Reading Cognitive Comorbidity, Frailty and Ageing Research Database-Electronic Patient Records (ORCHARD-EPR)

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Multimodal Data Research Articles

Articles published on Multimodal Data

The affordances of artificial intelligence-based tools for supporting 21st-century skills:

Toward Enhanced Prediction of High‐Impact Solar Energetic Particle Events Using Multimodal Time Series Data Fusion Models

A Web Knowledge-Driven Multimodal Retrieval Method in Computational Social Systems: Unsupervised and Robust Graph Convolutional Hashing

Integrating 4 Methods (In4M) to evaluate physical function in patients with cancer: Results of a comprehensive digital health study.

Adversarial Learning Based Node-Edge Graph Attention Networks for Autism Spectrum Disorder Identification.

Using memes and emoji-scales in a web survey: experimental assessment of consequences for multimodal cognitive effort and data quality

MGIML: Cancer Grading With Incomplete Radiology-Pathology Data via Memory Learning and Gradient Homogenization.

Deep Learning Model for Estimation of LV Ejection Fraction from Echocardiogram

Refining the 30-day emergency department visit after initial chemotherapy administration A-PROACCT model.

Enhanced Day-Ahead Electricity Price Forecasting Using a Convolutional Neural Network–Long Short-Term Memory Ensemble Learning Approach with Multimodal Data Integration

MultiCogniGraph: A multimodal data fusion and graph convolutional network‐based multi‐hop reasoning method for large equipment fault diagnosis

Attention-Like Multimodality Fusion With Data Augmentation for Diagnosis of Mental Disorders Using MRI.

Learning Robust Representations of Tonic-Clonic Seizures With Cyclic Transformer.

Immediate traffic flow monitoring and management based on multimodal data in cloud computing

A multi-modal extraction integrated model for neuropsychiatric disorders classification

Graph convolutional network with attention mechanism improve major depressive depression diagnosis based on plasma biomarkers and neuroimaging data

Application of emotion recognition technology in psychological counseling for college students

Predicting glaucoma before onset using a large language model chatbot

DMGM: deformable-mechanism based cervical cancer staging via MRI multi-sequence * *This work was financially aided by the Natural Science Foundation of China (U1934221, 61773323), and Sichuan Science and Technology Program (2019YFG0345, 2019YJ0210).

Protocol for the Development and Analysis of the Oxford and Reading Cognitive Comorbidity, Frailty and Ageing Research Database-Electronic Patient Records (ORCHARD-EPR)