Discovery Logo
Sign In
Search
Paper
Search Paper
R Discovery for Libraries Pricing Sign In
  • Home iconHome
  • My Feed iconMy Feed
  • Search Papers iconSearch Papers
  • Library iconLibrary
  • Explore iconExplore
  • Ask R Discovery iconAsk R Discovery Star Left icon
  • Literature Review iconLiterature Review NEW
  • Chat PDF iconChat PDF Star Left icon
  • Citation Generator iconCitation Generator
  • Chrome Extension iconChrome Extension
    External link
  • Use on ChatGPT iconUse on ChatGPT
    External link
  • iOS App iconiOS App
    External link
  • Android App iconAndroid App
    External link
  • Contact Us iconContact Us
    External link
  • Paperpal iconPaperpal
    External link
  • Mind the Graph iconMind the Graph
    External link
  • Journal Finder iconJournal Finder
    External link
Discovery Logo menuClose menu
  • Home iconHome
  • My Feed iconMy Feed
  • Search Papers iconSearch Papers
  • Library iconLibrary
  • Explore iconExplore
  • Ask R Discovery iconAsk R Discovery Star Left icon
  • Literature Review iconLiterature Review NEW
  • Chat PDF iconChat PDF Star Left icon
  • Citation Generator iconCitation Generator
  • Chrome Extension iconChrome Extension
    External link
  • Use on ChatGPT iconUse on ChatGPT
    External link
  • iOS App iconiOS App
    External link
  • Android App iconAndroid App
    External link
  • Contact Us iconContact Us
    External link
  • Paperpal iconPaperpal
    External link
  • Mind the Graph iconMind the Graph
    External link
  • Journal Finder iconJournal Finder
    External link
features
  • Audio Papers iconAudio Papers
  • Paper Translation iconPaper Translation
  • Chrome Extension iconChrome Extension
Content Type
  • Journal Articles iconJournal Articles
  • Conference Papers iconConference Papers
  • Preprints iconPreprints
  • Seminars by Cassyni iconSeminars by Cassyni
More
  • R Discovery for Libraries iconR Discovery for Libraries
  • Research Areas iconResearch Areas
  • Topics iconTopics
  • Resources iconResources

Related Topics

  • Deep Feature Learning
  • Deep Feature Learning

Articles published on Multimodal Learning

Authors
Select Authors
Journals
Select Journals
Duration
Select Duration
5233 Search results
Sort by
Recency
  • New
  • Research Article
  • 10.1016/j.ergon.2026.103919
Towards accurate prediction of drivers’ mental workload during continuous driving tasks with multimodal measures and machine learning regression models
  • May 1, 2026
  • International Journal of Industrial Ergonomics
  • Yuchen Ji + 5 more

Towards accurate prediction of drivers’ mental workload during continuous driving tasks with multimodal measures and machine learning regression models

  • New
  • Research Article
  • 10.1016/j.comnet.2026.112179
A systematic literature survey of machine learning approaches to cyber data breach detection: Current research issues and future directions
  • May 1, 2026
  • Computer Networks
  • Paul Ntim Yeboah + 4 more

Although several machine learning driven solutions are deemed to be effective at detecting data breaches, the recent proliferation in data breach incidents resulting from cyber attacks on computer networks demands an updated, thorough analysis of Machine Learning (ML) based data breach countermeasures to identify research gaps and guide future studies. In view of this, this study employs a systematic approach and draws insight from 89 research articles to classify machine learning based data breach countermeasures using eight criteria namely learning tasks, learning classifiers, datasets, feature engineering methods, multimodal approaches, pre-training approaches and performance. In classifying the studies, we: (a) propose a taxonomy of feature extraction and representation to classify studies using ten sub-criteria, (b) classify multimodal machine learning approaches used in the studies into three fusion sub-criteria: namely early fusion, intermediate fusion and late fusion, (c) show a comparison of studies based on pre-training techniques employed such as pre-text objective, learning model and data used in pre-training, (d) classify the datasets used in the study evaluation into two categories: real dataset and simulated dataset and (e) evaluate studies by detection performance and effectiveness against data breaches on unknown and obfuscated network traffic. To aid the literature identification, we analyse forty recent incidents and obtain prevalent cyber attack vectors of data breaches, which we present as the general workflow for data breaches due to cyber attacks. Finally, we highlight the research issues associated with existing ML-based data breach countermeasures and recommend future research directions.

  • New
  • Research Article
  • 10.1016/j.bspc.2026.109466
A multi-modal fusion-based deep learning with finetuned LLaMA 3 for lung disease diagnosis using PACS radiology reports
  • May 1, 2026
  • Biomedical Signal Processing and Control
  • J Lefty Joyson + 2 more

A multi-modal fusion-based deep learning with finetuned LLaMA 3 for lung disease diagnosis using PACS radiology reports

  • New
  • Research Article
  • 10.1016/j.inffus.2025.104019
Multi-modal collaborative learning with vision foundation model prompt boosts 3D semi-supervised semantic segmentation
  • May 1, 2026
  • Information Fusion
  • Xiang He + 5 more

Multi-modal collaborative learning with vision foundation model prompt boosts 3D semi-supervised semantic segmentation

  • New
  • Research Article
  • 10.1016/j.asoc.2026.114946
Entity alignment based on multi-modal representation learning and constrained weakly-supervised clustering
  • May 1, 2026
  • Applied Soft Computing
  • Ruifeng Xie + 2 more

Entity alignment based on multi-modal representation learning and constrained weakly-supervised clustering

  • New
  • Research Article
  • 10.1016/j.autcon.2026.106851
Scenario-based multimodal deep learning framework for simultaneous detection of construction accident causal factors and risk evaluation
  • May 1, 2026
  • Automation in Construction
  • Jaehui Son + 3 more

Scenario-based multimodal deep learning framework for simultaneous detection of construction accident causal factors and risk evaluation

  • New
  • Research Article
  • 10.1016/j.ejogrb.2026.115018
Preoperative identification of deep myometrial invasion in endometrial cancer: a multicenter MRI study with a vision foundation model-enhanced multimodal deep learning framework.
  • May 1, 2026
  • European journal of obstetrics, gynecology, and reproductive biology
  • Peijun Li + 16 more

Preoperative identification of deep myometrial invasion in endometrial cancer: a multicenter MRI study with a vision foundation model-enhanced multimodal deep learning framework.

  • New
  • Research Article
  • 10.1016/j.jbi.2026.105001
Explainable multimodal deep learning models for variable-length sequences in critically ill patients.
  • May 1, 2026
  • Journal of biomedical informatics
  • Jennifer Martin + 9 more

Explainable multimodal deep learning models for variable-length sequences in critically ill patients.

  • New
  • Research Article
  • 10.1016/j.compstruc.2026.108216
Transformer self-attention encoder–decoder with multimodal deep learning for response time series forecasting and digital twin support in wind structural health monitoring
  • May 1, 2026
  • Computers & Structures
  • Feiyu Zhou + 1 more

Transformer self-attention encoder–decoder with multimodal deep learning for response time series forecasting and digital twin support in wind structural health monitoring

  • New
  • Research Article
  • 10.1016/j.cmpb.2026.109265
Correlative analysis between ocular surface features and carotid plaque : A multimodal machine learning framework.
  • May 1, 2026
  • Computer methods and programs in biomedicine
  • Shichen Zhang + 3 more

Correlative analysis between ocular surface features and carotid plaque : A multimodal machine learning framework.

  • New
  • Research Article
  • 10.1016/j.engappai.2026.114357
A multi-modal multi-task learning network for intelligent parameter measurement in gas–liquid two-phase flow
  • May 1, 2026
  • Engineering Applications of Artificial Intelligence
  • Hanqing Chen + 8 more

A multi-modal multi-task learning network for intelligent parameter measurement in gas–liquid two-phase flow

  • New
  • Research Article
  • 10.11591/edulearn.v20i2.21735
Enhancing special needs literacy: insights from Indonesian inclusive schools
  • May 1, 2026
  • Journal of Education and Learning (EduLearn)
  • Redite Kurniawan + 2 more

The study focuses on the inclusive school’s approach to literacy development for special needs students (SNS). The research aims to delve into the inclusive school’s methodologies concerning literacy development, particularly tailored to cater to the diverse needs of special education students. The study was conducted in 17 inclusive schools across 3 provinces in Indonesia, all of which have demonstrated remarkable progress in literacy development. This comprehensive investigation involved meticulously surveying these schools to gather insights into their practices and achievements. The findings of the study highlight several patterns in the approach to literacy development in inclusive schools. A comprehensive inclusive literacy development model is proposed, which emphasizes personalized planning, multimodal and experiential learning, differentiated instruction, recognition of achievements, and ongoing professional development. The study’s insights are relevant for educators, policymakers, and researchers interested in enhancing inclusive literacy practices for SNS in inclusive school settings schools. The practicality of the curriculum is seen from the aspect of good communication, cooperation, sufficient time, and the ease of understanding the material.

  • New
  • Research Article
  • 10.1016/j.ejca.2026.116679
MuTriM: A multiscale deep learning model integrating longitudinal radiomics and pathomic features for predicting recurrence and adjuvant radiation benefit in breast cancer.
  • May 1, 2026
  • European journal of cancer (Oxford, England : 1990)
  • Xiangxue Wang + 14 more

MuTriM: A multiscale deep learning model integrating longitudinal radiomics and pathomic features for predicting recurrence and adjuvant radiation benefit in breast cancer.

  • New
  • Research Article
  • 10.1016/j.cmpb.2026.109306
Integrating multimodal data and deep learning for functional assessment and rehabilitation prediction after cerebral hemorrhage.
  • May 1, 2026
  • Computer methods and programs in biomedicine
  • Xuemin Liu + 4 more

Integrating multimodal data and deep learning for functional assessment and rehabilitation prediction after cerebral hemorrhage.

  • New
  • Research Article
  • 10.1016/j.neunet.2025.108518
CILF-CIAE: CLIP-driven image-language fusion for correcting inverse age estimation.
  • May 1, 2026
  • Neural networks : the official journal of the International Neural Network Society
  • Yuntao Shou + 4 more

CILF-CIAE: CLIP-driven image-language fusion for correcting inverse age estimation.

  • New
  • Research Article
  • 10.1016/j.lmot.2026.102252
A longitudinal study of digital multimodal learning activities and achievement motivation: A superposition perspective from Chinese EFL learners
  • May 1, 2026
  • Learning and Motivation
  • Haining Zhang + 1 more

A longitudinal study of digital multimodal learning activities and achievement motivation: A superposition perspective from Chinese EFL learners

  • New
  • Research Article
  • 10.1016/j.neunet.2025.108525
MGML: A plug-and-play meta-guided multi-modal learning framework for incomplete multimodal brain tumor segmentation.
  • May 1, 2026
  • Neural networks : the official journal of the International Neural Network Society
  • Yulong Zou + 8 more

MGML: A plug-and-play meta-guided multi-modal learning framework for incomplete multimodal brain tumor segmentation.

  • New
  • Research Article
  • 10.1016/j.eswa.2026.131440
A two-stage multimodal learning framework based on text-driven vision pretraining and cross-modal feature fusion for thyroid ultrasound diagnosis
  • May 1, 2026
  • Expert Systems with Applications
  • Mengzhu Yu + 7 more

A two-stage multimodal learning framework based on text-driven vision pretraining and cross-modal feature fusion for thyroid ultrasound diagnosis

  • New
  • Research Article
  • 10.1016/j.inffus.2025.104062
Multimodal fusion with vision-language-action models for robotic manipulation: A systematic review
  • May 1, 2026
  • Information Fusion
  • Muhayy Ud Din + 4 more

• Provides a unified taxonomy that organizes more than 100 VLA architectures. • Maps 26 major VLA datasets using a framework based on task difficulty and modality richness. • Presents a large-scale quantitative analysis linking model design choices to normalized performance. • Demonstrates that diffusion-based decoders and hierarchical fusion significantly improve manipulation success. • Introduces the VLA-FEB benchmark with new metrics for measuring multimodal fusion quality and alignment. • Proposes an agentic VLA framework where LLM planners verify and re-plan actions using uncertainty-driven feedback for self-improving robotic autonomy. Vision Language Action (VLA) models represent a new frontier in robotics by unifying perception, reasoning, and control within a single multimodal learning framework. By integrating visual, linguistic, and action modalities, they enable multimodal fusion systems designed for instruction-driven manipulation and generalist autonomy. This systematic review synthesizes the state of the art in VLA research with an emphasis on architectures, algorithms, and applications relevant to robotic manipulation. We examine 102 models, 26 foundational datasets, and 12 simulation platforms, categorizing them according to their fusion strategies and integration mechanisms. Foundational datasets are evaluated using a novel criterion based on task complexity, modality richness, and dataset scale, allowing a comparative analysis of their suitability for generalist policy learning. We further introduce a structured taxonomy of fusion hierarchies and encoder-decoder families, together with a two-dimensional dataset characterization framework and a meta-analytic benchmarking protocol that quantitatively links design variables to empirical performance across benchmarks. Our analysis shows that hierarchical and late fusion architectures achieve the highest manipulation success and generalization, confirming the benefit of multi-level cross-modal integration. Diffusion-based decoders demonstrate superior cross-domain transfer and robustness compared to autoregressive heads. Dataset analysis highlights a persistent lack of benchmarks that combine high-complexity, multimodal, and long-horizon tasks, while existing simulators offer limited multimodal synchronization and real-to-sim consistency. To address these gaps, we propose the VLA Fusion Evaluation Benchmark to quantify fusion efficiency and alignment. Drawing on both academic and industrial advances, the review outlines future research directions in adaptive and modular fusion architectures, computational resource optimization, and the deployment of interpretable, resource-efficient robotic systems. We further propose a forward-looking agentic VLA paradigm where LLM planners integrate VLA skills as verifiable tools within a closed feedback loop for adaptive and self-improving robotic control. This work provides both a conceptual foundation and a quantitative roadmap for advancing embodied intelligence through multimodal information fusion across robotic domains. A public repository summarizing models, datasets, and simulators is available at: https://muhayyuddin.github.io/VLAs/ .

  • New
  • Research Article
  • 10.1016/j.bspc.2026.109643
A novel multimodal learning method for predicting treatment resistance in MPO-AAV with lung involvement
  • May 1, 2026
  • Biomedical Signal Processing and Control
  • Yinan Zhang + 5 more

A novel multimodal learning method for predicting treatment resistance in MPO-AAV with lung involvement

  • 1
  • 2
  • 3
  • 4
  • 5
  • 6
  • .
  • .
  • .
  • 10
  • 1
  • 2
  • 3
  • 4
  • 5

Popular topics

  • Latest Artificial Intelligence papers
  • Latest Nursing papers
  • Latest Psychology Research papers
  • Latest Sociology Research papers
  • Latest Business Research papers
  • Latest Marketing Research papers
  • Latest Social Research papers
  • Latest Education Research papers
  • Latest Accounting Research papers
  • Latest Mental Health papers
  • Latest Economics papers
  • Latest Education Research papers
  • Latest Climate Change Research papers
  • Latest Mathematics Research papers

Most cited papers

  • Most cited Artificial Intelligence papers
  • Most cited Nursing papers
  • Most cited Psychology Research papers
  • Most cited Sociology Research papers
  • Most cited Business Research papers
  • Most cited Marketing Research papers
  • Most cited Social Research papers
  • Most cited Education Research papers
  • Most cited Accounting Research papers
  • Most cited Mental Health papers
  • Most cited Economics papers
  • Most cited Education Research papers
  • Most cited Climate Change Research papers
  • Most cited Mathematics Research papers

Latest papers from journals

  • Scientific Reports latest papers
  • PLOS ONE latest papers
  • Journal of Clinical Oncology latest papers
  • Nature Communications latest papers
  • BMC Geriatrics latest papers
  • Science of The Total Environment latest papers
  • Medical Physics latest papers
  • Cureus latest papers
  • Cancer Research latest papers
  • Chemosphere latest papers
  • International Journal of Advanced Research in Science latest papers
  • Communication and Technology latest papers

Latest papers from institutions

  • Latest research from French National Centre for Scientific Research
  • Latest research from Chinese Academy of Sciences
  • Latest research from Harvard University
  • Latest research from University of Toronto
  • Latest research from University of Michigan
  • Latest research from University College London
  • Latest research from Stanford University
  • Latest research from The University of Tokyo
  • Latest research from Johns Hopkins University
  • Latest research from University of Washington
  • Latest research from University of Oxford
  • Latest research from University of Cambridge

Popular Collections

  • Research on Reduced Inequalities
  • Research on No Poverty
  • Research on Gender Equality
  • Research on Peace Justice & Strong Institutions
  • Research on Affordable & Clean Energy
  • Research on Quality Education
  • Research on Clean Water & Sanitation
  • Research on COVID-19
  • Research on Monkeypox
  • Research on Medical Specialties
  • Research on Climate Justice
Discovery logo
FacebookTwitterLinkedinInstagram

Download the FREE App

  • Play store Link
  • App store Link
  • Scan QR code to download FREE App

    Scan to download FREE App

  • Google PlayApp Store
FacebookTwitterTwitterInstagram
  • Universities & Institutions
  • Publishers
  • R Discovery PrimeNew
  • Ask R Discovery
  • Blog
  • Accessibility
  • Topics
  • Journals
  • Open Access Papers
  • Year-wise Publications
  • Recently published papers
  • Pre prints
  • Questions
  • FAQs
  • Contact us
Lead the way for us

Your insights are needed to transform us into a better research content provider for researchers.

Share your feedback here.

FacebookTwitterLinkedinInstagram
Cactus Communications logo

Copyright 2026 Cactus Communications. All rights reserved.

Privacy PolicyCookies PolicyTerms of UseCareers