Audiovisual Training Research Articles

Communication in the real world is inherently multimodal. When having a conversation, typically sighted and hearing people use both auditory and visual cues to understand one another. For example, objects may make sounds as they move in space, or we may use the movement of a person's mouth to better understand what they are saying in a noisy environment. Still, many neuroscience experiments rely on unimodal stimuli to understand encoding of sensory features in the brain. The extent to which visual information may influence encoding of auditory information and vice versa in natural environments is thus unclear. Here, we addressed this question by recording scalp electroencephalography (EEG) in 11 subjects as they listened to and watched movie trailers in audiovisual (AV), visual (V) only, and audio (A) only conditions. We then fit linear encoding models that described the relationship between the brain responses and the acoustic, phonetic, and visual information in the stimuli. We also compared whether auditory and visual feature tuning was the same when stimuli were presented in the original AV format versus when visual or auditory information was removed. In these stimuli, visual and auditory information was relatively uncorrelated, and included spoken narration over a scene as well as animated or live-action characters talking with and without their face visible. For this stimulus, we found that auditory feature tuning was similar in the AV and A-only conditions, and similarly, tuning for visual information was similar when stimuli were presented with the audio present (AV) and when the audio was removed (V only). In a cross prediction analysis, we investigated whether models trained on AV data predicted responses to A or V only test data similarly to models trained on unimodal data. Overall, prediction performance using AV training and V test sets was similar to using V training and V test sets, suggesting that the auditory information has a relatively smaller effect on EEG. In contrast, prediction performance using AV training and A only test set was slightly worse than using matching A only training and A only test sets. This suggests the visual information has a stronger influence on EEG, though this makes no qualitative difference in the derived feature tuning. In effect, our results show that researchers may benefit from the richness of multimodal datasets, which can then be used to answer more than one research question.

Read full abstract

The integration of information from different sensory modalities is a fundamental process that enhances perception and performance in real and virtual environments (VR). Understanding these mechanisms, especially during learning tasks that exploit novel multisensory cue combinations provides opportunities for the development of new rehabilitative interventions.This study aimed to investigate how functional brain changes support behavioural performance improvements during an audio-visual (AV) learning task. Twenty healthy participants underwent a 30 min daily VR training for four weeks. The task was an AV adaptation of a ‘scanning training’ paradigm that is commonly used in hemianopia rehabilitation. Functional magnetic resonance imaging (fMRI) and performance data were collected at baseline, after two and four weeks of training, and four weeks post-training.We show that behavioural performance, operationalised as mean reaction time reduction in VR, significantly improves. In separate tests in a controlled laboratory environment, we showed that the behavioural performance gains in the VR training environment transferred to a significant mean RT reduction for the trained AV voluntary task on a computer screen. Enhancements were observed in both the visual-only and AV conditions, with the latter demonstrating a faster response time supported by the presence of audio cues. The behavioural learning effect also transfers to two additional tasks that were tested: a visual search task and an involuntary visual task.Our fMRI results reveal an increase in functional activation (BOLD signal) in multisensory brain regions involved in early-stage AV processing: the thalamus, the caudal inferior parietal lobe and cerebellum. These functional changes were only observed for the trained, multisensory, task and not for unimodal visual stimulation. Functional activation changes in the thalamus were significantly correlated to behavioural performance improvements.This study demonstrates that incorporating spatial auditory cues to voluntary visual training in VR leads to augmented brain activation changes in multisensory integration, resulting in measurable performance gains across tasks. The findings highlight the potential of VR-based multisensory training as an effective method for enhancing cognitive function and as a potentially valuable tool in rehabilitative programmes.

Read full abstract

Audiovisual Training Research Articles

Related Topics

Articles published on Audiovisual Training

A comparison of EEG encoding models using audiovisual stimuli and their unimodal counterparts.

Audio-visual training and feedback to learn touch-based gestures

Perceptual Adaptation to Noise-Vocoded Speech by Lip-Read Information: No Difference between Dyslexic and Typical Readers.

The Effect of Providing Education to Patients Undergoing Coronary Angiography on Vital Signs

Assessment of a Teaching Module for Cardiac Auscultation of Horses by Veterinary Students.

Enhancing learning outcomes through multisensory integration: A fMRI study of audio-visual training in virtual reality

Performance of two educational approaches in increasing knowledge of high-school students about COVID-19 during the first wave of pandemic

Feasibility and acceptability of online opioid overdose education and naloxone distribution: Study protocol and preliminary results from a randomized pilot clinical trial

Audiovisual Training in Virtual Reality Improves Auditory Spatial Adaptation in Unilateral Hearing Loss Patients.

Feasibility of Developing Audiovisual Material for Training Needs in a Vietnam Orphanage: A Mixed-Method Design

The effects of the presence of a face and direct eye gaze on voice identity learning.

Prospective Media Translators in Audio-Visual Training: Towards a Critical Discourse Analysis of Gender-Bias in Subtitling

Design and Evaluation of a School-based Sustained Attention Training Program with Parental Involvement for Preschoolers in Rural China

Learning challenging L2 sounds via computer‐assisted training: Audiovisual training with an airflow model

Audio-Visual Training Improves Awareness and Willingness of Cervical Cancer Screening among Healthy Indian Women: Findings from a Survey.

Thanks or Tanks: Training with Tactile Cues Improves Learners’ Accuracy of English Interdental Consonants in an Oral Reading Task

Reducing the hemodialysis patient stress level through progressive relaxation

Lipreading: A Review of Its Continuing Importance for Speech Recognition With an Acquired Hearing Loss and Possibilities for Effective Training.

Patients With Breast Cancer Receiving Chemotherapy: Effects of Multisensory Stimulation Training on Cognitive Impairment.

African-American Lay Pastoral Care Facilitators’ Perspectives on Dementia Caregiver Education and Training

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Audiovisual Training Research Articles

Related Topics

Articles published on Audiovisual Training

A comparison of EEG encoding models using audiovisual stimuli and their unimodal counterparts.

Audio-visual training and feedback to learn touch-based gestures

Perceptual Adaptation to Noise-Vocoded Speech by Lip-Read Information: No Difference between Dyslexic and Typical Readers.

The Effect of Providing Education to Patients Undergoing Coronary Angiography on Vital Signs

Assessment of a Teaching Module for Cardiac Auscultation of Horses by Veterinary Students.

Enhancing learning outcomes through multisensory integration: A fMRI study of audio-visual training in virtual reality

Performance of two educational approaches in increasing knowledge of high-school students about COVID-19 during the first wave of pandemic

Feasibility and acceptability of online opioid overdose education and naloxone distribution: Study protocol and preliminary results from a randomized pilot clinical trial

Audiovisual Training in Virtual Reality Improves Auditory Spatial Adaptation in Unilateral Hearing Loss Patients.

Feasibility of Developing Audiovisual Material for Training Needs in a Vietnam Orphanage: A Mixed-Method Design

The effects of the presence of a face and direct eye gaze on voice identity learning.

Prospective Media Translators in Audio-Visual Training: Towards a Critical Discourse Analysis of Gender-Bias in Subtitling

Design and Evaluation of a School-based Sustained Attention Training Program with Parental Involvement for Preschoolers in Rural China

Learning challenging L2 sounds via computer‐assisted training: Audiovisual training with an airflow model

Audio-Visual Training Improves Awareness and Willingness of Cervical Cancer Screening among Healthy Indian Women: Findings from a Survey.

Thanks or Tanks: Training with Tactile Cues Improves Learners’ Accuracy of English Interdental Consonants in an Oral Reading Task

Reducing the hemodialysis patient stress level through progressive relaxation

Lipreading: A Review of Its Continuing Importance for Speech Recognition With an Acquired Hearing Loss and Possibilities for Effective Training.

Patients With Breast Cancer Receiving Chemotherapy: Effects of Multisensory Stimulation Training on Cognitive Impairment.

African-American Lay Pastoral Care Facilitators’ Perspectives on Dementia Caregiver Education and Training