What Can Computational Models Learn From Human Selective Attention? A Review From an Audiovisual Unimodal and Crossmodal Perspective.

Di Fu,Matthias Kerzel,Guochun Yang,Pablo Barros,Weizhi Nan,Haiyan Wu,Cornelius Weber,Xun Liu,Stefan Wermter

doi:10.3389/fnint.2020.00010

Abstract

Selective attention plays an essential role in information acquisition and utilization from the environment. In the past 50 years, research on selective attention has been a central topic in cognitive science. Compared with unimodal studies, crossmodal studies are more complex but necessary to solve real-world challenges in both human experiments and computational modeling. Although an increasing number of findings on crossmodal selective attention have shed light on humans' behavioral patterns and neural underpinnings, a much better understanding is still necessary to yield the same benefit for intelligent computational agents. This article reviews studies of selective attention in unimodal visual and auditory and crossmodal audiovisual setups from the multidisciplinary perspectives of psychology and cognitive neuroscience, and evaluates different ways to simulate analogous mechanisms in computational models and robotics. We discuss the gaps between these fields in this interdisciplinary review and provide insights about how to use psychological findings and theories in artificial intelligence from different perspectives.

Highlights

The real world is complex, uncertain and rich in dynamic ambiguous stimuli
Song et al (2017) conducted a mice experiment by using a task with audiovisual conflicts, where audition was required to dominate vision. They found that when the conflict occurred, the co-activation of the primary visual and auditory cortices suppressed the response evoked by vision but maintained the response evoked by audition in the posterior parietal cortex (PPC)
The current review summarizes experimental findings, theories, and model approaches of audiovisual unimodal and crossmodal selective attention from psychology, neuroscience, and computer science perspective

Summary

INTRODUCTION

“The art of being wise is knowing what to overlook.” –William James, 1842-1910. The real world is complex, uncertain and rich in dynamic ambiguous stimuli. It is considered to be instinctive and spontaneous and often results in a reflexive saccade (Smith et al, 2004; Styles, 2006) Another point of view distinguishes between “covert” and “overt” orienting attention: covert attention can attend events or objects with the absence of eyes movement, while overt attention guides the fovea to the stimulus directly with eyes or head movements (Posner, 1980). The development and application of technical measurements and methods like functional magnetic resonance imaging (fMRI), Magnetoencephalography (MEG), and state-ofthe-art artificial neural networks (ANN) and deep learning (DL) open up a new window for studies on humans, primates, and robots Such new findings should be valuated and integrated into the current framework. We aim to integrate selective attention concepts, theories, behavioral, and neural mechanisms studied by the unimodal and crossmodal experiment designs. We discuss the current limitations and the future trends of utilization and implications of human selective attention models in artificial intelligence

DIFFERENT THEORIES AND MODELS OF SELECTIVE ATTENTION

Functional Neural Networks Model

Neural Oscillation Model

Free-Energy Model and Information Theory

Attention Mechanisms in Computer Science

Behavioral and Neural Mechanisms of Human Visual Selective Attention

Computational Models Based on Human Visual Selective Attention

Behavioral and Neural Mechanisms of Human Auditory Selective Attention

Computational Models for the Human Cocktail Party Problem Solution

Behavioral and Neural Mechanisms of Human Crossmodal Selective Attention

Computational Models Simulating Human Crossmodal Selective Attention

CONCLUDING REMARKS AND OUTSTANDING QUESTIONS

Limits Remain in Current Interdisciplinary Research

Future Directions for Interdisciplinary Research

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Frontiers in integrative neuroscience	Publication Date: Feb 27, 2020
Citations: 14	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

What Can Computational Models Learn From Human Selective Attention? A Review From an Audiovisual Unimodal and Crossmodal Perspective.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Frontiers in integrative neuroscience

Lead the way for us

Similar Papers

The Neural Correlates of Visual and Auditory Cross-Modal Selective Attention in Aging.
Caroline M Van Heugten ... Judith Eck
Frontiers in aging neuroscience | VOL. 12
Caroline M Van Heugten, et. al.Caroline M Van Heugten ... Judith Eck
12 Nov 2020
Frontiers in aging neuroscience | VOL. 12

The role of perceptual load and sensory degradation on cross-modal selective attention
Rajwant Sandhu
-
Rajwant SandhuRajwant Sandhu
24 May 2021
24 May 2021

The role of perceptual load and sensory degradation on cross-modal selective attention
Rajwant Sandhu
-
Rajwant SandhuRajwant Sandhu
24 May 2021
24 May 2021

Vision and touch in ageing: Crossmodal selective attention and visuotactile spatial interactions
S Ashworth ... C Lowe
Neuropsychologia | VOL. 44
S Ashworth, et. al.S Ashworth ... C Lowe
11 Aug 2005
Neuropsychologia | VOL. 44

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

What Can Computational Models Learn From Human Selective Attention? A Review From an Audiovisual Unimodal and Crossmodal Perspective.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Frontiers in integrative neuroscience