Connecting Deep Neural Networks to Physical, Perceptual, and Electrophysiological Auditory Signals.

Nicholas Huang,Mounya Elhilali,Malcolm Slaney

doi:10.3389/fnins.2018.00532

Nicholas Huang, Mounya Elhilali + Show 1 more

Open Access

https://doi.org/10.3389/fnins.2018.00532

Copy DOI

Abstract

Deep neural networks have been recently shown to capture intricate information transformation of signals from the sensory profiles to semantic representations that facilitate recognition or discrimination of complex stimuli. In this vein, convolutional neural networks (CNNs) have been used very successfully in image and audio classification. Designed to imitate the hierarchical structure of the nervous system, CNNs reflect activation with increasing degrees of complexity that transform the incoming signal onto object-level representations. In this work, we employ a CNN trained for large-scale audio object classification to gain insights about the contribution of various audio representations that guide sound perception. The analysis contrasts activation of different layers of a CNN with acoustic features extracted directly from the scenes, perceptual salience obtained from behavioral responses of human listeners, as well as neural oscillations recorded by electroencephalography (EEG) in response to the same natural scenes. All three measures are tightly linked quantities believed to guide percepts of salience and object formation when listening to complex scenes. The results paint a picture of the intricate interplay between low-level and object-level representations in guiding auditory salience that is very much dependent on context and sound category.

Highlights

Over the past few years, convolutional neural networks (CNNs) have revolutionized machine perception, in the domains of image understanding, speech and audio recognition, and multimedia analytics (Krizhevsky et al, 2012; Karpathy et al, 2014; Cai and Xia, 2015; Simonyan and Zisserman, 2015; He et al, 2016; Hershey et al, 2017; Poria et al, 2017)
A CNN is a form of a deep neural network (DNN) where most of the computation are done with trainable kernel that are slid over the entire input
The current study leverages the complex hierarchy afforded by CNNs trained on audio classification to explore parallels between network activation and auditory salience in natural sounds measured through a variety of modalities

Summary

Introduction

Over the past few years, convolutional neural networks (CNNs) have revolutionized machine perception, in the domains of image understanding, speech and audio recognition, and multimedia analytics (Krizhevsky et al, 2012; Karpathy et al, 2014; Cai and Xia, 2015; Simonyan and Zisserman, 2015; He et al, 2016; Hershey et al, 2017; Poria et al, 2017). A CNN is a form of a deep neural network (DNN) where most of the computation are done with trainable kernel that are slid over the entire input. These networks implement hierarchical architectures that mimic the biological structure of the human sensory system. They are organized in a series of processing layers that perform different transformations of the incoming signal, “learning” information in a distributed topology. By constraining the selectivity of units in these layers, nodes in the network have emergent “receptive fields,” allowing them to learn from local information in the input and structure processing in a distributed way; much like neurons in the brain have receptive fields with localized connectivity organized in topographic

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Frontiers in Neuroscience	Publication Date: Aug 14, 2018
Citations: 15	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Connecting Deep Neural Networks to Physical, Perceptual, and Electrophysiological Auditory Signals.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Frontiers in Neuroscience

Lead the way for us

Similar Papers

Deep distributed convolutional neural networks: Universality
Ding-Xuan Zhou
Analysis and Applications | VOL. 16
Ding-Xuan ZhouDing-Xuan Zhou
01 Nov 2018
Analysis and Applications | VOL. 16

Research on improved convolutional wavelet neural network
Jingwei Liu ... Jiaxin Li
Scientific Reports | VOL. 11
Jingwei Liu, et. al.Jingwei Liu ... Jiaxin Li
09 Sep 2021
Scientific Reports | VOL. 11

Brain tumor segmentation with deep convolutional symmetric neural network
Hao Chen ... Zhen Qin
Neurocomputing | VOL. 392
Hao Chen, et. al.Hao Chen ... Zhen Qin
24 Apr 2019
Neurocomputing | VOL. 392

Absolute distance measurement based on laser self-mixing interferometry and deep neural network
Jinyuan Yuan ... Werner H Hofmann
-
Jinyuan Yuan, et. al.Jinyuan Yuan ... Werner H Hofmann
27 Dec 2022
27 Dec 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Connecting Deep Neural Networks to Physical, Perceptual, and Electrophysiological Auditory Signals.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Frontiers in Neuroscience