Deep neural network models reveal interplay of peripheral coding and stimulus statistics in pitch perception

Mark R Saddler,Ray Gonzalez,Josh H Mcdermott

doi:10.1038/s41467-021-27366-6

Mark R Saddler, Ray Gonzalez + Show 1 more

Open Access

https://doi.org/10.1038/s41467-021-27366-6

Copy DOI

Abstract

Perception is thought to be shaped by the environments for which organisms are optimized. These influences are difficult to test in biological organisms but may be revealed by machine perceptual systems optimized under different conditions. We investigated environmental and physiological influences on pitch perception, whose properties are commonly linked to peripheral neural coding limits. We first trained artificial neural networks to estimate fundamental frequency from biologically faithful cochlear representations of natural sounds. The best-performing networks replicated many characteristics of human pitch judgments. To probe the origins of these characteristics, we then optimized networks given altered cochleae or sound statistics. Human-like behavior emerged only when cochleae had high temporal fidelity and when models were optimized for naturalistic sounds. The results suggest pitch perception is critically shaped by the constraints of natural environments in addition to those of the cochlea, illustrating the use of artificial neural networks to reveal underpinnings of behavior.

Highlights

IntroductionThrough optimization for the training task, the DNNs should learn to use whichever peripheral cues best allow them to extract F0
We developed a model of pitch perception by optimizing artificial neural networks to estimate the fundamental frequency of their acoustic input
The networks were trained on simulated auditory nerve representations of speech and music embedded in background noise

Summary

Introduction

Through optimization for the training task, the DNNs should learn to use whichever peripheral cues best allow them to extract F0. To make the F0 estimation task more difficult and to simulate naturalistic listening conditions, each speech or instrument excerpt in the training dataset was embedded in natural background noise. The signal-to-noise ratio for each training example was drawn uniformly between −10 dB and +10 dB. Noise source clips were taken from a subset of the AudioSet corpus[78], screened to remove nonstationary sounds (e.g., speech or music). To ensure the F0 estimation task remained well defined for the noisy stimuli, background noise clips were screened for periodicity by computing their autocorrelation functions. Noise clips with peaks greater than 0.8 at lags greater than 1 ms in their normalized autocorrelation function were excluded

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Nature Communications	Publication Date: Dec 1, 2021
Citations: 35	License type: open-access

R Discovery Prime

R Discovery Prime

Deep neural network models reveal interplay of peripheral coding and stimulus statistics in pitch perception

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Nature Communications

Lead the way for us

Similar Papers

Effects of depth, eccentricity and size of additional static stimulus on visually induced self-motion perception
Shinji Nakamura
Vision Research | VOL. 46
Shinji NakamuraShinji Nakamura
03 Mar 2006
Vision Research | VOL. 46

A Hybrid Neural Network Based on the Duplex Model of Pitch Perception for Singing Melody Extraction
Hsin Chou ... Ming-Tso Chen
-
Hsin Chou, et. al.Hsin Chou ... Ming-Tso Chen
01 Apr 2018
01 Apr 2018

Deep Truck : A deep neural network model for longitudinal dynamics of heavy duty trucks
Saleh Albeaik ... Fang-Chieh Chou
-
Saleh Albeaik, et. al.Saleh Albeaik ... Fang-Chieh Chou
01 Oct 2019
01 Oct 2019

Study of Per- and Poststimulatory Fatigue in Pitch Perception
Paul Skinner ... Frank Antinoro
The Journal of the Acoustical Society of America | VOL. 45
Paul Skinner, et. al.Paul Skinner ... Frank Antinoro
01 Jan 1969
The Journal of the Acoustical Society of America | VOL. 45

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Deep neural network models reveal interplay of peripheral coding and stimulus statistics in pitch perception

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Nature Communications