A Dynamical Model of Pitch Memory Provides an Improved Basis for Implied Harmony Estimation.

Ji Chul Kim

doi:10.3389/fpsyg.2017.00666

Abstract

Tonal melody can imply vertical harmony through a sequence of tones. Current methods for automatic chord estimation commonly use chroma-based features extracted from audio signals. However, the implied harmony of unaccompanied melodies can be difficult to estimate on the basis of chroma content in the presence of frequent nonchord tones. Here we present a novel approach to automatic chord estimation based on the human perception of pitch sequences. We use cohesion and inhibition between pitches in auditory short-term memory to differentiate chord tones and nonchord tones in tonal melodies. We model short-term pitch memory as a gradient frequency neural network, which is a biologically realistic model of auditory neural processing. The model is a dynamical system consisting of a network of tonotopically tuned nonlinear oscillators driven by audio signals. The oscillators interact with each other through nonlinear resonance and lateral inhibition, and the pattern of oscillatory traces emerging from the interactions is taken as a measure of pitch salience. We test the model with a collection of unaccompanied tonal melodies to evaluate it as a feature extractor for chord estimation. We show that chord tones are selectively enhanced in the response of the model, thereby increasing the accuracy of implied harmony estimation. We also find that, like other existing features for chord estimation, the performance of the model can be improved by using segmented input signals. We discuss possible ways to expand the present model into a full chord estimation system within the dynamical systems framework.

Highlights

Melody is a succession of pitched sounds arranged to form a coherent musical pattern (Bingham, 1910; Apel, 1969)
The model is driven by audio signals, and acoustic frequencies are transformed into a complex pattern of oscillations which we take as a measure of pitch salience
We describe the structure and function of the shortterm pitch memory model with an example. (The differential equations governing the dynamics of the model are given below, along with the parameter values used in this study, but understanding of the mathematical details is not required to comprehend the results and implications of this study.) The model consists of two layers of nonlinear oscillators tuned to a chromatic scale (Figure 1)

Summary

INTRODUCTION

Melody is a succession of pitched sounds arranged to form a coherent musical pattern (Bingham, 1910; Apel, 1969). To model the interaction of melodic pitches in auditory memory, we use a network of tonotopically tuned nonlinear oscillators. The model is driven by audio signals, and acoustic frequencies are transformed into a complex pattern of oscillations which we take as a measure of pitch salience. We use a generic mathematical form of nonlinear oscillation, called the canonical model, which describes oscillatory activities with complex-valued state variables (Kim and Large, 2015). When two Layer 2 oscillators in a simple frequency relationship have high amplitudes at the same time, the plastic connections between them quickly strengthen and let the oscillators reinforce each other through nonlinear resonance (i.e., mode-locking). Let us discuss how the pitch memory model can improve the estimation of implied harmony by selectively enhancing chord tones over nonchord tones. We are developing a GrFNN pitch estimator, and the future versions of the present model will include a pitch estimator and be able to handle signals containing complex sounds

Methods

Results and Discussion

GENERAL DISCUSSION

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Frontiers in Psychology	Publication Date: May 4, 2017
Citations: 5	License type: cc-by

R Discovery Prime

R Discovery Prime

A Dynamical Model of Pitch Memory Provides an Improved Basis for Implied Harmony Estimation.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Frontiers in Psychology

Lead the way for us

Similar Papers

Improving Audio Chord Estimation by Alignment and Integration of Crowd-Sourced Symbolic Music
Daphne Odekerken ... Hendrik Vincent Koops
Transactions of the International Society for Music Information Retrieval | VOL. 4
Daphne Odekerken, et. al.Daphne Odekerken ... Hendrik Vincent Koops
09 Nov 2021
Transactions of the International Society for Music Information Retrieval | VOL. 4

Semi-Supervised Neural Chord Estimation Based on a Variational Autoencoder With Latent Chord Labels and Features
Yiming Wu ... Eita Nakamura
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 28
Yiming Wu, et. al.Yiming Wu ... Eita Nakamura
01 Jan 2020
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 28

Understanding Effects of Subjectivity in Measuring Chord Estimation Accuracy
Yizhao Ni ... Raul Santos-Rodriguez
IEEE Transactions on Audio, Speech, and Language Processing | VOL. 21
Yizhao Ni, et. al.Yizhao Ni ... Raul Santos-Rodriguez
01 Dec 2013
IEEE Transactions on Audio, Speech, and Language Processing | VOL. 21

Automatic Chord estimation on seventhsbass Chord vocabulary using deep neural network
Junqi Deng ... Yu-Kwong Kwok
-
Junqi Deng, et. al.Junqi Deng ... Yu-Kwong Kwok
01 Mar 2016
01 Mar 2016

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Dynamical Model of Pitch Memory Provides an Improved Basis for Implied Harmony Estimation.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Frontiers in Psychology