Unsupervised grammar induction from music data

Kazuyoshi Yoshii

doi:10.1121/1.4969432

Abstract

Music has a lot of similarities to language. Since most languages have clear syntactic structures (e.g., words should be arranged in the SVO order), many linguists have proposed various kinds of grammar theories by carefully and manually investigating language data. The situation is the same with Western music. Although a single musical note (cf. alphabet) has no meaning by itself, a cluster or pattern of multiple musical notes over the quantized time-frequency grid (cf. word) can invoke some impression and such short patterns are concatenated or superimposed (unique to music) to produce more complicated meaning (cf. sentence). We introduce several attempts to discover latent structures underlying music from acoustic or symbolic data (music signals and musical scores) in an unsupervised manner. Integrating statistical acoustic and language models as in speech recognition, for example, it is possible not only to transcribe music but also to discover that particular note combinations can form chords. A key feature of this approach is that both models are trained jointly only from acoustic data. Recently, we have attempted to induce music grammars from polyphonic scores by leveraging the state-of-the-art techniques of natural language processing. This would contribute to automatic music transcription and computational musicology.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Unsupervised grammar induction from music data

Abstract

Talk to us

Similar Papers

More From: Journal of the Acoustical Society of America

Lead the way for us

Journal: Journal of the Acoustical Society of America	Publication Date: Oct 1, 2016
Citations: 1

Similar Papers

Modelo Acústico y de Lenguaje del Idioma Español para el dialecto Cucuteño, Orientado al Reconocimiento Automático del Habla
Juan David Celis Nuñez ... Rodrigo Andres Llanos Castro
Ingeniería | VOL. 22
Juan David Celis Nuñez, et. al.Juan David Celis Nuñez ... Rodrigo Andres Llanos Castro
12 Sep 2017
Ingeniería | VOL. 22

Exploring recurrent neural network based acoustic and linguistic modeling for children's speech recognition
Sreeram Ganji ... Rohit Sinha
-
Sreeram Ganji, et. al.Sreeram Ganji ... Rohit Sinha
01 Nov 2017
01 Nov 2017

Statistical feature language model
Salma Jamoussi ... Kamel Smaili
-
Salma Jamoussi, et. al.Salma Jamoussi ... Kamel Smaili
04 Oct 2004
04 Oct 2004

Japanese large-vocabulary continuous-speech recognition using a newspaper corpus and broadcast news
Katsutoshi Ohtsuki ... Katsuhiko Shirai
Speech Communication | VOL. 28
Katsutoshi Ohtsuki, et. al.Katsutoshi Ohtsuki ... Katsuhiko Shirai
01 Jun 1999
Speech Communication | VOL. 28

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Unsupervised grammar induction from music data

Abstract

Talk to us

Similar Papers

More From: Journal of the Acoustical Society of America