Acoustic modeling of spontaneous speech of Japanese preschool children

Izumi Shindo,Tomoki Toda,Hiroshi Saruwatari,Kiyohiro Shikano,Tobias Cincarek

doi:10.1121/1.4787222

Izumi Shindo, Tomoki Toda + Show 3 more

Open Access

https://doi.org/10.1121/1.4787222

Copy DOI

Abstract

In recent years, there is an increasing demand for speech recognition of children. However, the recognition of children’s speech, especially preschool children (2 to 5 years of age), is very difficult. For example, recognition accuracy using a children’s acoustic model provided by the Japanese Dictation Toolkit is only 21.4%. Many different variations of child speech with palatal sounds and pronunciation error decrease recognition performance. This paper proposes a recognition method that investigates the characteristics of preschool children’s speech using experimental data and considers phonetic changes. Mapping between standard and altered pronunciations of words is determined. In experiments, a large amount of spontaneous child speech (2 to 15 years of age) was collected with the speech‐oriented public guidance system, ‘‘Takemaru‐kun,’’ which is currently available. Recognition performance increases to 49.2% by acoustic model adaptation of preschool children’s speech. When allowing multiple pronunciation variations per word during recognition, further improvement to 52.0% is achieved.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Acoustic modeling of spontaneous speech of Japanese preschool children

Abstract

Talk to us

Similar Papers

More From: The Journal of the Acoustical Society of America

Lead the way for us

Journal: The Journal of the Acoustical Society of America	Publication Date: Nov 1, 2006
License type: cc-by-nc-nd

Similar Papers

Acoustic and language models adaptation for Indonesian spontaneous speech recognition
Dessi Puji Lestari ... Angela Irfani
-
Dessi Puji Lestari, et. al.Dessi Puji Lestari ... Angela Irfani
01 Aug 2015
01 Aug 2015

Data Augmentation Based on Vowel Stretch for Improving Children's Speech Recognition
Tohru Nagano ... Takashi Fukuda
-
Tohru Nagano, et. al.Tohru Nagano ... Takashi Fukuda
01 Dec 2019
01 Dec 2019

Transfer learning for children's speech recognition
Rong Tong ... Bin Ma
-
Rong Tong, et. al.Rong Tong ... Bin Ma
01 Dec 2017
01 Dec 2017

Exploring recurrent neural network based acoustic and linguistic modeling for children's speech recognition
Sreeram Ganji ... Rohit Sinha
-
Sreeram Ganji, et. al.Sreeram Ganji ... Rohit Sinha
01 Nov 2017
01 Nov 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Acoustic modeling of spontaneous speech of Japanese preschool children

Abstract

Talk to us

Similar Papers

More From: The Journal of the Acoustical Society of America