The DKU-JNU-EMA Electromagnetic Articulography Database on Mandarin and Chinese Dialects with Tandem Feature based Acoustic-to-Articulatory Inversion

Zexin Cai,Ming Li,Danwei Cai,Xinzhong Liu,Haibin Zhong,Xiaoyi Qin

doi:10.1109/iscslp.2018.8706629

Abstract

This paper presents the acquisition of the Duke Kunshan University Jinan University Electromagnetic Articulography (DKU-JNU-EMA) database in terms of aligned acoustics and articulatory data on Mandarin and Chinese dialects. This database currently includes data from multiple individuals in Mandarin and three Chinese dialects, namely Cantonese, Hakka, Teochew. There are 2–7 native speakers for each language or dialect. Acoustic data is obtained by one headmounted close talk microphone while articulatory data is obtained by the NDI electromagnetic articulography wave research system. The DKU-JNU-EMA database is now in preparation for public release to help advance research in areas of acoustic-to-articulatory inversion, speech production, dialect recognition, and experimental phonetics. Along with the database, we propose an acoustic-to-articulatory inversion baseline using deep neural networks. Moreover, we show that by concatenating the dimension reduced phoneme posterior probability feature with MFCC features at the feature level as tandem feature, the inversion system performance is enhanced.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

The DKU-JNU-EMA Electromagnetic Articulography Database on Mandarin and Chinese Dialects with Tandem Feature based Acoustic-to-Articulatory Inversion

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Determining an Optimal Set of Flesh Points on Tongue, Lips, and Jaw for Continuous Silent Speech Recognition
Jun Wang ... Seongjun Hahm
-
Jun Wang, et. al.Jun Wang ... Seongjun Hahm
01 Jan 2015
01 Jan 2015

Integration of articulatory and spectrum features based on the hybrid HMM/BN modeling framework
Konstantin Markov ... Satoshi Nakamura
Speech Communication | VOL. 48
Konstantin Markov, et. al.Konstantin Markov ... Satoshi Nakamura
15 Aug 2005
Speech Communication | VOL. 48

The Electromagnetic Articulography Mandarin Accented English (EMA-MAE) corpus of acoustic and 3D articulatory kinematic data
An Ji ... Michael T Johnson
-
An Ji, et. al.An Ji ... Michael T Johnson
01 May 2014
01 May 2014

Congruence of articulatory and acoustic variability
Alice Faber ... Julie M Brown
The Journal of the Acoustical Society of America | VOL. 101
Alice Faber, et. al.Alice Faber ... Julie M Brown
01 May 1997
The Journal of the Acoustical Society of America | VOL. 101

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

The DKU-JNU-EMA Electromagnetic Articulography Database on Mandarin and Chinese Dialects with Tandem Feature based Acoustic-to-Articulatory Inversion

Abstract

Talk to us

Similar Papers