Multi-label Classification Models for Detection of Phonetic Features in building Acoustic Models

Rupam Ojha,C Chandra Sekhar

doi:10.1109/ijcnn.2019.8851682

Abstract

Acoustic modeling in large vocabulary continuous speech recognition systems is commonly done by building the models for subword units such as phonemes, syllables or senones. In recent years, various end-to-end systems using acoustic models built at grapheme or phoneme level have also been explored. These systems either require a lot of data and/or heavily rely on the use of language models or pronunciation dictionary for good recognition performance. With the intention of reducing the dependence on data or external models, we have explored the usage of phonetic features in building acoustic models for speech recognition. The phonetic features describe a sound based on the speech production mechanism in humans. Multi-label classification models are built for detection of phonetic features in a given speech signal. The detected phonetic features are used along with the acoustic features as input to models for phoneme identification. The effectiveness of the proposed approach is demonstrated on TIMIT and Wall Street Journal corpora. Performance improvement over other phoneme recognition studies using the phonetic features is obtained.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Multi-label Classification Models for Detection of Phonetic Features in building Acoustic Models

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Deep Belief Neural Networks and Bidirectional Long-Short Term Memory Hybrid for Speech Recognition
Łukasz Brocki ... Krzysztof Marasek
Archives of Acoustics | VOL. 40
Łukasz Brocki, et. al.Łukasz Brocki ... Krzysztof Marasek
01 Jun 2015
Archives of Acoustics | VOL. 40

Chapter 4 - Multilingual Acoustic Modeling
Tanja Schultz
Multilingual Speech Processing | VOL. -
Tanja SchultzTanja Schultz
01 Jan 2006
Multilingual Speech Processing | VOL. -

Graph-based semi-supervised acoustic modeling in DNN-based speech recognition
Yuzong Liu ... Katrin Kirchhoff
-
Yuzong Liu, et. al.Yuzong Liu ... Katrin Kirchhoff
01 Dec 2014
01 Dec 2014

Language-independent and language-adaptive acoustic modeling for speech recognition
Tanja Schultz ... Alex Waibel
Speech Communication | VOL. 35
Tanja Schultz, et. al.Tanja Schultz ... Alex Waibel
25 Jun 2001
Speech Communication | VOL. 35

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multi-label Classification Models for Detection of Phonetic Features in building Acoustic Models

Abstract

Talk to us

Similar Papers