Technique for Phoneme Set Selection for Automatic Russian Speech Recognition

Daria Vazhenina ,Ирина Сергеевна Кипяткова ,Konstantin Markov ,Алексей Анатольевич Карпов

doi:10.15622/sp.36.6

Abstract

In the paper, selection of best phoneme set for Russian automatic speech recognition is described. For the acoustic modeling, we describe a method based on combination of knowledge-based and statistical approaches to create several different phoneme sets. Applying this method to the Russian phonetic set of the IPA (International Phonetic Alphabet) alphabet, we first reduced it to 47 phonological units and derived several other phoneme sets with different number of phonological units from 27 till 47. Speech recognition experiments using these sets showed that reduced phoneme sets are better for phoneme recognition task and as good for word level speech recognition. For experiment with extra-large vocabulary, we used syntactico-statistical language model, which allowed us to achieve the word recognition accuracy of 73.1%. The results correspond to continuous Russian speech recognition quality obtained by other organizations up to date.

Highlights

We describe a method based on combination of knowledge-based and statistical approaches to create several different phoneme sets
Applying this method to the Russian phonetic set of the IPA (International Phonetic Alphabet) alphabet, we first reduced it to 47 phonological units and derived several other phoneme sets with different number of phonological units from 27 till 47
This model was created by adding grammatically-connected word pairs, which were separated by other words in the training corpus, to the baseline bigram model

Summary

Фонетические единицы

При использовании трифонов может возникнуть проблема дефицита обучающих данных. Обычно кластеризация контекстов осуществляется с помощью деревьев решений [14,15], при этом используются вопросы о том, имеет ли левая или правая фонема определенные фонетические признаки (например, является ли левая/правая фонема звонкой). В таблице 2 приведены основные различия между наборами вопросов дерева решений для английского и русского языков. Дерево решений для русского языка состоит из 38 общих вопросов плюс по одному вопросу для каждой единицы фонемного набора отдельно для левого и правого контекста. Размер используемого фонемного набора определяет количество контекстно-независимых моделей и также влияет на число контекстнозависимых моделей. Если их количество слишком мало, может снизиться точность системы, так как акустически схожие модели будут чаще распознаваться неправильно (спутываться). Различия между наборами вопросов дерева решений для английского и русского языков

Добавленные вопросы для русского

Ударные Безударные

Число фонологических единиц

Число состояний

Фонемный набор

Findings

SUMMARY

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Technique for Phoneme Set Selection for Automatic Russian Speech Recognition

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: SPIIRAS Proceedings

Lead the way for us

Journal: SPIIRAS Proceedings	Publication Date: Nov 14, 2014
License type: cc-by

Similar Papers

Phoneme set selection for russian speech recognition
Daria Vazhenina ... Konstantin Markov
-
Daria Vazhenina, et. al.Daria Vazhenina ... Konstantin Markov
01 Nov 2011
01 Nov 2011

Automatic Derivation of a Phoneme Set with Tone Information for Chinese Speech Recognition Based on Mutual Information Criterion
Jin-Song Zhang ... Xin-Hui Hu
-
Jin-Song Zhang, et. al. Jin-Song Zhang ... Xin-Hui Hu
14 May 2006
14 May 2006

Using Mutual Information Criterion to Design an Efficient Phoneme Set for Chinese Speech Recognition
J.-S Zhang ... X.-H Hu
IEICE Transactions on Information and Systems | VOL. E91-D
J.-S Zhang, et. al.J.-S Zhang ... X.-H Hu
01 Mar 2008
IEICE Transactions on Information and Systems | VOL. E91-D

Framework for Choosing a Set of Syllables and Phonemes for Lithuanian Speech Recognition
Sigita Laurinčiukaitė ... Antanas Lipeika
Informatica | VOL. 18
Sigita Laurinčiukaitė, et. al.Sigita Laurinčiukaitė ... Antanas Lipeika
01 Jan 2007
Informatica | VOL. 18

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Technique for Phoneme Set Selection for Automatic Russian Speech Recognition

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: SPIIRAS Proceedings