Syllable Error Rate Research Articles

Automatic speech recognition (ASR) can potentially help older adults and people with disabilities reduce their dependence on others and increase their participation in society. However, maxillectomy patients with reduced speech intelligibility may encounter some problems using such technologies. To investigate the accuracy of three commonly used ASR platforms when used by Japanese maxillectomy patients with and without their obturator placed. Speech samples were obtained from 29 maxillectomy patients with and without their obturator and 17 healthy volunteers. The samples were input into three speaker-independent speech recognition platforms and the transcribed text was compared with the original text to calculate the syllable error rate (SER). All participants also completed a conventional speech intelligibility test to grade their speech using Taguchi's method. A comprehensive articulation assessment of patients without their obturator was also performed. Significant differences in SER were observed between healthy and maxillectomy groups. Maxillectomy patients with an obturator showed a significant negative correlation between speech intelligibility scores and SER. However, for those without an obturator, no significant correlations were observed. Furthermore, for maxillectomy patients without an obturator, significant differences were found between syllables grouped by vowels. Syllables containing /i/, /u/ and /e/ exhibited higher error rates compared to those containing /a/ and /o/. Additionally, significant differences were observed when syllables were grouped by consonant place of articulation and manner of articulation. The three platforms performed well for healthy volunteers and maxillectomy patients with their obturator, but the SER for maxillectomy patients without their obturator was high, rendering the platforms unusable. System improvement is needed to increase accuracy for maxillectomy patients.

Read full abstract

In this article we present the design and the development of a knowledge based computational linguistic tool, Mlphon [em.el.foːɳ] for Malayalam language. Mlphon computationally models linguistic rules using finite state transducers and performs multiple functions including grapheme to phoneme (g2p) and phoneme to grapheme (p2g) conversions, syllabification, phonetic feature analysis and script grammar check. This open source software tool, released under MIT license, is developed as a one-stop solution to handle different speech related text processing tasks for automatic speech recognition, text to speech synthesis and non-speech natural language processing tasks including syllable subword based language modeling, phoneme diversity analysis and text sanity check. The tool is evaluated on a manually crafted gold standard lexicon. Mlphon performs orthographic syllabification with 99% accuracy with a syllable error rate of 0.62% on the gold standard lexicon. For grapheme to phoneme conversion task, overall phoneme recognition accuracy of 99% with a phoneme error rate of 0.55% is obtained on gold standard lexicon. Additionally an extrinsic evaluation of Mlphon is performed by employing the pronunciation lexicon created using Mlphon, in Malayalam automatic speech recognition (ASR) task. Performance analysis in terms of the computation time of lexicon creation process and the word error rate (WER) on ASR task are presented along with a comparison over other automated tools for lexicon creation. Pronunciation lexicons with more than 100k commonly used Malayalam words in phonemised and syllabified forms is created and they are published as open language resources along with this work. We also demonstrate the usage of Mlphon on different natural language processing applications - syllable subword ASR, assisted pronunciation learning, phoneme diversity analysis and text sanity check. Being a knowledge based solution with open source code, Mlphon can be adapted to other languages of similar script nature.

Read full abstract

Syllable Error Rate Research Articles

Related Topics

Articles published on Syllable Error Rate

Maxillectomy patients' speech and performance of contemporary speaker-independent automatic speech recognition platforms in Japanese.

Comparison of Automatic Speech Recognition System for School-aged Children’s Narratives: Naver Clova Speech and Google Speech-to-Text

Augmented-syllabification of n-gram tagger for Indonesian words and named-entities

Multi-Task Transformer with Adaptive Cross-Entropy Loss for Multi-Dialect Speech Recognition

ASR - VLSP 2021: An Efficient Transformer-based Approach for Vietnamese ASR Task

VLSP 2021 - ASR Challenge for Vietnamese Automatic Speech Recognition

ASR - VLSP 2021: Semi-supervised Ensemble Model for Vietnamese Automatic Speech Recognition

VLSP 2021 - TTS Challenge: Vietnamese Spontaneous Speech Synthesis

Mlphon: A Multifunctional Grapheme-Phoneme Conversion Tool Using Finite State Transducers

Syllabification Model of Indonesian Language Named-Entity Using Syntactic n-Gram

Phonological similarity-based backoff smoothing to boost a bigram syllable boundary detection

Flipping onsets to enhance syllabification

Incorporating syllabification points into a model of grapheme-to-phoneme conversion

Improving Myanmar Automatic Speech Recognition with Optimization of Convolutional Neural Network Parameters

IMPROVING MYANMAR AUTOMATIC SPEECH RECOGNITION WITH OPTIMIZATION OF CONVOLUTIONAL NEURAL NETWORK PARAMETERS

Indonesian graphemic syllabification using a nearest neighbour classifier and recovery procedure

Indonesian syllabification using a pseudo nearest neighbour rule and phonotactic knowledge

Reliable Accent-Specific Unit Generation With Discriminative Dynamic Gaussian Mixture Selection for Multi-Accent Chinese Speech Recognition

Articulation-Disordered Speech Recognition Using Speaker-Adaptive Acoustic Models and Personalized Articulation Patterns

Automatic Speech Recognition for Under-Resourced Languages: Application to Vietnamese Language

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Syllable Error Rate Research Articles

Related Topics

Articles published on Syllable Error Rate

Maxillectomy patients' speech and performance of contemporary speaker-independent automatic speech recognition platforms in Japanese.

Comparison of Automatic Speech Recognition System for School-aged Children’s Narratives: Naver Clova Speech and Google Speech-to-Text

Augmented-syllabification of n-gram tagger for Indonesian words and named-entities

Multi-Task Transformer with Adaptive Cross-Entropy Loss for Multi-Dialect Speech Recognition

ASR - VLSP 2021: An Efficient Transformer-based Approach for Vietnamese ASR Task

VLSP 2021 - ASR Challenge for Vietnamese Automatic Speech Recognition

ASR - VLSP 2021: Semi-supervised Ensemble Model for Vietnamese Automatic Speech Recognition

VLSP 2021 - TTS Challenge: Vietnamese Spontaneous Speech Synthesis

Mlphon: A Multifunctional Grapheme-Phoneme Conversion Tool Using Finite State Transducers

Syllabification Model of Indonesian Language Named-Entity Using Syntactic n-Gram

Phonological similarity-based backoff smoothing to boost a bigram syllable boundary detection

Flipping onsets to enhance syllabification

Incorporating syllabification points into a model of grapheme-to-phoneme conversion

Improving Myanmar Automatic Speech Recognition with Optimization of Convolutional Neural Network Parameters

IMPROVING MYANMAR AUTOMATIC SPEECH RECOGNITION WITH OPTIMIZATION OF CONVOLUTIONAL NEURAL NETWORK PARAMETERS

Indonesian graphemic syllabification using a nearest neighbour classifier and recovery procedure

Indonesian syllabification using a pseudo nearest neighbour rule and phonotactic knowledge

Reliable Accent-Specific Unit Generation With Discriminative Dynamic Gaussian Mixture Selection for Multi-Accent Chinese Speech Recognition

Articulation-Disordered Speech Recognition Using Speaker-Adaptive Acoustic Models and Personalized Articulation Patterns

Automatic Speech Recognition for Under-Resourced Languages: Application to Vietnamese Language