Acoustic-Phonetic Approaches for Improving Segment-Based Speech Recognition for Large Vocabulary Continuous Speech

Krerksak Likitsupin,Chai Wutiwiwatchai,Atiwong Suchato,Proadpran Punyabukkana

doi:10.4186/ej.2016.20.2.179

Krerksak Likitsupin, Chai Wutiwiwatchai + Show 2 more

Open Access

https://doi.org/10.4186/ej.2016.20.2.179

Copy DOI

Abstract

Segment-based speech recognition has shown to be a competitive alternative to the state-of-the-art HMM-based techniques. Its accuracies rely heavily on the quality of the segment graph from which the recognizer searches for the most likely recognition hypotheses. In order to increase the inclusion rate of actual segments in the graph, it is important to recover possible missing segments generated by segment-based segmentation algorithm. An aspect of this research focuses on determining the missing segments due to missed detection of segment boundaries. The acoustic discontinuities, together with manner-distinctive features are utilized to recover the missing segments. Another aspect of improvement to our segment-based framework tackles the restriction of having limited amount of training speech data which prevents the usage of more complex covariance matrices for the acoustic models. Feature dimensional reduction in the form of the Principal Component Analysis (PCA) is applied to enable the training of full covariance matrices and it results in improved segment-based phoneme recognition. Furthermore, to benefit from the fact that segment-based approach allows the integration of phonetic knowledge, we incorporate the probability of each segment being one type of sound unit of a certain specific common manner of articulation into the scoring of the segment graphs. Our experiment shows that, with the proposed improvements, our segment-based framework approximately increases the phoneme recognition accuracy by approximately 25% of the one obtained from the baseline segment-based speech recognition.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Engineering Journal	Publication Date: May 18, 2016
Citations: 24	License type: cc-by-nc-nd

R Discovery Prime

R Discovery Prime

Acoustic-Phonetic Approaches for Improving Segment-Based Speech Recognition for Large Vocabulary Continuous Speech

Abstract

Talk to us

Similar Papers

More From: Engineering Journal

Lead the way for us

Similar Papers

A method of feature fusion and dimension reduction for knee joint pathology screening and separability evaluation criteria
Chunyi Ma ... Jianhua Yang
Computer Methods and Programs in Biomedicine | VOL. 224
Chunyi Ma, et. al.Chunyi Ma ... Jianhua Yang
30 Jun 2022
Computer Methods and Programs in Biomedicine | VOL. 224

Structured covariance principal component analysis for real-time onsite feature extraction and dimensionality reduction in hyperspectral imaging.
Jaime Zabalza ... Zhe Liu
Applied Optics | VOL. 53
Jaime Zabalza, et. al.Jaime Zabalza ... Zhe Liu
04 Jul 2014
Applied Optics | VOL. 53

Uncertainty and Resolution Analysis of 2D and 3D Inversion Models Computed from Geophysical Electromagnetic Data
Zhengyong Ren ... Thomas Kalscheuer
Surveys in Geophysics | VOL. 41
Zhengyong Ren, et. al.Zhengyong Ren ... Thomas Kalscheuer
24 Sep 2019
Surveys in Geophysics | VOL. 41

Flexible Bayesian Dynamic Modeling of Correlation and Covariance Matrices.
Shiwei Lan ... Andrew Holbrook
Bayesian analysis | VOL. 15
Shiwei Lan, et. al.Shiwei Lan ... Andrew Holbrook
21 Nov 2017
Bayesian analysis | VOL. 15

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Acoustic-Phonetic Approaches for Improving Segment-Based Speech Recognition for Large Vocabulary Continuous Speech

Abstract

Talk to us

Similar Papers

More From: Engineering Journal