Broad Phonetic Classification of ASR using Visual Based Features

Doaa Lehabik,Sameh Saad,Amr Gody,Mohamed Merzban

doi:10.21608/ejle.2020.24358.1003

Abstract

Abstract: This paper presents a novel method of classifying speech phonemes. Four hybrid techniques based on the acoustic-phonetic approach and pattern recognition approach are used to emphasize the principle idea of this research. The first hybrid model is constructed of fixed state, structured Hidden Markov Model, Gaussian Mixture, Mel scaled Best Tree Image, Convolution Neural network, Vector Quantization (FS-HMM-GM-MBTI-CNN-VQ). The second hybrid model is constructed of variable state, dynamically structured Hidden Markov Model, Gaussian Mixture, Mel scaled Best Tree Image, Convolution Neural network, Vector Quantization (VS-HMM-GM-MBTI-CNN-VQ). The third hybrid model is constructed of fixed state, structured Hidden Markov Model, Gaussian Mixture, Mel scaled Best Tree Image, Convolution Neural network (FS-HMM-GM-MBTI-CNN). The fourth hybrid model is constructed of variable state, dynamically structured Hidden Markov Model, Gaussian Mixture, Mel scaled Best Tree Image, Convolution Neural network (VS-HMM-GM-MBTI-CNN). TIMIT database is used in this paper. All phones are classified into five classes and segregated into Vowels, Plosives, Fricatives, Nasals, and Silences. The results show that using (VS-HMM-GM-MBTI-CNN-VQ) is an available method for classification of phonemes, with the potential for use in applications such as automatic speech recognition and automatic language identification. Competitive results are achieved especially in nasals, plosives, and silence high successive rates than others.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Broad Phonetic Classification of ASR using Visual Based Features

Abstract

Talk to us

Similar Papers

More From: The Egyptian Journal of Language Engineering

Lead the way for us

Journal: The Egyptian Journal of Language Engineering	Publication Date: Apr 1, 2020
License type: cc-by

Similar Papers

Automatic Database Segmentation using Hybrid Spectrum -Visual Approach

The Egyptian Journal of Language Engineering | VOL. 8

01 Sep 2021
The Egyptian Journal of Language Engineering | VOL. 8

High-accuracy recognition of gas–liquid two-phase flow patterns: A Flow–Hilbert–CNN hybrid model
Pan Zhang ... Jiang Bian
Geoenergy Science and Engineering | VOL. 230
Pan Zhang, et. al.Pan Zhang ... Jiang Bian
28 Jul 2023
Geoenergy Science and Engineering | VOL. 230

Comparative study of univariate and multivariate strategy for short-term forecasting of heat demand density: Exploring single and hybrid deep learning models
Sajad Salehi ... Luc Begnoche
Energy and AI | VOL. 16
Sajad Salehi, et. al.Sajad Salehi ... Luc Begnoche
24 Jan 2024
Energy and AI | VOL. 16

A hybrid model based on neural networks for biomedical relation extraction
Yijia Zhang ... Liang Yang
Journal of Biomedical Informatics | VOL. 81
Yijia Zhang, et. al.Yijia Zhang ... Liang Yang
27 Mar 2018
Journal of Biomedical Informatics | VOL. 81

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Broad Phonetic Classification of ASR using Visual Based Features

Abstract

Talk to us

Similar Papers

More From: The Egyptian Journal of Language Engineering