Phone based acoustic modeling for automatic speech recognition for Punjabi language

Wiqas Ghai,Navdeep Singh

doi:10.20396/joss.v3i1.15040

Abstract

Punjabi language is a tonal language belonging to an Indo-Aryan language family and has a number of speakers all around the world. Punjabi language has gained acceptability in the media & communication and therefore deserves to have a place in the growing field of automatic speech recognition which has been explored already for a number of other Indian and foreign languages successfully. Some work has been done in the field of isolated word speech recognition for Punjabi language, but only using whole word based acoustic models. A phone based approach has yet to be applied for Punjabi language speech recognition. This paper describes an automatic speech recognizer that recognizes isolated word speech and connected word speech using a triphone based acoustic model on the HTK 3.4.1 speech Engine and compares the performance with acoustic whole word model based ASR system. Word recognition accuracy of isolated word speech was 92.05% for acoustic whole word model based system and 97.14% for acoustic triphone model based system whereas word recognition accuracy of connected word speech was 87.75% for acoustic whole word model based system and 91.62% for acoustic triphone model based system.

Highlights

Speech is generated when vibrating vocal cords create puffs of air
The phone based acoustic model approach is new to the Punjabi language automatic speech recognition
This paper focuses on implementing an ASR for recognizing isolated word and connected word speech in the Punjabi Language

Summary

Introduction

Speech is generated when vibrating vocal cords create puffs of air. These puffs result in air pressure variations and it is due to these variations that the sensation of hearing develops. Automatic speech recognition [1, 18] is a process of transforming a speech signal (Figure 1) to a text which closely matches the input speech signal This technique is being used extensively in application areas such as: voice user interface, voice interactive response, enhancing social interactive capability of handicapped people, learning a foreign language etc. The acoustic model is used to represent the different ways a word of a particular language can sound. It makes use of audio recordings along with their transcriptions and compiles these two to produce statistical representations. The language model provides the context information to a speech recognition system It models the way the words are connected to form a sentence. The prior probability of the word, i.e. P (W), is provided by the language model, whereas the observation likelihood, i.e. P (X|W), is provided by the acoustic model

Acoustic Phone Model

Punjabi Language

Previous Work

Implementation

Phase 1

Phase 2

Findings

Conclusion & Future Work

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Speech Sciences	Publication Date: Feb 5, 2021
Citations: 2	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Phone based acoustic modeling for automatic speech recognition for Punjabi language

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Speech Sciences

Lead the way for us

Similar Papers

Efficient training of acoustic models for reverberation-robust medium-vocabulary automatic speech recognition
Armin Sehr ... Roland Maas
-
Armin Sehr, et. al.Armin Sehr ... Roland Maas
01 May 2014
01 May 2014

ETEH: Unified Attention-Based End-to-End ASR and KWS Architecture
Gaofeng Cheng ... Runyan Yang
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 30
Gaofeng Cheng, et. al.Gaofeng Cheng ... Runyan Yang
01 Jan 2021
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 30

A study on multilingual acoustic modeling for large vocabulary ASR
Hui Lin ... Li Deng
-
Hui Lin, et. al.Hui Lin ... Li Deng
01 Apr 2009
01 Apr 2009

Ensemble acoustic modeling in automatic speech recognition
Xin Chen
-
Xin ChenXin Chen
01 Jan 2010
01 Jan 2010

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Phone based acoustic modeling for automatic speech recognition for Punjabi language

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Speech Sciences