Speaker-Independent Automatic Speech Recognition System for Mobile Phone Applications in Punjabi

Puneet Mittal,Navdeep Singh

doi:10.1007/978-3-319-67934-1_33

Abstract

Speaker-independent Automatic Speech Recognition (ASR) system based mobile phone applications are gaining popularity due to technological advancements and accessibility. Speech based applications may provide mobile phone accessibility and comfort to people performing activities where hand-free phone access is desirable e.g. drivers, athletes, machine operators etc. Similarly, users with disabilities like low vision, blindness and physically challenged may use it as an assistive technology. Development of ASR system for a specific language needs accurate, reliable and efficient acoustic model having language-specific pronunciation dictionary. Punjabi language is one of the popular languages worldwide having more than 150 million speakers. Three acoustic models- continuous, semi-continuous and phonetically-tied are developed based on three pronunciation dictionaries- word, sub-word and character based. Analysis of performance results validate Punjabi language principle “One word one sound” by having better accuracy and reliability for character based pronunciation dictionary than others. Further, phonetically-tied model outperforms others in terms of accuracy, word error rate and size due to reasonable number of Gaussians.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Speaker-Independent Automatic Speech Recognition System for Mobile Phone Applications in Punjabi

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Arabic Dialectical Speech Recognition in Mobile Communication Services
Qiru Zhou ... Imed Zitouni
-
Qiru Zhou, et. al.Qiru Zhou ... Imed Zitouni
01 Nov 2008
01 Nov 2008

Theoretical Analysis of Diversity in an Ensemble of Automatic Speech Recognition Systems
Kartik Audhkhasi ... Shrikanth S Narayanan
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 22
Kartik Audhkhasi, et. al.Kartik Audhkhasi ... Shrikanth S Narayanan
01 Mar 2014
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 22

A Regression Approach to Single-Channel Speech Separation Via High-Resolution Deep Neural Networks
Jun Du ... Yanhui Tu
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 24
Jun Du, et. al.Jun Du ... Yanhui Tu
01 Aug 2016
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 24

A Comparative Study on Selecting Acoustic Modeling Units for WFST-based Mongolian Speech Recognition
Wang Yonghe ... Feilong Bao
ACM Transactions on Asian and Low-Resource Language Information Processing | VOL. 22
Wang Yonghe, et. al.Wang Yonghe ... Feilong Bao
13 Oct 2023
ACM Transactions on Asian and Low-Resource Language Information Processing | VOL. 22

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Speaker-Independent Automatic Speech Recognition System for Mobile Phone Applications in Punjabi

Abstract

Talk to us

Similar Papers