Development and Analysis of Speech Recognition Systems for Assamese Language Using HTK

Himangshu Sarma,Utpal Sharma,Navanath Saharia

doi:10.1145/3137055

Abstract

Language analysis is very important for the native speaker to connect with the digital world. Assamese is a relatively unexplored language. In this report, we analyze different aspects of speech-to-text processing, starting from building a speech corpus, defining syllable rules, and finally developing a speech search engine of Assamese. We have collected about 20 hours of speech in three (viz., read, extempore, and conversation) modes and transcribed it. We also discuss some issues and challenges faced during development of the corpus. We have developed an automatic syllabification model with 11 rules for the Assamese language and found an accuracy of more than 95% in our result. We found 12 different syllable patterns where 5 are found most frequent. The maximum length of a syllable found is four letters. With the help of Hidden Markov Model Toolkit (HTK) 3.5, we used deep learning based neural network for our speech recognition model, where we obtained 78.05% accuracy for automatic transcription of Assamese speech.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Development and Analysis of Speech Recognition Systems for Assamese Language Using HTK

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Asian and Low-Resource Language Information Processing

Lead the way for us

Journal: ACM Transactions on Asian and Low-Resource Language Information Processing	Publication Date: Oct 18, 2017
Citations: 9

Similar Papers

Arabic Speech Recognition for Connected Words Using HTK: Triphones Expanded to Gmm Based Quran Recognition
Nihal Merad-Boudia ... Abdelkader Benyettou
International Review on Computers and Software (IRECOS) | VOL. 11
Nihal Merad-Boudia, et. al.Nihal Merad-Boudia ... Abdelkader Benyettou
31 Dec 2016
International Review on Computers and Software (IRECOS) | VOL. 11

Development of Assamese Speech Corpus and Automatic Transcription Using HTK
Himangshu Sarma ... Utpal Sharma
-
Himangshu Sarma, et. al.Himangshu Sarma ... Utpal Sharma
01 Jan 2014
01 Jan 2014

PyHTK: Python Library and ASR Pipelines for HTK
C Zhang ... F.L Kreyssig
-
C Zhang, et. al.C Zhang ... F.L Kreyssig
01 May 2019
01 May 2019

Performance comparison of speaker and emotion recognition
A Revathy ... V Mohan
-
A Revathy, et. al.A Revathy ... V Mohan
01 Mar 2015
01 Mar 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Development and Analysis of Speech Recognition Systems for Assamese Language Using HTK

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Asian and Low-Resource Language Information Processing