Robust Perceptual Wavelet Packet Features for Recognition of Continuous Kannada Speech

Mahadevaswamy Mahadevaswamy,D J Ravi

doi:10.1007/s11277-021-08736-1

Mahadevaswamy Mahadevaswamy, D J Ravi

Open Access

https://doi.org/10.1007/s11277-021-08736-1

Copy DOI

Abstract

An ASR system is built for the Continuous Kannada Speech Recognition. The acoustic and language models are created with the help of the Kaldi toolkit. The speech database is created with the native male and female Kannada speakers. The 80% of collected speech data is used for training the acoustic models and 20% of speech database is used for the system testing. The Performance of the system is presented interms of Word Error Rate (WER). Wavelet Packet Decomposition along with Mel filter bank is used to achieve feature extraction. The proposed feature extraction performs slightly better than the conventional features such as MFCC, PLP interms of WRA and WER under uncontrolled conditions. For the speech corpus collected in Kannada Language, the proposed features shows an improvement in Word Recognition Accuracy (WRA) of 1.79% over baseline features.

Highlights

The frequent pauses between the speech sounds of a speech signal portrays its unique characteristic that distinguishes it from all other signals
The database consists of 3 sets for Kannada Language namely: isolated digits through (0-9), isolated words, Continuous Kannada Speech consisting of Spontaneous Spoken Kannada Sentences
The database consists of 3 sets for English Language namely: isolated digits (TIMIT) through (0-9), isolated words (TIMIT), Librispeech of Continuous English Speech

Summary

D J Ravi Vidyavardhaka College of Engineering

The acoustic and language models are created with the help of the Kaldi toolkit. The speech database is created with the native male and female Kannada speakers. The 75% of collected speech data is used for training the acoustic models and 25% of speech database is used for the system testing. The Performance of the system is presented interms of Word Error Rate (WER). The proposed feature extraction performs slightly better than the conventional features such as MFCC, PLP interms of WRA and WER under uncontrolled conditions. For the speech corpus collected in Kannada Language, the proposed features shows an improvement in WRA of 1.79% over baseline features

INTRODUCTION

RELATED WORKS

PROPOSED FEATURES

Theoretical Background of Wavelet Transforms

Mel Filter like WP Decomposition

PERFORMANCE ANALYSIS

DATABASE The Kannada speech Database consisting of isolated digits from

21 Acoustic Model

RESULTS

CONCLUSION

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Wireless Personal Communications	Publication Date: Jul 21, 2021
Citations: 4	License type: cc-by

R Discovery Prime

R Discovery Prime

Robust Perceptual Wavelet Packet Features for Recognition of Continuous Kannada Speech

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Wireless Personal Communications

Lead the way for us

Similar Papers

WITHDRAWN: Robust Continuous Kannada speech recognition using Kaldi toolkit
Mahadevaswamy ... D J Ravi
Materials today. Proceedings | VOL. -
Mahadevaswamy, et. al. Mahadevaswamy ... D J Ravi
01 Mar 2021
Materials today. Proceedings | VOL. -

Morpheme-Based and Factored Language Modeling for Amharic Speech Recognition
Martha Yifiru Tachbelie ... Wolfgang Menzel
-
Martha Yifiru Tachbelie, et. al.Martha Yifiru Tachbelie ... Wolfgang Menzel
01 Jan 2010
01 Jan 2010

Mel scaled M-band wavelet filter bank for speech recognition
Prashant Upadhyaya ... M R Abidi
International Journal of Speech Technology | VOL. 21
Prashant Upadhyaya, et. al.Prashant Upadhyaya ... M R Abidi
07 Aug 2018
International Journal of Speech Technology | VOL. 21

Building Acoustic and Language Model for Continuous Speech Recognition in Bahasa Indonesia
Andreas Widjaja ... Vincent Elbert Budiman
Jurnal Teknik Informatika dan Sistem Informasi | VOL. 6
Andreas Widjaja, et. al.Andreas Widjaja ... Vincent Elbert Budiman
10 Aug 2020
Jurnal Teknik Informatika dan Sistem Informasi | VOL. 6

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Robust Perceptual Wavelet Packet Features for Recognition of Continuous Kannada Speech

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Wireless Personal Communications