Abstract
The automatic speech recognition (ASR) is an active field of research. The performance of the ASR can be degraded due to various features like environmental noise, channel distortion and speech rate variability. The speech rate variability is one of the important features that affect the accuracy of the speech recognition system (SRS). In this research work, the speech signal is categorized as slow, normal and fast speech using features like the sound intensity level, time duration and root mean square. This paper addresses the enhancement of the performance of a SRS by applying time normalization to the speech signal. The comparison of the proposed Model and baseline syllable based SRS is done.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.