Abstract

This paper addresses the application in speech recognition of simulated telephone speech, which is generated from clean speech by approximately mimicking actual telephone channel conditions. Maximum Likelihood Linear Regression (MLLR) algorithm was performed to conduct experiments on evaluating the performances of HMM recognizers, which were trained from clean speech and from generated telephone data, respectively. The test and adaptation data were recorded by piping clean speech through local telephone network. The experiments without adaptation report that the simulation models trained on generated data can give an obviously higher rate than the clean speech. The adaptation performances show that MLLR lends itself to further improve the recognition performance of telephone recognition system. The results show that telephone speech recognition performance can be effectively improved using the generated data, and its generating method can reduce the acoustic mismatch between training and testing data that was induced by the shortage of actual telephone speech.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.