Abstract

This paper presents “Google” Lithuanian speech recognition efficiency evaluation research. For the experiment it was chosen method that consists of three parts: (1) to process all voice records without adding any noise; (2) process all voice records with several different types of noise, modified so as to get some predefined signal-to-noise ratio (SNR); (3) after one month reprocess all voice records without any additional noise and to assess improvements in the quality of the speech recognition. It was chosen WER metrics for speech recognition quality assessment. Analyzing the results of the experiment it was observed that the greatest impact on the quality of speech recognition has a SNR and speech type (most recognizable is isolated words, the worst - spontaneous speech). Meanwhile, characteristics such as the gender of the speaker, smooth speech, speech speed, speech volume does not make any significant influence on speech recognition quality.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.