“Google” Lithuanian Speech Recognition Efficiency Evaluation Research

Donatas Sipavičius,Rytis Maskeliūnas

doi:10.1007/978-3-319-46254-7_49

Abstract

This paper presents “Google” Lithuanian speech recognition efficiency evaluation research. For the experiment it was chosen method that consists of three parts: (1) to process all voice records without adding any noise; (2) process all voice records with several different types of noise, modified so as to get some predefined signal-to-noise ratio (SNR); (3) after one month reprocess all voice records without any additional noise and to assess improvements in the quality of the speech recognition. It was chosen WER metrics for speech recognition quality assessment. Analyzing the results of the experiment it was observed that the greatest impact on the quality of speech recognition has a SNR and speech type (most recognizable is isolated words, the worst - spontaneous speech). Meanwhile, characteristics such as the gender of the speaker, smooth speech, speech speed, speech volume does not make any significant influence on speech recognition quality.

Full Text