Abstract

The level of development of information technology makes it possible to use speech recognition technologies in a wide range of human life and activities. It is very convenient to use the voice interface: voice search for the necessary documents, dialing a phone number, managing IOT devices, voice navigation, simple text dictation. Since the natural language interface provides an additional convenience for a person when typing, sending voice messages has become common among users. In this case, voice messages are audio files. But it is not always available and convenient for the recipient to listen to such messages. This problem can be solved with the help of an automatic speech recognition system (ASR). The article describes the stages and elements of the process of processing and recognition of natural language by audio signal. Modern technologies of automatic speech recognition and problems with choosing among them are indicated. Modern automatic speech recognition (ASR) systems understand fully spontaneous speech that is natural, not memorized, contains signs of stuttering or even minor errors. At the same time, they are still too expensive to develop from scratch. So companies are faced with a choice between using the cloud API for ASR developed by the tech giants and using open source solutions. The analysis of the latest research and publications on the processing of voice data is considered. A software solution for automatic conversion of voice messages into text is proposed. The interface to the voice signal delivery system is proposed to be made as a chat bot in the messenger. The article presents the main components of the system, the algorithm of the chat bot, modern technologies for the development, implementation and configuration of the chat bot in the messenger

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call