Information system for converting audio in Ukrainian language into its textual representation using nlp methods and machine learning

Yurii Tyshchuk,Olha Vlasenko,Victoria Vysotska

doi:10.23939/sisn2022.12.023

Abstract

Speech recognition involves various models, methods and algorithms for analysing and processing the user’s recorded voice. This allows people to control different systems that support one type of speech recognition. A speech-to-text conversion system is a type of speech recognition that uses spoken data for further processing. It also provides several stages for processing an audio file, which uses electroacoustic means, filtering algorithms in the audio file to isolate relevant sounds, electronic data arrays for the selected language, as well as mathematical models that make up the most likely words from phonemes. Thanks to the conversion of speech to text, people whose professions are closely related to typing a large amount of text on the keyboard, significantly speed up and facilitate the work process, as well as reduce the amount of stress. In addition, such systems help businesses, because the concept of remote work is becoming more and more popular, and therefore companies need tools to record and systematize meetings in the form of written text. The object of the research is the process of converting the Ukrainian-language text into a written one based on NLP and machine learning methods. The subject of the research is file processing algorithms for extracting relevant sounds and recognizing phonemes, as well as mathematical models for recognizing an array of phonemes as specific words. The purpose of the work is to design and develop an information system for converting audio Ukrainian-language text into written text based on the Ukrainian Speech-to-text Web application, which is a technology for accurate and easy analysis of Ukrainian-language audio files and their subsequent transcription into text. The application supports downloading files from the file system and recording using the microphone, as well as saving the analysed data. The article also describes the stages of design and the general typical architecture of the corresponding system for converting audio Ukrainian-language text into written text. According to the results of the experimental testing of the developed system, it was found that the number of words does not affect the accuracy of the conversion algorithm, and the decrease in percentage is not large and occurred due to the complexity of the words and the low quality of the microphone, and therefore the recorded file.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Information system for converting audio in Ukrainian language into its textual representation using nlp methods and machine learning

Abstract

Talk to us

Similar Papers

More From: Vìsnik Nacìonalʹnogo unìversitetu "Lʹvìvsʹka polìtehnìka". Serìâ Ìnformacìjnì sistemi ta merežì

Lead the way for us

Similar Papers

Information System for Ukrainian Text Voiceover Based on Nlp and Machine Learning Methods
Illia Bielousov ... Olha Vlasenko
Vìsnik Nacìonalʹnogo unìversitetu "Lʹvìvsʹka polìtehnìka". Serìâ Ìnformacìjnì sistemi ta merežì | VOL. 14
Illia Bielousov, et. al.Illia Bielousov ... Olha Vlasenko
26 Dec 2023
Vìsnik Nacìonalʹnogo unìversitetu "Lʹvìvsʹka polìtehnìka". Serìâ Ìnformacìjnì sistemi ta merežì | VOL. 14

Запозичення з польської мови в українських пам’ятках XVI–XVII ст.

-

29 Nov 2016
29 Nov 2016

Підхід до виявлення аномалій в потоках тектових даних
Elena Afanasyeva ... Yuriy Oliynyk
System technologies | VOL. 2
Elena Afanasyeva, et. al.Elena Afanasyeva ... Yuriy Oliynyk
24 Feb 2020
System technologies | VOL. 2

Використання графічного процесора для прискорення пошуку кореферентних об'єктів з використанням моделі RoBERTa
S.D Pogorilyy ... P.V Biletsky
Scientific papers of Donetsk National Technical University. Series: Informatics, Cybernetics and Computer Science | VOL. №2 - №1
S.D Pogorilyy, et. al.S.D Pogorilyy ... P.V Biletsky
01 Jan 2021
Scientific papers of Donetsk National Technical University. Series: Informatics, Cybernetics and Computer Science | VOL. №2 - №1

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Information system for converting audio in Ukrainian language into its textual representation using nlp methods and machine learning

Abstract

Talk to us

Similar Papers

More From: Vìsnik Nacìonalʹnogo unìversitetu "Lʹvìvsʹka polìtehnìka". Serìâ Ìnformacìjnì sistemi ta merežì