МЕТОДИ ПІДВИЩЕННЯ ЯКОСТІ ПЕРЕТВОРЕННЯ МОВИ НА ТЕКСТ В СИСТЕМАХ БІОМЕТРИЧНОЇ АУТЕНТИФІКАЦІЇ

V Korchynskyi,I Vynogradov,S Staikuca,O Shvets,I Bielova

doi:10.36994/2788-5518-2023-01-05-13

Abstract

The article discusses the methods and algorithms of speech-to-text conversion, modern open and commercial systems for creating systems, as well as the use of these technologies in the field of cyber security. It is proposed to create a high-quality speech-to-text conversion system. An analysis of the mathematical algorithms used to reduce the error rate, which makes it possible to create unique voice prints and increase protection against forgery, has been carried out. The structure of modern speech-to-text conversion systems is described. By changing datasets, parameters of hidden Markov models, a high-quality dictionary of phonemes, and the use of language models, there is an opportunity to reduce the percentage of errors in language recognition, as well as the use of a system for multilingualism such as "surzhyk". The mathematical methods of assessing the quality of the system of speech to text (WER), as well as various methods of calculation, which is important for their further improvement and optimization, are considered. The structure of modern systems is considered, namely, signal pre-processing, feature extraction, acoustic modeling, speech modeling, decoding, post-processing. For each of the stages, study vectors have been proposed that can reduce the error rate of the system as a whole. Reducing speech recognition errors and the ability to fake a voice is achieved using various methods: deep neural networks, hidden Markov models, Baum-Welch algorithm, N-gram models, models with attention, creation of a high-quality phonemes dictionary, dataset, and fillers. Speech-to-text conversion technology can be used in biometric authentication systems to detect and analyze the unique features of the user's voice. However, modern speech-to-text conversion systems for Ukrainian, Russian, and "surzhyk" need improvement in acoustic and language units. Scientific works, which are devoted to research and optimization of these systems for biometric authentication, do not fully cover these issues. This became the reason for further research in this direction, so this work aims to create a speech recognition system with a minimum error rate.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

МЕТОДИ ПІДВИЩЕННЯ ЯКОСТІ ПЕРЕТВОРЕННЯ МОВИ НА ТЕКСТ В СИСТЕМАХ БІОМЕТРИЧНОЇ АУТЕНТИФІКАЦІЇ

Abstract

Talk to us

Similar Papers

More From: Інфокомунікаційні та комп’ютерні технології

Lead the way for us

Similar Papers

GPU-Aware Genetic Estimation of Hidden Markov Models for Workload Classification Problems
Alfredo Cuzzocrea ... Nicola Timeus
-
Alfredo Cuzzocrea, et. al.Alfredo Cuzzocrea ... Nicola Timeus
01 Jun 2016
01 Jun 2016

A Variable Initialization Approach to the EM Algorithm for Better Estimation of the Parameters of Hidden Markov Model Based Acoustic Modeling of Speech Signals
Md Shamsul Huda ... John Yearwood
-
Md Shamsul Huda, et. al.Md Shamsul Huda ... John Yearwood
01 Jan 2006
01 Jan 2006

Genetic algorithm based simultaneous optimization of feature subsets and hidden Markov model parameters for discrimination between speech and non-speech events
Yan-Xiong Li ... Sam Kwong
International Journal of Speech Technology | VOL. 13
Yan-Xiong Li, et. al.Yan-Xiong Li ... Sam Kwong
17 Apr 2010
International Journal of Speech Technology | VOL. 13

Estimating HMM Parameters Using Particle Swarm Optimisation
Somnuk Phon-Amnuaisuk
-
Somnuk Phon-AmnuaisukSomnuk Phon-Amnuaisuk
01 Jan 2009
01 Jan 2009

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

МЕТОДИ ПІДВИЩЕННЯ ЯКОСТІ ПЕРЕТВОРЕННЯ МОВИ НА ТЕКСТ В СИСТЕМАХ БІОМЕТРИЧНОЇ АУТЕНТИФІКАЦІЇ

Abstract

Talk to us

Similar Papers

More From: Інфокомунікаційні та комп’ютерні технології