Conceptual model of the technology for calculating the similarity threshold of two audio sequences

Владислав Олександрович Холєв

doi:10.18664/ikszt.v29i3.313703

Abstract

The paper is focused on the pressing problem of speaker verification by means of voice time series comparison. The aim of this paper is to determine the orders of mel-frequency cepstral coefficients that most accurately describe the difference, between an authentic voice and an artificially generated copy for their further use as input to a neural network model in a resource-limited environment. To achieve this goal, the following tasks were accomplished: a conceptual model of the technology for determining the similarity threshold of two audio series was developed; the orders of fine-frequency cepstral coefficients with the most characteristic differences between the recording and the generated voice were determined on the basis of neural network analysis; an experimental study of the dependence of the execution time and computational load on the created feature vector when assessing the degree of similarity of two time series was conducted; and the optimal similarity threshold was determined on the basis of the chosen dataset. The developed model of the technology for determining the similarity threshold was tested on a dataset that is a combination of the DEEP-VOICE dataset and our own dataset. The demonstrated result of applying the developed technology showed an increase of 43% when using the specified MFCCs compared to using all of them. Based on experimental studies, the DTW acceptance threshold was set at 0.37.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Conceptual model of the technology for calculating the similarity threshold of two audio sequences

Abstract

Talk to us

Similar Papers

More From: Інформаційно-керуючі системи на залізничному транспорті

Lead the way for us

Journal: Інформаційно-керуючі системи на залізничному транспорті	Publication Date: Oct 10, 2024
License type: CC BY-NC-ND 4.0

Similar Papers

Protection augmentation using key replacement in acoustic steganography
B Abinaya ... P Sobiyaa
-
B Abinaya, et. al.B Abinaya ... P Sobiyaa
01 Jul 2014
01 Jul 2014

Time series learning network using fast dynamic time warping
Haoran Xu ... Jinxi Zhao
Neurocomputing | VOL. -
Haoran Xu, et. al.Haoran Xu ... Jinxi Zhao
01 May 2016
Neurocomputing | VOL. -

Whole-Cell MALDI-TOF MS Versus 16S rRNA Gene Analysis for Identification and Dereplication of Recurrent Bacterial Isolates.
Michal Strejcek ... Tereza Smrhova
Frontiers in Microbiology | VOL. 9
Michal Strejcek, et. al.Michal Strejcek ... Tereza Smrhova
19 Jun 2018
Frontiers in Microbiology | VOL. 9

Artificial Neural Nets Problem Solving Methods
José R Álvarez
-
José R ÁlvarezJosé R Álvarez
01 Jan 2003
01 Jan 2003

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Conceptual model of the technology for calculating the similarity threshold of two audio sequences

Abstract

Talk to us

Similar Papers

More From: Інформаційно-керуючі системи на залізничному транспорті