Development of the method of automatic determination of the speaker gender on the basis of joint evaluation of frequency moments of basic tons and formant frequencies

Sergey Omelchenko

doi:10.15587/2312-8372.2018.134977

Abstract

The object of research is the methods of recognizing the speaker gender by means of speech signals. One of the most problematic places is insufficient knowledge of the choice of signs and decisive rules. This is necessary to increase the probability of correct recognition and noise immunity of gender recognition by voice signals in conditions of interference. It is also important to simplify the implementation of algorithms for recognizing the speaker gender. For recognition of the speaker gender, a new set of classification characteristics is selected, including the joint use of estimates of the average value of the pitch frequency, its kurtosis coefficient, estimates of the mean values of the formants and their asymmetry coefficients. In the course of the research, the method of statistical testing of the proposed algorithms on a personal computer is used. The experiments are carried out using real audio signals input from a microphone into a personal computer for both female and male representatives, and recorded as separate files. For this purpose, 10 standards of 10 words are used for each of the 5 female speakers and 5 male speakers. Based on the results of statistical tests for an algorithm involving the joint use of estimates of the mean value of the pitch frequency, its kurtosis coefficient, estimates of the mean values of the formants and their asymmetry coefficients, an average probability of correct recognition is obtained 1. With the additional action of additive noise of the Gaussian type, white noise and the ratio of the signal/noise q=20, for such algorithm the probability of correct recognition is experimentally obtained – 0.8. For the decision algorithm, which uses only estimates of the average value of the pitch frequency and its kurtosis coefficient, an average probability of correct recognition is estimated at 0.9. This indicates more noise immunity of such algorithms. In the future, the use of the obtained results not only for Russian and Ukrainian languages, but also for a number of foreign languages is supposed.

Highlights

Algorithms for recognizing the speaker gender are necessary for solving a number of applied problems
The results of determining the speaker gender are used in systems of adaptive word recognition and speech phonemes, identification and verification of speakers, since recognition of the speaker gender allows significantly narrowing the range of values accepted by the signs
In the system [9], Gaussian mixtures are constructed for Mel-cepstral coefficients (MFCC)

Summary

Introduction

Algorithms for recognizing the speaker gender are necessary for solving a number of applied problems. The results of determining the speaker gender are used in systems of adaptive word recognition and speech phonemes, identification and verification of speakers, since recognition of the speaker gender allows significantly narrowing the range of values accepted by the signs. Dimensions of the larynx, vocal folds and muscles that control their fluctuations, are different for men and women. This gives grounds for searching for distinctive features in the parameters of the voice excitation pulses and the digital filter of the speech formation model. It is important to investigate the methods of recognizing the speaker gender using speech signals

The object of research and its technological audit

The aim and objectives of research

Research of existing solutions of the problem

Methods of research

Research results

SWOT analysis of research results

Conclusions

Findings

Objective

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Development of the method of automatic determination of the speaker gender on the basis of joint evaluation of frequency moments of basic tons and formant frequencies

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Technology audit and production reserves

Lead the way for us

Journal: Technology audit and production reserves	Publication Date: Jan 23, 2018
License type: cc-by

Similar Papers

Automatic determination of a speaker’s gender based on the Cauchy distribution in the octave frequency band
Sergey Omelchenko
ScienceRise | VOL. 1
Sergey OmelchenkoSergey Omelchenko
16 Jul 2019
ScienceRise | VOL. 1

Computational spectral imaging technology with high light utilization for space target detection
Yanli Liu ... Junhong Su
-
Yanli Liu, et. al.Yanli Liu ... Junhong Su
27 Mar 2022
27 Mar 2022

Vocal Attractiveness Increases by Averaging
Laetitia Bruckert ... Pascal Belin
Current biology : CB | VOL. 20
Laetitia Bruckert, et. al.Laetitia Bruckert ... Pascal Belin
01 Jan 2009
Current biology : CB | VOL. 20

АНАЛИЗ МЕТОДОВ ПОСТКЛАССИФИКАЦИОННОЙ ОБРАБОТКИ МНОГОКАНАЛЬНЫХ ИЗОБРАЖЕНИЙ
Ирина Карловна Васильева ... Владимир Васильевич Лукин
RADIOELECTRONIC AND COMPUTER SYSTEMS | VOL. -
Ирина Карловна Васильева, et. al.Ирина Карловна Васильева ... Владимир Васильевич Лукин
23 Mar 2019
RADIOELECTRONIC AND COMPUTER SYSTEMS | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Development of the method of automatic determination of the speaker gender on the basis of joint evaluation of frequency moments of basic tons and formant frequencies

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Technology audit and production reserves