Abstract

The object of research is the methods of recognizing the speaker gender by means of speech signals. One of the most problematic places is insufficient knowledge of the choice of signs and decisive rules. This is necessary to increase the probability of correct recognition and noise immunity of gender recognition by voice signals in conditions of interference. It is also important to simplify the implementation of algorithms for recognizing the speaker gender. For recognition of the speaker gender, a new set of classification characteristics is selected, including the joint use of estimates of the average value of the pitch frequency, its kurtosis coefficient, estimates of the mean values of the formants and their asymmetry coefficients. In the course of the research, the method of statistical testing of the proposed algorithms on a personal computer is used. The experiments are carried out using real audio signals input from a microphone into a personal computer for both female and male representatives, and recorded as separate files. For this purpose, 10 standards of 10 words are used for each of the 5 female speakers and 5 male speakers. Based on the results of statistical tests for an algorithm involving the joint use of estimates of the mean value of the pitch frequency, its kurtosis coefficient, estimates of the mean values of the formants and their asymmetry coefficients, an average probability of correct recognition is obtained 1. With the additional action of additive noise of the Gaussian type, white noise and the ratio of the signal/noise q=20, for such algorithm the probability of correct recognition is experimentally obtained – 0.8. For the decision algorithm, which uses only estimates of the average value of the pitch frequency and its kurtosis coefficient, an average probability of correct recognition is estimated at 0.9. This indicates more noise immunity of such algorithms. In the future, the use of the obtained results not only for Russian and Ukrainian languages, but also for a number of foreign languages is supposed.

Highlights

  • Algorithms for recognizing the speaker gender are necessary for solving a number of applied problems

  • The results of determining the speaker gender are used in systems of adaptive word recognition and speech phonemes, identification and verification of speakers, since recognition of the speaker gender allows significantly narrowing the range of values accepted by the signs

  • In the system [9], Gaussian mixtures are constructed for Mel-cepstral coefficients (MFCC)

Read more

Summary

Introduction

Algorithms for recognizing the speaker gender are necessary for solving a number of applied problems. The results of determining the speaker gender are used in systems of adaptive word recognition and speech phonemes, identification and verification of speakers, since recognition of the speaker gender allows significantly narrowing the range of values accepted by the signs. Dimensions of the larynx, vocal folds and muscles that control their fluctuations, are different for men and women. This gives grounds for searching for distinctive features in the parameters of the voice excitation pulses and the digital filter of the speech formation model. It is important to investigate the methods of recognizing the speaker gender using speech signals

The object of research and its technological audit
The aim and objectives of research
Research of existing solutions of the problem
Methods of research
Research results
SWOT analysis of research results
Conclusions
Findings
Objective
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call