Voice Gender Research Articles

Some cochlear implant (CI) users are fitted with a CI in each ear ("bilateral"), while others have a CI in one ear and a hearing aid in the other ("bimodal"). Presently, evaluation of the benefits of bilateral or bimodal CI fitting does not take into account the integration of frequency information across the ears. This study tests the hypothesis that CI listeners, especially bimodal CI users, with a more precise integration of frequency information across ears ("sharp binaural pitch fusion") will derive greater benefit from voice gender differences in a multi-talker listening environment. Twelve bimodal CI users and twelve bilateral CI users participated. First, binaural pitch fusion ranges were measured using the simultaneous, dichotic presentation of reference and comparison stimuli (electric pulse trains for CI ears and acoustic tones for HA ears) in opposite ears, with reference stimuli fixed and comparison stimuli varied in frequency/electrode to find the range perceived as a single sound. Direct electrical stimulation was used in implanted ears through the research interface, which allowed selective stimulation of one electrode at a time, and acoustic stimulation was used in the non-implanted ears through the headphone. Second, speech-on-speech masking performance was measured to estimate masking release by voice gender difference between target and maskers (VGRM). The VGRM was calculated as the difference in speech recognition thresholds of target sounds in the presence of same-gender or different-gender maskers. Voice gender differences between target and masker talkers improved speech recognition performance for the bimodal CI group, but not the bilateral CI group. The bimodal CI users who benefited the most from voice gender differences were those who had the narrowest range of acoustic frequencies that fused into a single sound with stimulation from a single electrode from the CI in the opposite ear. There was no similar voice gender difference benefit of narrow binaural fusion range for the bilateral CI users. The findings suggest that broad binaural fusion reduces the acoustical information available for differentiating individual talkers in bimodal CI users, but not for bilateral CI users. In addition, for bimodal CI users with narrow binaural fusion who benefit from voice gender differences, bilateral implantation could lead to a loss of that benefit and impair their ability to selectively attend to one talker in the presence of multiple competing talkers. The results suggest that binaural pitch fusion, along with an assessment of residual hearing and other factors, could be important for assessing bimodal and bilateral CI users.

Compared to other features of the human body, voice is quite complex and dynamic, in a sense that a speech can be spoken in various languages with different accents and in different emotional states. Recognizing the gender, i.e. male or female from the voice of an individual, is by all accounts a minor errand for human beings. Similar goes for speaker identification if we are well accustomed with the speaker for a long time. Our ears function as the front end, accepting the sound signs which our cerebrum processes and settles on our disposition. Although being trivial for us, it becomes a challenging task to mimic for any computing device. Automatic gender, emotion and speaker identification systems have many applications in surveillance, multimedia technology, robotics and social media. In this paper, we propose a Golden Ratio-aided Neural Network (GRaNN) architecture for the said purposes. As deciding the number of units for each layer in deep NN is a challenging issue, we have done this using the concept of Golden Ratio. Prior to that, an optimal subset of features are selected from the feature vector extracted, common for all three tasks, from spectral images obtained from the input voice signals. We have used a wrapper-filter framework where minimum redundancy maximum relevance selected features are fed to Mayfly algorithm combined with adaptive beta hill climbing (AbetaHC) algorithm. Our model achieves accuracies of 99.306% and 95.68% for gender identification in RAVDESS and Voice Gender datasets, 95.27% for emotion identification in RAVDESS dataset and 67.172% for speaker identification in RAVDESS dataset. Performance comparison of this model with existing models on the publicly available datasets confirms its superiority over those models. Results also ensure that we have chosen the common feature set meticulously, which works equally well on three different pattern classification tasks. The proposed wrapper-filter framework reduces the feature dimension significantly, thereby lessening the storage requirement and training time. Finally, strategically selecting the number units in each layer in NN help increases the overall performance of all three pattern classification tasks.

Voice Gender Research Articles

Related Topics

Articles published on Voice Gender

Differential Effects of Binaural Pitch Fusion Range on the Benefits of Voice Gender Differences in a "Cocktail Party" Environment for Bimodal and Bilateral Cochlear Implant Users.

Towards a History of Syriac Rhetoric in Late Antiquity

Multiplicity in the experience of voice-hearing: A phenomenological inquiry

Gender voices in Afghanistan primary and secondary school English textbooks

Stimulation Rate and Voice Pitch Perception in Cochlear Implants.

Vocal Depersonalization in Scat Singing

Do Men Have No Need for “Feminist” Artificial Intelligence? Agentic and Gendered Voice Assistants in the Light of Basic Psychological Needs

A Hybrid Ensemble Stacking Model for Gender Voice Recognition Approach

GRaNN: feature selection with golden ratio-aided neural network for emotion, gender and speaker identification from voice signals

Phonological but not lexical processing alters the perceptual weighting of mean fundamental frequency and vocal-tract length cues for voice gender categorisation

Exploring racial and gender disparities in voice biometrics

Gender Identification from Speech Recognition Using Machine Learning Techniques and Convolutional Neural Networks

Gendered Voices and Ideological ‎Communications: A Case from the Film Industry‎

Gender expansive listeners utilize a non-binary, multidimensional conception of gender to inform voice gender perception

Perception of indexical cues in speech by children and adults with and without dyslexia: Regional dialect and gender identification.

Covid-19 and Domestic Violence in Kashmir: Gender Voices.

Feminization Laryngoplasty: 17-Year Review on Long-Term Outcomes, Safety, and Technique.

Effect of AI Chatbots´ Gender in the Business Context: Empirical Evidence from Debt Collection

Perception of Social Odor and Gender-Related Differences Investigated Through the Use of Transfer Entropy and Embodied Medium.

Modern Muslims’ Online Struggle: Countering Islamophobia One Tweet at a Time

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Voice Gender Research Articles

Related Topics

Articles published on Voice Gender

Differential Effects of Binaural Pitch Fusion Range on the Benefits of Voice Gender Differences in a "Cocktail Party" Environment for Bimodal and Bilateral Cochlear Implant Users.

Towards a History of Syriac Rhetoric in Late Antiquity

Multiplicity in the experience of voice-hearing: A phenomenological inquiry

Gender voices in Afghanistan primary and secondary school English textbooks

Stimulation Rate and Voice Pitch Perception in Cochlear Implants.

Vocal Depersonalization in Scat Singing

Do Men Have No Need for “Feminist” Artificial Intelligence? Agentic and Gendered Voice Assistants in the Light of Basic Psychological Needs

A Hybrid Ensemble Stacking Model for Gender Voice Recognition Approach

GRaNN: feature selection with golden ratio-aided neural network for emotion, gender and speaker identification from voice signals

Phonological but not lexical processing alters the perceptual weighting of mean fundamental frequency and vocal-tract length cues for voice gender categorisation

Exploring racial and gender disparities in voice biometrics

Gender Identification from Speech Recognition Using Machine Learning Techniques and Convolutional Neural Networks

Gendered Voices and Ideological ‎Communications: A Case from the Film Industry‎

Gender expansive listeners utilize a non-binary, multidimensional conception of gender to inform voice gender perception

Perception of indexical cues in speech by children and adults with and without dyslexia: Regional dialect and gender identification.

Covid-19 and Domestic Violence in Kashmir: Gender Voices.

Feminization Laryngoplasty: 17-Year Review on Long-Term Outcomes, Safety, and Technique.

Effect of AI Chatbots´ Gender in the Business Context: Empirical Evidence from Debt Collection

Perception of Social Odor and Gender-Related Differences Investigated Through the Use of Transfer Entropy and Embodied Medium.

Modern Muslims’ Online Struggle: Countering Islamophobia One Tweet at a Time