Serial Weakening of Human-Based Attributes Regarding Their Effect on Content-Based Speech Recognition

Amir Rahdar,Davood Gharavian,Waldemar Jęśko

doi:10.1109/access.2023.3255982

Amir Rahdar, Davood Gharavian + Show 1 more

Open Access

https://doi.org/10.1109/access.2023.3255982

Copy DOI

Journal: IEEE Access	Publication Date: Jan 1, 2023
Citations: 1	License type: CC BY-NC-ND 4.0

Affiliation: Shahid Beheshti University, University of Technology

Abstract

Numerous studies have investigated automatic speech recognition tasks, such as content-based speech recognition, using machine learning techniques, such as deep learning. In general, each speech sample contains four main human-based attributes: i.e., content, emotion, gender, and speaker identity. Among them, the content has the lowest correlation with the other three attributes. However, to classify speech samples concerning each attribute, the model ignores the existence of unrelated attributes. This study shows that information on these non-content attributes is not always useful and can cause a content-based speech classifier to significantly underperform. Moreover, weakening the effects of one, two, or three attributes is possible, and weakening these attributes in a specific order is crucial. For this purpose, two-input, two-output autoencoders are proposed as a feature extraction method. These networks are specifically designed to reduce the level of information (in this case, one, two, or three attributes). The level of change in the performance of classifiers caused by using these pre-trained autoencoders helps rank the negative effect of selected human-based attributes. Based on the results obtained, gender has the most negative effect on the performance of content-based speech recognition models, and serial weakening gives the best results when considering the attributes in the following order: gender, speaker identity, and emotion.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Serial Weakening of Human-Based Attributes Regarding Their Effect on Content-Based Speech Recognition

Abstract

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Genetic Algorithm for Combined Speaker and Speech Recognition using Deep Neural Networks
Gurpreet Kaur ... Mohit Srivastava
Journal of Telecommunications and Information Technology | VOL. 2
Gurpreet Kaur, et. al.Gurpreet Kaur ... Mohit Srivastava
29 Jun 2018
Journal of Telecommunications and Information Technology | VOL. 2

Application of Machine Learning Methods in Mental Health Detection: A Systematic Review
Rohizah Abd Rahman ... Shahrul Azman Mohd Noah
IEEE Access | VOL. 8
Rohizah Abd Rahman, et. al.Rohizah Abd Rahman ... Shahrul Azman Mohd Noah
01 Jan 2020
IEEE Access | VOL. 8

Using Machine Learning to Predict the Sentiment of Online Reviews: A New Framework for Comparative Analysis
Gregorius Satia Budhi ... Raymond Chiong
Archives of Computational Methods in Engineering | VOL. 28
Gregorius Satia Budhi, et. al.Gregorius Satia Budhi ... Raymond Chiong
08 Jan 2021
Archives of Computational Methods in Engineering | VOL. 28

Daily scale streamflow forecasting in multiple stream orders of Cauvery River, India: Application of advanced ensemble and deep learning models
Sujay Raghavendra Naganna ... Zaher Mundher Yaseen
Journal of Hydrology | VOL. 626
Sujay Raghavendra Naganna, et. al.Sujay Raghavendra Naganna ... Zaher Mundher Yaseen
15 Oct 2023
Journal of Hydrology | VOL. 626

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Serial Weakening of Human-Based Attributes Regarding Their Effect on Content-Based Speech Recognition

Abstract

Talk to us

Similar Papers

More From: IEEE Access