Abstract
Objective: Humans with their developed senses can easily ascertain a person’s gender just by listening to a few uttered words and it does not take any conscious additional effort to do so, however, a machine cannot do the same unless trained. This research proposes a real time system to identify a person’s gender from their voice. Methods: Features are extracted from the dataset and checked for outliers. Then a baseline classifier is constructed to measure performance of the different models. Next the dataset is prepared for training and five machine learning models, Decision Tree Classifier, Random Forest Classifier, K Nearest Neighbours, Support Vector Machine and Gaussian Naive Bayes Classifier are applied. Finally, real time prediction is done by taking speech input and analysing it against the trained model, after input of speech the gender along with accuracy of prediction is displayed within 1.37s. Results: A maximum accuracy score of 88.19% is obtained using SVM. Additionally, the juxtaposition of the feature importance graph highlights the two most important features which fuel this classification. A combination of these features is then studied to design a less complex system and it is observed that using just MFCCs and Chroma Vector a near optimal accuracy score of 87.78% is obtained. Conclusion: Identification of gender prior to applying speech recognition and emotion recognition algorithm can help in reduction of the search space. Further, using only MFCC and Chroma Vector can make the system memory efficient and yet provide near optimal accuracy. The system can be used as an authentication mechanism and can be installed in public places.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
More From: Recent Advances in Computer Science and Communications
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.