Kurdish Spoken Dialect Recognition Using X-Vector Speaker Embedding

Arash Amani,Hadi Veisi,Mohammad Mohammadamini

doi:10.1007/978-3-030-87802-3_5

Abstract

This paper presents a dialect recognition system for the Kurdish language using speaker embeddings. Two main goals are followed in this research: first, we investigate the availability of dialect information in speaker embeddings, then this information is used for spoken dialect recognition in the Kurdish language. Second, we introduce a public dataset for Kurdish spoken dialect recognition named Zar. The Zar dataset comprises 16,385 utterances in 49h-36min for five dialects of the Kurdish language (Northern Kurdish, Central Kurdish, Southern Kurdish, Hawrami, and Zazaki). The dialect recognition is done with x-vector speaker embedding which is trained for speaker recognition using Vox-celeb1 and Voxceleb2 datasets. After that, the extracted x-vectors are used to train support vector machine (SVM) and decision tree classifiers for dialect recognition. The results are compared with an i-vector system that is trained specifically for Kurdish spoken dialect recognition. In both systems (i-vector and x-vector), the SVM classifier with 86% of precision results in better performance. Our results show that the information preserved in the speaker embeddings can be used for automatic dialect recognition.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Kurdish Spoken Dialect Recognition Using X-Vector Speaker Embedding

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Dataset for the recognition of Kurdish sound dialects
Karwan M Hama Rawf ... Karzan J Ghafoor
Data in Brief | VOL. 53
Karwan M Hama Rawf, et. al.Karwan M Hama Rawf ... Karzan J Ghafoor
22 Feb 2024
Data in Brief | VOL. 53

Real and Complex Wavelet Transform Using Singular Value Decomposition for Malaysian Speaker and Accent Recognition
Rokiah Abdullah ... Hariharan Muthusamy
-
Rokiah Abdullah, et. al.Rokiah Abdullah ... Hariharan Muthusamy
06 Aug 2020
06 Aug 2020

Kurdish Dialect Recognition using 1D CNN
Karzan J Ghafoor ... Sarkhel H Taher
ARO-THE SCIENTIFIC JOURNAL OF KOYA UNIVERSITY | VOL. 9
Karzan J Ghafoor, et. al.Karzan J Ghafoor ... Sarkhel H Taher
15 Oct 2021
ARO-THE SCIENTIFIC JOURNAL OF KOYA UNIVERSITY | VOL. 9

Automated microseismic event detection with machine learning
Zhengguang Zhao
-
Zhengguang ZhaoZhengguang Zhao
08 Oct 2021
08 Oct 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Kurdish Spoken Dialect Recognition Using X-Vector Speaker Embedding

Abstract

Talk to us

Similar Papers