An Optimal Feature Parameter Set Based on Gated Recurrent Unit Recurrent Neural Networks for Speech Segment Detection

Özlem Batur Di̇nler,Nizamettin Aydin

doi:10.3390/app10041273

Abstract

Speech segment detection based on gated recurrent unit (GRU) recurrent neural networks for the Kurdish language was investigated in the present study. The novelties of the current research are the utilization of a GRU in Kurdish speech segment detection, creation of a unique database from the Kurdish language, and optimization of processing parameters for Kurdish speech segmentation. This study is the first attempt to find the optimal feature parameters of the model and to form a large Kurdish vocabulary dataset for a speech segment detection based on consonant, vowel, and silence (C/V/S) discrimination. For this purpose, four window sizes and three window types with three hybrid feature vector techniques were used to describe the phoneme boundaries. Identification of the phoneme boundaries using a GRU recurrent neural network was performed with six different classification algorithms for the C/V/S discrimination. We have demonstrated that the GRU model has achieved outstanding speech segmentation performance for characterizing Kurdish acoustic signals. The experimental findings of the present study show the significance of the segment detection of speech signals by effectively utilizing hybrid features, window sizes, window types, and classification models for Kurdish speech.

Highlights

Speech is a sign that contains a lot of personal information
According to the results presented in the tables, the best accuracy was obtained using EZDDMFCC
The results based on three different window types gave the highest accuracy values of 97.25% for the Hamming windows, 97.04% for the Hanning windows, and 97.60% for the rectangular windows for male speakers, and 95.10% for the Hamming windows, 95.63% for the Hanning windows, and

Summary

Introduction

Speech is a sign that contains a lot of personal information. Together with the developing technology, a wide variety of applications are developed using the information obtained from this speech signal. Segmentation represents a procedure of breaking down a speech signal into smaller acoustic units. It is possible to define speech segmentation as the procedure of finding limits in a natural spoken language between words, syllables, or phonemes [1]. Speech segment detection is one of the most commonly used technologies for recognizing spoken words and expressions and converting them into a format that can be understood by machines, especially computers

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Applied Sciences	Publication Date: Feb 13, 2020
Citations: 28	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

An Optimal Feature Parameter Set Based on Gated Recurrent Unit Recurrent Neural Networks for Speech Segment Detection

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences

Lead the way for us

Similar Papers

Predicting and optimizing multirow film cooling with trenches using gated recurrent unit neural network
Yaning Wang ... Zirui Wang
Physics of Fluids | VOL. 34
Yaning Wang, et. al.Yaning Wang ... Zirui Wang
01 Apr 2022
Physics of Fluids | VOL. 34

A comparative study on long short-term memory and gated recurrent unit neural networks in fault diagnosis for chemical processes using visualization
Somayeh Mirzaei ... Kuang-Yi Chu
Journal of the Taiwan Institute of Chemical Engineers | VOL. 130
Somayeh Mirzaei, et. al.Somayeh Mirzaei ... Kuang-Yi Chu
17 Aug 2021
Journal of the Taiwan Institute of Chemical Engineers | VOL. 130

Application of recurrent neural network in prognosis of peritoneal dialysis
...
Beijing da xue xue bao. Yi xue ban = Journal of Peking University. Health sciences | VOL. 51
, et. al. ...
18 Jun 2019
Beijing da xue xue bao. Yi xue ban = Journal of Peking University. Health sciences | VOL. 51

Short-Term Wind Speed and Temperature Forecasting Model Based on Gated Recurrent Unit Neural Networks
Fahad Radhi Alharbi ... Denes Csala
-
Fahad Radhi Alharbi, et. al.Fahad Radhi Alharbi ... Denes Csala
05 Oct 2021
05 Oct 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An Optimal Feature Parameter Set Based on Gated Recurrent Unit Recurrent Neural Networks for Speech Segment Detection

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences