Deep features-based dialect and mood recognition using Assamese telephonic speech

Kandarpa Kumar Sarma,Mridusmita Sharma

doi:10.1504/ijict.2020.10031570

Abstract

Learning aided methods are popular for designing automatic speech recognition (ASR) systems. Majority of works have used shallow models in combination with mel frequency cepstral coefficients (MFCC) and other features for speech recognition applications. Although these shallow models are effective but incorporating deep features in the mechanism for speech processing applications is necessary to increase the efficiency. Despite of considerable amount of works on the design of deep learning topologies and training paradigms in supervised domain, very few works have concentrated on deep features which are essential to capture detailed information of speech. This work focuses on the generation of deep features using stacked auto-encoder for normal and time shifted telephonic speech samples in Assamese language with mood and dialect variations. Experimental results show that the deep features learned by the stacked auto-encoder performs better while it is configured for Assamese speech recognition with mood and dialect variations.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Deep features-based dialect and mood recognition using Assamese telephonic speech

Abstract

Talk to us

Similar Papers

More From: International Journal of Information and Communication Technology

Lead the way for us

Similar Papers

Deep features-based dialect and mood recognition using Assamese telephonic speech
Mridusmita Sharma ... Kandarpa Kumar Sarma
International Journal of Information and Communication Technology | VOL. 17
Mridusmita Sharma, et. al.Mridusmita Sharma ... Kandarpa Kumar Sarma
01 Jan 2020
International Journal of Information and Communication Technology | VOL. 17

Learning aided mood and dialect recognition using telephonic speech
Mridusmita Sharma ... Kandarpa Kumar Sarma
-
Mridusmita Sharma, et. al.Mridusmita Sharma ... Kandarpa Kumar Sarma
01 Dec 2016
01 Dec 2016

Performance Analysis of various Front-end and Back End Amalgamations for Noise-robust DNN-based ASR
Mohit Dua ... Vinam Agrawal
Recent Advances in Computer Science and Communications | VOL. 14
Mohit Dua, et. al.Mohit Dua ... Vinam Agrawal
01 Dec 2021
Recent Advances in Computer Science and Communications | VOL. 14

Prosodic Feature-Based Discriminatively Trained Low Resource Speech Recognition System
Taniya Hasija ... Kalpna Guleria
Sustainability | VOL. 14
Taniya Hasija, et. al.Taniya Hasija ... Kalpna Guleria
06 Jan 2022
Sustainability | VOL. 14

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Deep features-based dialect and mood recognition using Assamese telephonic speech

Abstract

Talk to us

Similar Papers

More From: International Journal of Information and Communication Technology