Language Identification System Research Articles

Spoken language identification is the process of recognising language in an audio segment and is the precursor for several technologies such as automatic call routing, language recognition, multilingual conversation, language parsing, and sentimental analysis. Language identification has become a challenging task for low-resource languages like Kashmiri and Ladakhi spoken in the UT’s of Jammu and Kashmir (JK) and Ladakh, India. This is mainly due to speaker variations like duration, moderator, and ambiance particularly when training and testing are done on different datasets whilst analysing the accuracy of language identification system in actual implementation, thus producing low accuracy results. In order to tackle this problem, we propose a hybrid convolutional bi-directional gated recurrent unit (Bi-GRU) utilising the effects of both static and dynamic behaviour of the audio signal in order to achieve better results as compared to state-of-the-art models. The audio signals are first converted into two-dimensional structures called Mel-spectrograms to represent the frequency distribution over time. To investigate the spectral behaviour of audio signals, we employ a convolutional neural network (CNN) that perceives Mel-spectrograms in multiple dimensions. The CNN-learned feature vector serves as input to the Bi-GRU that maintains the dynamic behaviour of the audio signal. Experiments are done on six spoken languages, i.e. Ladakhi, Kashmiri, Hindi, Urdu, English, and Dogri. The data corpora used for experimentation are the International Institute of Information Technology Hyderabad-Indian Language Speech Corpus (IIITH-ILSC) and the self-created data corpus for the Ladakhi language. The model is tested on two datasets, i.e. speaker-dependent and speaker-independent. Results show that when validating the efficiency of our proposed model on both speaker-dependent and speaker-independent datasets, we achieve optimal accuracies of 99% and 91%, respectively, thus achieving promising results in comparison to the state-of-the-art models available.

Read full abstract

Objective: Spoken language identification being the fore-front of language recognition tasks and most significant medium of communication has to be enhanced in order to improve the accuracy of recently developed spoken language recognition systems. The purpose of this paper is to enhance the Spoken Language Identification (SLID) model using hybrid machine learning with deep learning model for regionally spoken languages of Jammu & Kashmir (JK) and Ladakh. Method: Initially, the speech signals of different languages of JK and Ladakh are manually collected from diverse sources, and it is preprocessed using Spectral Noise Gate (SNG) filtering technique. Once the speech signals are pre-processed, the feature extraction is performed by the cepstral features like Mel-frequency Cepstral Coefficients (MFCCs), Relative Spectral Transform-Perceptual Linear Prediction (RASTA-PLP), and spectral features like spectral roll off, spectral flatness. Findings: From this feature extraction, the length of the feature vector seems to be long, and it is required to reduce the feature length. Hence, optimal feature selection is accomplished using the new meta-heuristic algorithm termed Adaptive Distance-based Tunicate Swarm Algorithm (AD-TSA) by considering the minimum correlation as objective. Finally, the language identification is handled by the hybrid classifier termed Improved Support Vector Machine-Recurrent Neural Network (ISVM-RNN). Novelty: The identification learning algorithm is enhanced by the AD-TSA by considering the minimum correlation as objective among features in order to get minimum number of features that are sufficient for language identification process. The efficiency of the proposed hybrid approach is validated by simulating the experiment on a user-defined language database of JK and Ladakh speech signals in the working platform of Python. Keywords: Language Identification; Kashmir Languages; Optimal Feature Selection; Improved Support Vector MachineRecurrent Neural Network; Adaptive DistanceBased Tunicate Swarm Algorithm

Read full abstract

Language Identification System Research Articles

Related Topics

Articles published on Language Identification System

Generative adversarial networks for whispered to voiced speech conversion: a comparative study

Convolutional neural network based language identification system: A spectrogram based approach

Improved Arithmetic Optimization Algorithm with Transfer Learning based Arabic Sign Language Identification System

Implementation of Sibi Sign Language Realtime Detection Program (Case Studi At Sekolah Luar Biasa Negeri 1 Tabanan)

Leveraging BERT to Improve Spoken Language Identification of Code-Switching Speech

A comparison of cepstral and spectral features using recurrent neural network for spoken language identification

Towards audio-based identification of Ethio-Semitic languages using recurrent neural network

A Hybrid Convolutional Bi-Directional Gated Recurrent Unit System for Spoken Languages of JK and Ladakhi

Identifying languages in a novel dataset: ASMR-whispered speech.

Sentiment analysis based offensive language identification system for code-mixed data

Review of Automatic Language Identification System in Indian languages from the Non-Uniform Region

Improved Support Vector-Recurrent Neural Network with Optimal Feature Selection-based Spoken Language Identification System

Cross-corpora spoken language identification with domain diversification and generalization

Spoken language identification using a genetic-based fusion approach to combine acoustic and universal phonetic results

An Overview of Indian Spoken Language Recognition from Machine Learning Perspective

Language Identification-Based Evaluation of Single Channel Speech Separation of Overlapped Speeches

A comprehensive approach for performance evaluation of Indian language identification systems

Spoken Language Identification System Using Convolutional Recurrent Neural Network

Spoken language identification in unseen channel conditions using modified within-sample similarity loss

A Systematic Review on Language Identification of Code-Mixed Text: Techniques, Data Availability, Challenges, and Framework Development

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Language Identification System Research Articles

Related Topics

Articles published on Language Identification System

Generative adversarial networks for whispered to voiced speech conversion: a comparative study

Convolutional neural network based language identification system: A spectrogram based approach

Improved Arithmetic Optimization Algorithm with Transfer Learning based Arabic Sign Language Identification System

Implementation of Sibi Sign Language Realtime Detection Program (Case Studi At Sekolah Luar Biasa Negeri 1 Tabanan)

Leveraging BERT to Improve Spoken Language Identification of Code-Switching Speech

A comparison of cepstral and spectral features using recurrent neural network for spoken language identification

Towards audio-based identification of Ethio-Semitic languages using recurrent neural network

A Hybrid Convolutional Bi-Directional Gated Recurrent Unit System for Spoken Languages of JK and Ladakhi

Identifying languages in a novel dataset: ASMR-whispered speech.

Sentiment analysis based offensive language identification system for code-mixed data

Review of Automatic Language Identification System in Indian languages from the Non-Uniform Region

Improved Support Vector-Recurrent Neural Network with Optimal Feature Selection-based Spoken Language Identification System

Cross-corpora spoken language identification with domain diversification and generalization

Spoken language identification using a genetic-based fusion approach to combine acoustic and universal phonetic results

An Overview of Indian Spoken Language Recognition from Machine Learning Perspective

Language Identification-Based Evaluation of Single Channel Speech Separation of Overlapped Speeches

A comprehensive approach for performance evaluation of Indian language identification systems

Spoken Language Identification System Using Convolutional Recurrent Neural Network

Spoken language identification in unseen channel conditions using modified within-sample similarity loss

A Systematic Review on Language Identification of Code-Mixed Text: Techniques, Data Availability, Challenges, and Framework Development