Identification of Indian languages using multi-level spectral and prosodic features

V Ramu Reddy,K Sreenivasa Rao,Sudhamay Maity

doi:10.1007/s10772-013-9198-0

Abstract

In this paper spectral and prosodic features extracted from different levels are explored for analyzing the language specific information present in speech. In this work, spectral features extracted from frames of 20 ms (block processing), individual pitch cycles (pitch synchronous analysis) and glottal closure regions are used for discriminating the languages. Prosodic features extracted from syllable, tri-syllable and multi-word (phrase) levels are proposed in addition to spectral features for capturing the language specific information. In this study, language specific prosody is represented by intonation, rhythm and stress features at syllable and tri-syllable (words) levels, whereas temporal variations in fundamental frequency (F 0 contour), durations of syllables and temporal variations in intensities (energy contour) are used to represent the prosody at multi-word (phrase) level. For analyzing the language specific information in the proposed features, Indian language speech database (IITKGP-MLILSC) is used. Gaussian mixture models are used to capture the language specific information from the proposed features. The evaluation results indicate that language identification performance is improved with combination of features. Performance of proposed features is also analyzed on standard Oregon Graduate Institute Multi-Language Telephone-based Speech (OGI-MLTS) database.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Identification of Indian languages using multi-level spectral and prosodic features

Abstract

Talk to us

Similar Papers

More From: International Journal of Speech Technology

Lead the way for us

Journal: International Journal of Speech Technology	Publication Date: May 31, 2013
Citations: 70

Similar Papers

Pitch synchronous and glottal closure based speech analysis for language recognition
K Sreenivasa Rao ... V Ramu Reddy
International Journal of Speech Technology | VOL. 16
K Sreenivasa Rao, et. al.K Sreenivasa Rao ... V Ramu Reddy
12 Apr 2013
International Journal of Speech Technology | VOL. 16

Multilingual Speech Corpus in Low-Resource Eastern and Northeastern Indian Languages for Speaker and Language Identification
Joyanta Basu ... Tapan Kumar Basu
Circuits, Systems, and Signal Processing | VOL. 40
Joyanta Basu, et. al.Joyanta Basu ... Tapan Kumar Basu
20 Apr 2021
Circuits, Systems, and Signal Processing | VOL. 40

IITKGP-MLILSC speech database for language identification
Sudhamay Maity ... Dipanjan Nandi
-
Sudhamay Maity, et. al.Sudhamay Maity ... Dipanjan Nandi
01 Feb 2012
01 Feb 2012

Language Identification Using Spectral Features
K Sreenivasa Rao ... Sudhamay Maity
-
K Sreenivasa Rao, et. al.K Sreenivasa Rao ... Sudhamay Maity
01 Jan 2015
01 Jan 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Identification of Indian languages using multi-level spectral and prosodic features

Abstract

Talk to us

Similar Papers

More From: International Journal of Speech Technology