Text- and speech-based phonotactic models for spoken language identification of Basque and Spanish

Víctor G Guijarrubia,M Inés Torres

doi:10.1016/j.patrec.2009.11.014

Abstract

This paper presents a series of spoken language identification experiments involving Spanish and Basque. Spanish and Basque are both official languages in the Basque Country, a region located in northern Spain. We focused our research on the study of several phonotactic-based methodologies, analysing at the same time the performance of phonotactic models trained from text and speech samples and the use of phone and phone sequences as decoding units. Although we focus mainly on Spanish–Basque identification, the analysis is later extended to English, so that more generic conclusions can be drawn. From the bilingual results, we can conclude that the text-based phonotactic models can perform similarly to the audio-based ones when applied to read speech. Moreover, when using task-specific information it is also possible to achieve a high accuracy. The use of phone sequences as decoding units results, in most of the cases, in a decrease in performance and appears to be useful when constraining the phone decoders to those sequences. Similar conclusions can be drawn from the trilingual experiments.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Text- and speech-based phonotactic models for spoken language identification of Basque and Spanish

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition Letters

Lead the way for us

Journal: Pattern Recognition Letters	Publication Date: Nov 26, 2009
Citations: 8

Similar Papers

Comparative Study of Several Phonotactic-Based Approaches to Spanish-Basque Language Identification
Víctor G Guijarrubia ... M Inés Torres
-
Víctor G Guijarrubia, et. al.Víctor G Guijarrubia ... M Inés Torres
01 Jan 2008
01 Jan 2008

Significance of neural phonotactic models for large-scale spoken language identification
Brij Mohan Lal Srivastava ... Hari Vydana
-
Brij Mohan Lal Srivastava, et. al.Brij Mohan Lal Srivastava ... Hari Vydana
01 May 2017
01 May 2017

Automatic segmentation and labeling of speech
A. Ljolje ... M.D. Riley
-
A. Ljolje, et. al.A. Ljolje ... M.D. Riley
01 Jan 1991
01 Jan 1991

Language revitalization and the normalization of Basque: a study of teacher perceptions and expectations in the Basque Country
Concepción Valadez ... Nahia Intxausti
Current Issues in Language Planning | VOL. 16
Concepción Valadez, et. al.Concepción Valadez ... Nahia Intxausti
09 Sep 2014
Current Issues in Language Planning | VOL. 16

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Text- and speech-based phonotactic models for spoken language identification of Basque and Spanish

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition Letters