Multilingual and unsupervised subword modeling for zero-resource languages

Enno Hermann,Herman Kamper,Sharon Goldwater

doi:10.1016/j.csl.2020.101098

Enno Hermann, Herman Kamper + Show 1 more

Open Access

https://doi.org/10.1016/j.csl.2020.101098

Copy DOI

Abstract

Subword modeling for zero-resource languages aims to learn low-level representations of speech audio without using transcriptions or other resources from the target language (such as text corpora or pronunciation dictionaries). A good representation should capture phonetic content and abstract away from other types of variability, such as speaker differences and channel noise. Previous work in this area has primarily focused unsupervised learning from target language data only, and has been evaluated only intrinsically. Here we directly compare multiple methods, including some that use only target language speech data and some that use transcribed speech from other (non-target) languages, and we evaluate using two intrinsic measures as well as on a downstream unsupervised word segmentation and clustering task. We find that combining two existing target-language-only methods yields better features than either method alone. Nevertheless, even better results are obtained by extracting target language bottleneck features using a model trained on other languages. Cross-lingual training using just one other language is enough to provide this benefit, but multilingual training helps even more. In addition to these results, which hold across both intrinsic measures and the extrinsic task, we discuss the qualitative differences between the different types of learned features.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Computer Speech & Language	Publication Date: Apr 17, 2020
Citations: 22	License type: cc-by-nc-nd

R Discovery Prime

R Discovery Prime

Multilingual and unsupervised subword modeling for zero-resource languages

Abstract

Talk to us

Similar Papers

More From: Computer Speech & Language

Lead the way for us

Similar Papers

Cross-language use of acoustic information for automatic speech recognition
C Nieuwoudt ... Elizabeth C Botha
-
C Nieuwoudt, et. al.C Nieuwoudt ... Elizabeth C Botha
16 Oct 2000
16 Oct 2000

Cross-language use of acoustic information for automatic speech recognition
C Nieuwoudt ... E.C Botha
Speech Communication | VOL. 38
C Nieuwoudt, et. al.C Nieuwoudt ... E.C Botha
20 Feb 2002
Speech Communication | VOL. 38

The Study Of Metaphor Categories And The Translation Strategies Metaphors In Twilight
Yoana Gita Pradnya Lengari ... Maria Dimitrij Anggie Pavita
Jurnal Pendidikan dan Sastra Inggris | VOL. 3
Yoana Gita Pradnya Lengari, et. al.Yoana Gita Pradnya Lengari ... Maria Dimitrij Anggie Pavita
28 Nov 2023
Jurnal Pendidikan dan Sastra Inggris | VOL. 3

Fine-Tuning Language Models For Semi-Supervised Text Mining
Xinyu Chen ... Ian Beaver
-
Xinyu Chen, et. al.Xinyu Chen ... Ian Beaver
10 Dec 2020
10 Dec 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multilingual and unsupervised subword modeling for zero-resource languages

Abstract

Talk to us

Similar Papers

More From: Computer Speech &amp; Language

More From: Computer Speech & Language