Spoken Language Translation System Research Articles

This paper describes the development of a multilingual and multigenre manually annotated speech dataset, freely available to the research community as ground truth for the evaluation of automatic transcription systems and spoken language translation systems. The dataset includes two video genres--television broadcast news and talk-shows--and covers Flemish, English, German, and Italian, for a total of about 35 h of television speech. Besides segmentation and orthographic transcription, we added a very rich annotation on the audio signal, both at the linguistic level (e.g. filled pauses, pronunciation errors, disfluencies, speech in a foreign language) and at the acoustic level (e.g. background noise and different types of non-speech events). Furthermore, a subset of the transcriptions is translated in four directions, namely Flemish to English, German to English, German to Italian and English to Italian. The development of this dataset was organized in several phases, relying on expert transcribers as well as involving non-expert contributors through crowdsourcing. We first conducted a feasibility study to test and compare two methods for crowdsourcing speech transcription on broadcast news data. These methods are based on different transcription processes (i.e. parallel vs. iterative) and incorporate two different quality control mechanisms. With both methods, we achieved near-expert transcription quality--in terms of word error rate--for English, German and Italian data. Instead, for Flemish data we were not able to get a sufficient response from the crowd to complete the offered transcription tasks. The results obtained demonstrate that the viability of methods for crowdsourcing speech transcription significantly depends on the target language. This paper provides a detailed comparison of the results obtained with the two crowdsourcing methods tested, describes the main characteristics of the final ground truth resource created as well as the methodology adopted, and the guidelines prepared for its development.

India possesses a large variety of languages and dialects spoken in different parts of the country. These languages possess some unique linguistic, phonological, and phonetic properties different from European languages. Research is being done in several of Indian languages—such as Hindi, Bangla, etc. to study the articulatory, acoustic—phonetic and prosodic nature for the purpose of creating standards of phonetic representation of phonemes and Pronunciation Lexicon in Indian Languages. Comprehensive and task specific language corpora, speech databases in laboratory as well as in mobile communication situation and the tools/technics required for processing of speech signals are being developed. The emphasis is on developing multilingual human-machine interaction systems. Some of the recently developed systems include multi-lingual speech recognition system for voice enabled services, multilingual text to speech synthesis system, speaker and language identification system for general purpose and forensic applications. Recognition of emotions in spoken speech, spoken language translation system, etc. The paper presents an overview of such studies conducted in various laboratories, academic institutions, and industries in India pertaining to these areas. The technologies used for data collection, processing, and recognition/ synthesis, etc., utilized and status of the development have been mentioned.

Spoken Language Translation System Research Articles

Related Topics

Articles published on Spoken Language Translation System

Creating a ground truth multilingual dataset of news and talk show transcriptions through crowdsourcing

Vollautomatisches Dolmetschen – Möglichkeiten und Grenzen

Lightly supervised word-sense translation-error detection and resolution in an interactive conversational spoken language translation system

An overview of the development of resources, techniques, and, systems for Indian spoken languages

A-STAR: Toward translating Asian spoken languages

An ARM-Based System-on-a-Programmable-Chip Architecture for Spoken Language Translation

Paraphrasing spoken Chinese using a paraphrase corpus

Toward Practical Spoken Language Translation

Analyzing inflectional morphology in a spoken language translation system

Detection of sentence accents in a speech recognition system.

Opportunities for advanced speech processing in military computer-based systems

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Spoken Language Translation System Research Articles

Related Topics

Articles published on Spoken Language Translation System

Creating a ground truth multilingual dataset of news and talk show transcriptions through crowdsourcing

Vollautomatisches Dolmetschen – Möglichkeiten und Grenzen

Lightly supervised word-sense translation-error detection and resolution in an interactive conversational spoken language translation system

An overview of the development of resources, techniques, and, systems for Indian spoken languages

A-STAR: Toward translating Asian spoken languages

An ARM-Based System-on-a-Programmable-Chip Architecture for Spoken Language Translation

Paraphrasing spoken Chinese using a paraphrase corpus

Toward Practical Spoken Language Translation

Analyzing inflectional morphology in a spoken language translation system

Detection of sentence accents in a speech recognition system.

Opportunities for advanced speech processing in military computer-based systems