A Coarse-Grained Model for Optimal Coupling of ASR and SMT Systems for Speech Translation

Gaurav Kumar,Daniel Povey,Jan Trmal,Sanjeev Khudanpur,Graeme Blackwood

doi:10.18653/v1/d15-1218

Abstract

Speech translation is conventionally carried out by cascading an automatic speech recognition (ASR) and a statistical machine translation (SMT) system. The hypotheses chosen for translation are based on the ASR system’s acoustic and language model scores, and typically optimized for word error rate, ignoring the intended downstream use: automatic translation. In this paper, we present a coarseto-fine model that uses features from the ASR and SMT systems to optimize this coupling. We demonstrate that several standard features utilized by ASR and SMT systems can be used in such a model at the speech-translation interface, and we provide empirical results on the Fisher Spanish-English speech translation corpus.

Highlights

Speech translation is the process of translating speech in the source language to text or speech in the target language
This paper presents a featurized model which performs the job of hypothesis selection from the outputs of the Automatic Speech Recognition (ASR) system for the input to the statistical machine translation (SMT) system
We present a general framework in which hypothesis selection can be carried out using knowledge from the ASR and the SMT system

Summary

Introduction

Speech translation is the process of translating speech in the source language to text or speech in the target language. Step three involves training and tuning a Statistical Machine Translation (SMT) system and decoding the output extracted through the speech translation interface. There may exist hypotheses that a trained SMT system may find easier to translate and produce better translations for than the ones that are deemed best based on the ASR acoustic and language model scores. 2. Coarse-to-fine grained decoding : An intermediate model which acts as an interface and is a weak (coarse) version of the downstream process may be able to select better hypotheses. A weak translation decoder can be used as the interface to estimate the expected translation quality of an ASR hypothesis This method of hypothesis selection should be able to incorporate features from the ASR and the SMT system. Optimization for hypothesis selection at the Speech-Translation interface should be conducted using phrases as the basic unit instead of words

Coarse-to-Fine Speech Translation

A simple model : Maximum Spanning Phrases

A general featurized model for hypothesis selection

A discussion about related techniques

Training

Features

Results

Conclusions

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Coarse-Grained Model for Optimal Coupling of ASR and SMT Systems for Speech Translation

Abstract

Highlights

Summary

Talk to us

Similar Papers

Lead the way for us

Publication Date: Jan 1, 2015
Citations: 16	License type: cc-by

Similar Papers

Some insights from translating conversational telephone speech
Gaurav Kumar ... Daniel Povey
-
Gaurav Kumar, et. al.Gaurav Kumar ... Daniel Povey
01 May 2014
01 May 2014

Training, Enhancing, Evaluating and Using MT Systems with Comparable Data
Bogdan Babych ... Sabine Hunsicker
-
Bogdan Babych, et. al.Bogdan Babych ... Sabine Hunsicker
01 Jan 2019
01 Jan 2019

QCRI's Live Speech Translation System
Fahim Dalvi ... Ahmed Ali
-
Fahim Dalvi, et. al.Fahim Dalvi ... Ahmed Ali
01 Jan 2018
01 Jan 2018

Theoretical Analysis of Diversity in an Ensemble of Automatic Speech Recognition Systems
Kartik Audhkhasi ... Andreas M Zavou
IEEE/ACM transactions on audio, speech, and language processing | VOL. 22
Kartik Audhkhasi, et. al.Kartik Audhkhasi ... Andreas M Zavou
01 Mar 2014
IEEE/ACM transactions on audio, speech, and language processing | VOL. 22

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Coarse-Grained Model for Optimal Coupling of ASR and SMT Systems for Speech Translation

Abstract

Highlights

Summary

Talk to us

Similar Papers