Optimized large vocabulary WFST speech recognition system

Yuhong Guo,Yujing Si,Jielin Pan,Ta Li,Yonghong Yan

doi:10.1109/fskd.2012.6234200

Abstract

Speech recognition decoder is an important part of large vocabulary speech recognition application. The speed and the accuracy is the main concern of its application. Recently, weighted finite state transducers (WFST) has become the dominant description of decoding network. However, the large memory and time cost of constructing the final WFST decoding network is the bottleneck of this technique. The goal of this article is to construct a tight, flexible WFST decoding network as well as a fast, scalable decoder. A tight representation of silence in speech is proposed and the decoding algorithm with improved pruning strategies is also suggested. The experimental results show that the proposed network presentation will cut off 37% memory cost and 19% time cost of constructing the final decoding network. And with the decoding strategies of WFST feature specified beams the proposed decoder's efficiency and accuracy are also significantly improved.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Optimized large vocabulary WFST speech recognition system

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

A study of large vocabulary speech recognition decoding using finite-state graphs
Zhijian Ou ... Ji Xiao
-
Zhijian Ou, et. al.Zhijian Ou ... Ji Xiao
01 Nov 2010
01 Nov 2010

UNFOLD
Reza Yazdani ... Antonio González
-
Reza Yazdani, et. al.Reza Yazdani ... Antonio González
14 Oct 2017
14 Oct 2017

WFST Enabled Solutions to ASR Problems: Beyond HMM Decoding
Björn Hoffmeister ... Hermann Ney
IEEE Transactions on Audio, Speech, and Language Processing | VOL. 20
Björn Hoffmeister, et. al.Björn Hoffmeister ... Hermann Ney
01 Feb 2012
IEEE Transactions on Audio, Speech, and Language Processing | VOL. 20

A synchronized pruning composition algorithm of weighted finite state transducers for large vocabulary speech recognition
Zhiyang He ... Ping Lv
-
Zhiyang He, et. al.Zhiyang He ... Ping Lv
01 Dec 2012
01 Dec 2012

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Optimized large vocabulary WFST speech recognition system

Abstract

Talk to us

Similar Papers