Abstract

In a Korean connected digit recogniser, duration modelling is necessary to reduce insertion and deletion errors due to monophonemic digits, which cannot usually be corrected even by discriminative training algorithms. In the Letter the authors incorporate context-dependent word duration modelling directly in a decoding algorithm to reduce those errors. While incorporating duration information in the postprocessing stage shows little improvements over a baseline system, the proposed method reduces word error rates by ~10% for unknown length decoding when both maximum likelihood estimation and generalised probabilistic descent training are used.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call