Abstract

In this demonstration, we show a real-time transcription of TV broadcast news in Japanese using a very large vocabulary speech recognition system developed at BBN Technologies. Both signal processing and speech recognition are run on a commodity notebook computer. Transcription word error rate is about 1.5% with average word latency less than 2 seconds. The high recognition accuracy in real time is achieved by a fast, and with low latency, 2-pass Byblos recognizer utilizing good acoustic and language models trained on the NHK Broadcast News Corpus.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call