Automatic acoustic segmentation in N-best list rescoring for lecture speech recognition

Peng Shen,Hisashi Kawai,Xugang Lu

doi:10.1109/iscslp.2016.7918409

Abstract

Speech segmentation is important in automatic speech recognition (ASR) and machine translation (MT). Particularly in N-best list rescoring processing, generalizing N-best lists consisting of as many as candidates from a decoding lattice requires proper utterance segmentation. In lecture speech recognition, only long audio recordings are provided without any utterance segmentation information. In addition, rather than only speech event, other acoustic events, e.g., laugh, applause, etc., are included in the recordings. Traditional speech segmentation algorithms for ASR focus on acoustic cues in segmentation, while in MT, speech text segmentation algorithms pay much attention to linguistic cues. In this study, we propose a three-stage speech segmentation framework by integrating both the acoustic and linguistic cues. We tested the segmentation framework for lecture speech recognition. Our results showed the effectiveness of the proposed segmentation algorithm.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Automatic acoustic segmentation in N-best list rescoring for lecture speech recognition

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Integration of Speech Recognition and Machine Translation in Computer-Assisted Translation
Shahram Khadivi ... Hermann Ney
IEEE Transactions on Audio, Speech, and Language Processing | VOL. 16
Shahram Khadivi, et. al.Shahram Khadivi ... Hermann Ney
01 Nov 2008
IEEE Transactions on Audio, Speech, and Language Processing | VOL. 16

Speech fine structure contains critical temporal cues to support speech segmentation
Xiangbin Teng ... David Poeppel
NeuroImage | VOL. 202
Xiangbin Teng, et. al.Xiangbin Teng ... David Poeppel
01 Sep 2019
NeuroImage | VOL. 202

A semi-Markov model for speech segmentation with an utterance-break prior
Mark Sinclair ... Peter Bell
-
Mark Sinclair, et. al.Mark Sinclair ... Peter Bell
14 Sep 2014
14 Sep 2014

Automatic acoustic segmentation for speech recognition on broadcast recordings
Gang Peng ... Mei-Yuh Hwang
-
Gang Peng, et. al.Gang Peng ... Mei-Yuh Hwang
27 Aug 2007
27 Aug 2007

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Automatic acoustic segmentation in N-best list rescoring for lecture speech recognition

Abstract

Talk to us

Similar Papers