Abstract

The authors present the concept of a 'segmental neural net' (SNN) for phonetic modeling in continuous speech recognition (CSR) and demonstrate how than can be used with a multiple hypothesis (or N-Best) paradigm to combine different CSR systems. In particular, they have developed a system that combines the SNN with a hidden Markov model (HMM) system. They believe that this is the first system incorporating a neural network for which the performance has exceeded the state of the art in large-vocabulary, continuous speech recognition. To take advantage of the training and decoding speed of HMMs, the authors have developed a novel hybrid SNN/HMM system that combines the advantages of both types of approaches. In this hybrid system, use is made of the N-best paradigm to generate likely phonetic segmentations, which are then scored by the SNN. The HMM and SNN scores are then combined to optimize performance. >

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.