Optimizing acoustic models for commercial speech recognition using foreground scores and data weighting

D Boies,M Weintraub,Su-Lin Wu Su-Lin Wu,B Strope

doi:10.1109/icassp.2004.1326111

Optimizing acoustic models for commercial speech recognition using foreground scores and data weighting

D Boies, M Weintraub + Show 2 more

Open Access

https://doi.org/10.1109/icassp.2004.1326111

Copy DOI

Publication Date: May 17, 2004

Citations: 9

Affiliation: Nuance Communications (United States)

#Acoustic Models For Speech Recognition #Models For Speech Recognition + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

This paper describes a data-driven technique for optimizing the acoustic models for speech recognition systems that target commercial applications over telephones. Frame-averaged foreground log-likelihoods (foreground scores) correlate to recognition errors. These scores are used together with gender to optimize data weighting for the acoustic model. This process is interpreted as increasing the priors and associated parameters for poorly modeled data. The score-based optimization leads to about 7% fewer semantic errors on a live evaluation set collected after the last data used to estimate the acoustic model.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.