A computationally efficient approach for acoustic class specific VTLN using regression tree

Shakti P Rath

doi:10.1007/s10772-018-9518-5

A computationally efficient approach for acoustic class specific VTLN using regression tree

Shakti P Rath

https://doi.org/10.1007/s10772-018-9518-5

Copy DOI

Journal: International Journal of Speech Technology

Publication Date: Jun 20, 2018

Affiliation: Institute for Infocomm Research

#Vocal Tract Length Normalization #Wall Street Journal + Show 8 more

Abstract
Full-Text
Similar Papers

Abstract

In this paper a novel frame-work for acoustic class specific vocal tract length normalization (VTLN) is developed. Unlike the computationally expensive grid search involved in conventional VTLN, the proposed technique works in the joint paradigm of linear transform VTLN and the txpectation maximization algorithm, and uses Regression class tree for robustness. Experimental results are demonstrated on two wall street journal (WSJ) test sets Nov92 eval and Dev-93 with the acoustic model being trained on the WSJ-284 set. It is found that the proposed acoustic class specific VTLN provides consistent improvements in word accuracies in comparison to the conventional VTLN which uses single warp-factor for spectral warping.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

More From: International Journal of Speech Technology

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.