Abstract
Confidence measures~(CM) plays an important role in spoken term detection~(STD) systems. Traditionally, confidence measures based on multi-layer perceptron~(MLP) is computed via accumulating the frame-level phone posterior probabilities, where only short acoustic context information is used and some useful information from linguistic constraints is lost. In this paper, we propose two approaches to calculate the MLP-based CMs which can integrate language prior as well as sentence-level acoustic context. Experimental results show that the proposed approaches outperform the traditional one significantly. Moreover, fusing our proposed CM with the HMM/GMM based CM shows further performance improvement to our baseline system.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.