Learning Dependencies of Discrete Speech Representations with Neural Hidden Markov Models

Sung-Lin Yeh,Hao Tang

doi:10.1109/icassp49357.2023.10094772

Learning Dependencies of Discrete Speech Representations with Neural Hidden Markov Models

Sung-Lin Yeh, Hao Tang

Open Access

https://doi.org/10.1109/icassp49357.2023.10094772

Copy DOI

Publication Date: Jun 4, 2023

Affiliation: University of Edinburgh

#Neural Hidden Markov Models #Discrete Latent Variable Models + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

While discrete latent variable models have had great success in self-supervised learning, most models assume that frames are independent. Due to the segmental nature of phonemes in speech perception, modeling dependencies among latent variables at the frame level can potentially improve the learned representations on phonetic-related tasks. In this work, we assume Markovian dependencies among latent variables, and propose to learn speech representations with neural hidden Markov models. Our general framework allows us to compare to self-supervised models that assume independence, while keeping the number of parameters fixed. The added dependencies improve the accessibility of phonetic information, phonetic segmentation, and the cluster purity of phones, showcasing the benefit of the assumed dependencies.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.