End-to-End Speaker Diarization Conditioned on Speech Activity and Overlap Detection

Yuki Takashima,Paola Garcia,Yusuke Fujita,Shota Horiguchi,Shinji Watanabe,Kenji Nagamatsu

doi:10.1109/slt48900.2021.9383555

End-to-End Speaker Diarization Conditioned on Speech Activity and Overlap Detection

Yuki Takashima, Paola Garcia + Show 4 more

Open Access

https://doi.org/10.1109/slt48900.2021.9383555

Copy DOI

Publication Date: Jan 19, 2021

Citations: 41

Affiliation: Hitachi (Japan), Johns Hopkins University

#Speaker Diarization #Speech Activity Detection + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

In this paper, we present a conditional multitask learning method for end-to-end neural speaker diarization (EEND). The EEND system has shown promising performance compared with traditional clustering-based methods, especially in the case of overlapping speech. In this paper, to further improve the performance of the EEND system, we propose a novel multitask learning framework that solves speaker diarization and a desired subtask while explicitly considering the task dependency. We optimize speaker diarization conditioned on speech activity and overlap detection that are subtasks of speaker diarization, based on the probabilistic chain rule. Experimental results show that our proposed method can leverage a subtask to effectively model speaker diarization, and outperforms conventional EEND systems in terms of diarization error rate.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.