Abstract
Despite the recent prevalence of keyword spotting (KWS) in smart home, open-vocabulary KWS remains a keen but unmet need among the users. Conventional open-vocabulary KWS systems are difficult to obtain a high wake-up rate and low false alarms simultaneously due to the lack of specific data for model optimisation. In this letter, a light-weight neural keyword confidence estimation module (KCEM) for the second detection part in the open-vocabulary KWS system is proposed, which utilises the transformer structure to calculate the confidence by fusing the keyword embedding and the acoustic feature obtained from the KWS model. KCEM method is evaluated on a self-collected open-vocabulary KWS test set, yielding equally efficient performance compared with typical confidence estimation methods, a reduction in false reject rate by 34% and 29% relative under clean and noisy conditions, respectively, at 0.04 false alarms per hour.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.