Bone- and air-conduction speech combination method for speaker recognition

Satoru Tsuge,Shingo Kuroiwa

doi:10.1504/ijbm.2019.10016806

Abstract

In this paper, first, we report speaker recognition performance using bone-conduction speech based on an i-vector-based speaker recognition system, which is the current state-of-the-art method. In addition, we propose three speaker recognition methods combining bone-conduction speech and air-conduction speech: a feature combination method, a speaker model combination method, and a similarity score combination method. To evaluate the proposed methods, we conducted speaker recognition experiments using a part of a large speech corpus constructed by the National Research Institute of Police Science, Japan. Experimental results show the bone-conduction speech performs almost the same as the air-conduction speech when the enrolment data and evaluation data are collected in the same session. In addition, all proposed methods improved the speaker recognition performance of air- and bone-conduction speech in the experiments. From these results, we conclude that fusing air- and bone-conduction speech improves the speaker recognition performance.

Full Text