Authors: Satoru Tsuge; Shingo Kuroiwa
Addresses: Department of Information Systems, School of Informatics, Daido University, 10-3 Takiharu-cho, Minami-ku, Nagoya 457-8530, Japan ' Department of Applied and Cognitive Informatics, Graduate School of Engineering, Chiba University, 1-33, Yayoi-cho, Inage-ku, Chiba, 263-8522, Japan
Abstract: In this paper, first, we report speaker recognition performance using bone-conduction speech based on an i-vector-based speaker recognition system, which is the current state-of-the-art method. In addition, we propose three speaker recognition methods combining bone-conduction speech and air-conduction speech: a feature combination method, a speaker model combination method, and a similarity score combination method. To evaluate the proposed methods, we conducted speaker recognition experiments using a part of a large speech corpus constructed by the National Research Institute of Police Science, Japan. Experimental results show the bone-conduction speech performs almost the same as the air-conduction speech when the enrolment data and evaluation data are collected in the same session. In addition, all proposed methods improved the speaker recognition performance of air- and bone-conduction speech in the experiments. From these results, we conclude that fusing air- and bone-conduction speech improves the speaker recognition performance.
Keywords: speaker recognition; bone-conduction speech; air-conduction speech; i-vector; personal authentication systems; biometrics; bone-conductive microphone; condenser microphone; speech processing; session variability; new speech sensors.
International Journal of Biometrics, 2019 Vol.11 No.1, pp.35 - 49
Available online: 11 Oct 2018 *Full-text access for editors Access for subscribers Purchase this article Comment on this article