Title: VAD, feature extraction and modelling techniques for speaker recognition: a review

Authors: Spoorti J. Jainar; Pritam Limbaji Sale; B.G. Nagaraja

Addresses: Department of E&CE, Visvesvaraya Technological University, Belagavi – 18, Karnataka, India ' Department of E&CE, Visvesvaraya Technological University, Belagavi – 18, Karnataka, India ' Department of E&CE, Jain Institute of Technology, Davangere – 03, Karnataka, India

Abstract: This paper reviews an automatic speaker recognition technology, with an emphasis on state-of-the-art voice activity detection (VAD), feature extraction and speaker-modelling techniques that have emerged during the last few years. Researchers in the field of speaker recognition have made a few attempts to recognise the speaker in the language mismatch environment and limited data condition.To address robustness issues, we also elaborate language mismatch and limited data speaker recognition. Further, this paper identified some issues with the existing speaker recognition systems and also investigated areas of possible improvements in speaker recognition field. We conclude the paper with a discussion on the possible future directions.

Keywords: VAD; voice activity detection; speaker identification; speaker verification; language mismatch; limited data; multilingual; features; modelling techniques.

DOI: 10.1504/IJSISE.2020.113552

International Journal of Signal and Imaging Systems Engineering, 2020 Vol.12 No.1/2, pp.1 - 18

Received: 26 Apr 2018
Accepted: 13 May 2019

Published online: 08 Mar 2021 *

Full-text access for editors Access for subscribers Purchase this article Comment on this article