Authors: R. Hemavathi; R. Kumara Swamy
Addresses: Department of Electronics and Communication Engineering, Siddaganga Institute of Technology (affiliated to Visvesvaraya Technological University, Belagavi), Tumakuru, India ' Department of Electronics and Communication Engineering, Siddaganga Institute of Technology (affiliated to Visvesvaraya Technological University, Belagavi), Tumakuru, India
Abstract: Speaker verification can be viewed as a process of verifying the person using his/her utterance. The major challenge to implement automatic speaker verification in security applications is spoofing attacks. Speaker verification systems can be spoofed using pre-recorded speech, synthetic and voice conversion speech. Hence, there is a need to develop spoof detection system in order to make voice biometrics viable for security applications. This paper proposes to explore time-frequency representations obtained using gammatone filterbank and constant Q transform for detecting presentation attack for automatic speaker verification. The experiments are carried out for ASV spoof 2017 database and the results are compared with state-of-art replay speech detection systems based on cepstral features.
Keywords: speaker verification; replay speech; spoofing attack; time-frequency representation.
International Journal of High Performance Computing and Networking, 2020 Vol.16 No.2/3, pp.170 - 177
Received: 30 Jun 2020
Accepted: 23 Oct 2020
Published online: 12 Jan 2021 *