Article: Spectro-temporal features for audio replay attack detection Journal: International Journal of High Performance Computing and Networking (IJHPCN) 2020 Vol.16 No.2/3 pp.170 - 177 Abstract: Speaker verification can be viewed as a process of verifying the person using his/her utterance. The major challenge to implement automatic speaker verification in security applications is spoofing attacks. Speaker verification systems can be spoofed using pre-recorded speech, synthetic and voice conversion speech. Hence, there is a need to develop spoof detection system in order to make voice biometrics viable for security applications. This paper proposes to explore time-frequency representations obtained using gammatone filterbank and constant Q transform for detecting presentation attack for automatic speaker verification. The experiments are carried out for ASV spoof 2017 database and the results are compared with state-of-art replay speech detection systems based on cepstral features. Inderscience Publishers - linking academia, business and industry through research

Title: Spectro-temporal features for audio replay attack detection

Authors: R. Hemavathi; R. Kumara Swamy

Addresses: Department of Electronics and Communication Engineering, Siddaganga Institute of Technology (affiliated to Visvesvaraya Technological University, Belagavi), Tumakuru, India ' Department of Electronics and Communication Engineering, Siddaganga Institute of Technology (affiliated to Visvesvaraya Technological University, Belagavi), Tumakuru, India

Abstract: Speaker verification can be viewed as a process of verifying the person using his/her utterance. The major challenge to implement automatic speaker verification in security applications is spoofing attacks. Speaker verification systems can be spoofed using pre-recorded speech, synthetic and voice conversion speech. Hence, there is a need to develop spoof detection system in order to make voice biometrics viable for security applications. This paper proposes to explore time-frequency representations obtained using gammatone filterbank and constant Q transform for detecting presentation attack for automatic speaker verification. The experiments are carried out for ASV spoof 2017 database and the results are compared with state-of-art replay speech detection systems based on cepstral features.

Keywords: speaker verification; replay speech; spoofing attack; time-frequency representation.

DOI: 10.1504/IJHPCN.2020.112704

International Journal of High Performance Computing and Networking, 2020 Vol.16 No.2/3, pp.170 - 177

Received: 30 Jun 2020
Accepted: 23 Oct 2020
Published online: 28 Jan 2021 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article

Title: Spectro-temporal features for audio replay attack detection

Keep up-to-date