Article: Method to integrate speaker identification, speech recognition, and information retrieval algorithms for speaker-based information retrieval Journal: International Journal of Knowledge Engineering and Data Mining (IJKEDM) 2022 Vol.7 No.3/4 pp.234 - 251 Abstract: This article proposes speakers' voice-based information (audio and video) retrieval systems, which combines speaker identification, speech recognition, and information retrieval algorithms. Information retrieval systems encompass system structure and a way to query the system for information retrieval. This article illustrates both, including how it is deployed on top of existing systems. The input to the system is a speaker voice sample and a text query. Based on the speaker's voice, the size of the corpus is reduced, and based on the text query, documents are retrieved and ranked. For the speaker identification, we used the LPC coefficient, for voice recognition, we used a Python speech recognition library, and for ranking, we used cosine similarity and TF-IDF. Other algorithms can replace any intermediate modules depending on the system, like crime investigation, news analysis, and lecture retrieval. We demonstrated the proposed method on simulated data generated from online websites. Inderscience Publishers - linking academia, business and industry through research

You can view the full text of this article for free using the link below.

Title: Method to integrate speaker identification, speech recognition, and information retrieval algorithms for speaker-based information retrieval

Authors: Muhammad Muneeb

Addresses: Department of Electrical Engineering and Computer Science, Khalifa University of Science, Technology and Research (KUSTAR), Abu Dhabi, UAE

Abstract: This article proposes speakers' voice-based information (audio and video) retrieval systems, which combines speaker identification, speech recognition, and information retrieval algorithms. Information retrieval systems encompass system structure and a way to query the system for information retrieval. This article illustrates both, including how it is deployed on top of existing systems. The input to the system is a speaker voice sample and a text query. Based on the speaker's voice, the size of the corpus is reduced, and based on the text query, documents are retrieved and ranked. For the speaker identification, we used the LPC coefficient, for voice recognition, we used a Python speech recognition library, and for ranking, we used cosine similarity and TF-IDF. Other algorithms can replace any intermediate modules depending on the system, like crime investigation, news analysis, and lecture retrieval. We demonstrated the proposed method on simulated data generated from online websites.

Keywords: audio retrieval; information retrieval; speaker identification; TF-IDF; voice recognition.

DOI: 10.1504/IJKEDM.2022.126069

International Journal of Knowledge Engineering and Data Mining, 2022 Vol.7 No.3/4, pp.234 - 251

Received: 15 Dec 2021
Accepted: 04 Feb 2022
Published online: 10 Oct 2022 *

Full-text access for editors Full-text access for subscribers Free access Comment on this article

Title: Method to integrate speaker identification, speech recognition, and information retrieval algorithms for speaker-based information retrieval

Keep up-to-date