You can view the full text of this article for free using the link below.

Title: A novel machine extraction algorithm for implicit and explicit keywords based on dynamic web metadata of scientific scholars' corpus

Authors: Mawloud Mosbah

Addresses: LRES Laboratory, Informatics Department, Faculty of Sciences, University 20 Août 1955, Skikda, Algeria

Abstract: Keywords extraction, as an operation to construct metadata, is an important pre-processing task considered by many natural language processing applications such as text summarisation, information retrieval, and clustering of documents. In this paper, we introduce a novel machine extraction algorithm for implicit and explicit keywords. The algorithm relies on a dynamic corpus of similar documents built by information retrieval engines. In addition to the direct utilisation of the keywords for similar documents, our algorithm combines some basic techniques. The given results, compared with some basic methods of the literature, seem to be very promising and we claim also the efficiency of our solution.

Keywords: natural language processing; keywords extraction; automatic construction of metadata; implicit keywords; explicit keywords.

DOI: 10.1504/IJWET.2023.131136

International Journal of Web Engineering and Technology, 2023 Vol.18 No.1, pp.29 - 44

Received: 17 Mar 2022
Received in revised form: 22 Aug 2022
Accepted: 08 Jan 2023

Published online: 31 May 2023 *

Full-text access for editors Full-text access for subscribers Free access Comment on this article