Title: Web documents semantic similarity by extending document ontology using current trends

Authors: Poonam Chahal; Manjeet Singh; Suresh Kumar

Addresses: Department of Computer Science and Engineering, Manav Rachna International University, Delhi Suraj kund Road, Sector 43, Faridabad, Haryana 121004, India ' Department of Computer Science and Engineering, YMCA University of Science and Technology, Sector-6, Faridabad, Haryana, India ' Department of Computer Science and Engineering, Manav Rachna International University, Delhi Suraj kund Road, Sector 43, Faridabad, Haryana 121004, India

Abstract: Semantic evaluation of similarity index is computation of relatedness between terms/concepts/documents. In this paper, we have given a novel semantic similarity approach to overcome the limitations that exists in calculating semantic similarity score. In our approach we are extracting words/terms from the set of documents, and then replacing the extracted words/terms by their respective set of probable concepts stored in a dictionary. The concepts retrieved from the dictionary are connected using relationships from a base ontology for construction of document ontology corresponding to a given document. The ontology constructed this way is further extended using trend relationships stored in a separate database. Finally, the extended documents' ontology is compared for finding the relatedness between the documents. It is proved empirically that the proposed approach gives the better results of semantic similarity as compared with the conventional approaches.

Keywords: Semantic Web; concepts; ontology; semantic; similarity; document ontology; words; syntactic analysis; parsing; web page.

DOI: 10.1504/IJWS.2017.088673

International Journal of Web Science, 2017 Vol.3 No.1, pp.1 - 15

Accepted: 25 Nov 2016
Published online: 14 Dec 2017 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article