Title: An algorithm for finding document concepts using semantic similarities from WordNet ontology

Authors: Aditi Sharan, Manju L. Joshi

Addresses: School of Computer and Systems Sciences, Jawaharlal Nehru University, New Delhi 110067, India. ' School of Computer and Systems Sciences, Jawaharlal Nehru University, New Delhi 110067, India

Abstract: Semantic similarity is becoming a generic issue in a variety of applications in area of information retrieval (IR). Most of the researchers are using ontology as a tool for finding semantic similarities. Use of ontology allows terms in documents to be replaced by the concepts. The concepts are generally selected by identifying semantically related terms and finding a suitable term (concept) to replace them. Several approaches have been proposed for finding concepts by selecting semantically related terms, however no attempt has been made to automatise the process. The motivation of this paper is to suggest an automatic method of identifying the concepts from documents using hypernym relationship in ontologies and propose an algorithm for the same. WordNet ontology has been used for implementing the algorithm. The algorithm can be used for finding document concepts and clustering the documents based on these concepts.

Keywords: semantic similarity; WordNet ontology; document clustering; document concepts; information retrieval.

DOI: 10.1504/IJCVR.2010.036078

International Journal of Computational Vision and Robotics, 2010 Vol.1 No.2, pp.147 - 157

Published online: 17 Oct 2010 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article