Ontology-based context dependent document clustering method
by Soumen Swarnakar
International Journal of Knowledge Engineering and Data Mining (IJKEDM), Vol. 2, No. 1, 2012

Abstract: Document clustering has become an increasingly important task in analysing huge documents. The challenging aspect to analyse the enormous documents is to organise them in such a way that facilitates better search and knowledge extraction without introducing extra cost and complexity. In this paper, first ontology-based document clustering method has been proposed using hierarchical clustering technique. The approach is purely based on the frequency count of the terms present in the documents where context of the terms are totally ignored. Therefore, the method is modified by incorporating belief to measure the degree of relatedness of the terms with respect to the concepts present in the documents. An efficient searching algorithm has been developed to create the ontology tree of the documents, through consultation with different dictionaries. Davis-Bouldin's (DB) index is the well-known metric for measuring quality of clusters exhibits that the proposed approach can efficiently produce higher quality document clusters as compared with several well-known document-clustering algorithms, including our previous one.

Online publication date: Tue, 02-Sep-2014

The full text of this article is only available to individual subscribers or to users at subscribing institutions.

 
Existing subscribers:
Go to Inderscience Online Journals to access the Full Text of this article.

Pay per view:
If you are not a subscriber and you just want to read the full contents of this article, buy online access here.

Complimentary Subscribers, Editors or Members of the Editorial Board of the International Journal of Knowledge Engineering and Data Mining (IJKEDM):
Login with your Inderscience username and password:

    Username:        Password:         

Forgotten your password?


Want to subscribe?
A subscription gives you complete access to all articles in the current issue, as well as to all articles in the previous three years (where applicable). See our Orders page to subscribe.

If you still need assistance, please email subs@inderscience.com