Distributed document clustering algorithms: a recent survey
by J.E. Judith; J. Jayakumari
International Journal of Enterprise Network Management (IJENM), Vol. 6, No. 3, 2015

Abstract: Distributed data mining paradigm is an active research area due to the enormous volume of data that are to be processed from across a wide cluster of data nodes. Document clustering algorithms are widely applied in a variety of distributed environments like peer-to-peer networks, wireless sensor networks, etc. This paper entails a comprehensive review on most of the recent distributed document clustering algorithms that is ultimately making massive impacts on the technological realm. These algorithms are analysed based on few pivotal elements such as clustering quality, scale-up, speed-up and accuracy. Recent advances in technology have developed MapReduce-based distributed document clustering algorithms, which show dramatic improvements in the aforementioned analytical elements. Based on the review, intelligent discussions are presented for algorithm development and implementation.

Online publication date: Thu, 13-Aug-2015

The full text of this article is only available to individual subscribers or to users at subscribing institutions.

 
Existing subscribers:
Go to Inderscience Online Journals to access the Full Text of this article.

Pay per view:
If you are not a subscriber and you just want to read the full contents of this article, buy online access here.

Complimentary Subscribers, Editors or Members of the Editorial Board of the International Journal of Enterprise Network Management (IJENM):
Login with your Inderscience username and password:

    Username:        Password:         

Forgotten your password?


Want to subscribe?
A subscription gives you complete access to all articles in the current issue, as well as to all articles in the previous three years (where applicable). See our Orders page to subscribe.

If you still need assistance, please email subs@inderscience.com