Title: An incremental approach for solution of text clustering problem

Authors: Burak Ordin; Duygu Selin Ballı; Nur Uylaş Satı

Addresses: Department of Mathematics, Ege University Faculty of Science, 35100, Bornova, Izmir, Turkey; Bodrum Maritime Vocational School, Muğla Sıtkı Koçman University, Bodrum, Muğla, Turkey ' Department of Mathematics, Ege University Faculty of Science, 35100, Bornova, Izmir, Turkey; Bodrum Maritime Vocational School, Muğla Sıtkı Koçman University, Bodrum, Muğla, Turkey ' Department of Mathematics, Ege University Faculty of Science, 35100, Bornova, Izmir, Turkey; Bodrum Maritime Vocational School, Muğla Sıtkı Koçman University, Bodrum, Muğla, Turkey

Abstract: Text clustering is a significant study field in data mining. In text clustering, hidden-information is revealed by analysing text datasets with data mining techniques. Text clustering is used in many current application areas such as document recognition, document organisation, indexing, and visualisation. Since it is used in important applications it is in need of efficient algorithms. In this paper, k-means based algorithms in literature are investigated and a novel k-means based text clustering algorithm is defined. Proposed algorithm is compared with existing k-means algorithm in literature on five real-world datasets. Obtained implications show that recommended algorithm is efficient and useful.

Keywords: text mining; clustering problem; k-means algorithm; incremental algorithm.

DOI: 10.1504/IJKEDM.2019.102488

International Journal of Knowledge Engineering and Data Mining, 2019 Vol.6 No.3, pp.273 - 288

Received: 15 Feb 2019
Accepted: 27 Apr 2019

Published online: 27 Sep 2019 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article