The FRCK clustering algorithm for determining cluster number and removing outliers automatically Online publication date: Tue, 12-Oct-2021
by Yubin Guo; Yuhang Wu; Xiaopeng Zhang; Aofeng Bo; Ximing Li
International Journal of Computational Science and Engineering (IJCSE), Vol. 24, No. 5, 2021
Abstract: Clustering algorithm is one of the most popular unsupervised algorithms for data grouping. The K-means algorithm is a popular clustering algorithm for its simplicity, ease of implementation and efficiency. But for K-means algorithm, the optical cluster number is difficult to predict, while it is sensitive to outliers. In this paper, we divide outliers into two types, and then prompt a clustering algorithm to remove the two-type outliers and calculate the optimal cluster number in each clustering iteration. The algorithm is a fusion of rough clustering and K-means, abbreviated as FRCK algorithm. In the FRCK algorithm, outliers are removed precisely, therefore the optical cluster number can be more accurate, and the quality of clustering result can be improved accordingly. And this algorithm is proven effective by experiment.
Existing subscribers:
Go to Inderscience Online Journals to access the Full Text of this article.
If you are not a subscriber and you just want to read the full contents of this article, buy online access here.Complimentary Subscribers, Editors or Members of the Editorial Board of the International Journal of Computational Science and Engineering (IJCSE):
Login with your Inderscience username and password:
Want to subscribe?
A subscription gives you complete access to all articles in the current issue, as well as to all articles in the previous three years (where applicable). See our Orders page to subscribe.
If you still need assistance, please email subs@inderscience.com