Title: An efficient hybrid clustering algorithm for segmentation: Autocluster

Authors: Seyed Behnam Khakbaz; Marziyeh Pourestarabadi; Nastaran Hajiheydari

Addresses: Faculty of Management, Strategy Management, University of Tehran, Jalal Al-e-Ahmad Ave., Nasr Bridge, Tehran 14155-6311, Iran ' Faculty of Management, University of Tehran, Jalal Al-e-Ahmad Ave., Nasr Bridge, Tehran 14155-6311, Iran ' Faculty of Management, University of Tehran, Jalal Al-e-Ahmad Ave., Nasr Bridge, Tehran 14155-6311, Iran

Abstract: A new automatic clustering algorithm has been proposed in this paper, which does not need clustering information, such as number of clusters and radius of density. Moreover, this algorithm generates robust results, and named Autocluster. Autocluster is a suitable algorithm for customer segmentation, and as it is known, clustering information is not available properly in customer segmentation. Autocluster applies concepts of partitioning clustering algorithms, hierarchical clustering algorithms and density-based clustering algorithm. Consequently, a new, automatic and high-precision algorithm has been proposed. Autocluster consists of four steps: developing 'distance matrix', identifying 'best point (data record)', developing 'point matrix' and 'clustering'. These steps have been explained comprehensively in this paper. Furthermore, iris database and a synthetic dataset has been analysed by Autocluster to verify its capabilities vs. K-means algorithm. Moreover, an Iranian insurance dataset has been clustered by Autocluster, which has shown satisfying results, compared to the results from K-means.

Keywords: clustering; Autocluster; hybrid clustering algorithm; K-means algorithm; segmentation.

DOI: 10.1504/IJDS.2017.086258

International Journal of Data Science, 2017 Vol.2 No.3, pp.205 - 220

Accepted: 06 Apr 2015
Published online: 04 Sep 2017 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article