Title: Using graphical techniques from discriminant analysis to understand and interpret cluster solutions

Authors: Courtney McKim; James A. Bovaird; Chaorong Wu

Addresses: University of Wyoming, Professional Studies, 1000 E. University Ave. Dept. 3374, Laramie, WY 82071, USA ' University of Nebraska – Lincoln, Educational Psychology, 114 Teachers College Hall, Lincoln, NE 68588, USA ' University of Nebraska – Lincoln, Educational Psychology, 114 Teachers College Hall, Lincoln, NE 68588, USA

Abstract: Clustering is a common form of exploratory analysis in the social and behavioural sciences and education. There are many clustering algorithms available to researchers and each algorithm assigns membership slightly different. This paper compares five classification algorithms (SPSS' TwoStep, k-means, hierarchical (nearest and furthest neighbour), and finite mixture model. The results show the highest agreement among the finite mixture model and the two-step clustering algorithm, as well as k-means and two-step. Hierarchical (nearest neighbour) does not have high agreement with k-means and the mixture model. Once a research decides on a clustering algorithm they often have a hard time interpreting clusters once a solution is reached. This study suggests using discriminant analysis as a method of interpreting cluster solutions which also allows researchers to visually see the interpretation and also provides the number of functions and which measures load on which function allowing more information about the clusters.

Keywords: clustering; discriminant analysis; k-means; two-step; finite mixture model.

DOI: 10.1504/IJDATS.2017.086633

International Journal of Data Analysis Techniques and Strategies, 2017 Vol.9 No.3, pp.189 - 206

Received: 17 Aug 2015
Accepted: 10 May 2016

Published online: 15 Sep 2017 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article