Title: Statistical validation of multiple classifiers over multiple datasets in the field of pattern recognition

Authors: Pawan Kumar Singh; Ram Sarkar; Mita Nasipuri

Addresses: Department of Computer Science and Engineering, Jadavpur University, 188, Raja S.C. Mullick Road, Kolkata-700032, West Bengal, India ' Department of Computer Science and Engineering, Jadavpur University, 188, Raja S.C. Mullick Road, Kolkata-700032, West Bengal, India ' Department of Computer Science and Engineering, Jadavpur University, 188, Raja S.C. Mullick Road, Kolkata-700032, West Bengal, India

Abstract: Classification is a machine learning technique which is used to categorise the different input patterns into different classes. To select the best classifier for a given dataset is one of the critical issues in pattern classification. Using cross-validation approach, it is possible to apply candidate algorithms on a given dataset and best classifier is selected by considering various evaluation measures of classification. But computational cost associated with this approach is trivial in nature. There are a lot of tests described in the literature but they are specifically used on different and non-related datasets. But, in pattern recognition domain the datasets are taken from the same testing samples and hence are related in nature. In this paper, we propose the use of two non-parametric tests, namely Mann-Whitney U test for comparison of two classifiers and Kruskal-Wallis H test with the corresponding post-hoc tests for comparison of multiple classifiers on multiple datasets. The tests have been fruitfully applied on the datasets of handwritten digit recognition problem taken from UCI Machine Learning Database Repository (http://www.ics.uci.edu/~mlearn). The results show that both the tests provide a better solution than previously proposed tests.

Keywords: non-parametric tests; Mann-Whitney U test; Kruskal-Wallis H test; Conover test; Schaich and Hamerle test; multiple classifiers; multiple datasets; pattern recognition; machine learning; pattern classification; handwritten digit recognition.

DOI: 10.1504/IJAPR.2015.068929

International Journal of Applied Pattern Recognition, 2015 Vol.2 No.1, pp.1 - 23

Received: 09 Apr 2014
Accepted: 02 May 2014

Published online: 21 Apr 2015 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article