Title: Accuracy-based learning classification system

Authors: Bikash Kanti Sarkar, Shib Sankar Sana, Kripasindhu Chaudhuri

Addresses: Department of Information Technology, Birla Institute of Technology, Deemed University, Mesra, Ranchi, India. ' Department of Mathematics, Bhangar Mahavidyalaya, University of Calcutta, Bhangar 743502, 24PGS (South), West Bengal, India. ' Department of Mathematics, Jadavpur University, Kolkata 32, India

Abstract: In order to implement a multi-category classification system, an efficient rule set is imperative for its investigation. In this paper, such a system is being introduced. In the first phase of its kind, the C4.5 rule induction algorithm is adopted to obtain useful rule set from classification problem, following a new data set partitioning approach. Next, the presented genetic algorithm (GA) is implemented to refine the learned rules in more efficient way. The resultant system has been compared with UCS (GA-based classification system) and C4.5 (non GA-based rule induction algorithm) on a number of benchmark data sets collected from UCI (University of California at Irvine) machine learning repository. Results demonstrate that the proposed genetic approach provides marked improvement in a number of cases.

Keywords: C4.5 algorithm; decision trees; genetic algorithms; accuracy-based learning classification systems; rule induction; data set partitioning; benchmarking; University of California at Irvine; UCI; machine learning; UCS classifier system; universities; higher education; USA; United States; information science.

DOI: 10.1504/IJIDS.2010.029904

International Journal of Information and Decision Sciences, 2010 Vol.2 No.1, pp.68 - 86

Published online: 02 Dec 2009 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article