Title: Combining models in discrete discriminant analysis

Authors: Anabela Marques; Ana Sousa Ferreira; Margarida G.M.S. Cardoso

Addresses: Barreiro College of Technology, Setúbal Polytechnic, Rua Américo da Silva Marinho – Lavradio, 2839-001 Barreiro, Portugal ' Faculty of Psychology, University of Lisbon and UNIDE, Alameda da Universidade, 1649-013 LISBOA, Portugal ' UNIDE and Department of Quantitative Methods of ISCTE, Lisbon University Institute, Avenida das Forças Armadas, 1649-026 Lisboa, Portugal

Abstract: When conducting discrete discriminant analysis, alternative models provide different levels of predictive accuracy which has encouraged the research in combined models. This research seems to be specially promising when small or moderate sized samples are considered, which often occurs in practice. In this work we evaluate the performance of a linear combination of two discrete discriminant analysis models: the first-order independence model and the dependence trees model. The proposed methodology also uses a hierarchical coupling model when addressing multi-class classification problems, decomposing the multi-class problems into several bi-class problems, using a binary tree structure. The analysis is based both on simulated and real datasets. Results of the proposed approach are compared with those obtained by random forests, being generally more accurate. Measures of precision regarding a training set, a test set and cross-validation are presented. The R software is used for the algorithms' implementation.

Keywords: discrete discriminant analysis; DDA; combining models; dependence trees model; DTM; first-order independence model; FOIM; hierarchical coupling model; HIERM; random forest; modelling.

DOI: 10.1504/IJDATS.2016.077483

International Journal of Data Analysis Techniques and Strategies, 2016 Vol.8 No.2, pp.143 - 160

Published online: 04 Jul 2016 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article