Title: Bayesian feature construction for the improvement of classification performance

Authors: Manolis Maragoudakis

Addresses: AI Lab, University of the Aegean, Samos, 83200, Greece

Abstract: In this paper we are going to talk about the problem of the increase in validity, concerning the process of classification, but not through approaches having to do with the improvement of the ability to construct a precise classification model using any algorithm of machine learning. On the contrary, we approach this important matter by the view of a wider encoding of the training data and more specifically under the perspective of the creation of more features so that the hidden angles of the subject areas, which model the available data, are revealed to a higher degree. We suggest the use of a novel feature construction algorithm, which is based on the ability of the Bayesian networks to re-enact the conditional independence assumptions of features, bringing forth properties concerning their interrelation that are not clear when a classifier provides the data in their initial form. The results from the increase of the features are shown through the experimental measurement in a wide domain area and after the use of a large number of classification algorithms, where the improvement of the performance of classification is evident.

Keywords: machine learning; knowledge engineering methodologies; pattern analysis; statistical pattern recognition.

DOI: 10.1504/IJDATS.2020.105152

International Journal of Data Analysis Techniques and Strategies, 2020 Vol.12 No.1, pp.43 - 75

Received: 06 Jan 2017
Accepted: 30 Dec 2017

Published online: 14 Feb 2020 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article