Title: An integrated approach of feature selection and parameter optimisation of kernel to enhance the performance of support vector machine

Authors: Balakrishnan Sarojini

Addresses: Department of Computer Science, Avinashilingam Institute for Home Science and Higher Education for Women University, Coimbatore-641043, India

Abstract: Mining the big data is a challenging task due to the size of the databases and the complexity in maintaining precise and non-redundant data. Classification algorithms need to analyse hundreds of independent features in these high dimensional databases for effective prediction. The performance of classification algorithms could be enhanced data if irrelevant and redundant data are removed. Feature selection algorithms help in identifying prominent features that could enhance the performance of the classifier. Additionally, the classification performance of support vector machine (SVM) could be enhanced by setting appropriate kernel parameters. The kernel parameters of SVM are tuned for each feature subset generated by feature selection and the performance is analysed. The feature subset that enhances the classification performance of SVM is the optimal feature subset of the dataset. Experiments are done on three medical datasets. The empirical results prove that integrating feature selection and optimising the kernel parameters enhance the performance of the SVM classifier. The approach is validated in terms of increase in accuracy and area under receiver operating characteristic (AUC) of the classifier.

Keywords: big data; feature selection; support vector machines; SVM; kernel parameters; parameter optimisation; classification performance.

DOI: 10.1504/IJCNDS.2015.070982

International Journal of Communication Networks and Distributed Systems, 2015 Vol.15 No.2/3, pp.265 - 278

Received: 17 Jul 2014
Accepted: 07 Feb 2015

Published online: 04 Aug 2015 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article