Title: Feature selection on educational data using Boruta algorithm

Authors: Neeyati Anand; Riya Sehgal; Sanchit Anand; Ajay Kaushik

Addresses: Maharaja Agrasen Institute of Technology, Rohini, Delhi, India ' Maharaja Agrasen Institute of Technology, Rohini, Delhi, India ' Maharaja Agrasen Institute of Technology, Rohini, Delhi, India ' Maharaja Agrasen Institute of Technology, Rohini, Delhi, India

Abstract: Data mining in education deals with formulating strategies for students with the aim to increase the parameters affecting the learning and employability. It also helps the educational institutes in maintaining their reputation as it is directly linked to the student's grades. We need to identify the parameters involved in learning and the relationship among those parameters. In EDM, feature selection (FS) is one of the most important and needed method in EDM, as it removes the features which have no direct link with the student's performance. For example, the date of birth of a student does not impact his/her performance. In this paper, an attempt has been made to improve the performance of the classifiers for undergraduate students at Maharaja Agrasen Institute of Technology. We have applied several techniques of data mining to make some rules that increase the learning and employability of students. The results of our study have shown a significant increase in accuracy, recall, precision and F-measure for naïve Bayes and decision tree classifiers.

Keywords: classifiers; educational data mining; EDM; feature selection; performance.

DOI: 10.1504/IJCISTUDIES.2021.113826

International Journal of Computational Intelligence Studies, 2021 Vol.10 No.1, pp.27 - 35

Received: 09 Oct 2019
Accepted: 03 Apr 2020

Published online: 18 Mar 2021 *

Full-text access for editors Access for subscribers Purchase this article Comment on this article