A survey on effects of class imbalance in data pre-processing stage of classification problem
by Nitin Malave; Anant V. Nimkar
International Journal of Computational Systems Engineering (IJCSYSE), Vol. 6, No. 2, 2020

Abstract: Classifier learning with datasets suffering from imbalance class distribution is a challenging task and it hinders the performance of machine learning algorithms. This imbalance occurs when a particular class is highly outnumbered than that of another class. Such kind of data distribution in the real world applications caught the attention of many researchers. This paper presents the review of various state of the art sampling techniques and ensemble techniques to resolve class imbalance. This paper also investigates the other factors such as threshold of distribution, inter or within class imbalance, etc., that make class imbalance a more complex issue. Comparisons of various approaches viz. data sampling, cost sensitive methods, bagging, boosting which alleviate the class imbalance problem are investigated in detail for their effects on class imbalance problem. Different parameters have been reviewed for measuring and evaluating the performance of the model. Accuracy is majorly used as evaluation parameter in machine learning problems, but from reviews it is found that there are different parameters such as precision, recall and AU-ROC which provide statistical measures for evaluating the model. The paper gives research directions in the domain of class imbalance problems.

Online publication date: Fri, 13-Nov-2020

The full text of this article is only available to individual subscribers or to users at subscribing institutions.

 
Existing subscribers:
Go to Inderscience Online Journals to access the Full Text of this article.

Pay per view:
If you are not a subscriber and you just want to read the full contents of this article, buy online access here.

Complimentary Subscribers, Editors or Members of the Editorial Board of the International Journal of Computational Systems Engineering (IJCSYSE):
Login with your Inderscience username and password:

    Username:        Password:         

Forgotten your password?


Want to subscribe?
A subscription gives you complete access to all articles in the current issue, as well as to all articles in the previous three years (where applicable). See our Orders page to subscribe.

If you still need assistance, please email subs@inderscience.com