Winsorize tree algorithm for handling outlier in classification problem
by Chee Keong Ch'ng; Nor Idayu Mahat
International Journal of Operational Research (IJOR), Vol. 38, No. 2, 2020

Abstract: Classification and regression tree (CART) has been widely used nowadays for providing users supports in classification and prediction. However, having outlier in database is inevitable and could affect the size and accuracy of the tree. Negligence in handling the outlier could affect the splitting point which yields to bias and inaccurate tree. In this paper, we propose a winsorize tree algorithm for detecting and handling the outlier before calculating gini index measurement in all non-terminal nodes. As such, the constructed tree will grow without the necessity to be pruned. For evaluation, the proposed approach was compared to classical tree and pruned tree. The results obtained from seven real datasets indicate that the proposed winsorize tree performs as good as or even better compare to the other investigated trees.

Online publication date: Mon, 04-May-2020

The full text of this article is only available to individual subscribers or to users at subscribing institutions.

 
Existing subscribers:
Go to Inderscience Online Journals to access the Full Text of this article.

Pay per view:
If you are not a subscriber and you just want to read the full contents of this article, buy online access here.

Complimentary Subscribers, Editors or Members of the Editorial Board of the International Journal of Operational Research (IJOR):
Login with your Inderscience username and password:

    Username:        Password:         

Forgotten your password?


Want to subscribe?
A subscription gives you complete access to all articles in the current issue, as well as to all articles in the previous three years (where applicable). See our Orders page to subscribe.

If you still need assistance, please email subs@inderscience.com