Title: Rare events and imbalanced datasets: an overview

Authors: Maher Maalouf, Theodore B. Trafalis

Addresses: 212 West Boyd Street, Room 124, Norman, OK, 73019, USA. ' 212 West Boyd Street, Room 124, Norman, OK, 73019, USA

Abstract: Accurate prediction is important in data mining and data classification. Rare events data, imbalanced or skewed datasets are very important in data mining and classification. However, These types of data are difficult to predict and to explain as has been demonstrated in the literature. The problems arise from various sources. This paper surveys the latest research on such data in the hope of adding further contribution to this important field of data mining.

Keywords: data mining; data classification; rare events; imbalanced data; skewed data.

DOI: 10.1504/IJDMMM.2011.042935

International Journal of Data Mining, Modelling and Management, 2011 Vol.3 No.4, pp.375 - 388

Published online: 08 Oct 2011 *

Full-text access for editors Access for subscribers Purchase this article Comment on this article