Title: Rare events and imbalanced datasets: an overview
Authors: Maher Maalouf, Theodore B. Trafalis
Addresses: 212 West Boyd Street, Room 124, Norman, OK, 73019, USA. ' 212 West Boyd Street, Room 124, Norman, OK, 73019, USA
Abstract: Accurate prediction is important in data mining and data classification. Rare events data, imbalanced or skewed datasets are very important in data mining and classification. However, These types of data are difficult to predict and to explain as has been demonstrated in the literature. The problems arise from various sources. This paper surveys the latest research on such data in the hope of adding further contribution to this important field of data mining.
Keywords: data mining; data classification; rare events; imbalanced data; skewed data.
DOI: 10.1504/IJDMMM.2011.042935
International Journal of Data Mining, Modelling and Management, 2011 Vol.3 No.4, pp.375 - 388
Published online: 26 Feb 2015 *
Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article