Title: Outlier detection techniques for big data streams: focus on cyber security

Authors: Fatima-Zahra Benjelloun; Ayoub Ait Lahcen; Samir Belfkih

Addresses: LGS, ENSA, Ibn Tofail University, Kenitra, Morocco ' LGS, ENSA, Ibn Tofail University, Kenitra, Morocco; LRIT, Unité associée au CNRST URAC 29, Mohammed V University, Rabat, Morocco ' LGS, ENSA, Ibn Tofail University, Kenitra, Morocco

Abstract: In recent years, detecting outliers in big data streams has become a main challenge in several domains (e.g., medical monitoring, government security, information security, natural disasters, and online financial frauds). In fact, unlike regular static data, streams raise many issues like high multidimensionality, dynamic data distribution, unpredictable relationships, data sequences, uncertainty and transiency. Most of the proposed approaches can handle some of these issues but not all. In addition, they provide limited considerations with regard to scalability and performance. Real-world applications require high performance, resources optimisation and real-time responsiveness when detecting outliers. This is useful to extract knowledge, detect incidents and predict patterns changes. In this paper, we review and compare recent studies in detecting outliers for data streams. We investigate how researchers improved the outcome of different models and monitoring systems, especially in the context of cyber security.

Keywords: outlier detection; data streams; streaming; big data; high dimension; cyber security.

DOI: 10.1504/IJITST.2019.102799

International Journal of Internet Technology and Secured Transactions, 2019 Vol.9 No.4, pp.446 - 474

Received: 02 May 2017
Accepted: 17 Aug 2017

Published online: 08 Oct 2019 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article