Title: Multi-agent approach for data mining-based bagging ensembles to improve the decision process for big data

Authors: Ahmed Ghenabzia; Okba Kazar; Abdelhak Merizig; Zaoui Sayah; Merouane Zoubeidi

Addresses: LINATI Laboratory, University of Kasdi Merbah, 30000, Ouargla, Algeria ' LINFI Laboratory, Computer Science Department, University of Biskra, 07000, Biskra, Algeria ' LINFI Laboratory, Computer Science Department, University of Biskra, 07000, Biskra, Algeria ' LINATI Laboratory, University of Kasdi Merbah, 30000, Ouargla, Algeria ' LINFI Laboratory, Computer Science Department, University of Biskra, 07000, Biskra, Algeria; National Well Services Company, A Sonatrach Company, Hassi-Messaoud, Algeria

Abstract: Today, data growth is accelerating to create a big data in various fields, such as social media, websites, e-mails, finance, and medicine. It needs analysis and knowledge extraction. In addition, data mining is a technology whose purpose is to promote information and knowledge extraction from a big data. In this paper, a multi-layered approach based on agents is proposed to extract knowledge from big dataset with bagging algorithm. To achieve this, we call the paradigm of a multi-agent system in Hadoop to distribute the complexity and processing of large datasets across several autonomous entities called agents. The goal is to predict the target class or value for each case in the data using the bagging technique that is dedicated to the task of classification or regression. This proposition will help decision-makers to take right decisions and provide a perfect response time by the use of the multi-agent system in Hadoop. Therefore, to implement the proposed architecture, it is more convenient to use the Apache Hadoop framework, Apache Spark MLlib framework for building scalable machine learning algorithms and JADE platform which provides a complete set of services and agents.

Keywords: big data; Apache Hadoop; Apache Spark MLlib; multi-agent system; MAS; bagging; JADE.

DOI: 10.1504/IJICT.2020.110794

International Journal of Information and Communication Technology, 2020 Vol.17 No.4, pp.380 - 402

Received: 14 Nov 2018
Accepted: 30 Aug 2019

Published online: 29 Oct 2020 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article