Title: Multiagent-based processing and integration of system data

Authors: Khamisi Kalegele; Johan Sveholm; Hideyuki Takahashi; Kazuto Sasai; Gen Kitagata; Tetsuo Kinoshita

Addresses: Research Institute of Electrical Communication, Tohoku University, Katahira 2-1-1, Sendai-shi, Miyagi-ken, Japan ' Research Institute of Electrical Communication, Tohoku University, Katahira 2-1-1, Sendai-shi, Miyagi-ken, Japan ' Research Institute of Electrical Communication, Tohoku University, Katahira 2-1-1, Sendai-shi, Miyagi-ken, Japan ' Research Institute of Electrical Communication, Tohoku University, Katahira 2-1-1, Sendai-shi, Miyagi-ken, Japan ' Research Institute of Electrical Communication, Tohoku University, Katahira 2-1-1, Sendai-shi, Miyagi-ken, Japan ' Research Institute of Electrical Communication, Tohoku University, Katahira 2-1-1, Sendai-shi, Miyagi-ken, Japan

Abstract: This paper presents a multiagent-based ETL (Extract, Transform, Load) unit for the processing and integration of system operational data in order to improve its value. Operational data plays a vital role in managing and optimising systems. Although KDD (Knowledge Discovery and Data Mining) techniques and concepts have long existed, it is only now that we are seeing real applications being extended onto network and systems management. However, the massive data pre-processing (e.g. feature extraction and data integration) which is needed prior to putting KDD tools in action, is still limiting the extent of exploitation. We propose and design the multiagent-based ETL unit which uses Support Vector Machine and Natural Language Processing techniques to efficiently extract information features from operational data. The unit uses an mSPIDER algorithm to discover INclusion Dependencies (INDs) which are used to integrate data across its peers within the system. We demonstrate efficiency of the unit and the used approaches using operational data from a mailing system.

Keywords: multi-agent systems; MAS; data integration; network management; systems management; agent-based systems; data processing; system data; feature extraction; support vector machines; SVM; natural language processing; NLP; knowledge discovery; data mining; mailing systems.

DOI: 10.1504/IJISTA.2013.056207

International Journal of Intelligent Systems Technologies and Applications, 2013 Vol.12 No.2, pp.128 - 155

Received: 05 Nov 2012
Accepted: 06 May 2013

Published online: 31 Aug 2013 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article