Title: Multiagent-based processing and integration of system data
Authors: Khamisi Kalegele; Johan Sveholm; Hideyuki Takahashi; Kazuto Sasai; Gen Kitagata; Tetsuo Kinoshita
Addresses: Research Institute of Electrical Communication, Tohoku University, Katahira 2-1-1, Sendai-shi, Miyagi-ken, Japan ' Research Institute of Electrical Communication, Tohoku University, Katahira 2-1-1, Sendai-shi, Miyagi-ken, Japan ' Research Institute of Electrical Communication, Tohoku University, Katahira 2-1-1, Sendai-shi, Miyagi-ken, Japan ' Research Institute of Electrical Communication, Tohoku University, Katahira 2-1-1, Sendai-shi, Miyagi-ken, Japan ' Research Institute of Electrical Communication, Tohoku University, Katahira 2-1-1, Sendai-shi, Miyagi-ken, Japan ' Research Institute of Electrical Communication, Tohoku University, Katahira 2-1-1, Sendai-shi, Miyagi-ken, Japan
Abstract: This paper presents a multiagent-based ETL (Extract, Transform, Load) unit for the processing and integration of system operational data in order to improve its value. Operational data plays a vital role in managing and optimising systems. Although KDD (Knowledge Discovery and Data Mining) techniques and concepts have long existed, it is only now that we are seeing real applications being extended onto network and systems management. However, the massive data pre-processing (e.g. feature extraction and data integration) which is needed prior to putting KDD tools in action, is still limiting the extent of exploitation. We propose and design the multiagent-based ETL unit which uses Support Vector Machine and Natural Language Processing techniques to efficiently extract information features from operational data. The unit uses an mSPIDER algorithm to discover INclusion Dependencies (INDs) which are used to integrate data across its peers within the system. We demonstrate efficiency of the unit and the used approaches using operational data from a mailing system.
Keywords: multi-agent systems; MAS; data integration; network management; systems management; agent-based systems; data processing; system data; feature extraction; support vector machines; SVM; natural language processing; NLP; knowledge discovery; data mining; mailing systems.
DOI: 10.1504/IJISTA.2013.056207
International Journal of Intelligent Systems Technologies and Applications, 2013 Vol.12 No.2, pp.128 - 155
Received: 05 Nov 2012
Accepted: 06 May 2013
Published online: 31 Aug 2013 *