Int. J. of Knowledge Management Studies   »   2018 Vol.9, No.3

 

 

Title: Using knowledge management to create a Data Hub and leverage the usage of a Data Lake

 

Authors: Matheus Correia Ferreira; Fernanda Bruno Dos Santos; Carlos Eduardo Barbosa; Jano Moreira De Souza

 

Addresses:
COPPE – Graduate School and Research in Engineering, Universidade Federal do Rio de Janeiro (UFRJ), P.O. Box 68511, Rio de Janeiro, 21941-972, Brazil
Postgraduate Program in Computer Science (PPGI), Universidade Federal do Rio de Janeiro (UFRJ), P.O. Box 68511, Rio de Janeiro, 21941-972, Brazil
COPPE – Graduate School and Research in Engineering, Universidade Federal do Rio de Janeiro (UFRJ), P.O. Box 68511, Rio de Janeiro, 21941-972, Brazil
COPPE – Graduate School and Research in Engineering, Universidade Federal do Rio de Janeiro (UFRJ), P.O. Box 68511, Rio de Janeiro, 21941-972, Brazil

 

Abstract: As repositories into which different types of data without pre-processing and modelling can be added, Data Lakes have attracted much interest. They speed up the delivery of data to users and preserve its highest granularity level. That same flexibility can be dangerous. If not managed, it is easy to lose control of the repository because of the volume it holds and its growth. As Data Lakes do not carry the semantics of a regular database, understanding its contents can be cumbersome, which undermines its widespread use within a company, dampening the perception that it has helped data science efforts. This work uses knowledge management models as a basis to solve these issues by enriching the data in a Data Lake with information that enhances its usability. Concomitantly, with the use of a data portal platform and suggested metadata, we propose a portal that provides easy access to the Data Lake.

 

Keywords: Data Lake; data science; metadata; knowledge management; CKAN; comprehensive knowledge archive network.

 

DOI: 10.1504/IJKMS.2018.10015483

 

Int. J. of Knowledge Management Studies, 2018 Vol.9, No.3, pp.260 - 277

 

Submission date: 22 Mar 2017
Date of acceptance: 10 Nov 2017
Available online: 17 Aug 2018

 

 

Editors Full text accessPurchase this articleComment on this article