Title: Empirical investigation of dimension hierarchy sharing-based metrics for multidimensional schema understandability

Authors: Anjana Gosain; Jaspreeti Singh

Addresses: University School of Information, Communication and Technology, Guru Gobind Singh Indraprastha University, Sector – 16C, Dwarka, New Delhi, 110078, India ' University School of Information, Communication and Technology, Guru Gobind Singh Indraprastha University, Sector – 16C, Dwarka, New Delhi, 110078, India

Abstract: Over the last years, quality has gained lot of importance in the development of data warehouse systems. Predicting understandability of multidimensional schemas could play a key role in controlling data warehouse quality at early stages of development. In this area, some effort has been spent to define structural metrics and identity models for assessing quality of these systems. Of the structural properties used to define metrics, aspects of dimension hierarchies and its sharing plays primary role to enhance analytical capabilities of multidimensional schemas, thereby affecting their quality. The authors have previously proposed structural metrics based on aforementioned aspects. The objective of this study is to apply principal component analysis (PCA) to find whether our metrics are improvements over the other existing metrics; and to apply logistic regression to study whether the metrics (selected as relevant in the extracted principal components) combined together are indicators of multidimensional schema understandability. The results of PCA confirm that our structural metrics based on the concept of sharing are different from other such metrics existing in the literature. Further, the metrics selected as principal components can be used in combination to predict understandability of data warehouse multidimensional schemas.

Keywords: data warehouse; quality metrics; principal component analysis; PCA; logistic regression; understandability; multidimensional schemas.

DOI: 10.1504/IJIEI.2019.099086

International Journal of Intelligent Engineering Informatics, 2019 Vol.7 No.2/3, pp.141 - 163

Received: 04 Jan 2017
Accepted: 17 Sep 2017

Published online: 15 Apr 2019 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article