Title: Data quality-based view selection in big data integration system
Authors: Samir Anter
Addresses: Faculty of Science and Technology of Mohammedia, Hassan II University of Casablanca, Morocco
Abstract: An integration system is an intermediate tool between a user and a set of distributed sources. It provides transparent access to information through an interface using a unique query language. This provides an illusion to the end user as if it is accessing a homogeneous central repository. In a hybrid system, one part of the data is queried on demand whereas another part is extracted, filtered and stored in a local database. This approach is very much promising for data access in the big data context. However, obtaining satisfactory results depends on the correct choice of data to materialise. Further this task is even more difficult in the big data context. In this article, a novel approach has been proposed to overcome the above problem which uses data quality to select views that will be materialised.
Keywords: data integration? materialised views? big data? data quality? view selection.
DOI: 10.1504/IJBIDM.2023.133139
International Journal of Business Intelligence and Data Mining, 2023 Vol.23 No.3, pp.264 - 276
Received: 11 Jan 2022
Accepted: 30 May 2022
Published online: 01 Sep 2023 *