Title: Open data integration model using a polystore system for large scale scientific data archives in astronomy

Authors: Shashank Shrestha; Manoj Poudel; Rashmi P. Sarode; Wanming Chu; Subhash Bhalla

Addresses: Department of Computer and Information Systems, University of Aizu, Tsuruga, Ikki-machi, Aizu-Wakamatsu City, Fukushima, 965-8580, Japan ' Department of Computer and Information Systems, University of Aizu, Tsuruga, Ikki-machi, Aizu-Wakamatsu City, Fukushima, 965-8580, Japan ' Department of Computer and Information Systems, University of Aizu, Tsuruga, Ikki-machi, Aizu-Wakamatsu City, Fukushima, 965-8580, Japan ' Department of Computer and Information Systems, University of Aizu, Tsuruga, Ikki-machi, Aizu-Wakamatsu City, Fukushima, 965-8580, Japan ' Department of Computer and Information Systems, University of Aizu, Tsuruga, Ikki-machi, Aizu-Wakamatsu City, Fukushima, 965-8580, Japan

Abstract: Polystore systems have been recently proposed as a new data integration model to provide integrated access to heterogeneous data stores through a unified single query language. Recently, there is a growing interest in the database community to manage large scale unstructured data from multiple heterogeneous data stores. Special attention is focused on this problem due to growth in the size of data, the speed of increment of data, and the emergence of various data types in different scientific data archives. Moreover, astronomy as a scientific domain produces a huge amount of data that is stored in the data archives provided by NASA and its subsidiaries. The data type mostly consists of images, unstructured texts, and structured (relations, key-values). This paper articulates the problems of integrating multiple data stores to manage heterogeneous data and polystore architecture as a solution. A method of managing a local data store and communicating with a remote cloud data store with the help of a web-based query system is defined.

Keywords: astronomical data; heterogeneous data; data integration; workflow system.

DOI: 10.1504/IJCSE.2021.115096

International Journal of Computational Science and Engineering, 2021 Vol.24 No.2, pp.116 - 127

Received: 15 May 2020
Accepted: 24 Aug 2020

Published online: 18 May 2021 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article