Title: A data model-independent approach to big research data integration
Authors: Valentina Bartalesi; Carlo Meghini; Costantino Thanos
Addresses: Istituto di Scienza e Tecnologie dell'Informazione, "Alessandro Faedo" (ISTI) - CNR, via G. Moruzzi 1, 56127, Pisa, Italy ' Istituto di Scienza e Tecnologie dell'Informazione, "Alessandro Faedo" (ISTI) - CNR, via G. Moruzzi 1, 56127, Pisa, Italy ' Istituto di Scienza e Tecnologie dell'Informazione, "Alessandro Faedo" (ISTI) - CNR, via G. Moruzzi 1, 56127, Pisa, Italy
Abstract: The paper addresses the data integration problem in the context of the scientific domain. The main characteristics of the big research data that make the traditional approach of data integration unfeasible are presented. Two new emerging practices, i.e. an exploratory approach to data seeking and an empiricist epistemological approach to knowledge creation, are discussed. Based on these considerations, we present a new paradigm of data integration and an application ontology that supports it. The ontology is based on five types of events and every event is extensionally modelled as an input/output operation on the involved data entity. The strong point of the ontology and of the whole approach to data integration is that no assumption is made on the data models in which the databases or the views are expressed. This provides a level of generality that successfully deals with the heterogeneity of the domain.
Keywords: data integration; big research data; ontology; semantic web.
DOI: 10.1504/IJMSO.2019.102680
International Journal of Metadata, Semantics and Ontologies, 2019 Vol.13 No.4, pp.330 - 345
Received: 08 Jan 2019
Accepted: 19 Jul 2019
Published online: 01 Oct 2019 *