Title: A flexible architecture for the pre-processing of solar satellite image time series data - the SETL architecture

Authors: Carlos Roberto Silveira Junior; Marilde Terezinha Prado Santos; Marcela Xavier Ribeiro

Addresses: Department of Computing, Federal University of São Carlos, São Carlos, SP, Brazil ' Department of Computing, Federal University of São Carlos, São Carlos, SP, Brazil ' Department of Computing, Federal University of São Carlos, São Carlos, SP, Brazil

Abstract: Satellite image time series (SITS) is a challenging domain for knowledge discovery database due to their characteristics: each image has several sunspots and each sunspot is associated with sensor data composed of the radiation level and the sunspot classifications. Each image has time parameters and sunspots' coordinates, spatiotemporal data. Several challenges of SITS domain are faced during the extract, transform, and load (ETL) process. In this paper, we proposed an architecture called SITS's extract, transform, and load (SETL) that extracts the visual characteristics of each sunspot and associates it with sunspot's sensor data considering the spatiotemporal relations. SETL brings flexibility and extensibility to working with challenging domains such as SITS because it integrates textual, visual and spatiotemporal characteristics at sunspot-record level. Furthermore, we obtained acceptable performance results according to a domain expert and increased the possibility of using different data mining algorithms comparing to the art state.

Keywords: satellite image time series; SITS; spatiotemporal ETL process; solar STIS process.

DOI: 10.1504/IJDMMM.2019.098970

International Journal of Data Mining, Modelling and Management, 2019 Vol.11 No.2, pp.129 - 143

Received: 07 Oct 2017
Accepted: 19 Jul 2018

Published online: 22 Feb 2019 *

Full-text access for editors Access for subscribers Purchase this article Comment on this article