Title: Towards distributed acceleration of image processing applications using reconfigurable active SSD clusters: a case study of seismic data analysis
Authors: Mageda Sharafeddin; Hmayag Partamian; Mariette Awad; Mazen A.R. Saghir; Haitham Akkary; Hassan Artail; Hazem Hajj; Mohammed Baydoun
Addresses: Department of Electrical and Computer Engineering, American University of Beirut, Beirut, Lebanon ' Department of Electrical and Computer Engineering, American University of Beirut, Beirut, Lebanon ' Department of Electrical and Computer Engineering, American University of Beirut, Beirut, Lebanon ' Department of Electrical and Computer Engineering, American University of Beirut, Beirut, Lebanon ' Department of Electrical and Computer Engineering, American University of Beirut, Beirut, Lebanon ' Department of Electrical and Computer Engineering, American University of Beirut, Beirut, Lebanon ' Department of Electrical and Computer Engineering, American University of Beirut, Beirut, Lebanon ' Department of Electrical and Computer Engineering, American University of Beirut, Beirut, Lebanon
Abstract: In this work, we propose a high performance distributed system that consists of several middleware servers each connected to a number of FPGAs with extended solid state storage that we call reconfigurable active solid state device (RASSD) nodes. A full data communication solution between middleware and RASSD nodes is presented. We use seismic data analysis as a case study to quantify how and by how much RASSD nodes can accelerate computational throughput. Speedup of seismic data prediction time when both GLCM and Haralick features are accelerated is examined. The distributed system achieves 102× speedup compared to 4-thread openMP implementation and 265× speedup compared to single thread modern CPU performance. Performance is 5× better than previous work reporting speedup on GLCM and Haralick feature analysis when data is local to the FPGA and 20× better than an identical CUDA implementation using modern GPU.
Keywords: active solid state devices; data analysis; distributed systems; field programmable gate array; FPGA; grey level cooccurrence matrix; GLCM; Haralick attributes; image processing applications; intelligent systems; machine learning; parallel architectures; reconfigurable computing; seismic data analysis.
DOI: 10.1504/IJHPCN.2018.093228
International Journal of High Performance Computing and Networking, 2018 Vol.11 No.4, pp.261 - 278
Received: 01 Aug 2015
Accepted: 23 Feb 2016
Published online: 24 Jul 2018 *