Title: Single-port and multi-port collective communication operations on single and dual Cell BE processor systems

Authors: Farshad Khunjush, David Gong, Nikitas J. Dimopoulos

Addresses: Department of Computer Science and Engineering, School of Electrical and Computer Engineering, Shiraz University, P.O. Box 71348-51154, Shiraz, Iran. ' Department of Electrical and Computer Engineering, University of Victoria, Victoria, B.C. V8W 3P6, Canada. ' Department of Electrical and Computer Engineering, University of Victoria, Victoria, B.C. V8W 3P6, Canada

Abstract: Recently, a set of factors has been leading high-performance processor architectures toward designs that feature multiple processing cores on a single chip (a.k.a. CMP). The cell broadband engine (BE) shows potential to provide high-performance to parallel applications (e.g., MPI applications). An efficient implementation of collective communication operations is one of the key issues to reach high-performance and scalability in parallel applications. In this work, we implement several collective communications and investigate their performance in terms of latency and the associated components. For this, broadcast, all-gather and total-exchange functions are implemented on the Cell BE processor.

Keywords: collective communications; MPI; cell broadband engines; latency.

DOI: 10.1504/IJCNDS.2011.040559

International Journal of Communication Networks and Distributed Systems, 2011 Vol.6 No.4, pp.373 - 391

Published online: 26 Feb 2015 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article