Title: Single-port and multi-port collective communication operations on single and dual Cell BE processor systems
Authors: Farshad Khunjush, David Gong, Nikitas J. Dimopoulos
Addresses: Department of Computer Science and Engineering, School of Electrical and Computer Engineering, Shiraz University, P.O. Box 71348-51154, Shiraz, Iran. ' Department of Electrical and Computer Engineering, University of Victoria, Victoria, B.C. V8W 3P6, Canada. ' Department of Electrical and Computer Engineering, University of Victoria, Victoria, B.C. V8W 3P6, Canada
Abstract: Recently, a set of factors has been leading high-performance processor architectures toward designs that feature multiple processing cores on a single chip (a.k.a. CMP). The cell broadband engine (BE) shows potential to provide high-performance to parallel applications (e.g., MPI applications). An efficient implementation of collective communication operations is one of the key issues to reach high-performance and scalability in parallel applications. In this work, we implement several collective communications and investigate their performance in terms of latency and the associated components. For this, broadcast, all-gather and total-exchange functions are implemented on the Cell BE processor.
Keywords: collective communications; MPI; cell broadband engines; latency.
International Journal of Communication Networks and Distributed Systems, 2011 Vol.6 No.4, pp.373 - 391
Published online: 05 Jun 2011 *Full-text access for editors Access for subscribers Purchase this article Comment on this article