Acceleration of unstructured implicit low-order finite-element earthquake simulation using OpenACC on Pascal GPUs Online publication date: Tue, 11-Dec-2018
by Takuma Yamaguchi; Kohei Fujita; Tsuyoshi Ichimura; Muneo Hori; Lalith Maddegedara
International Journal of High Performance Computing and Networking (IJHPCN), Vol. 13, No. 1, 2019
Abstract: We accelerate CPU-based unstructured implicit low-order finite-element simulations by porting to a GPU-CPU heterogeneous compute environment using OpenACC. We modified performance-sensitive parts of the code, such as sparse matrix-vector multiplication and MPI communication, so that computations would be suitable for GPUs. Other parts of the earthquake simulation code are ported by directly inserting OpenACC directives into the CPU code. This porting approach enables high performance with relatively low development costs. When comparing eight K computer nodes and eight NVIDIA Pascal P100 GPUs, we achieve 20.8 times speedup for the 3 × 3 block Jacobi preconditioned conjugate gradient finite-element solver. We show the effectiveness of the proposed method through many-case crust-deformation simulations and a large-scale computation using finite element model with 109 degrees-of-freedom on a GPU cluster.
Online publication date: Tue, 11-Dec-2018
If you are not a subscriber and you just want to read the full contents of this article, buy online access here.Complimentary Subscribers, Editors or Members of the Editorial Board of the International Journal of High Performance Computing and Networking (IJHPCN):
Login with your Inderscience username and password:
Want to subscribe?
A subscription gives you complete access to all articles in the current issue, as well as to all articles in the previous three years (where applicable). See our Orders page to subscribe.
If you still need assistance, please email email@example.com