A Step towards Energy Efficient Computing: Redesigning A Hydrodynamic Application on CPU-GPU

Tingxing Dong; Veselin Dobrev; Tzanio Kolev; Robert Rieben; Stanimire Tomov; Jack Dongarra

Submitted by webmaster on Tue, 06/17/2014 - 11:51

Title	A Step towards Energy Efficient Computing: Redesigning A Hydrodynamic Application on CPU-GPU
Publication Type	Conference Paper
Year of Publication	2014
Authors	Dong, T., V. Dobrev, T. Kolev, R. Rieben, S. Tomov, and J. Dongarra
Conference Name	IPDPS 2014
Date Published	2014-05
Publisher	IEEE
Conference Location	Phoenix, AZ
Keywords	Computer science, CUDA, FEM, Finite element method, linear algebra, nVidia, Tesla K20
Abstract	Power and energy consumption are becoming an increasing concern in high performance computing. Compared to multi-core CPUs, GPUs have a much better performance per watt. In this paper we discuss efforts to redesign the most computation intensive parts of BLAST, an application that solves the equations for compressible hydrodynamics with high order finite elements, using GPUs [10, 1]. In order to exploit the hardware parallelism of GPUs and achieve high performance, we implemented custom linear algebra kernels. We intensively optimized our CUDA kernels by exploiting the memory hierarchy, which exceed the vendor’s library routines substantially in performance. We proposed an autotuning technique to adapt our CUDA kernels to the orders of the finite element method. Compared to a previous base implementation, our redesign and optimization lowered the energy consumption of the GPU in two aspects: 60% less time to solution and 10% less power required. Compared to the CPU-only solution, our GPU accelerated BLAST obtained a 2:5x overall speedup and 1:42x energy efficiency (greenup) using 4th order (Q4) finite elements, and a 1:9x speedup and 1:27x greenup using 2nd order (Q2) finite elements.

Project Tags:

magma

File:

icl-utk-663-2014.pdf