Batched Matrix Computations on Hardware Accelerators Based on GPUs

TitleBatched Matrix Computations on Hardware Accelerators Based on GPUs
Publication TypeConference Paper
Year of Publication2015
AuthorsHaidar, A., A. Abdelfattah, S. Tomov, and J. Dongarra
Conference Name2015 SIAM Conference on Applied Linear Algebra (SIAM LA)
Date Published2015-10
PublisherSIAM
Conference LocationAtlanta, GA
AbstractWe will present techniques for small matrix computations on GPUs and their use for energy efficient, high-performance solvers. Work on small problems delivers high performance through improved data reuse. Many numerical libraries and applications need this functionality further developed. We describe the main factorizations LU, QR, and Cholesky for a set of small dense matrices in parallel. We achieve significant acceleration and reduced energy consumption against other solutions. Our techniques are of interest to GPU application developers in general.
Project Tags: