Harnessing GPU&#039;s Tensor Cores Fast FP16 Arithmetic to Speedup Mixed-Precision Iterative Refinement Solvers and Achieve 74 Gflops/Watt on Nvidia V100

Submitted by scrawford on Tue, 07/23/2019 - 12:32

Title	Harnessing GPU's Tensor Cores Fast FP16 Arithmetic to Speedup Mixed-Precision Iterative Refinement Solvers and Achieve 74 Gflops/Watt on Nvidia V100
Publication Type	Poster
Year of Publication	2018
Authors	Haidar, A., A. Abdelfattah, S. Tomov, and J. Dongarra
Date Published	2018-03
Event	GPU Technology Conference (GTC), Poster
Event Location	San Jose, CA

Project Tags:

File:

External Publication Flag: