CUDA Accelerated Linpack on Clusters - Nvidia
CUDA Accelerated Linpack on Clusters - Nvidia
CUDA Accelerated Linpack on Clusters - Nvidia
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
DTRSM<br />
• GPU implimentati<strong>on</strong> based <strong>on</strong> blocked DGEMM update<br />
• First compute diag<strong>on</strong>al block of A / Row block of B<br />
• DGEMM update next row B with n<strong>on</strong>-diag<strong>on</strong>al row of A<br />
A’<br />
B’<br />
C’