A Bibliography of Publications of Jack J. Dongarra - University of Utah
A Bibliography of Publications of Jack J. Dongarra - University of Utah
A Bibliography of Publications of Jack J. Dongarra - University of Utah
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
REFERENCES 122<br />
USA, August 5, 2011. URL http:<br />
//www.netlib.org/lapack/lawnspdf/<br />
lawn254.pdf.<br />
2011.<br />
UT-CS-11-677 Aug 5<br />
Haidar:2011:PRCb<br />
[758] Azzam Haidar, Hatem Ltaief, and <strong>Jack</strong><br />
<strong>Dongarra</strong>. Parallel reduction to condensed<br />
forms for symmetric eigenvalue<br />
problems using aggregated fine-grained<br />
and memory-aware kernels. In Lathrop<br />
et al. [985], pages 8:1–8:11. ISBN 1-4503-<br />
0771-X. LCCN ????<br />
Jagode:2011:TBP<br />
[759] Heike Jagode, Andreas Knüpfer, <strong>Jack</strong><br />
<strong>Dongarra</strong>, Matthias Jurenz, Matthias S.<br />
Müller, and Wolfgang E. Nagel. Tracebased<br />
performance analysis for the<br />
petascale simulation code FLASH. The<br />
International Journal <strong>of</strong> High Performance<br />
Computing Applications, 25(4):<br />
428–439, November 2011. CODEN<br />
IHPCFL. ISSN 1094-3420 (print),<br />
1741-2846 (electronic). URL http://<br />
hpc.sagepub.com/content/25/4/428.<br />
full.pdf+html.<br />
Kurzak:2011:AGF<br />
[760] Jakub Kurzak, Stanimire Tomov, and<br />
<strong>Jack</strong> <strong>Dongarra</strong>. Autotuning GEMMs<br />
for Fermi. LAPACK Working Note<br />
245, Department <strong>of</strong> Computer Science,<br />
<strong>University</strong> <strong>of</strong> Tennessee, Knoxville,<br />
Knoxville, TN 37996, USA, April 18,<br />
2011. URL http://www.netlib.org/<br />
lapack/lawnspdf/lawn245.pdf. UT-<br />
CS-11-671. Submitted at SC11 November<br />
12-18, 2011, Seattle, Washington,<br />
USA.<br />
Ltaief:2011:HPB<br />
[761] Hatem Ltaief, Piotr Luszczek, and<br />
<strong>Jack</strong> <strong>Dongarra</strong>. High performance<br />
bidiagonal reduction using tile algorithms<br />
on homogeneous multicore architectures.<br />
LAPACK Working Note<br />
247, Department <strong>of</strong> Computer Science,<br />
<strong>University</strong> <strong>of</strong> Tennessee, Knoxville,<br />
Knoxville, TN 37996, USA, May 18,<br />
2011. URL http://www.netlib.org/<br />
lapack/lawnspdf/lawn247.pdf. UT-<br />
CS-11-673. Submitted at TOMS.<br />
Ltaief:2011:PHP<br />
[762] Hatem Ltaief, Piotr Luszczek, and <strong>Jack</strong><br />
<strong>Dongarra</strong>. Pr<strong>of</strong>iling high performance<br />
dense linear algebra algorithms on multicore<br />
architectures for power and energy<br />
efficiency. LAPACK Working<br />
Note 251, Department <strong>of</strong> Computer Science,<br />
<strong>University</strong> <strong>of</strong> Tennessee, Knoxville,<br />
Knoxville, TN 37996, USA, June 21,<br />
2011. URL http://www.netlib.org/<br />
lapack/lawnspdf/lawn251.pdf. UT-<br />
CS-11-674.<br />
Luszczek:2011:TST<br />
[763] Piotr Luszczek, Hatem Ltaief, and<br />
<strong>Jack</strong> <strong>Dongarra</strong>. Two-stage tridiagonal<br />
reduction for dense symmetric matrices<br />
using tile algorithms on multicore<br />
architectures. LAPACK Working<br />
Note 244, Department <strong>of</strong> Computer Science,<br />
<strong>University</strong> <strong>of</strong> Tennessee, Knoxville,<br />
Knoxville, TN 37996, USA, April 18,<br />
2011. URL http://www.netlib.org/<br />
lapack/lawnspdf/lawn244.pdf. UT-<br />
CS-11-670.<br />
Nath:2011:OSD<br />
[764] Rajib Nath, Stanimire Tomov, Tingxing<br />
“Tim” Dong, and <strong>Jack</strong> <strong>Dongarra</strong>.<br />
Optimizing symmetric dense matrixvector<br />
multiplication on GPUs. In Lathrop<br />
et al. [985], pages 6:1–6:10. ISBN<br />
1-4503-0771-X. LCCN ????