Par4all: Auto-Parallelizing C and Fortran for the CUDA Architecture

More documents

Recommendations

Info

•Results ◮ (II) Speed up 100 10 �Par4All in <strong>CUDA</strong> — GPU conference 10/1/2009 1 Tesla 1060 240 streams GTX 200 192 streams 8c Intel X5472 3 GHz (OpenMP) 2c Intel Core2 6600 2,4 GHz (OpenMP) 1c Intel X5472 3 GHz DOUBLE PRECISION 200 300 Matrix size (Kbytes) Reference 1c Intel 6600 2,4 GHz HPC Project, Mines ParisTech, TÉLÉCOM Bretagne, RPI Ronan KERYELL et al. 35 / 46
•Results ◮ (I) Average time in s (5 samples) 100 10 1 0.1 �Par4All in <strong>CUDA</strong> — GPU conference 10/1/2009 2c Intel 6600 2,4 GHz (OpenMP) 2c Intel T9400 2,5 GHz (OpenMP) 8c Intel X5472 3 GHz (OpenMP) 8c X5570 2,9 GHz (OpenMP) 0.01 Quadro FX 3700M (G92GL) 128 streams GTX 200 192 streams SIMPLE PRECISION Tesla 1060 240 streams 0.001 50 100 200 300 /2 Matrix size (Kbytes) 1c Intel 6600 2,4 GHz 1c Intel T9400 2,5 GHz 1c Intel X5472 3 GHz or X5570 2,9 GHz HPC Project, Mines ParisTech, TÉLÉCOM Bretagne, RPI Ronan KERYELL et al. 36 / 46 /8 /35 /140 /175
Page 1 and 2: • ◮ Par4All: Auto-Parallelizing
Page 3 and 4: •Introduction ◮ • Parallelize
Page 5 and 6: •Par4All ◮ 1 Par4All 2 CUDA gen
Page 7 and 8: •Par4All ◮ (I) • PIPS (Interp
Page 9 and 10: •Par4All ◮ • Automatic parall
Page 11 and 12: •CUDA generation ◮ 1 Par4All 2
Page 13 and 14: •CUDA generation ◮ • Find par
Page 15 and 16: •CUDA generation ◮ (I) Parallel
Page 17 and 18: •CUDA generation ◮ (I) • Memo
Page 19 and 20: •CUDA generation ◮ (I) Conserva
Page 21 and 22: •CUDA generation ◮ • Hardware
Page 23 and 24: •CUDA generation ◮ (II) • So
Page 25 and 26: •CUDA generation ◮ • Reductio
Page 27 and 28: •CUDA generation ◮ (II) • For
Page 29 and 30: •CUDA generation ◮ (I) • CUDA
Page 31 and 32: •CUDA generation ◮ (III) 21 [..
Page 33 and 34: •Results ◮ 1 Par4All 2 CUDA gen
Page 35: •Results ◮ (I) Average time in
Page 39 and 40: •Conclusion ◮ 1 Par4All 2 CUDA
Page 41 and 42: •Conclusion ◮ Why open source?
Page 43 and 44: •Conclusion ◮ (II) • We have
Page 45 and 46: •Conclusion ◮ (II) • Capitali
Page 47 and 48: •Conclusion ◮ • HPC Project

Par4all: Auto-Parallelizing C and Fortran for the CUDA Architecture

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?