19.11.2014 Views

Tutorial: Introduction to CUDA Fortran | GTC 2013

Tutorial: Introduction to CUDA Fortran | GTC 2013

Tutorial: Introduction to CUDA Fortran | GTC 2013

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Instruction-Level Parallelism<br />

No Shared Memory<br />

Shared Memory<br />

Thread<br />

Block Size Occupancy Bandwidth Occupancy<br />

Bandwidth<br />

No ILP<br />

Bandwidth<br />

ILP = 4<br />

32 0.25 96 0.016 8 26<br />

64 0.5 125 0.031 15 50<br />

128 1.0 136 0.063 29 90<br />

256 1.0 137 0.125 53 125<br />

512 1.0 137 0.25 91 140<br />

1024 1.0 133 0.5 123 139

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!