12.07.2015 Views

GPU Performance Analysis and Optimization - GPU Technology ...

GPU Performance Analysis and Optimization - GPU Technology ...

GPU Performance Analysis and Optimization - GPU Technology ...

SHOW MORE
SHOW LESS
  • No tags were found...

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Caching Load• Scenario:– All threads in a warp request the same 4-byte word• Addresses fall within a single cache-line– Warp needs 4 bytes– 128 bytes move across the bus on a miss– Bus utilization: 3.125%addresses from a warp...032 64 96 128 160 192 224 256 288 320 352 384 416 448Memory addresses© 2012, NVIDIA42

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!