GPU Performance Analysis and Optimization - GPU Technology ...
GPU Performance Analysis and Optimization - GPU Technology ...
GPU Performance Analysis and Optimization - GPU Technology ...
- No tags were found...
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
Memory Throughput <strong>Analysis</strong>• Two perspectives on the throughput:– Application’s point of view:• count only bytes requested by application– HW point of view:• count all bytes moved by hardware• The two views can be different:– Memory is accessed at 32 or 128 byte granularity• Scattered/offset pattern: application doesn’t use all the hw transaction bytes– Broadcast: the same small transaction serves many threads in a warp• Two aspects to inspect for performance impact:– Address pattern– Number of concurrent accesses in flight© 2012, NVIDIA46