GPU Performance Analysis and Optimization - GPU Technology ...
GPU Performance Analysis and Optimization - GPU Technology ...
GPU Performance Analysis and Optimization - GPU Technology ...
- No tags were found...
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
Read-only Loads• Go through the read-only cache– Not coherent with writes– Thus, addresses must not be written by the same kernel• Two ways to enable:– Decorating pointer arguments as hints to compiler:• Pointer of interest: __restrict__ const• All other pointer arguments: __restrict__– Conveys to compiler that no aliasing will occur– Using __ldg() intrinsic• Requires no pointer decoration– Requires GK110 hardware• On prior hardware you can get similar functionality with textures© 2012, NVIDIA32