12.07.2015 Views

GPU Performance Analysis and Optimization - GPU Technology ...

GPU Performance Analysis and Optimization - GPU Technology ...

GPU Performance Analysis and Optimization - GPU Technology ...

SHOW MORE
SHOW LESS
  • No tags were found...

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Read-only Loads• Go through the read-only cache– Not coherent with writes– Thus, addresses must not be written by the same kernel• Two ways to enable:– Decorating pointer arguments as hints to compiler:• Pointer of interest: __restrict__ const• All other pointer arguments: __restrict__– Conveys to compiler that no aliasing will occur– Using __ldg() intrinsic• Requires no pointer decoration– Requires GK110 hardware• On prior hardware you can get similar functionality with textures© 2012, NVIDIA32

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!