11.07.2015 Views

Memory Bandwidth Limited Kernels - GPU Technology Conference

Memory Bandwidth Limited Kernels - GPU Technology Conference

Memory Bandwidth Limited Kernels - GPU Technology Conference

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Shared <strong>Memory</strong>• Uses:• Use it to improve global memory access patterns• Cache data to reduce redundant global memory accesses• Inter-thread communication within a block• Performance:• Program configurable: 16KB shared / 48 KB L1 OR 48KB shared / 16KB L1• Very low latency• Very high throughput: 1+ TB/s aggregate© NVIDIA Corporation 2011

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!