29.01.2013 Views

Tutorial CUDA

Tutorial CUDA

Tutorial CUDA

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Shared Memory Is Banked<br />

Bandwidth of each bank is 32 bits per 2 clock cycles<br />

Successive 32-bit words are assigned to successive<br />

banks<br />

G80 has 16 banks<br />

© NVIDIA Corporation 2008<br />

So bank = address % 16<br />

Same as the size of a half-warp<br />

No bank conflicts between different half-warps, only within a<br />

single half-warp

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!