12.07.2015 Views

GPU Performance Analysis and Optimization - GPU Technology ...

GPU Performance Analysis and Optimization - GPU Technology ...

GPU Performance Analysis and Optimization - GPU Technology ...

SHOW MORE
SHOW LESS
  • No tags were found...

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Kepler 4-byte Bank Mode• Underst<strong>and</strong>ing this mapping details matters only if you’re tryingto get 8-byte throughput in 4-byte mode– For all else just think that you have 32 banks, 4-bytes wide• Mapping addresses to banks:– Successive 4-byte words go to successive banks• We have to choose between two 4-byte “half-words” for each bank– “First” 32 4-byte words go to lower half-words– “Next” 32 4-byte words go to upper half-words– Given the 8 least-significant address bits: ...HBBBBBxx• xx selects the byte with a 4-byte word• BBBBB selects the bank• H selects the half-word within the bank• Higher bits select the “column” within a bank© 2012, NVIDIA83

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!