GPU Performance Analysis and Optimization - GPU Technology ...
GPU Performance Analysis and Optimization - GPU Technology ...
GPU Performance Analysis and Optimization - GPU Technology ...
- No tags were found...
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
Kepler 4-byte Bank Mode• Underst<strong>and</strong>ing this mapping details matters only if you’re tryingto get 8-byte throughput in 4-byte mode– For all else just think that you have 32 banks, 4-bytes wide• Mapping addresses to banks:– Successive 4-byte words go to successive banks• We have to choose between two 4-byte “half-words” for each bank– “First” 32 4-byte words go to lower half-words– “Next” 32 4-byte words go to upper half-words– Given the 8 least-significant address bits: ...HBBBBBxx• xx selects the byte with a 4-byte word• BBBBB selects the bank• H selects the half-word within the bank• Higher bits select the “column” within a bank© 2012, NVIDIA83