26.12.2014 Views

DirectCompute Optimizations and Best Practices - Nvidia

DirectCompute Optimizations and Best Practices - Nvidia

DirectCompute Optimizations and Best Practices - Nvidia

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Shared Memory Bank Addressing<br />

• No Bank Conflicts<br />

- Linear addressing stride == 1<br />

• No Bank Conflicts<br />

– R<strong>and</strong>om 1:1 Permutation<br />

Thread 0<br />

Thread 1<br />

Thread 2<br />

Thread 3<br />

Thread 4<br />

Thread 5<br />

Thread 6<br />

Thread 7<br />

Bank 0<br />

Bank 1<br />

Bank 2<br />

Bank 3<br />

Bank 4<br />

Bank 5<br />

Bank 6<br />

Bank 7<br />

Thread 0<br />

Thread 1<br />

Thread 2<br />

Thread 3<br />

Thread 4<br />

Thread 5<br />

Thread 6<br />

Thread 7<br />

Bank 0<br />

Bank 1<br />

Bank 2<br />

Bank 3<br />

Bank 4<br />

Bank 5<br />

Bank 6<br />

Bank 7<br />

Thread 15<br />

Bank 15<br />

Thread 15<br />

Bank 15

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!