10.07.2015 Views

Kepler's SHUFFLE (SHFL): Tips and Tricks | GTC 2013

Kepler's SHUFFLE (SHFL): Tips and Tricks | GTC 2013

Kepler's SHUFFLE (SHFL): Tips and Tricks | GTC 2013

SHOW MORE
SHOW LESS
  • No tags were found...

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Performance Experiment• We run the following test on a K20• We launch 26 blocks of 1024 threads— On K20, we have 13 SMsT x = input[tidx];for(int i = 0 ; i < 4096 ; ++i)x = get_right_neighbor(x);output[tidx] = x;— We need 2048 threads per SM to have 100% of occupancy• We time different variants of that kernel

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!