12.07.2015 Views

GPU Performance Analysis and Optimization - GPU Technology ...

GPU Performance Analysis and Optimization - GPU Technology ...

GPU Performance Analysis and Optimization - GPU Technology ...

SHOW MORE
SHOW LESS
  • No tags were found...

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Execution• Instructions are issued/executed per warp– Warp = 32 consecutive threads• Think of it as a “vector” of 32 threads• The same instruction is issued to the entire warp• Scheduling– Warps are scheduled at run-time– Hardware picks from warps that have an instruction ready to execute• Ready = all arguments are ready– Instruction latency is hidden by executing other warps98

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!