30.07.2015 Views

Slides - Par Lab

Slides - Par Lab

Slides - Par Lab

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

SIMD & Control Flow ¡ Nvidia GPU hardware handles control flow divergence and reconvergence § Write scalar SIMD code, the hardware schedules the SIMD execution § One caveat: __syncthreads() can’t appear in a divergent path ▪ This may cause programs to hang § Good performing code will try to keep the execution convergent within a warp ▪ Warp divergence only costs because of a finite instruction cache 35/73

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!