30.07.2015 Views

Slides - Par Lab

Slides - Par Lab

Slides - Par Lab

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Using per-­‐block shared memory ¡ Variables shared across block __shared__ int *begin, *end; ¡ Scratchpad memory __shared__ int scratch[BLOCKSIZE]; scratch[threadIdx.x] = begin[threadIdx.x]; // … compute on scratch values … begin[threadIdx.x] = scratch[threadIdx.x]; ¡ Communicating values between threads Block scratch[threadIdx.x] = begin[threadIdx.x]; __syncthreads(); int left = scratch[threadIdx.x -­‐ 1]; ¡ Per-­‐block shared memory is faster than L1 cache, slower than register file ¡ It is relatively small: register file is 2-­‐4x larger Shared 24/73

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!