30.04.2014 Views

Segmented Scan - Nvidia

Segmented Scan - Nvidia

Segmented Scan - Nvidia

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Binary <strong>Scan</strong><br />

• Often need to scan 1-bit values<br />

— Stream compact: scan true/false flags<br />

— Split / Radix Sort: scan 1-bit flags<br />

• Fermi GPU architecture provides efficient 1-bit warp scan<br />

— int __ballot(int p): 32-bit ―ballot‖ of t/f p from whole warp<br />

— int __popc(int x): count number of 1 bits in x<br />

• Combine with a per-thread mask to get 1-bit warp scan<br />

— Or with no mask for 1-bit warp count

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!