PCS - Part 2: Multiprocessor Architectures
PCS - Part 2: Multiprocessor Architectures
PCS - Part 2: Multiprocessor Architectures
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
Cuda - Programming (1)<br />
NVidia - Compute Unified<br />
Device Architecture<br />
Programming and execution<br />
model - oriented on, but not<br />
fixed to - GPU structures<br />
A kernel function is<br />
executed by many threads<br />
that commonly operate on<br />
different portions of the<br />
input data<br />
Blocks of threads - run<br />
parallel and may use shared<br />
memory<br />
Grid of blocks - blocks are<br />
executed either in a batch<br />
mode or parallel, depending<br />
on the device capabilities<br />
Peter Sobe<br />
<strong>PCS</strong> - <strong>Part</strong> 2: <strong>Multiprocessor</strong> <strong>Architectures</strong>