Advanced_GPU_talk
Advanced_GPU_talk
Advanced_GPU_talk
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
“Usual way”<br />
cudaMemcpy(…)<br />
kernel_1(…)<br />
cudaDeviceSynchronize()<br />
timing()<br />
kernel_2(…)<br />
cudaDeviceSynchronize()<br />
timing()<br />
…<br />
Kernel launch<br />
overhead<br />
~10-50us<br />
launch<br />
copy<br />
launch<br />
kernel<br />
synchronize<br />
launch<br />
kernel<br />
H2D<br />
Kernel 1<br />
Kernel 2