KernelGen - GPU Technology Conference
KernelGen - GPU Technology Conference
KernelGen - GPU Technology Conference
- No tags were found...
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
<strong>KernelGen</strong> execution workflow.Scenario 1: main kernel approached a point where another <strong>GPU</strong> kernel can be launched (for parallelloop). In this case, another kernel is launched via host callback (could be replaced withdynamic parallelism on Kepler sm_35). All <strong>GPU</strong> data is shared (.local data is disabled).CPUversionswitchcompilemain kernel<strong>GPU</strong>stream #2(many threads)<strong>GPU</strong>stream #1(single thread)42 / 75