13.07.2015 Views

KernelGen - GPU Technology Conference

KernelGen - GPU Technology Conference

KernelGen - GPU Technology Conference

SHOW MORE
SHOW LESS
  • No tags were found...

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

<strong>KernelGen</strong> execution workflow.Scenario 1: main kernel approached a point where another <strong>GPU</strong> kernel can be launched (for parallelloop). In this case, another kernel is launched via host callback (could be replaced withdynamic parallelism on Kepler sm_35). All <strong>GPU</strong> data is shared (.local data is disabled).CPUversionswitchcompilemain kernel<strong>GPU</strong>stream #2(many threads)<strong>GPU</strong>stream #1(single thread)42 / 75

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!