KernelGen - GPU Technology Conference
KernelGen - GPU Technology Conference
KernelGen - GPU Technology Conference
- No tags were found...
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
<strong>KernelGen</strong>: can handle external calls.Dependency resolution during linkingKernels generation in runtime}⇒Support for external calls definedin other objects or static librariessincos.f90function.f90!$acc paralleldo k = 1, nzdo j = 1, nydo i = 1, nxxy(i, j, k) = sincos_ijk(x(i, j, k), y(i, j, k))enddoenddoenddo!$acc end parallelfunction sincos_ijk(x, y)implicit nonereal, intent(in) :: x, yreal :: sincos_ijksincos_ijk = sin(x) + cos(y)end function sincos_ijkLaunching kernel __kernelgen_sincos__loop_3blockDim = { 32, 4, 4 }gridDim = { 16, 128, 16 }Finishing kernel __kernelgen_sincos__loop_3__kernelgen_sincos__loop_3 time = 4.986428e-03 sec9 / 75