CUDA and MPI
CUDA and MPI
CUDA and MPI
- No tags were found...
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
No-copy Pinning of System MemoryReduce system memory usage <strong>and</strong> CPU memcpy() overheadEasier to add <strong>CUDA</strong> acceleration to existing applicationsJust register malloc’d system memory for async operations<strong>and</strong> then call cudaMemcpy() as usualBefore No-copy PinningExtra allocation <strong>and</strong> extra copy requiredAll <strong>CUDA</strong>-capable GPUs on Linux or WindowsRequires Linux kernel 2.6.15+ (RHEL 5)With No-copy PinningJust register <strong>and</strong> go!cudaMallocHost(b)memcpy(b, a)cudaHostRegister(a)cudaMemcpy() to GPU, launch kernels, cudaMemcpy() from GPUmemcpy(a, b)cudaFreeHost(b)malloc(a)cudaHostUnregister(a)