11.07.2015 Views

CUDA and MPI

CUDA and MPI

CUDA and MPI

SHOW MORE
SHOW LESS
  • No tags were found...

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

No-copy Pinning of System MemoryReduce system memory usage <strong>and</strong> CPU memcpy() overheadEasier to add <strong>CUDA</strong> acceleration to existing applicationsJust register malloc’d system memory for async operations<strong>and</strong> then call cudaMemcpy() as usualBefore No-copy PinningExtra allocation <strong>and</strong> extra copy requiredAll <strong>CUDA</strong>-capable GPUs on Linux or WindowsRequires Linux kernel 2.6.15+ (RHEL 5)With No-copy PinningJust register <strong>and</strong> go!cudaMallocHost(b)memcpy(b, a)cudaHostRegister(a)cudaMemcpy() to GPU, launch kernels, cudaMemcpy() from GPUmemcpy(a, b)cudaFreeHost(b)malloc(a)cudaHostUnregister(a)

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!