29.01.2013 Views

Tutorial CUDA

Tutorial CUDA

Tutorial CUDA

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Conclusion<br />

<strong>CUDA</strong> is a powerful parallel programming model<br />

© NVIDIA Corporation 2008<br />

Heterogeneous - mixed serial-parallel programming<br />

Scalable - hierarchical thread execution model<br />

Accessible - minimal but expressive changes to C<br />

<strong>CUDA</strong> on GPUs can achieve great results on dataparallel<br />

computations with a few simple<br />

performance optimization strategies:<br />

Structure your application and select execution configurations<br />

to maximize exploitation of the GPU’s parallel capabilities<br />

Minimize CPU ↔ GPU data transfers<br />

Coalesce global memory accesses<br />

Take advantage of shared memory<br />

Minimize divergent warps<br />

Minimize use of low-throughput instructions<br />

Avoid shared memory accesses with high degree of bank<br />

conflicts

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!