13.07.2015 Views

Culises: A Library for Accelerated CFD on Hybrid GPU-CPU Systems

Culises: A Library for Accelerated CFD on Hybrid GPU-CPU Systems

Culises: A Library for Accelerated CFD on Hybrid GPU-CPU Systems

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

<str<strong>on</strong>g>Library</str<strong>on</strong>g> <str<strong>on</strong>g>Culises</str<strong>on</strong>g>Parallel approach• 1-1 link between MPI-process/rank and <strong>GPU</strong>-> <strong>CPU</strong> partiti<strong>on</strong>ing equals <strong>GPU</strong> partiti<strong>on</strong>ing-> peak per<str<strong>on</strong>g>for</str<strong>on</strong>g>mance <strong>CPU</strong> under-utilizati<strong>on</strong> of <strong>GPU</strong>s• Bunching of MPI-ranks requiredn-1 linkage opti<strong>on</strong>• <strong>GPU</strong>Direct– Peer-to-peer data exchangeCUDA 4.1 IPC– Directly hidden in MPI-implementati<strong>on</strong>release candidates: OpenMPI, MVAPICH2MPI_Comm_size (comm,&size)<strong>CPU</strong>0<strong>CPU</strong>1<strong>CPU</strong>21-1 3-1<strong>GPU</strong>0<strong>GPU</strong>1<strong>GPU</strong>2Slide 9<str<strong>on</strong>g>Culises</str<strong>on</strong>g> - A <str<strong>on</strong>g>Library</str<strong>on</strong>g> <str<strong>on</strong>g>for</str<strong>on</strong>g> <str<strong>on</strong>g>Accelerated</str<strong>on</strong>g> <str<strong>on</strong>g>CFD</str<strong>on</strong>g> <strong>on</strong> <strong>Hybrid</strong> <strong>GPU</strong>-<strong>CPU</strong> <strong>Systems</strong>B. Landmann

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!