13.07.2015 Views

CUDA Libraries and MPI+OpenMP+CUDA - Prace Training Portal

CUDA Libraries and MPI+OpenMP+CUDA - Prace Training Portal

CUDA Libraries and MPI+OpenMP+CUDA - Prace Training Portal

SHOW MORE
SHOW LESS
  • No tags were found...

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Time-consuming steps in PWSCF• Calculation of Charge Density– FFT + matrix-matrix multiplication• Calculation of Potential– FFT + operations on real-space grid• Davidson Iterative Diagonalization (SCF)– FFT + eigenvalues/eigenvectors problem + matrix-matrix multiplicationBasically most CPU time spent in linear-algebra operations,implemented in BLAS <strong>and</strong> LAPACK libraries, <strong>and</strong> in FFT!February 10, 2012 PRACE Winter School 201240

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!