CUDA Libraries and MPI+OpenMP+CUDA - Prace Training Portal
CUDA Libraries and MPI+OpenMP+CUDA - Prace Training Portal
CUDA Libraries and MPI+OpenMP+CUDA - Prace Training Portal
- No tags were found...
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
Time-consuming steps in PWSCF• Calculation of Charge Density– FFT + matrix-matrix multiplication• Calculation of Potential– FFT + operations on real-space grid• Davidson Iterative Diagonalization (SCF)– FFT + eigenvalues/eigenvectors problem + matrix-matrix multiplicationBasically most CPU time spent in linear-algebra operations,implemented in BLAS <strong>and</strong> LAPACK libraries, <strong>and</strong> in FFT!February 10, 2012 PRACE Winter School 201240