efficient parallel computation to simulate blood flow - CiteSeerX
efficient parallel computation to simulate blood flow - CiteSeerX
efficient parallel computation to simulate blood flow - CiteSeerX
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
SuperLU contains a collection of three related subroutine libraries: sequential<br />
SuperLU for uniprocessors, the multithreaded version SuperLU −MT for mediumsize<br />
SMPs, and the MPI version SuperLU − DIST for large distributed memory<br />
machines.<br />
30<br />
Elapsed time with 2 processors<br />
14<br />
Elapsed time with 4 processors<br />
25<br />
20<br />
AS with LU<br />
AS with GMRES<br />
SuperLU<br />
large<br />
12<br />
10<br />
AS with LU<br />
AS with GMRES<br />
SuperLU<br />
15<br />
10<br />
medium<br />
8<br />
6<br />
4<br />
medium<br />
large<br />
5<br />
small<br />
2<br />
small<br />
0<br />
0<br />
10<br />
Elapsed time with 6 processors<br />
7<br />
Elapsed time with 8 processors<br />
8<br />
6<br />
AS with LU<br />
AS with GMRES<br />
SuperLU<br />
6<br />
5<br />
4<br />
AS with LU<br />
AS with GMRES<br />
SuperLU<br />
large<br />
4<br />
2<br />
small<br />
medium<br />
large<br />
3<br />
2<br />
1<br />
small<br />
medium<br />
0<br />
0<br />
Figure 2.25: Comparison of the Elapsed time between Super lu and AS with either<br />
LU or GMRES depending on the number of processor and for small, medium and<br />
large size problem<br />
In Figure 2.25, we used the implementation for shared memory <strong>parallel</strong> machines<br />
SuperLU − MT, with three different test cases. The small one has a domain discretized<br />
in<strong>to</strong> 100×800, the medium with 200×800 and the larger grid with 300×800<br />
points. We compared the elapsed time between SuperLU and AS with either LU or<br />
GMRES depending on the number of processors and for small, medium and large<br />
60