link to my thesis
link to my thesis
link to my thesis
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
7.4. PARALLEL COMPUTATIONS IN THE ND-SYSTEMS APPROACH 117<br />
apart from the first test case, the results presented here can serve as an initial proof<br />
of concept that projection <strong>to</strong> Stetter structure can be used <strong>to</strong> increase the convergence<br />
speed of the eigenvalue solvers, at least if the initial estimate is accurate enough.<br />
7.4 Parallel computations in the nD-systems approach<br />
Although a parallel implementation of the least-increments method is not very promising<br />
as explained in Section 5.3.4, we carried this out nevertheless. The results of computing<br />
the global minimum of the test set of 22 polynomials using iterative solvers<br />
and a parallel nD-system implementation are given in Table 7.9 (see also [53]). The<br />
least-increments method is implemented here <strong>to</strong> compute each path in the nD-system<br />
on a separate processor. The number of required paths are shown in the last column<br />
of Table 7.4 as the number of terms of each polynomial. The number of processors<br />
used here is 4. Table 7.9 shows clearly that the computing time is longer but the<br />
required number of opera<strong>to</strong>r actions is roughly identical in this parallel environment<br />
in comparison with the results in Table 7.6. However, it is also clear <strong>to</strong> see that<br />
the increase in computation time for both the parallel JD and JDQZ method is not<br />
that high when the number of paths and the size of the involved matrix/opera<strong>to</strong>r<br />
increases: for example for the last 2 cases where the increase in computation time is<br />
roughly 2 for both the iterative methods.<br />
Table 7.9: Results for the parallel implementation of the nD-system for JD and JDQZ:<br />
time (mean±stdev), # MVs (mean±stdev) and the error<br />
JD<br />
JDQZ<br />
# Time (s) #MVs Error Time (s) #MVs Error<br />
1 4.73 ± 0.289 120.3 ± 8.982 5.50 × 10 −7 14.8 ± 6.85 1991 ± 1314 5.04 × 10 −2<br />
2 26 ± 0.131 36 5.52 × 10 −7 27.2 ± 0.557 41 1.90 × 10 −6<br />
3 40 ± 0.522 88.67 ± 6.351 2.16 × 10 −7 37.8 ± 1.08 76.67 ± 6.429 3.79 × 10 −4<br />
4 34.9 ± 1.38 59.33 ± 6.351 1.17 × 10 −7 31.6 ± 0.737 43 ± 3.464 4.65 × 10 −1<br />
5 70.3 ± 1.39 77.67 ± 6.351 5.48 × 10 −8 64.8 ± 0.918 47.67 ± 0.577 8.61 × 10 −5<br />
6 76.7 ± 3.08 52 ± 11 1.40 × 10 −14 73.6 ± 3.29 44.67 ± 6.351 1.40 × 10 −14<br />
7 45.2 ± 0.824 61 7.29 × 10 −12 45.7 ± 2.17 66.33 ± 9.238 4.85 × 10 −11<br />
8 47.5 ± 2.01 68.33 ± 6.351 9.10 × 10 −14 47.9 ± 0.674 71 7.90 × 10 −14<br />
9 67.8 ± 1.89 70.33 ± 6.351 7.00 × 10 −15 68.6 ± 1.28 66.67 ± 3.512 3.40 × 10 −14<br />
10 70.1 ± 7 74 ± 11 4.00 × 10 −14 68.5 ± 4.28 75 ± 6.928 1.55 × 10 −13<br />
11 78.3 ± 9.35 136.3 ± 38.63 9.10 × 10 −12 68.2 ± 1.87 95.67 ± 9.238 4.25 × 10 −12<br />
12 69.3 ± 3.84 99.67 ± 16.80 4.75 × 10 −12 67.3 ± 0.601 95.67 ± 9.238 7.21 × 10 −12<br />
13 118 ± 13.8 131.3 ± 31.75 6.98 × 10 −9 114 ± 1.46 129 1.86 × 10 −9<br />
14 109 ± 2.08 116.7 ± 6.351 1.82 × 10 −11 106 ± 0.780 113 3.21 × 10 −12<br />
15 245 ± 5.95 175.7 ± 5.680 2.19 × 10 −11 246 ± 3.85 177.3 ± 8.262 7.35 × 10 −12<br />
16 252 ± 5.16 175.7 ± 8.981 1.74 × 10 −12 253 ± 4.83 180 ± 8.764 2.00 × 10 −12<br />
17 296 ± 0.901 265 4.36 × 10 −7 292 ± 0.739 265 1.96 × 10 −7<br />
18 307 ± 1.40 265 9.65 × 10 −8 304 ± 2.56 265 2.28 × 10 −8<br />
19 614 ± 15.4 365 9.38 × 10 −9 614 ± 10.2 365 4.35 × 10 −9<br />
20 615 ± 10.1 365 2.39 × 10 −9 613 ± 10 365 2.66 × 10 −10<br />
21 2580 ± 636 702.7 ± 63.51 2.10 × 10 −5 2160 ± 29.5 666 5.64 × 10 −6<br />
22 2150 ± 44.3 666 2.19 × 10 −6 2130 ± 33.2 666 4.96 × 10 −7<br />
7.5 Numerical experiments with JDCOMM software<br />
In this section a Matlab implementation of the JDCOMM method, based on the pseudocode<br />
of Algorithm 3, is used <strong>to</strong> compute the global minimum of several polynomials.