20.05.2014 Views

link to my thesis

link to my thesis

link to my thesis

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

7.4. PARALLEL COMPUTATIONS IN THE ND-SYSTEMS APPROACH 117<br />

apart from the first test case, the results presented here can serve as an initial proof<br />

of concept that projection <strong>to</strong> Stetter structure can be used <strong>to</strong> increase the convergence<br />

speed of the eigenvalue solvers, at least if the initial estimate is accurate enough.<br />

7.4 Parallel computations in the nD-systems approach<br />

Although a parallel implementation of the least-increments method is not very promising<br />

as explained in Section 5.3.4, we carried this out nevertheless. The results of computing<br />

the global minimum of the test set of 22 polynomials using iterative solvers<br />

and a parallel nD-system implementation are given in Table 7.9 (see also [53]). The<br />

least-increments method is implemented here <strong>to</strong> compute each path in the nD-system<br />

on a separate processor. The number of required paths are shown in the last column<br />

of Table 7.4 as the number of terms of each polynomial. The number of processors<br />

used here is 4. Table 7.9 shows clearly that the computing time is longer but the<br />

required number of opera<strong>to</strong>r actions is roughly identical in this parallel environment<br />

in comparison with the results in Table 7.6. However, it is also clear <strong>to</strong> see that<br />

the increase in computation time for both the parallel JD and JDQZ method is not<br />

that high when the number of paths and the size of the involved matrix/opera<strong>to</strong>r<br />

increases: for example for the last 2 cases where the increase in computation time is<br />

roughly 2 for both the iterative methods.<br />

Table 7.9: Results for the parallel implementation of the nD-system for JD and JDQZ:<br />

time (mean±stdev), # MVs (mean±stdev) and the error<br />

JD<br />

JDQZ<br />

# Time (s) #MVs Error Time (s) #MVs Error<br />

1 4.73 ± 0.289 120.3 ± 8.982 5.50 × 10 −7 14.8 ± 6.85 1991 ± 1314 5.04 × 10 −2<br />

2 26 ± 0.131 36 5.52 × 10 −7 27.2 ± 0.557 41 1.90 × 10 −6<br />

3 40 ± 0.522 88.67 ± 6.351 2.16 × 10 −7 37.8 ± 1.08 76.67 ± 6.429 3.79 × 10 −4<br />

4 34.9 ± 1.38 59.33 ± 6.351 1.17 × 10 −7 31.6 ± 0.737 43 ± 3.464 4.65 × 10 −1<br />

5 70.3 ± 1.39 77.67 ± 6.351 5.48 × 10 −8 64.8 ± 0.918 47.67 ± 0.577 8.61 × 10 −5<br />

6 76.7 ± 3.08 52 ± 11 1.40 × 10 −14 73.6 ± 3.29 44.67 ± 6.351 1.40 × 10 −14<br />

7 45.2 ± 0.824 61 7.29 × 10 −12 45.7 ± 2.17 66.33 ± 9.238 4.85 × 10 −11<br />

8 47.5 ± 2.01 68.33 ± 6.351 9.10 × 10 −14 47.9 ± 0.674 71 7.90 × 10 −14<br />

9 67.8 ± 1.89 70.33 ± 6.351 7.00 × 10 −15 68.6 ± 1.28 66.67 ± 3.512 3.40 × 10 −14<br />

10 70.1 ± 7 74 ± 11 4.00 × 10 −14 68.5 ± 4.28 75 ± 6.928 1.55 × 10 −13<br />

11 78.3 ± 9.35 136.3 ± 38.63 9.10 × 10 −12 68.2 ± 1.87 95.67 ± 9.238 4.25 × 10 −12<br />

12 69.3 ± 3.84 99.67 ± 16.80 4.75 × 10 −12 67.3 ± 0.601 95.67 ± 9.238 7.21 × 10 −12<br />

13 118 ± 13.8 131.3 ± 31.75 6.98 × 10 −9 114 ± 1.46 129 1.86 × 10 −9<br />

14 109 ± 2.08 116.7 ± 6.351 1.82 × 10 −11 106 ± 0.780 113 3.21 × 10 −12<br />

15 245 ± 5.95 175.7 ± 5.680 2.19 × 10 −11 246 ± 3.85 177.3 ± 8.262 7.35 × 10 −12<br />

16 252 ± 5.16 175.7 ± 8.981 1.74 × 10 −12 253 ± 4.83 180 ± 8.764 2.00 × 10 −12<br />

17 296 ± 0.901 265 4.36 × 10 −7 292 ± 0.739 265 1.96 × 10 −7<br />

18 307 ± 1.40 265 9.65 × 10 −8 304 ± 2.56 265 2.28 × 10 −8<br />

19 614 ± 15.4 365 9.38 × 10 −9 614 ± 10.2 365 4.35 × 10 −9<br />

20 615 ± 10.1 365 2.39 × 10 −9 613 ± 10 365 2.66 × 10 −10<br />

21 2580 ± 636 702.7 ± 63.51 2.10 × 10 −5 2160 ± 29.5 666 5.64 × 10 −6<br />

22 2150 ± 44.3 666 2.19 × 10 −6 2130 ± 33.2 666 4.96 × 10 −7<br />

7.5 Numerical experiments with JDCOMM software<br />

In this section a Matlab implementation of the JDCOMM method, based on the pseudocode<br />

of Algorithm 3, is used <strong>to</strong> compute the global minimum of several polynomials.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!