20.11.2014 Views

PPKE ITK PhD and MPhil Thesis Classes

PPKE ITK PhD and MPhil Thesis Classes

PPKE ITK PhD and MPhil Thesis Classes

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

38<br />

2. MAPPING THE NUMERICAL SIMULATIONS OF PARTIAL<br />

DIFFERENTIAL EQUATIONS<br />

s1 s2 s3 s4<br />

Central cells<br />

s5 s6 s7 s8<br />

Rotate<br />

s4<br />

s1 s2 s3<br />

s8<br />

s5 s6 s7<br />

Select<br />

Left<br />

neighborhood<br />

s4 s5 s6 s7<br />

Figure 2.1: Generation of the left neighborhood<br />

State values<br />

s1 s2 s3 s10 s11 s12 s13 s20 s21 s22 s23 s30 s31 s32 s33 s40<br />

Rearranged<br />

state values<br />

s1 s11 s21 s31 s2 s12 s22 s32 s3 s13 s23 s33<br />

Left<br />

neighborhood<br />

Central cells<br />

Right<br />

neighborhood<br />

Figure 2.2: Rearrangement of the state values<br />

solution improves the performance of the simulation data, dependency between<br />

the successive MACs still cause floating-point pipeline stalls. In order to eliminate<br />

this dependency the inner loop of the computation must be rolled out. Instead of<br />

waiting for a result of the first MAC, the computation of the next group of cells<br />

is started. The level of unrolling is limited by the size of the register file.<br />

To measure the performance of the simulation a 256×256 sized cell array was<br />

used <strong>and</strong> 10 forward Euler iterations were computed, using a diffusion template.<br />

By using the IBM Systemsim simulator detailed statistics can be obtained about<br />

the operation of the SPEs while executing a program. Additionally, a static tim-

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!