20.11.2014 Views

PPKE ITK PhD and MPhil Thesis Classes

PPKE ITK PhD and MPhil Thesis Classes

PPKE ITK PhD and MPhil Thesis Classes

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

62<br />

2. MAPPING THE NUMERICAL SIMULATIONS OF PARTIAL<br />

DIFFERENTIAL EQUATIONS<br />

Cell<br />

array<br />

t n<br />

SPE 0 SPE 1<br />

t n+1<br />

t n+2<br />

t n+3<br />

SPE 2<br />

SPE 4<br />

SPE 6<br />

SPE 3<br />

SPE 5<br />

SPE 7<br />

SPE 0 SPE 1<br />

SPE 2 SPE 3<br />

SPE 4<br />

SPE 6<br />

SPE 5<br />

SPE 7<br />

QS22<br />

Blade<br />

CELL 0 CELL 1<br />

Figure 2.14: Data distribution between SPEs<br />

SPE is 50.5 million update/s. The estimated memory b<strong>and</strong>width requirement<br />

is 3.23GByte/s, which is slightly more than the 1/8 th of the available memory<br />

b<strong>and</strong>width. Therefore, SPEs should be arranged in a 4×2 logical array where the<br />

4 columns work on the 4 slices of the cell array <strong>and</strong> each row computes different<br />

iteration.<br />

2.3.2.6 Implementation on Falcon CNN-UM Architecture<br />

The Falcon architecture [30] is an emulated digital implementation of CNN-UM<br />

array processor which uses the full signal range model (Section 1.1). On this<br />

architecture the flexibility of simulators <strong>and</strong> computational power of analog architectures<br />

are mixed. Not only the size of templates <strong>and</strong> the computational<br />

precision can be configured but space-variant <strong>and</strong> non-linear templates can also<br />

be used.<br />

The Euler equations are solved by a modified Falcon processor array in which<br />

the arithmetic unit was redesigned according to the discretized governing equations.<br />

Since each CNN cell has only one real output value, four layers are required<br />

to represent the variables ρ, ρu, ρv <strong>and</strong> E in case of Lax-Friedrichs approximation.<br />

In the first-order case the non-linear CNN templates acting on the ρu layer can<br />

easily be taken from (2.13b). Equations (2.18)-(2.20) show templates, in which

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!