21.01.2013 Views

Lecture Notes in Computer Science 4917

Lecture Notes in Computer Science 4917

Lecture Notes in Computer Science 4917

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

70 F. Bouwens et al.<br />

4 Architectural Explorations<br />

The architecture explorations <strong>in</strong> the follow<strong>in</strong>g sections start from the base architecture<br />

presented <strong>in</strong> Figure 1 that was analyzed for energy efficiency <strong>in</strong> [8]. The power distribution<br />

of the base architecture for IDCT are depicted <strong>in</strong> Figure 3. The components<br />

with the highest consumption (primary targets for improvement) are the configuration<br />

memories (CMs: 37.22%), the FUs (19.94%) and the DRFs (14.80%) of the CGA.<br />

Intercon. REG<br />

2.66%<br />

Intercon. MUX<br />

2.30%<br />

drf_vliw<br />

10.07%<br />

prf_vliw<br />

0.31%<br />

fu_vliw<br />

6.64%<br />

cu_vliw<br />

0.41%<br />

drf_cga<br />

14.80%<br />

prf_cga<br />

0.44%<br />

Intercon. Logic<br />

5.21%<br />

fu_cga<br />

19.94%<br />

CM<br />

37.22%<br />

Fig. 3. Power distribution of our base architecture for IDCT: 80.45mW<br />

We optimize these three components us<strong>in</strong>g the follow<strong>in</strong>g methods:<br />

CM: Create an array with less configuration bits by reduc<strong>in</strong>g the number of architecture<br />

components or by us<strong>in</strong>g simpler <strong>in</strong>terconnections;<br />

FU CGA: Improve the FU design from non-pipel<strong>in</strong>ed to pipel<strong>in</strong>ed (VHDL modifications)<br />

and optimize the rout<strong>in</strong>g of the CGA array (XML architecture description<br />

update);<br />

DRF CGA: Reduce the register file sizes, apply clock gat<strong>in</strong>g and use register file<br />

shar<strong>in</strong>g.<br />

Shar<strong>in</strong>g data between the FUs and the local DRFs and PRFs is important for the<br />

power consumption and performance of the architecture as these <strong>in</strong>fluence the CMs,<br />

FUs and the local DRFs <strong>in</strong> power and performance. Therefore we will focus the explorations<br />

on rout<strong>in</strong>g and the register files. We only utilize IDCT and FFT kernels for<br />

architecture explorations due to the fact that simulat<strong>in</strong>g complete applications such as<br />

MPEG2 would result <strong>in</strong> prohibitively long simulation times. We will perform the follow<strong>in</strong>g<br />

experiments for the explorations:<br />

Local DRF distribution: Determ<strong>in</strong>e the <strong>in</strong>fluences of the RFs <strong>in</strong> the array by explor<strong>in</strong>g<br />

the distribution of the local data register files;<br />

Interconnection Topology: Determ<strong>in</strong>e the <strong>in</strong>fluence of additional <strong>in</strong>terconnections.<br />

More <strong>in</strong>terconnections improve rout<strong>in</strong>g, but <strong>in</strong>creases the power consumption and<br />

vice-versa;

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!