29.01.2015 Views

Embedded Software for SoC - Grupo de Mecatrônica EESC/USP

Embedded Software for SoC - Grupo de Mecatrônica EESC/USP

Embedded Software for SoC - Grupo de Mecatrônica EESC/USP

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

352 Chapter 26<br />

cache <strong>de</strong>lay. Figure 26-4 shows that as technology scales down, the gap<br />

between clock frequency and cache access frequency wi<strong>de</strong>ns. Multiple clock<br />

cycles are required to access the cache (Table 26-1). Hence, cache cannot be<br />

accessed in every clock cycle. This reduces the bandwidth of cache and leads<br />

to processor stalls and loss in overall per<strong>for</strong>mance.<br />

The third column of Table 26-1 shows the number of clock cycles required<br />

to access the cache with pipelined <strong>de</strong>co<strong>de</strong>r [1]. For the <strong>de</strong>sign un<strong>de</strong>r consi<strong>de</strong>ration,<br />

the cache has two pipeline stages. First stage (<strong>de</strong>co<strong>de</strong>r) is accessed<br />

in one clock cycle while second stage is accessed in the number of cycles<br />

given in Table 26-1. Putting <strong>de</strong>co<strong>de</strong>r in pipeline reduces the clock cycle time<br />

requirement and enhances the bandwidth. However, the cache is still not<br />

capable of limiting the overall cache <strong>de</strong>lay in pipeline stages to single clock<br />

cycle (Figure 26-4).<br />

The proposed pipeline technique is implemented in 64 K cache <strong>for</strong> different<br />

technology generations. To scale the bit-lines, the cache is divi<strong>de</strong>d into<br />

multiple banks. The overall cache <strong>de</strong>lay is divi<strong>de</strong>d into three parts: <strong>de</strong>co<strong>de</strong>r<br />

<strong>de</strong>lay (DD), word-line to sense amplifier <strong>de</strong>lay (WD + BSD) and multiplexer<br />

Table 26-1. The number of clock cycle vs. technology.<br />

Technology<br />

0.25<br />

0.18<br />

0.13<br />

0.10<br />

0.07<br />

Unpipelined conventional cache<br />

1<br />

2<br />

3<br />

3<br />

4<br />

Cache with pipelined <strong>de</strong>co<strong>de</strong>r<br />

1<br />

2<br />

2<br />

3<br />

3

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!