29.01.2015 Views

Embedded Software for SoC - Grupo de Mecatrônica EESC/USP

Embedded Software for SoC - Grupo de Mecatrônica EESC/USP

Embedded Software for SoC - Grupo de Mecatrônica EESC/USP

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Exploring High Bandwidth Pipelined Cache Architecture 347<br />

indicates the number of times the array has been split with vertical cut lines<br />

(creating more, but shorter word-lines), while Ndbl indicates the number of<br />

times the array is split with horizontal cut lines (creating shorter bit-lines).<br />

The total number of banks is Ndwl × Ndbl. However, reduced hit time by<br />

increasing these parameters comes with extra area, energy and <strong>de</strong>lay overhead.<br />

Increasing Ndbl increases the number of sense amplifiers, while increasing<br />

Ndwl translates into more word-line drivers and bigger <strong>de</strong>co<strong>de</strong>r due to increase<br />

in the number of word-lines [4]. Most importantly, except in the case of a<br />

direct mapped cache (where Ndbl and Ndwl are both equal to one), a multiplexer<br />

is required to select the data from the appropriate bank. Increasing Ndbl<br />

increases the size of multiplexer, which in turn increases the critical hit time<br />

<strong>de</strong>lay and energy consumption.<br />

Figure 26-2 shows the variation of different components of hit time with<br />

respect to Ndbl and Ndwl. A 64 K and a 128 K, 2-way caches were simulated<br />

using TSMC technology. Increasing Ndbl <strong>de</strong>creases BSD due to<br />

shorter bit-lines but it also increases MDD due to larger number of banks.<br />

More banking requires larger column multiplexer to select proper data. There<br />

is an optimum partitioning beyond which the increase in <strong>de</strong>lay due to column<br />

multiplexer superce<strong>de</strong>s the <strong>de</strong>lay gain due to reduced bit-lines. Increasing<br />

Ndwl does not make any difference in <strong>de</strong>lay <strong>for</strong> 64 K cache <strong>for</strong> current<br />

technology generations. In a 64 K cache, a single word-line needs to have 2<br />

cache blocks (64 byte) to maintain a good aspect ratio of the cache. Because

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!