29.01.2015 Views

Embedded Software for SoC - Grupo de Mecatrônica EESC/USP

Embedded Software for SoC - Grupo de Mecatrônica EESC/USP

Embedded Software for SoC - Grupo de Mecatrônica EESC/USP

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

524 Chapter 37<br />

ferent sizes WDUs and the MRU way predictor. For a 2-way the data cache<br />

power savings are smaller <strong>for</strong> the MRU predictor that <strong>for</strong> the WDU. Although<br />

the MRU predictor has a higher prediction rate, the energy consumption<br />

overhead of the predictor MRU reduces the total data cache power savings.<br />

For higher associativities the predictor overhead <strong>de</strong>creases, but so does the<br />

prediction rate so, except <strong>for</strong> small WDU sizes, the WDU has better energy<br />

savings.<br />

6. CONCLUSIONS<br />

This paper addresses the problem of the increased energy consumption of<br />

associative data caches in mo<strong>de</strong>rn embed<strong>de</strong>d processors. A <strong>de</strong>sign <strong>for</strong> a Way<br />

Determination Unit (WDU) that reduces the D-cache energy consumption by<br />

allowing the cache controller to only access one cache way <strong>for</strong> a load/store<br />

operation was presented. Reducing the number of way accesses greatly reduces<br />

the energy consumption of the data cache.<br />

Unlike previous work, our <strong>de</strong>sign is not a predictor. It does not incur<br />

mis-prediction penalties and it does not require changes in the ISA or in the<br />

compiler. Not having mis-predictions is an important feature <strong>for</strong> an embed<strong>de</strong>d<br />

system <strong>de</strong>signer, as the WDU does not introduce any new non-<strong>de</strong>terministic<br />

behavior in program execution. The energy consumption reduction is achieved<br />

with no per<strong>for</strong>mance penalty and it grows with the increase in the associativity<br />

of the cache.<br />

The WDU components, a small fully associative cache and a modulo<br />

counter, are well un<strong>de</strong>rstood, simple <strong>de</strong>vices that can be easily synthesized.<br />

It was shown that very a small (8–16 entries) WDU adds very little to the<br />

<strong>de</strong>sign gate count, but can still provi<strong>de</strong> significant energy consumption<br />

savings.<br />

The WDU evaluation was done on a 32-bit processor with virtually in<strong>de</strong>xed<br />

L1 cache. For a machine with a physically in<strong>de</strong>xed cache the WDU overhead<br />

would be even smaller resulting in higher energy consumption savings.<br />

REFERENCES<br />

1.<br />

2.<br />

3.<br />

4.<br />

D. Brooks, V. Tiwari, and M. Martonosi. “Wattch: A Framework <strong>for</strong> Architectural-Level<br />

Power Analysis and Optimizations.” In ISCA, pp. 83–94, 2000.<br />

D. Burger and T. M. Austin. “The Simplescalar Tool Set, Version 2.0.” Technical Report<br />

TR-97-1342, University of Wisconsin-Madison, 1997.<br />

M. R. Guthaus, J. S. Ringenberg, D. Ernst, T. M. Austin, T. Mudge, and R. B. Brown.<br />

“Mibench: A Free, Commercially Representative <strong>Embed<strong>de</strong>d</strong> Benchmark Suite.” In IEEE<br />

4th Annual Workshop on Workload Characterization, pp. 83–94, 2001.<br />

K. Inoue, T. Ishihara, and K. Murakami. “Way-Predicting Set-Associative Cache <strong>for</strong> High<br />

Per<strong>for</strong>mance and Low Energy Consumption.” In ACM/IEEE International Symposium on<br />

Low Power Electronics and Design, pp. 273–275, 1999.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!