03.08.2013 Views

Copyright by William Lloyd Bircher 2010 - The Laboratory for ...

Copyright by William Lloyd Bircher 2010 - The Laboratory for ...

Copyright by William Lloyd Bircher 2010 - The Laboratory for ...

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

memory. Since the number of main memory accesses is directly proportional to the<br />

number of cache misses, it is possible to approximate the number of accesses using only<br />

cache misses. Since these memory accesses must go off-chip, power is consumed<br />

proportionally in the memory controller and DRAM. In reality the relation is not so<br />

simple, but there is still a strong causal relationship between cache misses and main<br />

memory accesses.<br />

5.2 Complete-System Server Power Model<br />

Though the initial selection of per<strong>for</strong>mance events <strong>for</strong> modeling is dictated <strong>by</strong> an<br />

understanding of subsystem interactions (as in the previous example), the final selection<br />

of which event type(s) to use is determined <strong>by</strong> the average error rate and a qualitative<br />

comparison of the measured and modeled power traces. <strong>The</strong> dominant, power-related<br />

per<strong>for</strong>mance events are described below.<br />

Cycles – Execution time in terms of CPU clock cycles. <strong>The</strong> cycles metric is combined<br />

with most other metrics to create per cycle metrics. This corrects <strong>for</strong> slight differences in<br />

sampling rate. Though sampling is periodic, the actual sampling rate varies slightly due<br />

to cache effects and interrupt latency.<br />

Halted Cycles – Cycles in which clock gating is active. When the Pentium IV processor<br />

is idle, it saves power <strong>by</strong> gating the clock signal to portions of itself. Idle phases of<br />

execution are “detected” <strong>by</strong> the processor through the use of the HLT (halt) instruction.<br />

67

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!