03.08.2013 Views

Copyright by William Lloyd Bircher 2010 - The Laboratory for ...

Copyright by William Lloyd Bircher 2010 - The Laboratory for ...

Copyright by William Lloyd Bircher 2010 - The Laboratory for ...

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

power model that was used. Compare to the server power model which averaged over<br />

6% error using only three inputs. More importantly, this low error rate suggests that<br />

per<strong>for</strong>mance counter power models are effective across multiple microprocessor<br />

architecture generations, plat<strong>for</strong>ms, and manufacturers (Intel and AMD).<br />

<strong>The</strong> chipset power model is also improved compared to the server chipset model with<br />

average error of 3.3%. Like the server model, the desktop model contained a large<br />

workload-independent component: although in this case it contributed less than half the<br />

total chipset power rather than the 100% seen in the server model.<br />

<strong>The</strong> memory and memory controller power models had the highest average error with<br />

5.3% and 6.0% respectively. <strong>The</strong> high error is largely due to the CPU portion of the<br />

3DMark06 workload. This workload generates memory transactions at an interval that<br />

prevents effective utilization of precharge power down modes. <strong>The</strong>re<strong>for</strong>e, the model<br />

tends to underestimate memory power consumption. To resolve this error, a metric of<br />

typical memory bus idle duration or power down residency would be needed.<br />

<strong>The</strong> GPU power model had the lowest error rate at slightly less than 1%. This illustrates<br />

the effectiveness of the non-gated GPU clocks as a proxy <strong>for</strong> GPU power. In most<br />

workloads the GPU power has a clear bimodal characteristic. Active regions have a<br />

power level that is consistent. Idle regions also have a consistent power level due to the<br />

presence of idle clock gating. It is expected that as finer grain power management is<br />

100

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!