Copyright by William Lloyd Bircher 2010 - The Laboratory for ...
Copyright by William Lloyd Bircher 2010 - The Laboratory for ...
Copyright by William Lloyd Bircher 2010 - The Laboratory for ...
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
While this model is accurate <strong>for</strong> most current generation processors, future processors<br />
may require additional metrics to maintain comparable accuracy. Power savings<br />
techniques such as power gating and on-die voltage regulation will require new methods<br />
<strong>for</strong> power accounting. <strong>The</strong>se techniques extend the sensitivity of leakage power<br />
consumption to functional unit activity levels. Currently, leakage power is dictated<br />
almost completely <strong>by</strong> architecturally visible, core-level, idle and DVFS states. Future<br />
power gating implementations will likely be applied within subsets of a core, such as<br />
floating point units or caches. Similarly, on-die regulation allows DVFS to be applied<br />
independently to particular functional units. This increases the complexity of<br />
per<strong>for</strong>mance counter power modeling which normally only accounts <strong>for</strong> switching power.<br />
To accounting <strong>for</strong> these local power adaptations, models will need either detailed<br />
knowledge of the power gating and DVFS implementations or statistical characterizations<br />
of their application. Given the effectiveness of per<strong>for</strong>mance counter power models at<br />
accounting <strong>for</strong> fine-grain switching power, it is likely that power gating and on-die<br />
regulation can also be accounted <strong>for</strong>.<br />
5.3.5 GPU<br />
To estimate GPU power consumption a technique similar to that typically used <strong>for</strong> CPUs<br />
is employed: count the number of ungated clocks. In CPUs this is done <strong>by</strong> subtracting<br />
the number of halted clocks from all clocks [BiJo06-1]. In the case of the RS780 the<br />
ungated clocks can be measured directly. This approach only accounts directly <strong>for</strong> power<br />
saved due to clock gating. Power reductions due to DVFS are not explicitly represented.<br />
94