03.08.2013 Views

Copyright by William Lloyd Bircher 2010 - The Laboratory for ...

Copyright by William Lloyd Bircher 2010 - The Laboratory for ...

Copyright by William Lloyd Bircher 2010 - The Laboratory for ...

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

While this model is accurate <strong>for</strong> most current generation processors, future processors<br />

may require additional metrics to maintain comparable accuracy. Power savings<br />

techniques such as power gating and on-die voltage regulation will require new methods<br />

<strong>for</strong> power accounting. <strong>The</strong>se techniques extend the sensitivity of leakage power<br />

consumption to functional unit activity levels. Currently, leakage power is dictated<br />

almost completely <strong>by</strong> architecturally visible, core-level, idle and DVFS states. Future<br />

power gating implementations will likely be applied within subsets of a core, such as<br />

floating point units or caches. Similarly, on-die regulation allows DVFS to be applied<br />

independently to particular functional units. This increases the complexity of<br />

per<strong>for</strong>mance counter power modeling which normally only accounts <strong>for</strong> switching power.<br />

To accounting <strong>for</strong> these local power adaptations, models will need either detailed<br />

knowledge of the power gating and DVFS implementations or statistical characterizations<br />

of their application. Given the effectiveness of per<strong>for</strong>mance counter power models at<br />

accounting <strong>for</strong> fine-grain switching power, it is likely that power gating and on-die<br />

regulation can also be accounted <strong>for</strong>.<br />

5.3.5 GPU<br />

To estimate GPU power consumption a technique similar to that typically used <strong>for</strong> CPUs<br />

is employed: count the number of ungated clocks. In CPUs this is done <strong>by</strong> subtracting<br />

the number of halted clocks from all clocks [BiJo06-1]. In the case of the RS780 the<br />

ungated clocks can be measured directly. This approach only accounts directly <strong>for</strong> power<br />

saved due to clock gating. Power reductions due to DVFS are not explicitly represented.<br />

94

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!