05.08.2014 Views

here - Stefan-Marr.de

here - Stefan-Marr.de

here - Stefan-Marr.de

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

2.2. The Multicore Revolution<br />

2.2. The Multicore Revolution<br />

Performance improvements for sequential processors are tailing off. Before<br />

2005, processor <strong>de</strong>signers could use more transistors and higher clock<br />

frequencies to continuously increase the performance of new processor generations.<br />

Moore’s Law, which states that with improved manufacturing techniques<br />

and the resulting miniaturization the number of transistors doubles<br />

approximately every two years without extra cost, enabled them to increase<br />

the amount of logic that processors could contain. They used the additional<br />

logic to improve techniques such as out-of-or<strong>de</strong>r execution, branch prediction,<br />

memory caching schemes, and cache hierarchies. Unfortunately, these<br />

techniques are optimizations for specific usage patterns and eventually, their<br />

returns diminish [Hennessy and Patterson, 2007; Michaud et al., 2001]. 8 With<br />

the shrinking transistor sizes, it was possible to steadily increase the clock<br />

frequency as well.<br />

Higher clock speeds become impractical. Around 2005, processor <strong>de</strong>signers<br />

reached a point were it was no longer feasible to keep increasing the clock<br />

speed with the same pace as in previous <strong>de</strong>ca<strong>de</strong>s. An increase in clock speed<br />

corresponds directly to higher energy consumption and more heat dissipation<br />

[Hennessy and Patterson, 2007, p. 18]. However, handling the lost heat<br />

beyond a certain limit requires cooling techniques that are impractical for<br />

commodity <strong>de</strong>vices. For mobile <strong>de</strong>vices, the necessary increase in power consumption<br />

and the resulting heat make clock speed increases beyond a certain<br />

limit prohibitive. Processor <strong>de</strong>signers worked around the issue by applying<br />

various engineering techniques to handle the heat. This led to processors that<br />

can over-clock themselves when the thermal budget permits it. However, none<br />

of these techniques could <strong>de</strong>liver the performance improvements software <strong>de</strong>velopers<br />

have gotten accustomed to over the <strong>de</strong>ca<strong>de</strong>s.<br />

Increasing transistor budgets led to multicore processors. Still, the transistor<br />

budget for processors keeps increasing. In an attempt to satisfy the<br />

software industry’s need for constant performance improvement, processor<br />

<strong>de</strong>signers started to explore the <strong>de</strong>sign space of parallel processors. The increasing<br />

transistor budget can be spent on a wi<strong>de</strong> range of different features<br />

of a mo<strong>de</strong>rn processor [Hill and Marty, 2008]. Duplicating functional units,<br />

8 Welcome to the Jungle, Herb Sutter, access date: 27 June 2012<br />

http://herbsutter.com/welcome-to-the-jungle/<br />

17

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!