The IBM eServer BladeCenter JS20 - IBM Redbooks
The IBM eServer BladeCenter JS20 - IBM Redbooks
The IBM eServer BladeCenter JS20 - IBM Redbooks
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
<strong>The</strong> speculative superscalar techniques used in the PowerPC 970 include:<br />
► Aggressive branch prediction<br />
– Prediction for up to two branches per cycle<br />
– Support for up to 16 predicted branches in flight<br />
– Prediction support for both branch direction and branch addresses<br />
► In order dispatch of up to five operations into a distributed issue queue<br />
structure per cycle<br />
► Out of order issue of up to ten operations into ten execution pipelines per<br />
cycle, comprised of:<br />
– Two load or store operations<br />
– Two fixed-point register-to-register operations<br />
– Two floating-point operations<br />
– One branch operation<br />
– One condition register operation<br />
– One VMX permute operation<br />
– One VMX arithmetic operation<br />
► Register renaming on most architected PowerPC registers including the<br />
GPRs, FPRs, VRFs, condition register fields, FPSCR, VSCR, the link register<br />
(LR), and the count register (CTR)<br />
<strong>The</strong> total number of in-flight instructions within the processor core at any point in<br />
time can be as high as 215.<br />
<strong>The</strong> VMX execution units operate on multiple data elements concurrently.<br />
<strong>The</strong>refore, they provide a performance equivalent to multiple scalar execution<br />
units.<br />
Bus Interface Unit<br />
<strong>The</strong> interface between the PowerPC 970 and external devices is provided by the<br />
Bus Interface Unit (BIU). <strong>The</strong> interface is comprised of two unidirectional paths<br />
(one in, one out) that are each four bytes wide.<br />
<strong>The</strong> internal processor clock operates at an integral multiple of the interface<br />
speed. On the <strong>BladeCenter</strong> <strong>JS20</strong>, the processor clock operates at double the<br />
interface speed.<br />
<strong>The</strong> total aggregate transfer rate across the interface in bytes per second is four<br />
times of the processor clock speed (1/2 x 2 x 4). This equates to 6.4 GBps for<br />
processors operating at 1.6 GHz, and 8.8 GBps for processors operating at<br />
2.2 GHz.<br />
<strong>The</strong> interface is multiplexed, with both addresses and data being transferred<br />
across the same wires. <strong>The</strong>refore the peak theoretical data throughput across<br />
Chapter 2. Hardware components 45