13.07.2013 Views

The IBM eServer BladeCenter JS20 - IBM Redbooks

The IBM eServer BladeCenter JS20 - IBM Redbooks

The IBM eServer BladeCenter JS20 - IBM Redbooks

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

<strong>The</strong> speculative superscalar techniques used in the PowerPC 970 include:<br />

► Aggressive branch prediction<br />

– Prediction for up to two branches per cycle<br />

– Support for up to 16 predicted branches in flight<br />

– Prediction support for both branch direction and branch addresses<br />

► In order dispatch of up to five operations into a distributed issue queue<br />

structure per cycle<br />

► Out of order issue of up to ten operations into ten execution pipelines per<br />

cycle, comprised of:<br />

– Two load or store operations<br />

– Two fixed-point register-to-register operations<br />

– Two floating-point operations<br />

– One branch operation<br />

– One condition register operation<br />

– One VMX permute operation<br />

– One VMX arithmetic operation<br />

► Register renaming on most architected PowerPC registers including the<br />

GPRs, FPRs, VRFs, condition register fields, FPSCR, VSCR, the link register<br />

(LR), and the count register (CTR)<br />

<strong>The</strong> total number of in-flight instructions within the processor core at any point in<br />

time can be as high as 215.<br />

<strong>The</strong> VMX execution units operate on multiple data elements concurrently.<br />

<strong>The</strong>refore, they provide a performance equivalent to multiple scalar execution<br />

units.<br />

Bus Interface Unit<br />

<strong>The</strong> interface between the PowerPC 970 and external devices is provided by the<br />

Bus Interface Unit (BIU). <strong>The</strong> interface is comprised of two unidirectional paths<br />

(one in, one out) that are each four bytes wide.<br />

<strong>The</strong> internal processor clock operates at an integral multiple of the interface<br />

speed. On the <strong>BladeCenter</strong> <strong>JS20</strong>, the processor clock operates at double the<br />

interface speed.<br />

<strong>The</strong> total aggregate transfer rate across the interface in bytes per second is four<br />

times of the processor clock speed (1/2 x 2 x 4). This equates to 6.4 GBps for<br />

processors operating at 1.6 GHz, and 8.8 GBps for processors operating at<br />

2.2 GHz.<br />

<strong>The</strong> interface is multiplexed, with both addresses and data being transferred<br />

across the same wires. <strong>The</strong>refore the peak theoretical data throughput across<br />

Chapter 2. Hardware components 45

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!