20.11.2014 Views

PPKE ITK PhD and MPhil Thesis Classes

PPKE ITK PhD and MPhil Thesis Classes

PPKE ITK PhD and MPhil Thesis Classes

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

1.5 IBM Cell Broadb<strong>and</strong> Engine Architecture 29<br />

execute floating-point <strong>and</strong> Vector/SIMD Multimedia Extension instructions. The<br />

Fixed-point Unit executes fixed-point operations, including add, multiply, divide,<br />

compare, shift, rotate, <strong>and</strong> logical instructions. The Memory Management Unit<br />

manages address translation for all memory accesses.<br />

The EIB is not a bus as suggested by its name but a ring network which<br />

contains 4 unidirectional rings where two rings run counter to the direction of<br />

the other two. The EIB supports full memory-coherent <strong>and</strong> symmetric multiprocessor<br />

(SMP) operations. Thus, a CBE processor is designed to be ganged<br />

coherently with other CBE processors to produce a cluster. The EIB consists of<br />

four 16-byte-wide data rings. Each ring transfers 128 bytes at a time. Processor<br />

elements can drive <strong>and</strong> receive data simultaneously. The EIBŐs internal maximum<br />

b<strong>and</strong>width is 96 bytes per processor-clock cycle. Multiple transfers can be<br />

in-process concurrently on each ring, including more than 100 outst<strong>and</strong>ing DMA<br />

memory requests between main storage <strong>and</strong> the SPEs.<br />

The on-chip Memory Interface Controller (MIC) provides the interface between<br />

the EIB <strong>and</strong> physical memory. It supports one or two Rambus Extreme<br />

Data Rate (XDR) memory interfaces, which together support between 64 MB<br />

<strong>and</strong> 64 GB of XDR DRAM memory. Memory accesses on each interface are 1<br />

to 8, 16, 32, 64, or 128 bytes, with coherent memory-ordering. Up to 64 reads<br />

<strong>and</strong> 64 writes can be queued. The resource-allocation token manager provides<br />

feedback about queue levels.<br />

The dual-channel Rambus XDR memory interface provides very high 25.6GB/s<br />

memory b<strong>and</strong>width. The XDR DRAM memory is ECC-protected, with multibit<br />

error detection <strong>and</strong> optional single-bit error correction. I/O devices can be<br />

accessed via two Rambus FlexIO interfaces where one of them (the Broadb<strong>and</strong><br />

Interface (BIF)) is coherent <strong>and</strong> makes it possible to connect two Cell processors<br />

directly.<br />

The SPEs are SIMD only processors which are designed to h<strong>and</strong>le streaming<br />

data. Therefore they do not perform well in general purpose applications <strong>and</strong><br />

cannot run operating systems. Block diagram of the SPE is shown in Figure<br />

1.14.<br />

The SPE has two execution pipelines: the even pipeline is used to execute<br />

floating point <strong>and</strong> integer instructions while the odd pipeline is responsible for

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!