03.11.2015 Views

the Elbrus Architecture Series for Servers and Supercomputers

1P7sT8P

1P7sT8P

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Deep Hardware & Software Integration<br />

in <strong>the</strong> <strong>Elbrus</strong> <strong>Architecture</strong><br />

• HW architecture provides<br />

– Parallel resources by wide<br />

instruction (VLIW-like)<br />

• Up to 25 scalar operations per<br />

cycle per core<br />

• Up to 12 Flops DP (24 packed<br />

SP) per cycle per core<br />

– Doubling in <strong>the</strong> <strong>Elbrus</strong>-8CV<br />

• Multicore<br />

• Multiprocessor support<br />

(ccNUMA)<br />

– Large-scale register file<br />

– Optimization supporting features<br />

– Binary compatibility supporting<br />

features<br />

– Secure program execution<br />

supporting features<br />

General purpose MP architecture – wide range of use<br />

High MP resource utilization<br />

Power efficient on HPC applications<br />

• Compilers <strong>and</strong> OS provides<br />

– Program parallelization by<br />

optimizing compiler<br />

• Instruction level parallelism (many<br />

operations per cycle)<br />

• Packed (vector) operation<br />

parallelism<br />

• multicore, multithreading<br />

parallelism<br />

– Viable binary compatibility with Intel<br />

x86, x86-64 on <strong>the</strong> basis of<br />

transparent dynamic binary<br />

compilation technology<br />

– Programming languages<br />

implementation <strong>for</strong> secure program<br />

execution

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!