13.07.2015 Views

Intel® 64 and IA-32 Architectures Optimization Reference Manual

Intel® 64 and IA-32 Architectures Optimization Reference Manual

Intel® 64 and IA-32 Architectures Optimization Reference Manual

SHOW MORE
SHOW LESS
  • No tags were found...

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

INSTRUCTION LATENCY AND THROUGHPUTTable C-5a. Streaming SIMD Extension Single-precisionFloating-point Instructions (Contd.)Instruction Latency 1 ThroughputDisplayFamily_DisplayModel06_0FH06_0EH06_0DH06_09H06_0FH06_0EH06_0DH06_09HCVTSS2SI r<strong>32</strong>, xmm 3 4 4 4 1 1 1 1CVT[T]SS2SI r<strong>64</strong>, 4 N/A N/A N/A 1 N/A N/A N/AxmmCVTTPS2PI mm, xmm 3 3 3 3 1 1 1 1CVTTSS2SI r<strong>32</strong>, xmm 3 4 4 4 1 1 1 1DIVPS xmm, xmm 18 35 35 35 17 34 34 34DIVSS xmm, xmm 18 18 18 18 17 17 17 17MAXPS xmm, xmm 3 4 4 4 1 2 2 2MAXSS xmm, xmm 3 3 3 3 1 1 1 1MINPS xmm, xmm 3 4 4 4 1 2 2 2MINSS xmm, xmm 3 3 3 3 1 1 1 1MOVAPS xmm, xmm 1 1 1 1 0.33 1 1 1MOVHLPS xmm, xmm 1 1 1 1 1 0.5 0.5 0.5MOVLHPS xmm, xmm 1 1 1 1 1 0.5 0.5 0.5MOVMSKPS r<strong>32</strong>, xmm 1 1 1 1 1 1 1 1MOVMSKPS r<strong>64</strong>, xmm 1 N/A N/A N/A 1 N/A N/A N/AMOVSS xmm, xmm 1 1 1 1 0.33 0.5 0.5 0.5MOVUPS xmm, xmm 1 1 1 1 0.5 1 1 1MULPS xmm, xmm 4 5 5 5 1 2 2 2MULSS xmm, xmm 4 4 4 4 1 1 1 1ORPS xmm, xmm 1 2 0.33 2RCPPS xmm, xmm 3 2 1 2RCPSS xmm, xmm 3 1 1 1RSQRTPS xmm, xmm 3 2 2 2RSQRTSS xmm, xmm 3 2 1SHUFPS xmm, xmm, 4 2 3 2imm8SQRTPS xmm, xmm 29 29+28 28 58SQRTSS xmm, xmm 29 30 28 29SUBPS xmm, xmm 3 4 1 2SUBSS xmm, xmm 3 3 1 1C-16

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!