03.03.2013 Views

Intel® Architecture Instruction Set Extensions Programming Reference

Intel® Architecture Instruction Set Extensions Programming Reference

Intel® Architecture Instruction Set Extensions Programming Reference

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

INSTRUCTION SET REFERENCE - FMA<br />

VFMADD132SS/VFMADD213SS/VFMADD231SS — Fused Multiply-Add of Scalar Single-Precision<br />

Floating-Point Values<br />

Opcode/<br />

<strong>Instruction</strong><br />

VEX.DDS.LIG.128.66.0F38.W0<br />

99 /r<br />

VFMADD132SS xmm0, xmm1,<br />

xmm2/m32<br />

VEX.DDS.LIG.128.66.0F38.W0<br />

A9 /r<br />

VFMADD213SS xmm0, xmm1,<br />

xmm2/m32<br />

VEX.DDS.LIG.128.66.0F38.W0<br />

B9 /r<br />

VFMADD231SS xmm0, xmm1,<br />

xmm2/m32<br />

Description<br />

Op/<br />

En<br />

64/32<br />

-bit<br />

Mode<br />

CPUID<br />

Feature<br />

Flag<br />

Description<br />

A V/V FMA Multiply scalar single-precision floating-point value from xmm0 and<br />

xmm2/mem, add to xmm1 and put result in xmm0.<br />

A V/V FMA Multiply scalar single-precision floating-point value from xmm0 and<br />

xmm1, add to xmm2/mem and put result in xmm0.<br />

A V/V FMA Multiply scalar single-precision floating-point value from xmm1 and<br />

xmm2/mem, add to xmm0 and put result in xmm0.<br />

<strong>Instruction</strong> Operand Encoding<br />

Op/En Operand 1 Operand 2 Operand 3 Operand 4<br />

A ModRM:reg (r, w) VEX.vvvv (r) ModRM:r/m (r) NA<br />

Performs a SIMD multiply-add computation on packed single-precision floating-point values using three source<br />

operands and writes the multiply-add results in the destination operand. The destination operand is also the first<br />

source operand. The second operand must be a SIMD register. The third source operand can be a SIMD register or<br />

a memory location.<br />

VFMADD132SS: Multiplies the low packed single-precision floating-point value from the first source operand to the<br />

low packed single-precision floating-point value in the third source operand, adds the infinite precision intermediate<br />

result to the low packed single-precision floating-point value in the second source operand, performs<br />

rounding and stores the resulting packed single-precision floating-point value to the destination operand (first<br />

source operand).<br />

VFMADD213SS: Multiplies the low packed single-precision floating-point value from the second source operand to<br />

the low packed single-precision floating-point value in the first source operand, adds the infinite precision intermediate<br />

result to the low packed single-precision floating-point value in the third source operand, performs rounding<br />

and stores the resulting packed single-precision floating-point value to the destination operand (first source<br />

operand).<br />

VFMADD231SS: Multiplies the low packed single-precision floating-point value from the second source operand to<br />

the low packed single-precision floating-point value in the third source operand, adds the infinite precision intermediate<br />

result to the low packed single-precision floating-point value in the first source operand, performs rounding<br />

and stores the resulting packed single-precision floating-point value to the destination operand (first source<br />

operand).<br />

VEX.128 encoded version: The destination operand (also first source operand) is a XMM register and encoded in<br />

reg_field. The second source operand is a XMM register and encoded in VEX.vvvv. The third source operand is a<br />

XMM register or a 32-bit memory location and encoded in rm_field. The upper bits ([255:128]) of the YMM destination<br />

register are zeroed.<br />

Compiler tools may optionally support a complementary mnemonic for each instruction mnemonic listed in the<br />

opcode/instruction column of the summary table. The behavior of the complementary mnemonic in situations<br />

involving NANs are governed by the definition of the instruction mnemonic defined in the opcode/instruction<br />

column. See also Section 2.3.1, “FMA <strong>Instruction</strong> Operand Order and Arithmetic Behavior”.<br />

6-10 Ref. # 319433-014

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!