09.01.2013 Views

AMD x86-64 Architecture Programmer's Manual, Volume 4, 128-Bit ...

AMD x86-64 Architecture Programmer's Manual, Volume 4, 128-Bit ...

AMD x86-64 Architecture Programmer's Manual, Volume 4, 128-Bit ...

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

<strong>AMD</strong><strong>64</strong> Technology 26568—Rev. 3.05—September 2003<br />

MOVNTPS Move Non-Temporal Packed<br />

Single-Precision Floating-Point<br />

Stores four single-precision floating-point XMM register values into a <strong>128</strong>-bit memory<br />

location. This instruction indicates to the processor that the data is non-temporal, and<br />

is unlikely to be used again soon. The processor treats the store as a write-combining<br />

(WC) memory write, which minimizes cache pollution. The exact method by which<br />

cache pollution is minimized depends on the hardware implementation of the<br />

instruction. For further information, see “Memory Optimization” in <strong>Volume</strong> 1.<br />

Mnemonic Opcode Description<br />

MOVNTPS mem<strong>128</strong>, xmm 0F 2B /r Stores four packed single-precision floating-point XMM register values<br />

into a <strong>128</strong>-bit memory location, minimizing cache pollution.<br />

MOVNTPD is weakly-ordered with respect to other instructions that operate on<br />

memory. Software should use an SFENCE instruction to force strong memory ordering<br />

of MOVNTPD with respect to other stores.<br />

Related Instructions<br />

MOVNTDQ, MOVNTI, MOVNTPD, MOVNTQ<br />

rFLAGS Affected<br />

None<br />

mem<strong>128</strong> xmm<br />

127 96 95 <strong>64</strong> 63 32<br />

31<br />

0<br />

174 MOVNTPS<br />

127 96 95 <strong>64</strong> 63 32 31<br />

0<br />

copy<br />

copy<br />

copy<br />

copy<br />

movntps.eps

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!