13.07.2015 Views

Cortex-A8 R2P2.pdf - ARM Information Center

Cortex-A8 R2P2.pdf - ARM Information Center

Cortex-A8 R2P2.pdf - ARM Information Center

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Instruction Cycle TimingTable 16-18 Advanced SIMD floating-point instructions (continued)InstructionRegisterformat Cycles Source1 Source2 Source3 Source4 Result1 Result2VMLA aDd,Dn,Dm 1 Dn:N2 Dm:N2 Dd:N3 - Dd:N9 -VMLS a Qd,Qn,Qm 1QnLo:N2 QmLo:N2 QdLo:N3 -QdLo:N9 -2QnHi:N2 QmHi:N2 QdHi:N3 -QdHi:N9 -Dd,Dn,Dm[x](scalar)1 Dn:N2 Dm:N1 Dd:N3 - Dd:N9 -Qd,Qn,Dm[x](scalar)12QnLo:N2QnHi:N2Dm:N1-QdLo:N3QdHi:N3--QdLo:N9QdHi:N9--VRECPS aDd,Dn,Dm 1 Dn:N2 Dm:N2 - - Dd:N9 -VRSQRTS a Qd,Qn,Qm 1QnLo:N2 QmLo:N2 --QdLo:N9 -2QnHi:N2 QmHi:N2 --QdHi:N9 -a. The VMLA.F, VMLS.F, VRECPS.F, VRSQRTS.F instructions begin execution on the floating-point multiply pipeline. Thefloating-point multiply result is then forwarded to the floating-point add pipeline to complete the accumulate portion of theinstructions. Therefore, these instructions are pipelined across ten stages, N1 through N10, where N10 is the writeback stage.16.6.6 Advanced SIMD byte permute instructionsTable 16-19 shows the operation of the Advanced SIMD byte permute instructions.Table 16-19 Advanced SIMD byte permute instructionsInstruction Register format Cycles Source Result1 2 3 4 1 2VMOVDd,Qm(narrow)1 QmLo:N1 QmHi:N1- - Dd:N2 -VMOVDd,Dm[x](scalar)Qd,Dm[x](scalar)1 Dm:N1 - - - Dd:N2 -1 Dm:N1 - - - QdLo:N2 QdHi:N2VTRNVSWPDd,Dm 1 Dd:N1 Dm:N1 - - Dd:N2 Dm:N2Qd,Qm 12QdLo:N1QdHi:N1QmLo:N1QmHi:N1----QdLo:N2QdHi:N2QmLo:N2QmHi:N216-30 Copyright © 2006-2008 <strong>ARM</strong> Limited. All rights reserved. <strong>ARM</strong> DDI 0344E

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!