RealView Compilation Tools Compiler Reference Guide - ARM ...

More documents

Recommendations

Info

Using NEON Support E.3 Intrinsics The intrinsics described in this section map closely to NEON instructions. Each section begins with a list of function prototypes, with a comment specifying an equivalent assembler instruction. The compiler selects an instruction that has the required semantics, but there is no guarantee that the compiler produces the listed instruction. The intrinsics use a naming scheme that is similar to the NEON unified assembler syntax. That is, each intrinsic has the form: _ An additional q flag is provided to specify that the intrinsic operates on 128-bit vectors. For example: • vmul_s16, multiplies two vectors of signed 16-bit values. This compiles to VMUL.I16 d2, d0, d1. • vaddl_u8, is a long add of two 64-bit vectors containing unsigned 8-bit values, resulting in a 128-bit vector of unsigned 16-bit values. This compiles to VADDL.U8 q1, d0, d1. Note The intrinsic function prototypes in this section use the following type annotations: __const(n) the argument n must be a compile-time constant __constrange(min, max) the argument must be a compile-time constant in the range min to max __transfersize(n) the intrinsic loads n bytes from this pointer. E.3.1 Addition These intrinsics add vectors. Each lane in the result is the consequence of performing the addition on the corresponding lanes in each operand vector. The operations performed are as follows: • Vector add: vadd -> Vr[i]:=Va[i]+Vb[i] on page E-5 • Vector long add: vadd -> Vr[i]:=Va[i]+Vb[i] on page E-5 • Vector wide add: vadd -> Vr[i]:=Va[i]+Vb[i] on page E-5 • Vector halving add: vhadd -> Vr[i]:=(Va[i]+Vb[i])>>1 on page E-6 • Vector rounding halving add: vrhadd -> Vr[i]:=(Va[i]+Vb[i]+1)>>1 on page E-6 E-4 Copyright © 2007, 2010 ARM Limited. All rights reserved. ARM DUI 0348A Non-Confidential
Using NEON Support • Vector saturating add: vqadd -> Vr[i]:=sat(Va[i]+Vb[i]) on page E-6 • Vector add high half -> Vr[i]:=Va[i]+Vb[i] on page E-7 • Vector rounding add high half on page E-7. Vector add: vadd -> Vr[i]:=Va[i]+Vb[i] Vr, Va, Vb have equal lane sizes. int8x8_t vadd_s8(int8x8_t a, int8x8_t b); // VADD.I8 d0,d0,d0 int16x4_t vadd_s16(int16x4_t a, int16x4_t b); // VADD.I16 d0,d0,d0 int32x2_t vadd_s32(int32x2_t a, int32x2_t b); // VADD.I32 d0,d0,d0 int64x1_t vadd_s64(int64x1_t a, int64x1_t b); // VADD.I64 d0,d0,d0 float32x2_t vadd_f32(float32x2_t a, float32x2_t b); // VADD.F32 d0,d0,d0 uint8x8_t vadd_u8(uint8x8_t a, uint8x8_t b); // VADD.I8 d0,d0,d0 uint16x4_t vadd_u16(uint16x4_t a, uint16x4_t b); // VADD.I16 d0,d0,d0 uint32x2_t vadd_u32(uint32x2_t a, uint32x2_t b); // VADD.I32 d0,d0,d0 uint64x1_t vadd_u64(uint64x1_t a, uint64x1_t b); // VADD.I64 d0,d0,d0 int8x16_t vaddq_s8(int8x16_t a, int8x16_t b); // VADD.I8 q0,q0,q0 int16x8_t vaddq_s16(int16x8_t a, int16x8_t b); // VADD.I16 q0,q0,q0 int32x4_t vaddq_s32(int32x4_t a, int32x4_t b); // VADD.I32 q0,q0,q0 int64x2_t vaddq_s64(int64x2_t a, int64x2_t b); // VADD.I64 q0,q0,q0 float32x4_t vaddq_f32(float32x4_t a, float32x4_t b); // VADD.F32 q0,q0,q0 uint8x16_t vaddq_u8(uint8x16_t a, uint8x16_t b); // VADD.I8 q0,q0,q0 uint16x8_t vaddq_u16(uint16x8_t a, uint16x8_t b); // VADD.I16 q0,q0,q0 uint32x4_t vaddq_u32(uint32x4_t a, uint32x4_t b); // VADD.I32 q0,q0,q0 uint64x2_t vaddq_u64(uint64x2_t a, uint64x2_t b); // VADD.I64 q0,q0,q0 Vector long add: vadd -> Vr[i]:=Va[i]+Vb[i] Va, Vb have equal lane sizes, result is a 128 bit vector of lanes that are twice the width. int16x8_t vaddl_s8(int8x8_t a, int8x8_t b); // VADDL.S8 q0,d0,d0 int32x4_t vaddl_s16(int16x4_t a, int16x4_t b); // VADDL.S16 q0,d0,d0 int64x2_t vaddl_s32(int32x2_t a, int32x2_t b); // VADDL.S32 q0,d0,d0 uint16x8_t vaddl_u8(uint8x8_t a, uint8x8_t b); // VADDL.U8 q0,d0,d0 uint32x4_t vaddl_u16(uint16x4_t a, uint16x4_t b); // VADDL.U16 q0,d0,d0 uint64x2_t vaddl_u32(uint32x2_t a, uint32x2_t b); // VADDL.U32 q0,d0,d0 Vector wide add: vadd -> Vr[i]:=Va[i]+Vb[i] int16x8_t vaddw_s8(int16x8_t a, int8x8_t b); // VADDW.S8 q0,q0,d0 int32x4_t vaddw_s16(int32x4_t a, int16x4_t b); // VADDW.S16 q0,q0,d0 int64x2_t vaddw_s32(int64x2_t a, int32x2_t b); // VADDW.S32 q0,q0,d0 uint16x8_t vaddw_u8(uint16x8_t a, uint8x8_t b); // VADDW.U8 q0,q0,d0 uint32x4_t vaddw_u16(uint32x4_t a, uint16x4_t b); // VADDW.U16 q0,q0,d0 uint64x2_t vaddw_u32(uint64x2_t a, uint32x2_t b); // VADDW.U32 q0,q0,d0 ARM DUI 0348A Copyright © 2007, 2010 ARM Limited. All rights reserved. E-5 Non-Confidential
Page 1 and 2:
® RealView Compilation Tools Versi
Page 3 and 4:
Contents RealView Compilation Tools
Page 5 and 6:
Preface This preface introduces the
Page 7 and 8:
Preface Appendix A Via File Syntax
Page 9 and 10:
Preface • ARM datasheet or techni
Page 11 and 12:
Chapter 1 Introduction This chapter
Page 13 and 14:
Introduction 1.2 Source language mo
Page 15 and 16:
Introduction 1.3 Language extension
Page 17 and 18:
Introduction • Compiling a C sour
Page 19 and 20:
Chapter 2 Compiler Command-line Opt
Page 21 and 22:
Compiler Command-line Options 2.1.2
Page 23 and 24:
Compiler Command-line Options Defau
Page 25 and 26:
Compiler Command-line Options Note
Page 27 and 28:
Compiler Command-line Options Table
Page 29 and 30:
Compiler Command-line Options int m
Page 31 and 32:
Compiler Command-line Options See a
Page 33 and 34:
Compiler Command-line Options • -
Page 35 and 36:
Compiler Command-line Options • T
Page 37 and 38:
Compiler Command-line Options Where
Page 39 and 40:
Page 41 and 42:
Compiler Command-line Options Mode
Page 43 and 44:
Page 45 and 46:
Compiler Command-line Options • #
Page 47 and 48:
Compiler Command-line Options Synta
Page 49 and 50:
Page 51 and 52:
Page 53 and 54:
Page 55 and 56:
Compiler Command-line Options When
Page 57 and 58:
Compiler Command-line Options Examp
Page 59 and 60:
Compiler Command-line Options A num
Page 61 and 62:
Compiler Command-line Options softv
Page 63 and 64:
Compiler Command-line Options If --
Page 65 and 66:
Compiler Command-line Options • I
Page 67 and 68:
Page 69 and 70:
Page 71 and 72:
Page 73 and 74:
Page 75 and 76:
Compiler Command-line Options N #if
Page 77 and 78:
Page 79 and 80:
Page 81 and 82:
Page 83 and 84:
Page 85 and 86:
Compiler Command-line Options 1 Res
Page 87 and 88:
Page 89 and 90:
Compiler Command-line Options Restr
Page 91 and 92:
Compiler Command-line Options • #
Page 93 and 94:
Compiler Command-line Options Synta
Page 95 and 96:
Page 97 and 98:
Compiler Command-line Options Restr
Page 99 and 100:
Page 101 and 102:
Compiler Command-line Options • S
Page 103 and 104:
Page 105 and 106:
Page 107 and 108:
Compiler Command-line Options --no_
Page 109 and 110:
Page 111 and 112:
Compiler Command-line Options --apc
Page 113 and 114:
Compiler Command-line Options Selec
Page 115 and 116:
Chapter 3 Language Extensions This
Page 117 and 118:
Language Extensions 3.1.2 #include_
Page 119 and 120:
Language Extensions 3.2 C99 languag
Page 121 and 122:
Language Extensions 3.3 C99 languag
Page 123 and 124:
Language Extensions Example void co
Page 125 and 126:
Language Extensions static int y =
Page 127 and 128:
Language Extensions 3.4.4 Dollar si
Page 129 and 130:
Language Extensions 3.5 Standard C+
Page 131 and 132:
Language Extensions Example 3-2 Lin
Page 133 and 134:
Language Extensions 3.6 Standard C
Page 135 and 136:
Language Extensions 3.6.4 Assembler
Page 137 and 138:
Language Extensions 3.6.12 Structur
Page 139 and 140:
Language Extensions 3.7 GNU languag
Page 141 and 142:
Language Extensions 3.7.5 Character
Page 143 and 144:
Language Extensions 3.7.10 Inline T
Page 145 and 146:
Language Extensions Example int bar
Page 147 and 148:
Chapter 4 Compiler-specific Feature
Page 149 and 150:
Compiler-specific Features Usage __
Page 151 and 152:
Compiler-specific Features Example
Page 153 and 154:
Compiler-specific Features 4.1.6 __
Page 155 and 156:
Page 157 and 158:
Compiler-specific Features 4.1.11 _
Page 159 and 160:
Compiler-specific Features short x;
Page 161 and 162:
Compiler-specific Features Where: s
Page 163 and 164:
Compiler-specific Features typedef
Page 165 and 166:
Compiler-specific Features • 0-25
Page 167 and 168:
Compiler-specific Features Usage Fu
Page 169 and 170:
Page 171 and 172:
Page 173 and 174:
Compiler-specific Features }; __dec
Page 175 and 176:
Compiler-specific Features 4.3 Func
Page 177 and 178:
Compiler-specific Features The comp
Page 179 and 180:
Compiler-specific Features Note Thi
Page 181 and 182:
Compiler-specific Features In the f
Page 183 and 184:
Compiler-specific Features 4.4 Type
Page 185 and 186:
Compiler-specific Features void cal
Page 187 and 188:
Compiler-specific Features replaced
Page 189 and 190:
Compiler-specific Features Note Thi
Page 191 and 192:
Page 193 and 194:
Page 195 and 196:
Compiler-specific Features 4.6 Prag
Page 197 and 198:
Compiler-specific Features • Temp
Page 199 and 200:
Compiler-specific Features Syntax #
Page 201 and 202:
Compiler-specific Features #pragma
Page 203 and 204:
Compiler-specific Features • __at
Page 205 and 206:
Compiler-specific Features 4.6.20 #
Page 207 and 208:
Compiler-specific Features If the i
Page 209 and 210:
Compiler-specific Features 4.7 Inst
Page 211 and 212:
Page 213 and 214:
Page 215 and 216:
Compiler-specific Features Where: c
Page 217 and 218:
Compiler-specific Features Return v
Page 219 and 220:
Compiler-specific Features See also
Page 221 and 222:
Compiler-specific Features Syntax v
Page 223 and 224:
Compiler-specific Features Where: p
Page 225 and 226:
Page 227 and 228:
Compiler-specific Features Where: v
Page 229 and 230:
Compiler-specific Features Note The
Page 231 and 232:
Page 233 and 234:
Compiler-specific Features • BKPT
Page 235 and 236:
Compiler-specific Features sat must
Page 237 and 238:
Page 239 and 240:
Page 241 and 242:
Page 243 and 244:
Compiler-specific Features unsigned
Page 245 and 246:
Compiler-specific Features int32_t
Page 247 and 248:
Page 249 and 250:
Compiler-specific Features 4.8 Comp
Page 251 and 252:
Compiler-specific Features Table 4-
Page 253 and 254: Compiler-specific Features Table 4-
Page 255 and 256: Chapter 5 C and C++ Implementation
Page 257 and 258: C and C++ Implementation Details th
Page 259 and 260: C and C++ Implementation Details Se
Page 261 and 262: C and C++ Implementation Details Po
Page 263 and 264: C and C++ Implementation Details
Page 265 and 266: C and C++ Implementation Details st
Page 267 and 268: C and C++ Implementation Details 5.
Page 269 and 270: C and C++ Implementation Details No
Page 271 and 272: C and C++ Implementation Details na
Page 273 and 274: C and C++ Implementation Details 5.
Page 275 and 276: Appendix A Via File Syntax This app
Page 277 and 278: Via File Syntax A.2 Syntax Via file
Page 279 and 280: Appendix B Standard C Implementatio
Page 281 and 282: Standard C Implementation Definitio
Page 289 and 290: Appendix C Standard C++ Implementat
Page 291 and 292: Standard C++ Implementation Definit
Page 293 and 294: Standard C++ Implementation Definit
Page 295 and 296: Appendix D C and C++ Compiler Imple
Page 297 and 298: C and C++ Compiler Implementation L
Page 299 and 300: C and C++ Compiler Implementation L
Page 301 and 302: Appendix E Using NEON Support This
Page 303: Using NEON Support E.2 Vector data
Page 307 and 308: Using NEON Support Vector add high
Page 309 and 310: Using NEON Support Vector saturatin
Page 311 and 312: Using NEON Support uint64x1_t vqsub
Page 313 and 314: Using NEON Support uint32x2_t vcle_
Page 315 and 316: Using NEON Support uint16x4_t vabd_
Page 317 and 318: Using NEON Support Long pairwise ad
Page 319 and 320: Using NEON Support int16x8_t vqshlq
Page 321 and 322: Using NEON Support uint8x8_t vrshr_
Page 323 and 324: Using NEON Support Vector signed->u
Page 325 and 326: Using NEON Support uint8x16_t vld1q
Page 327 and 328: Using NEON Support int64x1_t vld1_l
Page 329 and 330: Using NEON Support // VST1.16 {d0[0
Page 331 and 332: Using NEON Support // VLD3.32 {d0,
Page 333 and 334: Using NEON Support // VLD3.16 {d0[]
Page 335 and 336: Using NEON Support // VLD3.32 {d0[0
Page 337 and 338: Using NEON Support void vst2_u64(__
Page 339 and 340: Using NEON Support void vst2q_lane_
Page 341 and 342: Using NEON Support float32_t vget_l
Page 343 and 344: Using NEON Support uint8x16_t vdupq
Page 345 and 346: Using NEON Support uint16x4_t vget_
Page 347 and 348: Using NEON Support E.3.25 Operation
Page 349 and 350: Using NEON Support int16x8_t vmulq_
Page 351 and 352: Using NEON Support Vector widening
Page 353 and 354: Using NEON Support Absolute: Vd[i]
Page 355 and 356:
Using NEON Support uint8x8_t vmvn_u
Page 357 and 358:
Using NEON Support int16x8_t vornq_
Page 359 and 360:
Using NEON Support uint32x4x2_t vuz
show all

RealView Compilation Tools Compiler Reference Guide - ARM ...

Create successful ePaper yourself

Delete template?

Save as template?