"TMS320C55x DSP Library DSPLIB Programmer's Reference"

More documents

Recommendations

Info

mmulDescriptionAlgorithmThis function finds the index for vector element with minimum value. In caseof multiple minimum elements, r_idx contains the index of the first minimumelement found. r_val contains the minimum value.Not applicableOverflow Handling MethodologyNot applicableSpecial Requirements noneImplementation Notes noneExampleBenchmarksSee examples/minvec subdirectory(preliminary)CyclesCode size(bytes)Core: nx*3Overhead: 826mmulFunctionArgumentsMatrix Multiplicationushort oflag = mmul (DATA *x1,short row1,short col1,DATA *x2,shortrow2,short col2,DATA *r)(defined in mmul.asm)x1[row1*col1]:Pointer to input vector of size nxPointer to input matrix of size row1*col1; row1 :; :; :; r[row1*col2] : Pointer to output data vector of sizerow1*col2row1 number of rows in matrix 1col1 number of columns in matrix 1x2[row2*col2]:Pointer to input matrix of size row2*col2row2 number of rows in matrix 2col2 number of columns in matrix 2r[row1*col2]Pointer to output matrix of size row1*col24-74
mmulDescriptionAlgorithmThis function multiplies two matricesMultiply input matrix A (M by N) by input matrix B (N by P) using 2 nested loops:for i = 1 to Mfor k = 1 to P{temp = 0for j = 1 to Ntemp = temp + A(i,j) * B(j,k)C(i,k) = temp}Overflow Handling MethodologyNot applicableSpecial Requirements Verify that the dimensions of input matrices are legal, i.e. col1 == row2Implementation Notes In order to take advantage of the dual MAC architecture of the C55x, this implementationchecks the size of the matrix x1. For small matrices x1 (row1 < 4 orcol1 < 2), single MAC loops are used. For larger matrices x1 (row1 ≥ 4 andcol1 ≥ 2), Dual MAC loops are more efficient and quickly make up for the additionalinitialization overhead.ExampleBenchmarksSee examples/mmul subdirectory(preliminary)Cycles †Code size(in bytes)Core: if(row1 < 4 || col1 < 2), use single MAC((col1 + 2)*row1 + 4)*col2if((row1==even)&&(row1 ≥ 4)&&(col1 ≥ 2)), use dual MAC((col1 + 4)*0.5*row1 + 10)col2 if((row1==odd)&&(row1 ≥ 4)&&(col1 ≥ 2), use dual MAC((col1 + 4)*0.5*(row1 – 1) + col1 + 12)col2Overhead: 30215† Assumes all data is in on-chip dual-access RAM and that there is no bus conflict due to twiddletable reads and instruction fetches (provided linker command file reflects those conditions).Function Descriptions4-75
Page 1 and 2:
TMS320C55x DSP LibraryProgrammer’
Page 3 and 4:
PrefaceRead This FirstAbout This Ma
Page 5:
Contentsatan16 . . . . . . . . . .
Page 9 and 10:
DSP Routines1.1 DSP RoutinesThe TI
Page 11 and 12:
DSPLIB Content / How to Install DSP
Page 13 and 14:
Chapter 3Using DSPLIBThis chapter d
Page 15 and 16:
Calling a DSPLIB Function from C3.2
Page 17 and 18:
Calling a DSPLIB Function from Asse
Page 19 and 20:
How DSPLIB Deals with Overflow and
Page 21 and 22:
Chapter 4Function DescriptionsThis
Page 23 and 24:
DSPLIB Functions4.2 DSPLIB Function
Page 25 and 26:
DSPLIB FunctionsTable 4-2. Summary
Page 27 and 28:
acorracorrFunctionArgumentsAutocorr
Page 29 and 30:
addCycles †Code size(in bytes)Aub
Page 31 and 32:
atan16Description This function cal
Page 33 and 34:
expbexpFunctionBlock Exponent Imple
Page 35 and 36:
cfftBenchmarks(preliminary)Cycles
Page 37 and 38:
cfirCFFT - NOSCALEFFT Size Cycles
Page 39 and 40:
cfirImplementation Notes The first
Page 41 and 42:
cifftcifftFunctionInverse Complex F
Page 43 and 44: convolCIFFT - NOSCALEFFT Size Cycle
Page 45 and 46: convol1Figure 4-6. h Array in Memor
Page 47 and 48: convol2Figure 4-8. r Array in Memor
Page 49 and 50: convol2Figure 4-10. x Array in Memo
Page 51 and 52: dlmsImplementation NotesSpecial deb
Page 53 and 54: expnImplementation NotesDelayed ver
Page 55 and 56: firr[nx]Pointer to output vector of
Page 57 and 58: fir2Figure 4-14. x Array in Memoryo
Page 59 and 60: fir2Special Requirementsnh must be
Page 61 and 62: firdecr[nx/D]dbuffer[nh+1]nxnhDofla
Page 63 and 64: firinterpIoflagInterpolation factor
Page 65 and 66: firlatpbuffer [nh]nxnhoflagDelay bu
Page 67 and 68: firsnh2oflagHalf the number of coef
Page 69 and 70: fltoq15Figure 4-21. r Array in Memo
Page 71 and 72: hilb16dbuffer[nh + 2] Pointer to de
Page 73 and 74: iir32Figure 4-23. x Array in Memory
Page 75 and 76: iir32In the case of multiple-buffer
Page 77 and 78: iircas4nxoflagNumber of elements of
Page 79 and 80: iircas51This function retains the a
Page 81 and 82: iirlatBenchmarks(preliminary)Cycles
Page 83 and 84: ldiv16ldiv16Function32-bit by 16-bi
Page 85 and 86: log_10Special Requirements noneImpl
Page 87 and 88: lognThe coefficients Bi used in the
Page 89 and 90: maxidxmaxidxFunctionIndex of the Ma
Page 91 and 92: maxvecDescription Returns the maxim
Page 93: minvecminvalFunctionMinimum Value o
Page 97 and 98: negr[nx]Pointer to output data vect
Page 99 and 100: powernxoflagNumber of elements of i
Page 101 and 102: and16AlgorithmNot applicableOverflo
Page 103 and 104: and16initBenchmarksCycles Core: 13
Page 105 and 106: fftThe initial estimate can be obta
Page 107 and 108: ifftrifftFunctionInverse Real FFT (
Page 109 and 110: sqrt_16DescriptionComputes the sine
Page 111 and 112: subr[nx]nxscaleoflagPointer to outp
Page 113 and 114: What DSPLIB Benchmarks are Provided
Page 115 and 116: DSPLIB Software Updates6.1 DSPLIB S
Page 117 and 118: Q3.12 Format / Q.15 Format / Q.31 F
Page 119 and 120: Calculating the Reciprocal of a Q15
Page 121 and 122: IndexIndex16-bit reciprocal functio
Page 123: Indexmmul 4-74mtrans 4-76mul32 4-76
show all

"TMS320C55x DSP Library DSPLIB Programmer's Reference"

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?