1 Montgomery Modular Multiplication in Hard- ware

More documents

Recommendations

Info

FEI KEMT embedded processor with a set of dedicated coprocessors. For such a system a highly flexible (although typically slower) scalable MMM coprocessor could be more attractive than a fixed length dedicated one. That direction was chosen in our research, when our goal is to analyse and implement solution that would allow quick prototyping of special purpose hardware designs and use features of target platform in order to accelerate execution of the MMM operation. The radix-2 MMM algorithm (b = 2) is very suitable for hardware implementation due to easily implementable operations as a word-by-bit multiplication, a bit-shift (division by two) and an addition. Implementations with higher radix were also published [30, 110] and offer a proper alternative, but using a more complex algebraic unit. Radix-2 Montgomery Multiplication Algorithm The simplified version of the MMM algorithm (Algorithm 1 – 2) when the radix b is equal to 2 (b = 2) for k-bit operands X = (xk−1, . . . , x1, x0), Y , and M is given as Algorithm 1 – 3. Algorithm 1 – 3 The basic radix-2 Montgomery multiplication algorithm for k-bit operands X = (xk−1, . . . , x1, x0), Y , and M Require: M = (mk−1 . . . m0)2, X = (xk−1 . . . x0)2, Y = (yk−1 . . . y0)2, M ′ = −M −1 mod 2, E = (et . . . e0)2 with et = 1, R = 2 k , and an integer X, 1 ≤ X < M. The values R 2 mod M and R mod M may be also provided as precomputed inputs. Ensure: S = XY R −1 mod M. 1: S0 ⇐ 0 2: for i = 0 to k − 1 do 3: qi ⇐ (Si + xiY ) mod 2 4: Si+1 ⇐ (Si + xiY + qiM)/2 5: end for 6: if Sk ≥ M then 7: Sk ⇐ Sk − M 8: end if 9: S ← Sk 10: return S From a comparison of the Algorithms 1 – 2 and 1 – 3 one can see how the choice of b = 2 may help to simplify the operations inside the MMM. The modular reduction 13
FEI KEMT by the radix b changes to a check of the LSB. In the Step 4 the division is replaced by a simple right shift operation. The formulation that describes the radix-2 algorithm was used as the starting point for derivation of a scalable design computing the MMM presented in [108,109]. Later we will discuss the features of such scalable architecture. Before that, we make a closer look at the operations of the algorithm and consider their modifications so they are better suitable for efficient execution on chosen FPGA hardware platform. The decision whether perform an addition of the modulus M to the temporal sum Si+1 is based on the value of the variable qi that can be simply implemented. The test checks the LSB of the partial sum Si+1 = Si + xiY and stores it as variable qi once the addition of xiY is finished (see step 3 of the Algorithm 1 – 3). The stored value decides on the addition of M in the following iteration of the loop. However, the second condition (see step 6 of the Algorithm 1 – 3) causes a prob- lem for a possible pipelined execution of computations. After the loop of additions, multiplications and shifts, the mentioned comparison and subsequent conditional subtraction is required. Without the final reduction step the outcome of the inner loop of multiplication can provide an improper input for the subsequent multiplication operation. That may happen in the case when the final value of S is bigger than M (S > M). We have intention to use the MMM in a series of multiplications when the transformation into the Montgomery domain brings profit over an expensive reduction as it was showed in the Algorithm 1 – 1. Therefore we analyse possibilities for omitting the final condition step by changes in the Algorithm 1 – 3 and make possible a use of pipelined multipliers. Algorithm Modifications The MMM algorithm (Algorithm 1 – 2) introduced earlier is further extended. Two variants of the algorithm are discussed and implemented, both supporting scalable multiple-word oriented implementation, but handling a carry processing in different ways. In the modified Algorithm 1 – 4 we use the following input operands: k� X = xi2 i=0 i = (0, 0, xk, xk−1, . . . , x1, x0) < 2M , (1.14) �Y = k� �yi2 i+1 = (yk, . . . , y1, y0, 0) < 4M , (1.15) i=0 where R = 2 k+3 , Y < 2M, and 2 k−1 < M < 2 k is an k-bit number (the same as in the Algorithm 1 – 3). Note that � Y in Equation 1.15 is a left shifted version of 14
Page 1 and 2: Technical University of Koˇsice Fa
Page 3 and 4: Metadata Sheet Author: Martin ˇ Si
Page 5 and 6: FEI KEMT ciel’ovej platformy a vl
Page 7 and 8: Acknowledgement There are several p
Page 9 and 10: Contents Introduction 1 1 Montgomer
Page 11 and 12: FEI KEMT 6.2.3 Analysis of TRNG in
Page 13 and 14: FEI KEMT 6 - 2 Block diagram of dig
Page 15 and 16: List of Algorithms 1 - 1 Montgomery
Page 17 and 18: FEI KEMT π(p) prime counting funct
Page 19 and 20: FEI KEMT MWR2MM Multiple Word Radix
Page 21 and 22: FEI KEMT and Elliptic Curve Cryptog
Page 23 and 24: FEI KEMT The hardware implementatio
Page 25 and 26: FEI KEMT data inputs clock Look-up
Page 27 and 28: FEI KEMT A (X) request for B’s pr
Page 29 and 30: FEI KEMT Algorithm 1 - 1 Montgomery
Page 31: FEI KEMT In the Algorithm 1 - 1 the
Page 35 and 36: FEI KEMT the ECC instead of the RSA
Page 37 and 38: FEI KEMT Algorithm 1 - 5 Key genera
Page 39 and 40: FEI KEMT 2 Montgomery Modular Multi
Page 41 and 42: FEI KEMT not significant. On the ot
Page 43 and 44: FEI KEMT Algorithm 2 - 2 The multip
Page 45 and 46: FEI KEMT Beside the internal struct
Page 47 and 48: FEI KEMT q x i S (j) 2 w-1 S (j) 1
Page 49 and 50: FEI KEMT x i x i-1 xi-n+1 Y (j) M (
Page 51 and 52: FEI KEMT especially if the access t
Page 53 and 54: FEI KEMT 2.2.3 Interface to Control
Page 55 and 56: FEI KEMT Clock Signal Distribution
Page 57 and 58: FEI KEMT 2.3.2 Montgomery Multiplic
Page 59 and 60: FEI KEMT 1. Fully software solution
Page 61 and 62: FEI KEMT 2.3.4 Implementation Resul
Page 63 and 64: FEI KEMT 3 Elliptic Curve Method in
Page 65 and 66: FEI KEMT improving the area-time pr
Page 67 and 68: FEI KEMT 3.3.1 Pollard’s (p − 1
Page 69 and 70: FEI KEMT If the order of P ∈ E(Fq
Page 71 and 72: FEI KEMT handicap of the Montgomery
Page 73 and 74: FEI KEMT Two major improvements hav
Page 75 and 76: FEI KEMT and case study with GNFS b
Page 77 and 78: FEI KEMT Table 4 - 1 Computational
Page 79 and 80: FEI KEMT from or writing to a singl
Page 81 and 82: FEI KEMT Algorithm 4 - 1 Modified M
Page 83 and 84:
FEI KEMT Algorithm 4 - 2 Modular ad
Page 85 and 86:
FEI KEMT microprocessor, e.g. Alter
Page 87 and 88:
FEI KEMT timings of ECM implementat
Page 89 and 90:
FEI KEMT hardware was implemented o
Page 91 and 92:
FEI KEMT [68] what can be expressed
Page 93 and 94:
FEI KEMT and a harvesting mechanism
Page 95 and 96:
FEI KEMT control also all the syste
Page 97 and 98:
FEI KEMT we can mention a generator
Page 99 and 100:
FEI KEMT noise to binary signal a c
Page 101 and 102:
FEI KEMT over time. For this reason
Page 103 and 104:
FEI KEMT The Bucci and Luzzi Testab
Page 105 and 106:
FEI KEMT CLI PLL PLL 1 2 CLJ CLK D
Page 107 and 108:
FEI KEMT For R it holds that the in
Page 109 and 110:
FEI KEMT extraction. In other words
Page 111 and 112:
FEI KEMT and effort needed for repr
Page 113 and 114:
FEI KEMT 6 True Random Number Gener
Page 115 and 116:
FEI KEMT clock input F IN F FB Phas
Page 117 and 118:
FEI KEMT Table 6 - 2 Parameters of
Page 119 and 120:
FEI KEMT Since the size of the jitt
Page 121 and 122:
FEI KEMT Table 6 - 3 Parameters set
Page 123 and 124:
FEI KEMT where N1 is the number of
Page 125 and 126:
FEI KEMT oscillator with frequency
Page 127 and 128:
FEI KEMT Table 6 - 7 Area occupatio
Page 129 and 130:
FEI KEMT 0,75 0,5 0,25 1 0 1 30 59
Page 131 and 132:
FEI KEMT Stochastic Model The clock
Page 133 and 134:
FEI KEMT Table 6 - 8 Mean values me
Page 135 and 136:
FEI KEMT Figure 6 - 8 Sampled wavef
Page 137 and 138:
FEI KEMT Table 6 - 9 Results of sta
Page 139 and 140:
FEI KEMT Figure 6 - 11 Amount of sa
Page 141 and 142:
FEI KEMT Figure 6 - 13 Amount of sa
Page 143 and 144:
FEI KEMT 7 Research Contribution Wi
Page 145 and 146:
FEI KEMT Curriculum vitae Professio
Page 147 and 148:
FEI KEMT [16] Altera Corporation. S
Page 149 and 150:
FEI KEMT [37] Bundesamt für Sicher
Page 151 and 152:
FEI KEMT [54] Federal Information P
Page 153 and 154:
FEI KEMT [71] Gura, N., Chang, S.,
Page 155 and 156:
FEI KEMT of Commerce, month = aug,
Page 157 and 158:
FEI KEMT and C. Paar, Eds., no. 216
Page 159:
FEI KEMT [128] Zimmermann, P. ECMNE
show all

1 Montgomery Modular Multiplication in Hard- ware

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?