1 Montgomery Modular Multiplication in Hard- ware

More documents

Recommendations

Info

FEI KEMT the sequence, altogether ϕ(D) + 6 numbers. The computational costs consist of the generation of T and the calculation of mDQ which amounts to at most D 4 + B2 D elliptic curve operations (mostly additions) and at most 3(π(B2) − π(B1)) modular multiplications, π(x) being the number of primes up to x. The last term can be lowered if D contains many small prime factors since this will increase the number of pairs (m, k) for which both mD − k and mD + k are prime. Neglecting space considerations a good choice for D is a number around √ B2 which is divisible by many small primes. 4 Elliptic Curve Method in Hardware We present the first published hardware implementation of the ECM for integer factoring. The ECM implementation includes a complete hardware logic that supports the ECM factoring of numbers up to approximately 200 bits. The proposed solution applies parameters best suited to find factors of up to about 42 bits. The ECM design features a supporting logic for computation of the modular operations as addition, subtraction, multiplication and squaring. The multiplication and squaring is computed in the MMM unit analysed in the Chapter 2. The circuit has a good scalability also to larger and smaller bit lengths. For a proof-of-concept purpose, the ECM architecture has been implemented as a software-hardware co-design on a FPGA and an embedded micro-controller in a SOC. Such a design perfectly fits the needs of recent proposals for hardware architectures for the GNFS (see, e.g. [64]) and can reduce the overall costs of a GNFS device considerably. Parts of this section were published in papers [65,94,120]. The research achieve- ments described in this chapter include the following: • ECM algorithm for hardware – algorithm adaptation and parametrisation, • ECM implementation – unit design, parallelisation, case study for GNFS. The ECM implementation was done as a joint work, mainly with Jan Pelzl from Ruhr University Bochum (in SHARK project that includes the ECM design, have cooperated also Christine Priplata and Colin Stahlke (Edizone GmbH, Germany), and Jens Franke and Thorsten Kleinjung (University of Bonn, Germany)). The Section 4.1 describes the details on selection of the parameters in the ECM. The architecture of the implementation and discussion on the chosen algorithms for the modular operations is presented in the Section 4.2. Implementation details 55 + 7
FEI KEMT and case study with GNFS based on ECM units are summarised in the Section 4.3. Finally, we conclude the chapter with discussion on obtained results. 4.1 Parameterisation of the ECM Algorithm Our implementation focuses on the factorisation of numbers up to 200 bits with factors of up to around 42 bits. Thus, the most optimal parameters need to be found for the smoothness bounds B1, B2, and in the improved standard continuation used parameter D (see the description of the ECM second phase in Section 3.3.2). We find the values that yield a high probability of success and a relatively small running time and area consumption. With the running time depending on the size of the (unknown) factors to be found, optimal parameters cannot be known beforehand. Hence, good parameters can be found by experiments with different prime bounds. 4.1.1 Phase 1 Deduced from software experiments, we choose B1 = 960 and B2 = 57 000 as prime bounds. The value of k has 1 375 bits, hence, assuming the binary method (Algo- rithm 3 – 2), 1 374 point additions and 1 374 point duplications for the execution of phase 1 are required. Due to the use of Montgomery coordinates, the coordinate zP of the starting point P can be set to 1, then the addition takes only 5 multiplications instead of 6. The improved phase 1 (with optimal addition chains) has to use the general case, where zP �= 1. For the sake of simplicity and a preferably simple control logic, we choose the binary method for the time being. For the chosen parameters, the computational complexity of phase 1 is 13 740 modular multiplications and squarings 3 . With optimised addition chains this number can be reduced to approximately 12 000 modular multiplications and squarings. According to Equation 3.10, duplicating a point 2PA = PC involves the input values xA, zA, A24 and n, where A24 = (A + 2)/4 is computed from the curve parameter A (see Equation 3.8) in advance and should be stored in a fixed register. A point addition PC = PA + PB handles the input values xA, zA, xB, zB, xA−B, zA−B and n (see Equation 3.9). Notice that the values n, A24, xA−B and zA−B do not change during phase 1. Furthermore, zA−B = z1 can be chosen to be 1. Thus, no register is required for zA−B. The output values xC and zC can be written to certain input registers to 3 Squarings and multiplications are considered to have an identical complexity in our case since the hardware unit is the same for both, the multiplication and squaring. 56
Page 1 and 2:
Technical University of Koˇsice Fa
Page 3 and 4:
Metadata Sheet Author: Martin ˇ Si
Page 5 and 6:
FEI KEMT ciel’ovej platformy a vl
Page 7 and 8:
Acknowledgement There are several p
Page 9 and 10:
Contents Introduction 1 1 Montgomer
Page 11 and 12:
FEI KEMT 6.2.3 Analysis of TRNG in
Page 13 and 14:
FEI KEMT 6 - 2 Block diagram of dig
Page 15 and 16:
List of Algorithms 1 - 1 Montgomery
Page 17 and 18:
FEI KEMT π(p) prime counting funct
Page 19 and 20:
FEI KEMT MWR2MM Multiple Word Radix
Page 21 and 22:
FEI KEMT and Elliptic Curve Cryptog
Page 23 and 24: FEI KEMT The hardware implementatio
Page 25 and 26: FEI KEMT data inputs clock Look-up
Page 27 and 28: FEI KEMT A (X) request for B’s pr
Page 29 and 30: FEI KEMT Algorithm 1 - 1 Montgomery
Page 31 and 32: FEI KEMT In the Algorithm 1 - 1 the
Page 33 and 34: FEI KEMT by the radix b changes to
Page 35 and 36: FEI KEMT the ECC instead of the RSA
Page 37 and 38: FEI KEMT Algorithm 1 - 5 Key genera
Page 39 and 40: FEI KEMT 2 Montgomery Modular Multi
Page 41 and 42: FEI KEMT not significant. On the ot
Page 43 and 44: FEI KEMT Algorithm 2 - 2 The multip
Page 45 and 46: FEI KEMT Beside the internal struct
Page 47 and 48: FEI KEMT q x i S (j) 2 w-1 S (j) 1
Page 49 and 50: FEI KEMT x i x i-1 xi-n+1 Y (j) M (
Page 51 and 52: FEI KEMT especially if the access t
Page 53 and 54: FEI KEMT 2.2.3 Interface to Control
Page 55 and 56: FEI KEMT Clock Signal Distribution
Page 57 and 58: FEI KEMT 2.3.2 Montgomery Multiplic
Page 59 and 60: FEI KEMT 1. Fully software solution
Page 61 and 62: FEI KEMT 2.3.4 Implementation Resul
Page 63 and 64: FEI KEMT 3 Elliptic Curve Method in
Page 65 and 66: FEI KEMT improving the area-time pr
Page 67 and 68: FEI KEMT 3.3.1 Pollard’s (p − 1
Page 69 and 70: FEI KEMT If the order of P ∈ E(Fq
Page 71 and 72: FEI KEMT handicap of the Montgomery
Page 73: FEI KEMT Two major improvements hav
Page 77 and 78: FEI KEMT Table 4 - 1 Computational
Page 79 and 80: FEI KEMT from or writing to a singl
Page 81 and 82: FEI KEMT Algorithm 4 - 1 Modified M
Page 83 and 84: FEI KEMT Algorithm 4 - 2 Modular ad
Page 85 and 86: FEI KEMT microprocessor, e.g. Alter
Page 87 and 88: FEI KEMT timings of ECM implementat
Page 89 and 90: FEI KEMT hardware was implemented o
Page 91 and 92: FEI KEMT [68] what can be expressed
Page 93 and 94: FEI KEMT and a harvesting mechanism
Page 95 and 96: FEI KEMT control also all the syste
Page 97 and 98: FEI KEMT we can mention a generator
Page 99 and 100: FEI KEMT noise to binary signal a c
Page 101 and 102: FEI KEMT over time. For this reason
Page 103 and 104: FEI KEMT The Bucci and Luzzi Testab
Page 105 and 106: FEI KEMT CLI PLL PLL 1 2 CLJ CLK D
Page 107 and 108: FEI KEMT For R it holds that the in
Page 109 and 110: FEI KEMT extraction. In other words
Page 111 and 112: FEI KEMT and effort needed for repr
Page 113 and 114: FEI KEMT 6 True Random Number Gener
Page 115 and 116: FEI KEMT clock input F IN F FB Phas
Page 117 and 118: FEI KEMT Table 6 - 2 Parameters of
Page 119 and 120: FEI KEMT Since the size of the jitt
Page 121 and 122: FEI KEMT Table 6 - 3 Parameters set
Page 123 and 124: FEI KEMT where N1 is the number of
Page 125 and 126:
FEI KEMT oscillator with frequency
Page 127 and 128:
FEI KEMT Table 6 - 7 Area occupatio
Page 129 and 130:
FEI KEMT 0,75 0,5 0,25 1 0 1 30 59
Page 131 and 132:
FEI KEMT Stochastic Model The clock
Page 133 and 134:
FEI KEMT Table 6 - 8 Mean values me
Page 135 and 136:
FEI KEMT Figure 6 - 8 Sampled wavef
Page 137 and 138:
FEI KEMT Table 6 - 9 Results of sta
Page 139 and 140:
FEI KEMT Figure 6 - 11 Amount of sa
Page 141 and 142:
FEI KEMT Figure 6 - 13 Amount of sa
Page 143 and 144:
FEI KEMT 7 Research Contribution Wi
Page 145 and 146:
FEI KEMT Curriculum vitae Professio
Page 147 and 148:
FEI KEMT [16] Altera Corporation. S
Page 149 and 150:
FEI KEMT [37] Bundesamt für Sicher
Page 151 and 152:
FEI KEMT [54] Federal Information P
Page 153 and 154:
FEI KEMT [71] Gura, N., Chang, S.,
Page 155 and 156:
FEI KEMT of Commerce, month = aug,
Page 157 and 158:
FEI KEMT and C. Paar, Eds., no. 216
Page 159:
FEI KEMT [128] Zimmermann, P. ECMNE
show all

1 Montgomery Modular Multiplication in Hard- ware

Create successful ePaper yourself

Delete template?

Save as template?