1 Montgomery Modular Multiplication in Hard- ware

More documents

Recommendations

Info

FEI KEMT units are available, the total execution time TMMM will increase. On the other hand the area occupation of the coprocessor can be changed according to the area constraints of the target device. Implementation of n < nmax stages means also more operations needed for reading from and storing in the memory. Shifting the processed data between the stages is faster than storing the intermediate results in the memory block and their repeated reading to finish the computations on them. Therefore the best performance is achieved in design with maximal number of stages nmax (n = nmax). Parametrisation The MMM coprocessor has three variable parameters (w, e, and n) that can be chosen for any implementation. According to the required area of the implemented coprocessor and the required timings for the MMM computations the number of pipelined stages and the word width (n, w) can be chosen. The security level of public-key algorithm defines the length of operands for the multiplier (k = we). This approach gives high flexibility to the processor and coprocessor design. In general, there are two possible approaches how to increase the speed of the MMM computation in the proposed designs (check Equation 2.4 to understand the relations between the design parameters and the computation time TMMM): 1. To increase the word length w. In this way the number of iterations given by e is reduced what yields a shorter computation time. While the older FPGAs provide memory blocks with dual port memory feature and configurable word lengths only up to 16 bits (Altera Apex [8]), in the high-performance models it can be up to 32 bits for middle-sized blocks or 128 bits for large memory blocks (Altera Stratix II [20]). Since the capacity of the block is sufficient for typical RSA operands it makes sense to use only one block per operand. In case of an older technology with smaller memory blocks and chosen bigger word width (16 < w ≤ 32) two memory blocks per variable aare required. In dependency of the memory configuration several variables may share one memory block. Operands mapping to the memory is especially important for constrained SOC designs with limited number of memory blocks. 2. To increase the number of pipelined stages n. The hardware structure of the PE for both solutions (CSA PE and CPA PE) is relatively simple and fast and independent on the number of stages, what was a condition for a scalable design. An addition of several pipelined stages can increase the overall speed, 31
FEI KEMT especially if the access to the embedded memory is a bottleneck (as it is in a case of FPGAs with limited routing resources for large w). From the previous analysis we can conclude that the number of words w is chosen according to the target platform architecture and its memory blocks organisation and support for fast carry operations. The number of pipelined stages n is adapted to available chip size. 2.2.2 Memory Block The operands are stored in the memory block that is included in the data-path. Op- timisation of the memory organisation and connection to the ALU helps to achieve better performance. Due to intensive exchange of data between the memory and ALU, the connection is often a part of the longest - critical path of the logic and influences a maximal clock frequency of the circuit. In dependency on number of pipelined stages (n) and number of iterations given by number of words (w) the data of operands are several times read out of the memory, processed by PEs, and stored back. The memory block may contain input data loaded by a control unit, the intermediate results, and the final results ready to be sent back to a host processor after the computations had been finished. Note that at the same time different words of an operand are loaded and stored. Therefore the memory have to support dual-port configuration. It makes possible to address reading and writing from/to separate places of the memory. Schematic organisation of the dual-port memory register inside the MMM coprocessor for one of the variables is depicted at Figure 2 – 6. A data A address 0: 1: e-1: w bits w bits . . . w bits memory unit: e x w bits B data B address A port B port Figure 2 – 6 Organisation of the dual-port memory register inside the MMM coprocessor for one variable with e words of width w bits 32
Page 1 and 2: Technical University of Koˇsice Fa
Page 3 and 4: Metadata Sheet Author: Martin ˇ Si
Page 5 and 6: FEI KEMT ciel’ovej platformy a vl
Page 7 and 8: Acknowledgement There are several p
Page 9 and 10: Contents Introduction 1 1 Montgomer
Page 11 and 12: FEI KEMT 6.2.3 Analysis of TRNG in
Page 13 and 14: FEI KEMT 6 - 2 Block diagram of dig
Page 15 and 16: List of Algorithms 1 - 1 Montgomery
Page 17 and 18: FEI KEMT π(p) prime counting funct
Page 19 and 20: FEI KEMT MWR2MM Multiple Word Radix
Page 21 and 22: FEI KEMT and Elliptic Curve Cryptog
Page 23 and 24: FEI KEMT The hardware implementatio
Page 25 and 26: FEI KEMT data inputs clock Look-up
Page 27 and 28: FEI KEMT A (X) request for B’s pr
Page 29 and 30: FEI KEMT Algorithm 1 - 1 Montgomery
Page 31 and 32: FEI KEMT In the Algorithm 1 - 1 the
Page 33 and 34: FEI KEMT by the radix b changes to
Page 35 and 36: FEI KEMT the ECC instead of the RSA
Page 37 and 38: FEI KEMT Algorithm 1 - 5 Key genera
Page 39 and 40: FEI KEMT 2 Montgomery Modular Multi
Page 41 and 42: FEI KEMT not significant. On the ot
Page 43 and 44: FEI KEMT Algorithm 2 - 2 The multip
Page 45 and 46: FEI KEMT Beside the internal struct
Page 47 and 48: FEI KEMT q x i S (j) 2 w-1 S (j) 1
Page 49: FEI KEMT x i x i-1 xi-n+1 Y (j) M (
Page 53 and 54: FEI KEMT 2.2.3 Interface to Control
Page 55 and 56: FEI KEMT Clock Signal Distribution
Page 57 and 58: FEI KEMT 2.3.2 Montgomery Multiplic
Page 59 and 60: FEI KEMT 1. Fully software solution
Page 61 and 62: FEI KEMT 2.3.4 Implementation Resul
Page 63 and 64: FEI KEMT 3 Elliptic Curve Method in
Page 65 and 66: FEI KEMT improving the area-time pr
Page 67 and 68: FEI KEMT 3.3.1 Pollard’s (p − 1
Page 69 and 70: FEI KEMT If the order of P ∈ E(Fq
Page 71 and 72: FEI KEMT handicap of the Montgomery
Page 73 and 74: FEI KEMT Two major improvements hav
Page 75 and 76: FEI KEMT and case study with GNFS b
Page 77 and 78: FEI KEMT Table 4 - 1 Computational
Page 79 and 80: FEI KEMT from or writing to a singl
Page 81 and 82: FEI KEMT Algorithm 4 - 1 Modified M
Page 83 and 84: FEI KEMT Algorithm 4 - 2 Modular ad
Page 85 and 86: FEI KEMT microprocessor, e.g. Alter
Page 87 and 88: FEI KEMT timings of ECM implementat
Page 89 and 90: FEI KEMT hardware was implemented o
Page 91 and 92: FEI KEMT [68] what can be expressed
Page 93 and 94: FEI KEMT and a harvesting mechanism
Page 95 and 96: FEI KEMT control also all the syste
Page 97 and 98: FEI KEMT we can mention a generator
Page 99 and 100: FEI KEMT noise to binary signal a c
Page 101 and 102:
FEI KEMT over time. For this reason
Page 103 and 104:
FEI KEMT The Bucci and Luzzi Testab
Page 105 and 106:
FEI KEMT CLI PLL PLL 1 2 CLJ CLK D
Page 107 and 108:
FEI KEMT For R it holds that the in
Page 109 and 110:
FEI KEMT extraction. In other words
Page 111 and 112:
FEI KEMT and effort needed for repr
Page 113 and 114:
FEI KEMT 6 True Random Number Gener
Page 115 and 116:
FEI KEMT clock input F IN F FB Phas
Page 117 and 118:
FEI KEMT Table 6 - 2 Parameters of
Page 119 and 120:
FEI KEMT Since the size of the jitt
Page 121 and 122:
FEI KEMT Table 6 - 3 Parameters set
Page 123 and 124:
FEI KEMT where N1 is the number of
Page 125 and 126:
FEI KEMT oscillator with frequency
Page 127 and 128:
FEI KEMT Table 6 - 7 Area occupatio
Page 129 and 130:
FEI KEMT 0,75 0,5 0,25 1 0 1 30 59
Page 131 and 132:
FEI KEMT Stochastic Model The clock
Page 133 and 134:
FEI KEMT Table 6 - 8 Mean values me
Page 135 and 136:
FEI KEMT Figure 6 - 8 Sampled wavef
Page 137 and 138:
FEI KEMT Table 6 - 9 Results of sta
Page 139 and 140:
FEI KEMT Figure 6 - 11 Amount of sa
Page 141 and 142:
FEI KEMT Figure 6 - 13 Amount of sa
Page 143 and 144:
FEI KEMT 7 Research Contribution Wi
Page 145 and 146:
FEI KEMT Curriculum vitae Professio
Page 147 and 148:
FEI KEMT [16] Altera Corporation. S
Page 149 and 150:
FEI KEMT [37] Bundesamt für Sicher
Page 151 and 152:
FEI KEMT [54] Federal Information P
Page 153 and 154:
FEI KEMT [71] Gura, N., Chang, S.,
Page 155 and 156:
FEI KEMT of Commerce, month = aug,
Page 157 and 158:
FEI KEMT and C. Paar, Eds., no. 216
Page 159:
FEI KEMT [128] Zimmermann, P. ECMNE
show all

1 Montgomery Modular Multiplication in Hard- ware

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?