Abstract

More documents

Recommendations

Info

CHAPTER 3. BIFURCATION ANALYSIS 57 bordering method. This is because the bordering method’s coefficient matrix is the Jacobian matrix W ′ ( f) which will be going singular at turning points. 3.7 Parallelization of Simulator Since LOCA has been developed to run with both serial and parallel applications and adding grid points to enhance the accuracy of the simulation would substan- tially increase computational time, we were motivated to parallelize the simulator. Therefore, we had to determine how we would split apart the f vector and the W ( f) evaluation across the different processors. We decided to divide the f vector across the processors by the spatial variable x. This meant each processor had a contiguous block of x-space, with each processor owning all of the possible k values for each x grid point it had. This makes computing integrals in k-space a completely parallel process, with no communication between processors required, but the integral and derivative term in x-space will require communication between the processors. One could have done the opposite, where each processor gets a continugous block of k- space, with each processor owning all of the possible x values for each k grid point it had. In this case, computing the derivative and integral in x-space would be trivial while requiring communication between processors to compute the k-space integrals. We chose the way we distributed the data between the processors because there are more k-space integrals to compute, and they are the most computationally intensive part of the simulation.
CHAPTER 3. BIFURCATION ANALYSIS 58 To handle the nonlinearity in computing W ( f), each processor computed the elec- tron density n(x) (1.11) for each x-point it owned. The processors would then send theirpartofn(x) to one processor, which is designated the main processor. This main processor takes all of n(x), performs the Poisson solve to compute the potential energy U(x) (1.13), and sends out U(x) to all the processors. Once each processor had U(x), it could compute P ( f) for each of its x-points. To compute the derivative term in K( f), each processor would need to know the f values on the 2 x grid points before its smallest x-point and the 2 x grid point ahead of its largest x-point since a second-order upwind differencing scheme was used. These values were passed between the processors. Finally, each processor would add these terms up to get the W ( f) evaluation on the parts of the domain the processor owned. To demonstrate the parallel efficiency of our program, the simulation with Nx = 512 and Nk = 2048 was run using from 2 up to 80 processors. The runs reported in this section were performed on processors of a Linux cluster at Sandia National Laboratories. This cluster has a total of 236 compute nodes. The nodes are dual 3.06 GHz Xeon processors, each with 2 GB of RAM. The table below compares the run times for taking 5 continuation steps, from V =0.2093 to V =0.2293. Since the nodes used to perform the efficiency study are dual processor, we decided a fair evaluation of the efficiency required a base case of 2 processors instead of the normal 1 processor. The communication between 2 processors on the same node would be more efficient than the communication between processors across distinct nodes.
Page 1 and 2:
Numerical Methods for the Wigner-Po
Page 3 and 4:
NUMERICAL METHODS FOR THE WIGNER-PO
Page 5 and 6:
Acknowledgments First and foremost,
Page 7 and 8:
Table of Contents List of Tables ix
Page 9 and 10:
4.3 ApplicationofSchauder’sFixedP
Page 11 and 12:
List of Tables 1.1 PhysicalConstant
Page 13 and 14:
3.11 I-V Curve with 30 Percent Redu
Page 15 and 16:
CHAPTER 1. OVERVIEW 2 being heavily
Page 17 and 18:
CHAPTER 1. OVERVIEW 4 Doping Spacer
Page 19 and 20: CHAPTER 1. OVERVIEW 6 and identfyin
Page 21 and 22: CHAPTER 1. OVERVIEW 8 for predictin
Page 23 and 24: CHAPTER 1. OVERVIEW 10 by Schrödin
Page 25 and 26: CHAPTER 1. OVERVIEW 12 ranging term
Page 27 and 28: CHAPTER 1. OVERVIEW 14 But because
Page 29 and 30: CHAPTER 1. OVERVIEW 16 = h 2π ,we
Page 31 and 32: CHAPTER 1. OVERVIEW 18 Table 1.1: P
Page 33 and 34: CHAPTER 1. OVERVIEW 20 where q is t
Page 35 and 36: CHAPTER 1. OVERVIEW 22 get K(f)
Page 37 and 38: CHAPTER 1. OVERVIEW 24 rule. The pr
Page 39 and 40: CHAPTER 1. OVERVIEW 26 The weights
Page 41 and 42: Chapter 2 Temporal Integration 2.1
Page 43 and 44: CHAPTER 2. TEMPORAL INTEGRATION 30
Page 57 and 58: Chapter 3 Bifurcation Analysis 3.1
Page 59 and 60: CHAPTER 3. BIFURCATION ANALYSIS 46
Page 69: CHAPTER 3. BIFURCATION ANALYSIS 56
Page 87 and 88: Chapter 4 Theory 4.1 Steady-State T
Page 89 and 90: CHAPTER 4. THEORY 76 If we write ou
Page 91 and 92: CHAPTER 4. THEORY 78 So if we write
Page 93 and 94: CHAPTER 4. THEORY 80 Combining Equa
Page 95 and 96: CHAPTER 4. THEORY 82 We now estimat
Page 97 and 98: CHAPTER 4. THEORY 84 pendent of x,
Page 99 and 100: CHAPTER 4. THEORY 86 interval we ha
Page 101 and 102: CHAPTER 4. THEORY 88 So an estimate
Page 103 and 104: CHAPTER 4. THEORY 90 We want to cho
Page 105 and 106: CHAPTER 4. THEORY 92 The exact same
Page 107 and 108: CHAPTER 4. THEORY 94 Since un → u
Page 109 and 110: CHAPTER 4. THEORY 96 thereexistsaco
Page 111 and 112: CHAPTER 4. THEORY 98 For 0
Page 113 and 114: CHAPTER 4. THEORY 100 denote this d
Page 115 and 116: CHAPTER 4. THEORY 102 Now, we can t
Page 117 and 118: CHAPTER 4. THEORY 104 operator. 4.2
Page 119 and 120: CHAPTER 4. THEORY 106 we have at th
Page 121 and 122:
CHAPTER 4. THEORY 108 we have at th
Page 123 and 124:
CHAPTER 4. THEORY 110 4.3 Applicati
Page 125 and 126:
CHAPTER 4. THEORY 112 We need to fi
Page 127 and 128:
Chapter 5 Conclusion In this work,
Page 129 and 130:
CHAPTER 5. CONCLUSION 116 • Findi
Page 131 and 132:
REFERENCES 118 ODE solver. Technica
Page 133 and 134:
REFERENCES 120 [21] C. T. Kelley an
Page 135 and 136:
REFERENCES 122 [37] Homer F. Walker
Page 137 and 138:
APPENDIX A. NOTATION 124 ˜B Genera
Page 139 and 140:
APPENDIX A. NOTATION 126 k Wave num
Page 141 and 142:
APPENDIX A. NOTATION 128 ∆t Incre
Page 143 and 144:
APPENDIX A. NOTATION 130 ɛc ɛk ɛ
Page 145 and 146:
APPENDIX B. TRILINOS 132 user-given
Page 147 and 148:
APPENDIX B. TRILINOS 134 Trilinos,
Page 149 and 150:
APPENDIX B. TRILINOS 136 Minimum Re
Page 151 and 152:
APPENDIX B. TRILINOS 138 A very use
Page 153 and 154:
Appendix C Guide to RTD Simulation
Page 155 and 156:
APPENDIX C. GUIDE TO RTD SIMULATION
Page 157 and 158:
Page 159 and 160:
Page 161 and 162:
Page 163 and 164:
Appendix D GMRES and Arnoldi Iterat
Page 165 and 166:
APPENDIX D. GMRES AND ARNOLDI ITERA
Page 167 and 168:
APPENDIX D. GMRES AND ARNOLDI ITERA
show all

Abstract

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?