CATS Proceedings Printout - Graz University of Technology
CATS Proceedings Printout - Graz University of Technology
CATS Proceedings Printout - Graz University of Technology
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
th<br />
The 15 International IGTE Symposium<br />
on Numerical Field Calculation in Electrical Engineering<br />
Institute for Fundamentals and Theory<br />
in Electrical Engineering - IGTE<br />
<strong>Proceedings</strong><br />
Sept. 17 - 19, 2012<br />
Hotel Novapark, <strong>Graz</strong>, Austria<br />
ISBN: 978-3-85125-258-3<br />
Verlag der Technischen Universität <strong>Graz</strong><br />
www.ub.tugraz.at/Verlag<br />
<strong>Graz</strong> <strong>University</strong><br />
<strong>of</strong> <strong>Technology</strong>
The 15th International IGTE Symposium on Numerical Field Calculation in Electrical<br />
Engineering is sponsored and supported by:
.<br />
- I - 15th IGTE Symposium 2012<br />
Table <strong>of</strong> Contents<br />
Multi Domain Multi Scale Problems in the High Frequency Finite 1<br />
Element Method (FEM)<br />
Istvan Bardi, Kezhong Zhao, Rickard Petersson, John Silvestro, Nancy Lambert<br />
A parallel-TLM algorithm. Modelling the Earth-ionosphere waveguide 7<br />
Sergio Toledo-Redondo, Alfonso Salinas, Jesús Fornieles, Jorge Portí,<br />
Bruno Besser, Herbert I. M. Lichtenegger<br />
A Novel Parametric Model Order Reduction Approach with Applications 13<br />
to Geometrically Parameterized Microwave Devices<br />
Stefan Burgard, Ortwin Farle, Romanus Dyczij-Edlinger<br />
Efficient Finite-Element Computation <strong>of</strong> Far-Fields <strong>of</strong> Phased Arrays by 19<br />
Order Reduction<br />
Alexander Sommer, Ortwin Farle, Romanus Dyczij-Edlinger<br />
Nanoparticle device for biomedical and optoelectronics applications 25<br />
Renato Iovine, Luigi La Spada, Lucio Vegni<br />
Validation <strong>of</strong> measurements with conjugate heat transfer models 31<br />
Maximilian Schrittwieser, Oszkár Bíró, Ernst Farnleitner, Gebhard Kastner<br />
Computing the shielding effectiveness <strong>of</strong> waveguides using FE-mesh 37<br />
truncation by surface operator implementation<br />
Christian Tuerk, Werner Renhart, Christian Magele<br />
Heat Transfer Analysis on End Windings <strong>of</strong> a Hydro Generator using a 41<br />
Stator-Slot-Sector Model<br />
Stephan Klomberg, Ernst Farnleitner, Gebhard Kastner, Oszkár Bíró<br />
Numerical Investigation <strong>of</strong> Linear Systems Obtained by Extended 47<br />
Element-Free Galerkin Method<br />
Taku Itoh, Soichiro Ikuno, Atsushi Kamitani<br />
Electromagnetic Wave Propagation Simulation in Corrugated 53<br />
Waveguide using Meshless Time Domain Method<br />
Soichiro Ikuno, Yoshihisa Fujita, Taku Itoh, Susumu Nakata, Atsushi Kamitani<br />
Optimization <strong>of</strong> Permanent Magnet Linear Actuator for Braille Screen 59<br />
Ivan Yatchev, Iosko Balabozov, Krastio Hinov, Vultchan Gueorgiev,<br />
Dimitar Karastoyanov<br />
3D Finite Element Analysis <strong>of</strong> Induction Heating System for High 63<br />
Frequency Welding<br />
Ilona Iatcheva, Georgi Gigov, Georgi Kunov, Rumena Stancheva<br />
Optimization Algorithms in the View <strong>of</strong> State Space Concepts 67<br />
Markus Neumayer, Daniel Watzenig, Gerald Steiner, Bernhard Brandstätter<br />
Quasi TEM Analysis <strong>of</strong> 2D Symmetrically Coupled Strip Lines with Finite 73<br />
Grounded Plane using HBEM<br />
Saša Ilić, Mirjana Perić, Slavoljub Aleksić, Nebojsa Raicevic
- II - 15th IGTE Symposium 2012<br />
Design Approach for a Line-Start Internal Permanent Magnet 78<br />
Synchronous Motor<br />
Vera Elistratova, Michel Hecquet, Pascal Brochet, Darius Vizireanu,<br />
Maxime Dessoude<br />
Speed-up <strong>of</strong> Nonlinear Electromagnetic Field Analysis using Fixed-Point 84<br />
Method<br />
Norio Takahashi, Kouske Shimoyama, Daisuke Miyagi, Hiroyuki Kaimori<br />
S<strong>of</strong>tware agent based domain decomposition method 89<br />
Matthias Jüttner, André Buchau, Michael Rauscher, Wolfgang M. Rucker,<br />
Peter Göhner<br />
Stochastic Jiles-Atherton model accounting for s<strong>of</strong>t magnetic material 95<br />
variability<br />
Rindra Ramarotafika, Abdelkader Benabou, Stéphane Clénet<br />
Human exposure to the magnetic field produced by MFDC spot welding 101<br />
systems<br />
Davide Bavastro, Aldo Canova, Luca Giaccone, Michele Manca, Marco Simioli<br />
A Circuital Approach for Eddy Currents Fast Evaluation in Beam-like 108<br />
Structures<br />
Alessandro Formisano, Raffaele Martone<br />
Convergence Characteristics <strong>of</strong> Preconditioned MRTR Method with 113<br />
Eisenstat’s Technique in Real Symmetric Sparse Matrix<br />
Yoshifumi Okamoto, Tomonori Tsuburaya, Koji Fujiwara, Shuji Sato<br />
High Frequency Mixing Rule Based Effective Medium Theory <strong>of</strong> 119<br />
Metamaterials<br />
Zsolt Szabo<br />
Enhancement <strong>of</strong> Maximum Starting Torque and Efficiency in Permanent 125<br />
Magnet Synchronous Motors<br />
Jawad Faiz, Vahid Ghorbanian, Bashir Mahdi Ebrahimi<br />
Core Losses Estimation Techniques in Electrical Machines with 131<br />
Different Supplies-A Review<br />
Jawad Faiz, Amir Masoud Takbash, Bashir Mahdi Ebrahimi<br />
Fast Computation <strong>of</strong> Inductances and Capacitances <strong>of</strong> High Voltage 137<br />
Power Transformer Windings<br />
Tomislav Župan, Željko Štih, Bojan Trkulja<br />
Numerical and Experimental Investigations <strong>of</strong> the Structural 144<br />
Characteristics <strong>of</strong> Stator Core Stacks<br />
Mathias Mair, Bernhard Weilharter, Siegfried Rainer, Katrin Ellermann,<br />
Oszkár Bíró<br />
Proper Location <strong>of</strong> the Regulating Coil in Transformers from 154<br />
Short-circuit Forces Point <strong>of</strong> View<br />
Oluş Sonmez, Bilal Düzgün, Güven Kömürgöz<br />
Robust Design <strong>of</strong> IPM motors using Co-Evolutionary Algorithms 160<br />
Min Li, Andre Ruela, Frederico Guimaraes, Jaime Ramirez, David Lowther
- III - 15th IGTE Symposium 2012<br />
Free-form optimization for magnetic design 167<br />
Zoran Andjelic, Salih Sadovic<br />
Optimization for ECT treatment planning 171<br />
Paolo Di Barba, Luca Giovanni Campana, Fabrizio Dughiero, Carlo Riccardo Rossi,<br />
Elisabetta Sieni<br />
Investigation <strong>of</strong> the Electroporation Effect in a Singel Cell 175<br />
Jaime Ramirez, William Figueiredo, Joao Francisco Vale, Isabela Metzker,<br />
Rafael Santos, Elizabeth Silva, David Lowther<br />
Anisotropic Model for the Numerical Computation <strong>of</strong> Magnetostriction 181<br />
in Steel Sheets<br />
Manfred Kaltenbacher, Adrian Volk, Michael Ertl<br />
Analytic Approximation Solution for the Schwarz-Christ<strong>of</strong>fel Parameter 186<br />
Problem<br />
Norbert Eidenberger, Bernhard G. Zagar<br />
Additional Eddy Current Losses in Induction Machines Due to 190<br />
Interlaminar Short Circuits<br />
Paul Handgruber, Andrej Stermecki, Oszkár Bíró, Georg Ofner<br />
Evaluating the influence <strong>of</strong> manufacturing tolerances in permanent 198<br />
magnet synchronous machines<br />
Isabel Coenen, Thomas Herold, Christelle Piantsop Mboo, Kay Hameyer<br />
Eddy current analysis <strong>of</strong> a PWM controlled induction machine 204<br />
Hai Van Jorks, Erion Gjonaj, Thomas Weiland<br />
Computation <strong>of</strong> end-winding inductances <strong>of</strong> rotating electrical 208<br />
machinery through three-dimensional magnetostatic integral FEM formulation<br />
Flavio Calvano, Giorgio Dal Mut, Fabrizio Ferraioli, Alessandro Formisano,<br />
Fabrizio Marignetti, Raffaele Martone, Guglielmo Rubinacci,<br />
Antonello Tamburrino, Salvatore Ventre<br />
Magnetomechanical Coupled FE Simulations <strong>of</strong> Rotating Electrical 214<br />
Machines<br />
Anouar Belahcen, Katarzyna Fonteyn, Reijo Kouhia, Paavo Rasilo, Antero Arkkio<br />
Saturable Model <strong>of</strong> Squirrel-cage Induction Motors under Stator 220<br />
Inter-turn Fault<br />
Jawad Faiz, Mansour Ojaghi, Mahdi Sabouri<br />
Accurate Magnetostatic Simulation <strong>of</strong> Step-Lap Joints in Transformer 226<br />
Cores Using Anisotropic Higher Order FEM<br />
Andreas Hauck, Michael Ertl, Joachim Schöberl, Manfred Kaltenbacher<br />
Finite Element Based Modeling <strong>of</strong> Wound Rotor Induction Machines 232<br />
Martin Mohr, Oszkár Bíró, Andrej Stermecki, Franz Diwoky<br />
Post Insulator Optimization Based on Dynamic Population Size 238<br />
Peter Kitak, Arnel Glotic, Igor Ticar<br />
Simulation <strong>of</strong> the Absorbing Clamp Method for Optimizing the 242<br />
Shielding <strong>of</strong> Power Cables<br />
Szabolcs Gyimóthy, József Pávó, Péter Kiss, Tomoaki Toratani, Ryuichi Katsumi,<br />
Gábor Varga
- IV - 15th IGTE Symposium 2012<br />
A Neural Network Approach to Sizing an Electrical Machine 248<br />
Steven Bielby, David Lowther<br />
Exploring and Exploiting Parallelism in the Finite Element Method on 254<br />
Multi-core Processors: an Overview<br />
Hussein Moghnieh, David Lowther<br />
Diagnosis <strong>of</strong> real cracks from the three spatial components <strong>of</strong> the eddy 262<br />
current testing signals<br />
Milan Smetana, Ladislav Janousek, Mihai Rebican, Tatiana Strapacova,<br />
Anton Duca, Gabriel Preda<br />
Adaptive Galaxy-Based Search Approach Applied to Loney’s Solenoid 267<br />
Benchmark Problem<br />
Leandro dos Santos Coelho, Teodoro Cardoso Bora, Piergiorgio Alotto<br />
Implementation <strong>of</strong> a 3D magnetic circuit model for automotive 271<br />
applications<br />
Ioannis Anastasiadis, Andreas Buchinger, Tobias Werth, Lukas Bellwald,<br />
Kurt Preis<br />
Robust Optimization <strong>of</strong> Passive RFID Antennas Loaded by Non-linear 276<br />
Circuits<br />
Yuta Watanabe, Hajime Igarashi<br />
Mixed Order Edge-based Finite Element Analysis by Means <strong>of</strong> 282<br />
Nonconforming Connection<br />
Yoshifumi Okamoto, Shuji Sato<br />
Topology Optimization Using Parallel Search Strategy for Magnetic 288<br />
Devices<br />
Takumi Nagano, Shogo Yasukawa, Shinji Wakao, Yoshifumi Okamoto<br />
Modeling <strong>of</strong> the Road Influence on the Grounding System in its Vicinity 294<br />
Dragan Vuckovic, Nenad Cvetkovic, Dejan Krstic, Miodrag Stojanovic<br />
Interaction Magnetic Force Calculation <strong>of</strong> Axial Passive Magnetic 300<br />
Bearing Using Magnetization Charges and Discretization Technique<br />
Saša Ilić, Ana Vuckovic, Slavoljub Aleksić<br />
Consideration <strong>of</strong> erroneous magnets in the electromagnetic field 305<br />
simulation<br />
Peter Offermann, Isabel Coenen, Kay Hameyer<br />
Potential <strong>of</strong> Spheroids in a Homogeneous Magnetic Field in Cartesian 310<br />
Coordinates<br />
Markus Kraiger, Bernhard Schnizer<br />
Application <strong>of</strong> Signal Processing Tools for Fault Diagnosis in Induction 315<br />
Motors-A Review<br />
Jawad Faiz, Amir Masoud Takbash, Bashir Mahdi Ebrahimi, Subhasis Nandi<br />
Experimental Calibration <strong>of</strong> Numerical Model <strong>of</strong> Thermoelastic Actuator 321<br />
Lukas Voracek, Vaclav Kotlan, Bohus Ulrych
- V - 15th IGTE Symposium 2012<br />
Scattering Calculations <strong>of</strong> Passive UHF-RFID Transponders 327<br />
Thomas Bauernfeind, Gergely Koczka, Kurt Preis, Oszkár Bíró<br />
Simulation <strong>of</strong> a high speed Reluctance Machine including hysteresis 331<br />
and eddy current losses<br />
Bernhard Schweigh<strong>of</strong>er, Hannes Wegleiter, Manes Recheis, Paul Fulmek<br />
An Iterative Domain Decomposition Method for Solving Wave 337<br />
Propagation Problems<br />
Koczka Gergely, Thomas Bauernfeind, Kurt Preis, Oszkár Bíró<br />
On Effectiveness <strong>of</strong> Model Reduction for Computational 340<br />
Electromagnetism<br />
Yuki Sato, Hajime Igarashi<br />
Calculation <strong>of</strong> eddy-current probe signal for a volumetric defect using 346<br />
global series expansion<br />
Sandor Bilicz, József Pávó, Szabolcs Gyimóthy<br />
Bodies motion computation using eddy-current integral equation 352<br />
Mihai Maricaru, Ioan R. Ciric, Horia Gavrila, George-Marian Vasilescu,<br />
Florea I. Hantila<br />
Adaptive Inductance Computation on GPU’s 357<br />
Andrea Gaetano Chiariello, Alessandro Formisano, Raffaele Martone<br />
The reduced-basis method applied to transport equations <strong>of</strong> a 362<br />
lithium-ion battery<br />
Stefan Volkwein, Andrea Wesche<br />
Surrogate Parameter Optimization based on Space Mapping for 368<br />
Lithium-Ion Cell Models<br />
Matthias Scharrer, Bettina Suhr, Daniel Watzenig<br />
Large Scale Energy Storage with Redox Flow Batteries 374<br />
Piergiorgio Alotto, Massimo Guarnieri, Federico Moro, Andrea Stella<br />
Model Order Reduction for a Lithium-Ion Cell 380<br />
Bettina Suhr, Jelena Rubesa<br />
Automatic domain detection for a meshfree post-processing in 386<br />
boundary element methods<br />
André Buchau, Matthias Jüttner, Wolfgang M. Rucker<br />
Efficient modeling <strong>of</strong> coil filament losses in 2D 392<br />
Leena Lehti, Janne Keränen, Saku Suuriniemi, Timo Tarhasaari, Lauri Kettunen<br />
Optimization <strong>of</strong> Energy Storage Usage 398<br />
Arnel Glotic, Peter Kitak, Igor Ticar, Adnan Glotic<br />
Adaptive Surrogate Approach for Bayesian Inference in Inverse 403<br />
Problems<br />
Markus Neumayer, Helcio R.B. Orlande, Marcello J. Colaco, Daniel Watzenig,<br />
Gerald Steiner, Bernhard Brandstätter, George S. Dulikravich
- 1 - 15th IGTE Symposium 2012<br />
Multi-Domain Multi-Scale Problems in High<br />
Frequency Finite Element Methods<br />
Istvan Bardi, Kezhong Zhao, Rickard Petersson, John Silvestro and Nancy Lambert<br />
ANSYS Inc., 225 W Station Square Drive, Pittsburgh, PA 15219, U.S.A.<br />
E-mail: steve.bardi@ansys.com<br />
Abstract—This paper presents Domain Decomposition Methods to overcome the challenges posed by multi-domain, multi-scale<br />
high frequency problems. By decomposing large electromagnetic regions into smaller domains, the Finite Element Method can<br />
cope with the simulation <strong>of</strong> electrically large problems. A hybrid Finite Element and Boundary Integral procedure is also<br />
presented that allows for domains to employ different solution methods in different subdomains. The Robin Transmission<br />
Condition (RTC) is applied to link the domains and preserve field continuity on interfaces. Real life examples demonstrate the<br />
accuracy and efficiency <strong>of</strong> the new method.<br />
Index Terms—FEM, hybrid FEM and boundary element method, multi scale problems, Robin Transmission Condition<br />
I. INTRODUCTION<br />
The finite element method (FEM) is a powerful tool<br />
for simulating high frequency structures. There are<br />
several features <strong>of</strong> the method that have become expected<br />
elements <strong>of</strong> a successful commercial simulator. These<br />
elements include spurious mode free hierarchical, higher<br />
order vector basis functions, curvilinear elements,<br />
automatic/adaptive meshing, transfinite elements, mesh<br />
truncation methods, broad band frequency sweeps,<br />
parameterization and preconditioned iterative solvers.<br />
However, new challenges have emerged in recent years:<br />
the simulation tools need to cope with multi-scale<br />
problems that start on the chip level, couple to the<br />
package and board levels, and encompass the platform<br />
and antenna levels. Each component in a multi-scale<br />
problem can require millions <strong>of</strong> unknowns to simulate.<br />
Chip complexity rises to billions <strong>of</strong> circuit elements;<br />
packages involve large numbers <strong>of</strong> ports; printed circuit<br />
boards (PCBs) <strong>of</strong>ten contain thousands <strong>of</strong> traces on many<br />
layers; and platforms and antennas <strong>of</strong>ten involve<br />
dimensions <strong>of</strong> hundreds <strong>of</strong> wavelengths.<br />
While recent advances in High Performance<br />
Computing (HPC) hardware greatly accelerate numerical<br />
computations, new algorithms are needed to exploit the<br />
new HPC environment. In particular, efficient and<br />
effective physics-based parallelization is required to<br />
address the challenges <strong>of</strong> multi-scale and multi-domain<br />
simulation. This paper presents an overview <strong>of</strong> domain<br />
decomposition methods that exploits the physical nature<br />
<strong>of</strong> multi-scale, multi-domain problems to tackle<br />
heret<strong>of</strong>ore impossibly complex high frequency problems.<br />
II. BASICS OF DOMAIN DECOMPOSITION METHOD<br />
A basic characteristic <strong>of</strong> HPC is the use <strong>of</strong> multiple<br />
processors to perform computations in parallel. An<br />
algebraic approach to using HPC is to partition large<br />
matrices into smaller sub-matrices. In some cases, this is<br />
inefficient even when iterative solvers are employed.<br />
Physics-based domain decomposition is typically more<br />
efficient because the subdomains exploit the geometry<br />
and the field. In this case, both the solution domain and<br />
the mesh are partitioned into smaller subdomains and<br />
meshes. The mesh <strong>of</strong> the sub-domains can be<br />
overlapping, conformal touching, non-conformal<br />
touching or even non-touching [1-3].<br />
Another important advantage for DDM is the ability to<br />
link differing solution methods and physics. For instance,<br />
the finite element method is better at simulating complex<br />
geometries, while the boundary element method copes<br />
better with electrically large but simple, smooth<br />
structures. This paper presents hybrid finite element–<br />
boundary integral (FE-BI) methods that allows nonconformal<br />
touching domains and disjoint regions.<br />
Figure 1: Incident surface electric and magnetic current<br />
densities impinging on an FEM domain<br />
A. Single FE domain with surface electric and<br />
magnetic current excitations<br />
Consider a computational domain where an incident<br />
field impinges on a section <strong>of</strong> a boundary as illustrated in<br />
Figure 1. The wave equation to be solved is<br />
2<br />
imp<br />
1 r<br />
E1<br />
ko1rE1 jkoJ<br />
1 (1)<br />
imp<br />
where J1 is the impressed current density. The total<br />
field description is used inside the domain, while the<br />
scattered field description is used outside<br />
sc inc<br />
sc inc<br />
E1 E1<br />
E1<br />
; H1 H1<br />
H1<br />
(2)<br />
For the sake <strong>of</strong> simplicity, a Dirichlet boundary condition<br />
is used on 1 \ 12<br />
<br />
n 1 E 0<br />
(3)<br />
, both the electric and the magnetic field jump<br />
On 12
inc<br />
inc<br />
with n1 E1<br />
and n2 H2<br />
, respectively [4].<br />
Consequently, an absorbing boundary condition is<br />
sc<br />
sc<br />
required for E and H . Assume we know an operator<br />
that provides perfect absorption:<br />
sc<br />
sc<br />
n1 H1<br />
ABC ( n1<br />
n1<br />
E1<br />
)<br />
(4)<br />
Since, the total field description is used in the<br />
computational domain, the scattered field variables can<br />
be eliminated using (2). Then, the Neumann boundary<br />
condition for the total magnetic field is<br />
n1<br />
1r<br />
E1<br />
<br />
(5)<br />
inc inc<br />
jo ( ABC ( e1)<br />
ABC ( e1<br />
) J1<br />
/ )<br />
where the J electric and the magnetic current densities e<br />
is introduced as<br />
J n<br />
H and e n<br />
n<br />
E<br />
(6)<br />
where is the wave impedance. Introducing the first<br />
order ABC generates the simple form<br />
inc inc<br />
n1 1r<br />
E1<br />
jko( e1<br />
e1<br />
J1<br />
)<br />
(7)<br />
Equations (5) and (7) are Robin transmission boundary<br />
conditions. They generalize the Neumann boundary<br />
condition to include the incident fields. Thus, to excite<br />
the computational domain by an external incident field,<br />
the electric and magnetic surface current densities inc<br />
J<br />
inc<br />
and e need to be specified. The transmission<br />
conditions are first order when the first order ABC is<br />
used and higher order when higher order ABC’s are used.<br />
B. Multiple FE domains with surface electric and<br />
magnetic current coupling<br />
Now consider a computational domain that is<br />
subdivided into two subdomains (Fig. 2).<br />
Figure 2: Decomposition into two non-overlapping<br />
subdomains<br />
The boundary value problem (BVP) for the first domain<br />
is<br />
2<br />
imp<br />
1 r<br />
E1<br />
ko1rE1 jkoJ<br />
1 in 1 (8)<br />
inc inc<br />
n1 1r<br />
E1<br />
jkoJ1 jko(<br />
e1<br />
e1<br />
J1<br />
) on 12 <br />
(9)<br />
and similarly for the second domain<br />
2<br />
imp<br />
<br />
2r<br />
E2 ko 2rE2<br />
jkoJ<br />
2 in 2 (10)<br />
inc inc<br />
n2 2r<br />
E2<br />
jkoJ 2 jko(<br />
e2<br />
e2<br />
J2<br />
) on 12 <br />
(11)<br />
The incident field for the first domain is the field <strong>of</strong> the<br />
second domain and vice versa<br />
inc<br />
e1 e2<br />
; e2 e1<br />
inc (12)<br />
inc<br />
J J<br />
J J<br />
inc<br />
<br />
(13)<br />
1<br />
2 ; 2 1<br />
- 2 - 15th IGTE Symposium 2012<br />
Applying this to Equations (9) and (11), we get<br />
n1 1r<br />
E1<br />
jkoJ1<br />
jko(<br />
e1<br />
e2<br />
J2<br />
) (14)<br />
n2 2r<br />
E2<br />
jkoJ<br />
2 jko(<br />
e2<br />
e1<br />
J1)<br />
(15)<br />
The right hand sides <strong>of</strong> Eqs. (14) and (15) are the<br />
Neumann Boundary conditions for domain 1 and 2 ,<br />
respectively. They will be included into the finite element<br />
formulation as natural boundary conditions. Since J1 and<br />
J 2 were introduced, Eqs. (14), (15) must be prescribed<br />
explicitly as well<br />
J1 e1<br />
e2<br />
J2<br />
(16)<br />
J2 e2<br />
e1<br />
J1<br />
(17)<br />
It can be proved ([1]) that solution <strong>of</strong> the differential<br />
equations (8) and (10) are unique applying natural and<br />
essential interface conditions (14), (15) and (16), 17),<br />
respectively. Applying Galerkin’s method, the bilinear<br />
form and the essential boundary condition for domain 1 <br />
is the following:<br />
b(<br />
, E ) jk v , e jk v , e jk v , J<br />
v1 1 o 1 1 o 1 2<br />
12<br />
o 1 2<br />
12<br />
12<br />
imp v1J11 jk <br />
(18)<br />
o<br />
jk o(<br />
w1, e1<br />
<br />
12<br />
w1,<br />
J1<br />
<br />
12<br />
w1,<br />
e2<br />
<br />
12<br />
w1,<br />
J2<br />
) 0<br />
12<br />
(19)<br />
The same applies to 2 . Note, that the testing functions<br />
w should be orthogonal to those <strong>of</strong> v. Discretizing the<br />
scalar products, yields the matrix equation [1]<br />
K1<br />
<br />
<br />
G 21<br />
where<br />
G12<br />
u1<br />
y1<br />
<br />
<br />
<br />
K<br />
<br />
2 u<br />
2<br />
y1<br />
<br />
(20)<br />
Ek<br />
bk<br />
<br />
u <br />
<br />
k <br />
ek<br />
<br />
; y<br />
<br />
k <br />
<br />
0<br />
<br />
; k=1,2<br />
<br />
J k <br />
0 <br />
(21)<br />
A<br />
k<br />
T<br />
K k <br />
<br />
Ck<br />
<br />
0<br />
Ck<br />
vv<br />
Bk<br />
jkoTkk<br />
wv<br />
jkoTkk<br />
0 <br />
0<br />
<br />
<br />
ww<br />
jk oTkk<br />
<br />
(22)<br />
0<br />
G<br />
<br />
12 G 21 <br />
<br />
0<br />
<br />
0<br />
0<br />
vv<br />
jkoT12<br />
wv<br />
jkoT12<br />
0 <br />
vw<br />
jk<br />
<br />
oT12<br />
<br />
ww<br />
jk oT12<br />
<br />
(23)<br />
, v n v n<br />
vv<br />
;<br />
ww<br />
n<br />
w , n<br />
w<br />
j<br />
12<br />
Tij i Tij i j<br />
12<br />
(24)<br />
vw<br />
Tij n<br />
vi<br />
, n<br />
w j<br />
<br />
(25)<br />
12<br />
Matrices A, B, C and b are the same as in the case <strong>of</strong><br />
standard FE discretization and can be found in [1] along<br />
with the definitions <strong>of</strong> the scalar products. The structure<br />
<strong>of</strong> Eq. (18) shows, that the variables <strong>of</strong> the FE domains<br />
are coupled just via the surface electric and magnetic<br />
current variables, which are called cement variables.<br />
C. Hybrid FE - BI domains with surface electric and<br />
magnetic current coupling<br />
Fig. 3 shows two separated domains 1 and 2 . The<br />
fields in these domains are coupled via the free space<br />
domain ext . The FEM is used in 1 and 2 , while<br />
Boundary Integral Method (BI) is used in ext .
Figure 3: Decomposition into two FEM and one BI<br />
subdomains<br />
The boundary value problem for the finite element<br />
domains is similar to that in section B<br />
2<br />
imp<br />
<br />
ir<br />
Ei ko irEi<br />
jkoJ<br />
i in i (26)<br />
<br />
ni 1i<br />
i o i o i i i<br />
<br />
<br />
<br />
<br />
E jk J jk ( e e J ) on i (27)<br />
<br />
Ji ei<br />
ei<br />
Ji<br />
on i (28)<br />
Eqs. (27) and (28) are the Neumann and the Robin<br />
transmission boundary conditions, respectively.<br />
For the unbounded subdomain ext , the boundary<br />
integral equation representation is used, based on<br />
Stratton-Chu [2]. The boundary integral equation for the<br />
electric and the magnetic current densities are<br />
1<br />
2<br />
inc <br />
<br />
ei ei<br />
{ nk<br />
( C(<br />
nk<br />
ek<br />
)) jk onk<br />
( A(<br />
Jk<br />
)) <br />
2<br />
k1<br />
1 <br />
<br />
( jk o)<br />
<br />
(<br />
Jk<br />
)} on i (29)<br />
jk<br />
2<br />
o inc<br />
<br />
J i Ji<br />
{ jkonk<br />
nk<br />
( C(<br />
Jk<br />
)) <br />
2<br />
k1<br />
2 <br />
<br />
<br />
jkonk nk (<br />
A(<br />
nk<br />
ek<br />
)) <br />
(<br />
nk<br />
ek<br />
)} on i (30)<br />
and the Robin transmission boundary condition is:<br />
<br />
<br />
Ji ei<br />
ei<br />
Ji<br />
on i (31)<br />
where<br />
'<br />
'<br />
'<br />
' '<br />
A ( x)<br />
xgds ; (<br />
x ) ( x)<br />
gds ; C (x)<br />
x <br />
gds<br />
<br />
<br />
<br />
(32)<br />
Applying Galerkin’s method again, the matrix equation to<br />
be solved is<br />
K<br />
1 N12x<br />
1<br />
y1<br />
<br />
(33)<br />
T<br />
N12<br />
K 2 x<br />
2<br />
y1<br />
where<br />
AII<br />
A 0 0<br />
0<br />
I<br />
<br />
<br />
<br />
<br />
A A T D T D <br />
I <br />
<br />
<br />
T<br />
T<br />
K 0 D T D T <br />
i<br />
<br />
<br />
<br />
<br />
<br />
T<br />
T<br />
<br />
0 T D Q T P<br />
<br />
ii <br />
ii <br />
<br />
T<br />
T<br />
T<br />
<br />
0 D T P Q T <br />
<br />
<br />
ii<br />
ii <br />
I<br />
0<br />
0 0 0 0 Ei<br />
y i<br />
<br />
<br />
0 0 0 0 0 e<br />
<br />
i 0 <br />
;<br />
<br />
N12<br />
0<br />
0 0 0 0 x i J<br />
;<br />
i y (34)<br />
i 0<br />
<br />
inc<br />
E <br />
0<br />
0 0 Q12<br />
P12<br />
ei<br />
y<br />
i <br />
T<br />
<br />
inc<br />
<br />
0<br />
0 0 P <br />
12 Q12<br />
<br />
J<br />
i <br />
H <br />
y<br />
i <br />
Further details are provided in [1].<br />
This general hybrid finite element-boundary integral<br />
equation method (hybrid FE-BI) is very flexible. The<br />
subdomains can be FEM, BEM or any other numerical<br />
method. If just one FEM subdomain exists, it provides a<br />
perfect absorbing boundary condition (FE-BI).<br />
- 3 - 15th IGTE Symposium 2012<br />
III. SOLVING THE MATRIX EQUATIONS<br />
In this section, the solution <strong>of</strong> the matrix equations is<br />
presented via a stationary alternating Schwartz algorithm<br />
based on Jacobi Splitting. The idea is to eliminate the<br />
internal variables and solve for the surface current<br />
densities also called cement variables. Performing this<br />
process iteratively is called domain iteration. Partitioning<br />
the variables accordingly<br />
e<br />
k Ek<br />
<br />
c k ; u k <br />
J<br />
; (35)<br />
k c<br />
k <br />
Eq. (20) for the k-th domain is<br />
Ek<br />
bk<br />
0 <br />
Kk <br />
c j<br />
c<br />
<br />
g<br />
<br />
k 0<br />
<br />
kj<br />
(36)<br />
g kj can be read from Eq.(23) and k and j are the domain<br />
indices. Note, that the internal variables are expressed by<br />
the cement variables. Supposing, the inverse matrix <strong>of</strong> the<br />
k-th domain is known and also partitioned to internal and<br />
cement variable blocks, we get:<br />
E E<br />
E c<br />
k Ek<br />
k k<br />
P b P<br />
g <br />
k k k k kj<br />
c<br />
j<br />
c<br />
<br />
(37)<br />
c<br />
c c<br />
k E<br />
k k<br />
k <br />
P b <br />
<br />
Pk<br />
g<br />
k k<br />
kj <br />
where<br />
Ek<br />
Ek<br />
Ek<br />
ck<br />
P<br />
<br />
k Pk<br />
K k c <br />
k Ek<br />
ck<br />
ck<br />
Pk<br />
Pk<br />
<br />
1<br />
(38)<br />
These equations allow domain iterations to be applied<br />
k E P b P g c<br />
n1<br />
k<br />
E Ek<br />
k<br />
c E<br />
k<br />
k<br />
Ek<br />
ck<br />
k<br />
n<br />
kj j<br />
(39)<br />
n1<br />
k k ck<br />
ck<br />
n<br />
ck P bk<br />
Pk<br />
gkjc<br />
j<br />
(40)<br />
where the superscript n provides the iteration number.<br />
cc<br />
The matrix P k is called the numerical Green’s function<br />
and quantities c k and c j in Eq. (40) are called the<br />
cement variables. The domain iteration works with the<br />
cement variables only, but it needs blocks <strong>of</strong> the inverse<br />
<strong>of</strong> the system matrix <strong>of</strong> the internal variables. Eq. (39)<br />
provides the update for the internal variables, which are<br />
not included in the domain iteration because they do not<br />
needed to be updated unless the right-hand-side changes.<br />
Other, popular methods also can be applied, such as<br />
GMRES, a Krylov Subspace Method. The domain<br />
iteration needs to invert the subdomain matrices in each<br />
iteration. For this purpose, either a multifrontal direction<br />
solver can be used or a p-type multiplicative Schwarz<br />
preconditioner (pMUS) iterative solver. The iteration<br />
matrix<br />
cc<br />
Akj Pk<br />
(41)<br />
is dense but it can be replaced by a sparse matrix using<br />
Adaptive Cross Approximation (ACA)<br />
~ mn<br />
kj<br />
mn<br />
mr<br />
rn<br />
Akj<br />
A Ukj<br />
Vkj<br />
(42)<br />
where m and n are the row and column numbers and r is<br />
the rank <strong>of</strong> the matrix.<br />
The domain iteration method presented above was<br />
derived for the case when the solution domain is<br />
partitioned into two sub-domains with one coupling<br />
surface interface. If the solution domain is partitioned<br />
into multiple domains with multiple coupling surfaces,<br />
the number <strong>of</strong> the cement variables increases but the
essence remains the same: the subdomain variables are<br />
expressed in terms <strong>of</strong> cement variables and the domain<br />
iteration is set up for the cement variables. The same<br />
applies when the subdomains are coupled via BI domains.<br />
The convergence <strong>of</strong> the domain iteration depends on<br />
the order <strong>of</strong> the Robin transmission boundary conditions.<br />
For simplicity, first order Robin boundary conditions<br />
were used in the above derivations. Higher order<br />
conditions are also available in [5]. Higher order<br />
transmission condition enforce the requirement that the<br />
eigenvalues <strong>of</strong> the system matrix be inside the unit circle.<br />
This is a necessary condition for a good domain iteration<br />
convergence. A second order transmission boundary<br />
condition can be realized as in Eqs. (16) and (17)<br />
J j Aje j Bj<br />
<br />
e<br />
j Jk<br />
Ake<br />
k Bk<br />
<br />
ek<br />
(43)<br />
J A e B <br />
e<br />
J<br />
A e B <br />
e<br />
k k k k k j j j j <br />
(44)<br />
where k and j are the indices <strong>of</strong> the neighboring domains<br />
and denotes the surface operator. Constants k A , k B , Aj<br />
and B j can be optimized for convergence. Figure 4<br />
shows the improvement in convergence provided by the<br />
second order Robin Transmission Condition (RTC).<br />
Figure 4: Convergence with first and second order RTC<br />
III. REPETITIVE STRUCTURES<br />
If identical substructures exist in the computational<br />
domain, the computational effort <strong>of</strong> storing and solving<br />
the equations is dramatically reduced. Repeated<br />
identical substructures are called unit cells and have<br />
the same mesh. Only one unit cell is stored physically<br />
in the computer; the other unit cells are virtual. The<br />
physically-stored unit cell is called the parent, while<br />
the virtual ones are called children. A structure can<br />
have multiple parents. In the case <strong>of</strong> non-conformal<br />
domain decomposition, no constraints are applied to<br />
the mesh. In the case <strong>of</strong> conformal DDM, the parent<br />
mesh must be constrained so that it matches with the<br />
surface meshes <strong>of</strong> the children.<br />
For the sake <strong>of</strong> simplicity, assume that the entire<br />
computational domain consists <strong>of</strong> just one repeated<br />
structure. This is usually the case when finite antenna<br />
arrays are simulated. Figure 5 shows a single parent<br />
case with an internal block and matrices <strong>of</strong> repetitive<br />
unit cells. Here there is one system matrix and two<br />
coupling matrices. Thus, only three <strong>of</strong> the sixteen<br />
- 4 - 15th IGTE Symposium 2012<br />
j<br />
matrix blocks need to be stored and matrix block A<br />
must be factored once instead <strong>of</strong> 4 times.<br />
Figure 5: Internal blocks and corresponding matrices<br />
<strong>of</strong> repetitive unit cells<br />
IV. MULTI DOMAIN DDM WITH FE-BI<br />
As it has been shown, DDM is based on a divide-andconquer<br />
philosophy. Instead <strong>of</strong> tackling a large and<br />
complex problem directly as a whole, the original<br />
problem is partitioned into smaller, possibly repetitive,<br />
and easier to solve sub-domains. In this paper, DDM is<br />
used as an effective FEM preconditioner, where a higher<br />
order Robin’s transmission condition (RTC) is devised to<br />
enforce the continuity <strong>of</strong> electromagnetic fields between<br />
adjacent sub-domains and accelerates the convergence <strong>of</strong><br />
the iterative process. DDM is also employed to provide a<br />
hybrid FEM-BEM approach where the treatment <strong>of</strong> the<br />
radiation condition is exact. The hybrid finite elementboundary<br />
integral (FE-BI) method allows FEM-domains<br />
to be disconnected with the coupling between disjoint<br />
domains provided via Green’s functions. The advantages<br />
<strong>of</strong> DDM-based FE-BI compared to traditional FE-BI<br />
include modularity <strong>of</strong> FEM and BI domains in terms <strong>of</strong><br />
mesh and basis functions. This “non-conformal” ability<br />
significantly simplifies the integration <strong>of</strong> existing state<strong>of</strong>-art<br />
FEM and BEM solvers. The continuity<br />
enforcement through Robin’s RTC naturally renders<br />
present FE-BI free <strong>of</strong> internal resonance issue. Since<br />
domains are allowed to be disjoint, if one or more subdomains<br />
are purely metallic or highly conducting, DDM<br />
can allow the integral equation method to be applied to<br />
these sub-domains directly to reduce memory<br />
consumption.<br />
V. APPLICATIONS<br />
To illustrate the effectiveness and accuracy <strong>of</strong> DDM,<br />
an array <strong>of</strong> tapered slot antennas is considered. The<br />
antenna element is <strong>of</strong> the Vivaldi type. The antenna is<br />
similar to the one described in [8]. The rectangular array<br />
spacing is 34 mm along y and 36mm along x. The εr = 6<br />
substrate is 0.02 λ0 thick and the height and opening size<br />
<strong>of</strong> the slot is ≈0.5 and 0.45λ0 respectively. To show the<br />
accuracy <strong>of</strong> the simulation an array <strong>of</strong> 81 elements (9x9)<br />
was analyzed using DDM. For comparison, a full array<br />
model <strong>of</strong> the 81 elements with a slightly different edge<br />
treatment was also created and simulated using FEM in a<br />
single domain. The model simulated without using DDM<br />
will be referred to as the explicit model. The two patterns<br />
for the φ=0° cut (perpendicular to the slot faces) for all<br />
elements excited with equal amplitude and 0° phase shift<br />
are shown in Figure 6. Excellent agreement is obtained.<br />
In addition to being able to compute the field patterns, the<br />
full scattering matrix can also be extracted from the DDM<br />
simulation. To verify the accuracy <strong>of</strong> this computation,<br />
consider the data shown in Figure 7. This plot compares
the refection coefficient <strong>of</strong> the center element (element<br />
#41) in the array and also the coupling terms (S41,-- dB)<br />
for the coupling between the center element and the next<br />
4 elements along the same row <strong>of</strong> slots. Again agreement<br />
between the two sets <strong>of</strong> data is excellent.<br />
Another infinite array simulation was performed using<br />
linked boundary conditions and the active element pattern<br />
was computed [9]. The active element pattern is the<br />
radiation pattern for an infinite array <strong>of</strong> elements where<br />
only a single element is excited. Finite arrays <strong>of</strong> 9x9 and<br />
21x21 elements were simulated using DDM. The<br />
radiation pattern with only the center element excited was<br />
computed for each <strong>of</strong> these arrays. For comparison a<br />
single antenna element on a finite ground plane was also<br />
analyzed. The normalized φ=90° patterns for these 4<br />
antennas is shown in Figure 8. The agreement with the<br />
infinite array active element pattern improves as the array<br />
size increases from 1 to 9x9 to 21x21 elements. This<br />
demonstrates the accuracy <strong>of</strong> the DDM simulation<br />
procedure for large arrays. As a final test, a 15x15 array<br />
<strong>of</strong> Vivaldi elements was simulated using DDM. In this<br />
case, the radiation pattern for 0° scan angle was<br />
calculated. The 3D polar <strong>of</strong> this pattern is shown in<br />
Figure 9.<br />
All simulations were run on a Linux cluster. Each<br />
machine in the cluster had 12 CPUs and 96 GB memory.<br />
The explicit 81 element model was run on a single<br />
machine. For the DDM simulation, the domain<br />
simulations were distributed over several machines and<br />
CPUs. For the 9x9 array, 62 domains were used and<br />
21GB Ram was required; for the 15x15 array, 68<br />
domains were used and the total memory required was<br />
≈28GB. The latter simulation shows the power <strong>of</strong> this<br />
approach – even though the number <strong>of</strong> tetrahedra<br />
increased significantly, the memory usage was still less<br />
than 30GB.<br />
Figure 6: Comparison <strong>of</strong> the φ=0° patterns for all<br />
elements excited with equal phase and magnitude for 9x9<br />
array. The DDM data is the solid black line and the<br />
explicit model data is the dashed red line.<br />
Table I shows a comparison <strong>of</strong> solver statistics <strong>of</strong><br />
different element sizes and methods.<br />
TABLE I<br />
COMPARISON OF MEMORY AND SOLUTION TIME OF<br />
DIFFEREN METHODS/ARRAYS<br />
Time Number<br />
<strong>of</strong> tets<br />
Memory<br />
Explicit (9 x 9) 190 min 1.7 m 50 GB<br />
DDM (9 x 9) 90 min 1.6 m 21 GB<br />
DDM (15 x 15) 300 min 4.3 m 28 GB<br />
- 5 - 15th IGTE Symposium 2012<br />
Figure 7. S41,-where element 41 is the center element <strong>of</strong><br />
the array and elements 42-45 are the remaining elements<br />
along the middle row computed using two different<br />
approaches.<br />
Figure 8: Phi =90 °element patterns calculated using the<br />
infinite array approximation (Element_pattern) and from<br />
a 9x9 and 21x21 element array compared to the pattern<br />
for a single isolated element (iso).<br />
Figure 9: 3D polar plot <strong>of</strong> the radiation pattern for the<br />
15x15 array where all elements are excited with equal<br />
amplitude and phase<br />
The next example demonstrates the efficiency <strong>of</strong> FE-<br />
BI. Figure 10 shows an Apache helicopter with a<br />
conformal FE-BI surface and it has been simulated at 900<br />
MHz. Table II shows a comparison with pure FEM and<br />
IE methods. The results show the superiority <strong>of</strong> FE-BI,<br />
due to its conformal mesh truncation capability.
Figure 10: Apache helicopter with conformal FE-BI<br />
boundary<br />
TABLE II<br />
COMPARISON OF MEMORY AND SOLUTION TIME OF<br />
DIFFEREN METHODS<br />
Number<br />
<strong>of</strong> cores<br />
Memory Time<br />
FEM (PML box) 12 300 GB 330 min<br />
IE 12 83 GB 328 min<br />
FE-BI<br />
(conformal)<br />
12 21 GB 63 min<br />
VI. CONCLUSION<br />
The proliferation <strong>of</strong> High Performance Computing<br />
(HPC) has made parallelization a basic requirement for<br />
simulation codes today. Computational tasks can be<br />
distributed on different machines (nodes) or cores<br />
(distributed or shared memory). DDM is an ideal<br />
procedure for achieving high HPC efficiency. The<br />
subdomain solutions are fully independent <strong>of</strong> each other,<br />
so they can be evaluated in parallel, either by using<br />
distributed or shared memory. Subdomain solvers also<br />
exploit multi-processing and iterative solution methods.<br />
Both the Schwartz or Krylov domain iteration methods<br />
distribute tasks with high parallelism. The standard<br />
Message Passing Interface (MPI) can be used to control<br />
the data exchange between the nodes and cores. As<br />
demonstrated in the examples, the hybridized FE and BI<br />
DDM procedure provides a flexible and efficient tool to<br />
solve multi scale multi domain problems.<br />
- 6 - 15th IGTE Symposium 2012<br />
[1]<br />
REFERENCES<br />
K. Zhao, V. Rawat, S. Lee and J.F Lee, "A Domain Decomposition<br />
Method with Nonconformal Meshes for Finite Periodic and<br />
Semi-Periodic Structures," IEEE Transactions on Antennas and<br />
Propagation, vol. 55, pp. 2559 - 2570, September, 2007.<br />
[2] K. Zhao, V. Rawat and J.F Lee, "A Domain Decomposition<br />
Method for Electromagnetic Radiation and Scattering Analysis <strong>of</strong><br />
Multi-Target Problems," IEEE Transactions on Antennas and<br />
Propagation, vol. 56, pp. 2211 - 2221, August 2008.<br />
[3] I. Bardi, Zs. Badics and Z. Cendes, "Total and Scattered Field<br />
Formulations in the Transfinite Element Method," IEEE<br />
Transactions on Magnetics, vol. 44, pp. 778-781, June, 2008.<br />
[4] R. F. Harrington, Time–Harmonic Electromagnetic Fields, John<br />
Wiley & Sons, Inc. New York, 2000.<br />
[5] Y. Shao, Z. Peng and J.F Lee, "Full-Wave Real-Life 3-D Package<br />
Signal Integrity Analysis Using Nonconformal Domain<br />
[6]<br />
Decomposition Method," IEEE Transactions on Nicrowave<br />
Theory and Techniques, vol. 59, pp. 230 - 241, February 2011.<br />
W. C. Chew and C.C. Lu, "The use <strong>of</strong> Huygens’ equivalence<br />
principle for solving the volume integral equation for scattering,"<br />
IEEE Transactions on Antennas and Propagation, vol. 41, pp. 897<br />
- 904, July 1993.<br />
[7] Y.J Li and J.M. Jin, "A New Dual–Primal Domain Decomposition<br />
Approach for Finite Element Simulation <strong>of</strong> 3-D Large – Scale<br />
Electromagnetic Problems," IEEE Transactions on Antennas and<br />
Propagation, vol. 55, pp. 2803 – 2810, October 2007.<br />
[8] L.E. R. Petersson and J-M Jin, “Analysis <strong>of</strong> periodic structures via<br />
a time-domain finite-element formualiton with a Floquet abc,”<br />
IEEE Trans. AP, pp. 933-944, Mar. 2009.<br />
[9] J. Manges, J. Silvestro and R. Petersson, “Accurate and Efficient<br />
Extraction <strong>of</strong> Antenna Array Performance from Numerical Unit-<br />
Cell Data,” 2011 European Microwave Conference
- 7 - 15th IGTE Symposium 2012<br />
Parallelization <strong>of</strong> the Transmission Line Matrix<br />
method. Modelling Schumann Resonances and<br />
Atmospherics<br />
S. Toledo-Redondo∗ ,A.Salinas∗ , J. Fornieles∗ ,J.Portí † ,B.Besser ‡ , and H.I.M. Lichtenegger ‡<br />
∗Department <strong>of</strong> Electromagnetism and Matter Physics, <strong>University</strong> <strong>of</strong> Granada, Spain.<br />
† Department <strong>of</strong> Applied Physics, <strong>University</strong> <strong>of</strong> Granada, Spain.<br />
‡ Space Research Institute, Austrian Academy <strong>of</strong> Sciences, <strong>Graz</strong>, Austria<br />
E-mail: sergiotr@ugr.es<br />
Abstract—In this paper, a parallelization <strong>of</strong> the Transmission-Line Modelling (TLM) method is presented. It is intended to<br />
work efficiently regardless <strong>of</strong> the spatial topology <strong>of</strong> the problem, by transforming the initial topology into a one-dimensional<br />
structure. It is designed for shared memory environments, and its implementation is carried out using OpenMP directives.<br />
The algorithm is applied to find the first cut-<strong>of</strong>f frequency <strong>of</strong> the Earth-ionosphere waveguide by solving two models <strong>of</strong><br />
the real system. The performance <strong>of</strong> the algorithm for the mentioned problem is studied in terms <strong>of</strong> speedup over two<br />
different platforms. Relative speedups <strong>of</strong> up to 16 are achieved with the use <strong>of</strong> 32 CPUs. Finally, the whole Earth-ionosphere<br />
cavity is simulated, with an accuracy <strong>of</strong> 5 km grid size, leading to an error <strong>of</strong> less than 1.5% in the Schumann Resonance<br />
frequencies. The spatial resolution achieved also enables for the first time the possibility <strong>of</strong> using this model to study the<br />
global effects generated by local phenomena in the Earth-ionosphere cavity.<br />
Index Terms—Earth-ionosphere waveguide, Schumann resonances, shared memory, speedup, TLM.<br />
I. INTRODUCTION<br />
Numerical methods are a tool for embracing scientific<br />
and technological problems which are difficult or even<br />
impossible to solve analytically. In addition, simulation is<br />
<strong>of</strong>ten an intermediary step between design and construction<br />
<strong>of</strong> prototypes in industry. High performance computers<br />
are one <strong>of</strong> the keys <strong>of</strong> the present importance <strong>of</strong> these<br />
methods, because they allow simulating more and more<br />
complex situations as the technology evolves. However,<br />
the top speed <strong>of</strong> processors seems to have reached its top<br />
[1], and the tendency <strong>of</strong> CPU manufacturers is to ship<br />
multi-core processors instead <strong>of</strong> building faster single<br />
CPUs [2].<br />
The Transmission-Line Modelling method [3] is employed<br />
for the simulation <strong>of</strong> electromagnetic problems<br />
since 1971 [4], although it can simulate other problems<br />
as well, such as heat or particle diffusion, acoustic propagation,<br />
deformation in electric solids, waves in fluids,<br />
etc. [5] [6]. It has been used previously, in 2D form,<br />
for the study <strong>of</strong> atmospheric phenomena, e.g., Schumann<br />
Resonances [7], which is a problem similar to the one<br />
that will be addressed in this paper.<br />
The propagation <strong>of</strong> atmospherics in the Earthionosphere<br />
waveguide is a complex problem which involves<br />
several natural media (ground, oceans, atmosphere,<br />
ionospheric plasma) as well as the phenomena <strong>of</strong><br />
lightning [8], [9]. A parallel-TLM algorithm is employed<br />
to model the propagation <strong>of</strong> these natural signals and<br />
allows finding the first cut-<strong>of</strong>f frequency under two<br />
different approximated models <strong>of</strong> the natural waveguide<br />
formed by the ground and the ionosphere.<br />
Programming efficient algorithms with these relatively<br />
new hardware solutions is not straightforward. Different<br />
approaches must be taken into account according to<br />
the kind <strong>of</strong> hardware used. For instance, the way <strong>of</strong><br />
designing a parallel code for a Graphical Processing Unit<br />
(GPU) [10] will be different than for a multi-core system<br />
with shared memory access [11]. Programming shared<br />
memory environments is probably the most similar to<br />
traditional computing, i.e., not parallel, but still there is<br />
a great difference in the way we should conceive the<br />
algorithms [12].<br />
In Section 2, the TLM method is briefly introduced, as<br />
well as the Symmetric Condensed Node (SCN). Section<br />
3 describes the approach employed to parallelize the<br />
method. In Section 4, the Earth-ionosphere waveguide is<br />
introduced, and it is solved by means <strong>of</strong> the proposed<br />
algorithm. The model is benchmarked and its performance<br />
in terms <strong>of</strong> speedup is presented. In Section 5,<br />
the model is employed to solve the whole lossless Earthionosphere<br />
cavity, obtaining its Schumann Resonances.<br />
A brief summary as well as the main conclusions <strong>of</strong> the<br />
paper are detailed in Section 6.<br />
II. THE TLM METHOD<br />
TLM is a numerical method intended for simulation <strong>of</strong><br />
propagation problems which are governed by differential<br />
equations. Problems which have to deal with electromagnetics,<br />
heat diffusion, gravity waves, acoustic waves,<br />
etc. are suitable to be modelled with this technique. The
Fig. 1. Scheme <strong>of</strong> the Symmetric Condensed Node with 12 link lines.<br />
idea behind the method is to build a circuit based on<br />
transmission lines which behaves in analogous form as<br />
the problem we want to implement, i.e., the governing<br />
equations are the same for the circuit and for the physical<br />
problem. In this work, TLM will be used to solve<br />
Maxwell equations and the study <strong>of</strong> the Earth-ionosphere<br />
waveguide.<br />
The TLM method discretizes both time and space and<br />
sets up an iterative process in which the six components<br />
<strong>of</strong> the electromagnetic field evolve in time from a known<br />
initial situation. Therefore, the fields radiated and/or<br />
propagated in the space are simulated with arbitrary<br />
accuracy, which is constrained by the size <strong>of</strong> the space<br />
discretization. A thumbnail rule is that the minimum<br />
wavelength (λ) <strong>of</strong> interest must be ten times larger than<br />
the size <strong>of</strong> the cell (Δl), i.e., Δl ≤ λ/10 [13].<br />
Depending on the problem we want to model, each<br />
independent cell will be simulated by a different set up<br />
<strong>of</strong> transmission lines and node. For 3D electromagnetic<br />
problems, the most used circuit since it was formulated is<br />
the Symmetrical Condensed Node (SCN) [14], together<br />
with its variations. In this paper, the SCN with stubs for<br />
conductivity [15] will be used. In Fig. 1, a scheme <strong>of</strong> the<br />
transmission lines arrangement is shown. With its 12 link<br />
lines, the node is capable <strong>of</strong> modelling the behavior <strong>of</strong><br />
Maxwell’s equations for a differential <strong>of</strong> volume, ΔV .<br />
Regardless <strong>of</strong> the node employed for modelling each<br />
ΔV , the process iteration <strong>of</strong> TLM is always the same.<br />
For each node and time step n, t = nΔt (where Δt ≤<br />
Δl/2c for a cubic SCN, being c the speed <strong>of</strong> light in the<br />
medium), there is a set <strong>of</strong> incident pulses or voltages Vi,<br />
one for each transmission line <strong>of</strong> the node. During the<br />
time step they travel along the line, and they are either<br />
reflected and or transmitted to other lines, depending on<br />
the node structure. The transmitted/reflected pulses, or<br />
simply the scattered pulses Vr, are related to Vi by the<br />
scattering matrix S:<br />
Vr = S · Vi . (1)<br />
At the next time step, the scattered voltages from<br />
each node are converted into incident voltages <strong>of</strong> the<br />
- 8 - 15th IGTE Symposium 2012<br />
nearby node, thus propagating the pulses along the entire<br />
network. It is important to fix time synchronism in a<br />
manner that all pulses in the mesh are simultaneously<br />
incident at the center <strong>of</strong> their respective node at each<br />
time nΔt.<br />
III. PARALLELIZATION OF TLM<br />
TLM method is a time and memory consuming application,<br />
when applied to large problems. In its most<br />
basic form, at least 12 floating point numbers (floats)<br />
are needed in order to store the 12 voltages <strong>of</strong> each<br />
node. Six more floats become necessary if either nodes<br />
<strong>of</strong> variable size, permittivity (εr), or permeability (μr) <strong>of</strong><br />
the medium are required. Finally, three more floats must<br />
be used for adding electric conductivity. The problems<br />
solved in this work make use <strong>of</strong> 15 different transmission<br />
lines per node, 12 required by the basic SCN configuration,<br />
plus 3 for modelling the electric conductivity.<br />
The operation which consumes most <strong>of</strong> the computational<br />
time is reflected in Eq. 1, since this matrix<br />
multiplication must be performed at each node for each<br />
time step. For the case involved in this paper, the matrix<br />
multiplication has been reduced to 18 floating point<br />
multiplications and 36 additions, at the cost <strong>of</strong> executing<br />
different portion <strong>of</strong> code for the nodes which are at the<br />
border <strong>of</strong> the spatial distribution. The number <strong>of</strong> nodes<br />
for the problems involved in this work are on the order <strong>of</strong><br />
10 6 , and all these computations can be done concurrently<br />
for different nodes.<br />
Parallelizing the independent matrix multiplications<br />
<strong>of</strong> Eq. 1 requires doubling the minimum memory size<br />
required to hold the problem. Since the input voltage<br />
for one node is the output voltage for another node,<br />
Vi and Vr must be stored in different variables. On a<br />
sequential implementation <strong>of</strong> the model, i.e., not parallel,<br />
the order <strong>of</strong> execution is known, and the calculated Vr can<br />
overwrite Vi, if its value is previously stored on a local<br />
variable which can be erased when all related Vr have<br />
been calculated. Since the order in which the matrix multiplications<br />
will be performed is not known for a parallel<br />
version, doubling the minimum required memory size is<br />
a non-avoidable penalty <strong>of</strong> engaging parallelization. In<br />
the described TLM implementation, each node has the<br />
requirement <strong>of</strong> 30 floating-point variables (120 bytes) for<br />
storing the line voltages.<br />
The algorithm has been designed to be independent<br />
<strong>of</strong> its spatial topology, and it should provide the same<br />
performance regardless <strong>of</strong> the arrangement <strong>of</strong> the nodes<br />
in the space. The motivation for this constraint is to have<br />
a very flexible tool for solving different problems. In<br />
order to work in the same way regardless <strong>of</strong> the spatial<br />
geometry, the algorithm is split into two different steps:<br />
the pre-processing and the TLM computation itself.<br />
The pre-processing is a fast (when compared to TLM<br />
computation) operation which is in charge <strong>of</strong> transforming<br />
an arbitrary topology to a common one which can<br />
accommodate any kind <strong>of</strong> spatial distribution. The idea
is to assign a unique identifier to each node <strong>of</strong> the initial<br />
topology and to store a vector with the unique identifiers<br />
<strong>of</strong> the adjacent nodes. In this way, the result can be seen<br />
as a one-dimensional vector <strong>of</strong> nodes where each one<br />
knows which others are their neighbors. Any complex<br />
distribution <strong>of</strong> nodes can be simplified to this unified<br />
arrangement, regardless <strong>of</strong> the arbitrary initial geometry.<br />
On the other hand, abstracting the initial topology to<br />
this new paradigm brings a penalty <strong>of</strong> 6 integers per<br />
node to store the neighbor identifier at each direction<br />
(for 3D topologies) plus another integer to mark the kind<br />
<strong>of</strong> medium that the node belongs to. Therefore, the total<br />
amount <strong>of</strong> RAM memory required for each node is 152<br />
bytes.<br />
The TLM computation loop employed is shown in<br />
high-level pseudo-code below, where the OpenMP directives<br />
have been included:<br />
#pragma omp parallel private(private variables)<br />
{<br />
for(t=0..TotalTime)<br />
{<br />
#pragma omp single<br />
{<br />
//The reflected pulses become the incident at new time step<br />
Vi = Vr;<br />
//system feeding<br />
for(i=0..NumberOfFeeds) V[i]= feeding;<br />
//store the relevant output<br />
for(i=0..NumberOfOutputs) output=V[i];<br />
}<br />
#pragma omp for schedule (static)<br />
for(i=0..NumberOfNodes) Vr[i]=S*Vi[i];<br />
}end for(t)<br />
}end pragma parallel<br />
The main loop <strong>of</strong> the code is inside a #pragma omp. In<br />
this way, the overhead <strong>of</strong> creating (and destroying) new<br />
threads needs to be computed only once for all the execution.<br />
It mainly consists <strong>of</strong> iteration over time steps, which<br />
are not parallelizable, and which need synchronization<br />
<strong>of</strong> the threads which work inside each iteration. Each<br />
time step iteration is divided into two blocks; a sequential<br />
block and a parallel block. The sequential block performs<br />
three different operations:<br />
• Swap Vi by Vr. As we mentioned before, Vi and<br />
Vr must be stored in separate memory addresses in<br />
order to enable parallelization. At the beginning <strong>of</strong><br />
a time step, the reflected pulses from the previous<br />
iteration become the incident pulses on the neighbor<br />
nodes. In this implementation, the pointers <strong>of</strong> the<br />
vectors Vi and Vr are only exchanged, and the<br />
complexity <strong>of</strong> neighbor swapping the pulses is done<br />
implicitly in the matrix calculations, avoiding extra<br />
reading and writing to memory, although adding a<br />
small penalty in processing and complexity to the<br />
code.<br />
• System feeding. The initial electromagnetic problem<br />
may have sources on its initial definition. These<br />
sources bring external voltage pulses to the system,<br />
which are added in this part <strong>of</strong> the code.<br />
• Output storage. Some key nodes are marked as<br />
output and, therefore, the temporal evolution <strong>of</strong><br />
their voltages is necessary to reconstruct the fields’<br />
- 9 - 15th IGTE Symposium 2012<br />
evolution afterwards. All the line voltages at each<br />
time step from these output nodes are stored in<br />
memory.<br />
The parallel block is in charge for the matrix multiplication<br />
<strong>of</strong> each node. It is composed <strong>of</strong> a parallel for.<br />
Since the Vr calculation can be performed independently<br />
for each node, the OpenMP directive is in charge to<br />
distribute the computations between the available number<br />
<strong>of</strong> threads. Therefore, each thread will compute a portion<br />
<strong>of</strong> the total range <strong>of</strong> i. Since the clause schedule<br />
(static) is present, all the available threads will<br />
iterate the same portion <strong>of</strong> the i range.<br />
In order to reduce the total time <strong>of</strong> computation,<br />
several optimizations have been included here, which<br />
make the real code hard to read. One <strong>of</strong> them reduces<br />
the number <strong>of</strong> multiplications, by identifying operations<br />
which are performed several times for the calculation <strong>of</strong><br />
different reflected pulses. Another, the most complex one,<br />
deals with the neighboring <strong>of</strong> the nodes. It is implemented<br />
in such a way that the nodes on the edges <strong>of</strong> the initial<br />
geometry are treated in a different manner than the<br />
internal nodes are. The code is different but the amount<br />
<strong>of</strong> computation remains similar, except for the nodes<br />
which are edge in more than one <strong>of</strong> its sides. In this case<br />
the computations are larger. The number <strong>of</strong> nodes being<br />
edge for more than one side is usually small on most<br />
geometries. This is true for the models considered in this<br />
work due to the fact that scheduling the parallel for<br />
as static improves the performance <strong>of</strong> the algorithm,<br />
although few nodes require a bit more computation than<br />
others.<br />
IV. MODELLING THE EARTH-IONOSPHERE<br />
WAVEGUIDE<br />
The algorithm described above has been employed to<br />
simulate the Earth-ionosphere waveguide. The surface <strong>of</strong><br />
Earth behaves like a good conductor in the Very Low<br />
Frequency range (VLF, i.e., in the order <strong>of</strong> kHz), with<br />
conductivity ∼10−2 S/m for ground and ∼3.2 S/m for<br />
sea water [16]. There is air above the ground, which is<br />
<strong>of</strong> dielectric nature. As the altitude increases, the number<br />
<strong>of</strong> free electrons increases too, the density <strong>of</strong> neutral<br />
decreases, and the air starts behaving like a conductor.<br />
A typical conductivity pr<strong>of</strong>ile with altitude dependence<br />
is shown in Fig. 2 (top) [17].<br />
The excitation sources <strong>of</strong> the waveguide are lightning<br />
strokes [18]. They generate a broadband signal which<br />
differs in orientation, strength and duration depending <strong>of</strong><br />
its nature (cloud to ground, cloud to cloud, Q-bursts, etc.).<br />
A typical stroke in positive cloud to ground lightning<br />
is depicted in Fig. 2 (bottom) [19], which has been<br />
employed as excitation in the problem considered.<br />
The signal originated by the stroke travels a certain distance,<br />
guided between the two plates before extinguishing<br />
due to losses. On a first approximation, the system<br />
can be regarded as the infinite parallel-plate waveguide.<br />
According to [20], the cut-<strong>of</strong>f frequencies for a lossless<br />
waveguide <strong>of</strong> this geometry are located at:
Fig. 2. Conductivity pr<strong>of</strong>ile <strong>of</strong> the atmosphere with altitude (top),<br />
extracted from [17], and typical current for cloud to ground lightning<br />
(bottom).<br />
Amplitude [a.u.]<br />
6<br />
5<br />
4<br />
3<br />
2<br />
1<br />
Vertical electric field as a function <strong>of</strong> frequency<br />
lossy waveguide<br />
lossless waveguide<br />
0<br />
1000 2000 3000 4000 5000 6000 7000 8000<br />
Frequency (Hz)<br />
Fig. 3. Detail <strong>of</strong> the first and second cut-<strong>of</strong>f frequencies for the lossless<br />
and the lossy Earth-ionosphere waveguide.<br />
fn = nc<br />
(2)<br />
2h<br />
where c is the speed <strong>of</strong> light in vacuum, h is the<br />
distance between the parallel plates, n is the mode<br />
number, and fn the associated cut-<strong>of</strong>f frequency <strong>of</strong> the<br />
mode.<br />
The problem has been modelled with the algorithm,<br />
both for a lossless and for a lossy waveguide. For the<br />
lossless waveguide, the conductivity is supposed to be<br />
zero in the dielectric. For the lossy waveguide, the night<br />
conductivity pr<strong>of</strong>ile from Fig. 2 (top) is applied. The<br />
parallel plates are taken as perfect conductors in both<br />
cases. Details <strong>of</strong> the first and second cut-<strong>of</strong>f frequencies<br />
are depicted in Fig. 3, which correspond to electric<br />
field in the z-direction, at a distance <strong>of</strong> 45 km in ydirection<br />
from the source (see Fig. 4 for definition <strong>of</strong><br />
the directions). It is interesting to note the effect <strong>of</strong><br />
the conductivity, which increases the value <strong>of</strong> the cut<strong>of</strong>f<br />
frequencies, being equivalent to having a narrower<br />
waveguide.<br />
A. Algorithm Benchmarking<br />
The total execution time <strong>of</strong> the waveguide model over<br />
different computers has been measured, using a different<br />
- 10 - 15th IGTE Symposium 2012<br />
Fig. 4. Spatial arrangement <strong>of</strong> the waveguide model.<br />
total time <strong>of</strong> execution (s)<br />
2000<br />
1000<br />
500<br />
200<br />
100<br />
Scalability <strong>of</strong> TLM algorithm<br />
Absolute time <strong>of</strong> execution Relative speedups<br />
SM32<br />
SM32 round-robin<br />
SM8<br />
SM8 round-robin<br />
0 5 10 15 20 25 30<br />
Number <strong>of</strong> CPUs<br />
speedup = time 1core / time nCores<br />
18<br />
16<br />
14<br />
12<br />
10<br />
8<br />
6<br />
4<br />
2<br />
SM32<br />
SM32 round-robin<br />
SM8<br />
SM8 round-robin<br />
0<br />
0 5 10 15 20<br />
Number <strong>of</strong> CPUs<br />
25 30<br />
Fig. 5. Total time <strong>of</strong> execution (left) and relative speedups (right) for<br />
the different platforms.<br />
number <strong>of</strong> CPUs, in order to determine the scalability <strong>of</strong><br />
our algorithm. Two different computers have been used<br />
in the benchmarking process:<br />
• SuperMicro8 (SM8). Server with 2 AMD opteron<br />
quad-core processors 2.0 GHz and 32 GB RAM, in<br />
Not Uniform Memory Access (NUMA) configuration.<br />
The OS is OpenSUSE 11.4 and the compiler<br />
employed is opencc 4.2.4 (level 2 <strong>of</strong> optimization).<br />
• SuperMicro32 (SM32). Server with 4 AMD opteron<br />
eight-core processors 2.0 GHz and 96 GB RAM, in<br />
Not Uniform Memory Access (NUMA) configuration.<br />
The OS is OpenSUSE 11.4 and the compiler<br />
employed is opencc 4.2.4 (level 2 <strong>of</strong> optimization).<br />
The problem benchmarked makes use <strong>of</strong> symmetry<br />
and the initial grid is two-dimensional. According to<br />
Fig. 4, the symmetry is applied in the x-direction. The<br />
conductivity pr<strong>of</strong>ile is extended along z, and the output<br />
measured at a certain distance along the y-direction. The<br />
node size is 1.5 km, the time step is 2.5 μs, the number<br />
<strong>of</strong> time steps is 7,500, and the total number <strong>of</strong> nodes is<br />
∼106 (67 nodes in z, 15,000 nodes in y). The excitation is<br />
placed next to the ground, at the center <strong>of</strong> the waveguide.<br />
With this configuration, a total <strong>of</strong> 7.5·109 step-node<br />
computations must be performed to solve the problem.<br />
The total execution time and relative speedups are shown<br />
in Fig. 5, for both platforms. The relative speedup is<br />
defined as the execution time using n cores divided by<br />
the execution time using 1 core. A maximum speedup <strong>of</strong><br />
6 is achieved with SM8, when making use <strong>of</strong> its 8 CPUs.<br />
On SM32, a maximum speedup <strong>of</strong> 16 is obtained when<br />
using 30 CPUs.<br />
As it can be observed in Fig. 5, two benchmarks<br />
have been measured for each computer. The aim is to<br />
compare the performance when using or not Round-<br />
Robin memory allocation policy. This policy consists in<br />
requesting the operative system to balance the memory<br />
reservation equally among the different portions <strong>of</strong> RAM.
This can be accomplished via the numactl tool [21].<br />
If this policy is not enabled, the memory reservation<br />
will be performed sequentially, and only some memory<br />
blocks will be used. Since the computers employed have<br />
NUMA architecture, each processor can access faster to<br />
a certain RAM circuit, while the access to the others is<br />
slower. Moreover, if the Round-Robin policy is not set,<br />
the different CPUs will have to compete to gain access<br />
to the particular RAM circuit, slowing down the overall<br />
computation. This technique is especially effective for a<br />
large number <strong>of</strong> CPUs (see Fig. 5).<br />
V. MODELLING THE WHOLE EARTH-IONOSPHERE<br />
CAVITY<br />
In this section, the whole cavity is considered in the<br />
simulation, leading to a much more time and RAM memory<br />
consuming model. The cavity has been considered as<br />
the space between two concentric spheres <strong>of</strong> 6,370 and<br />
6,470 km, with perfect conducting walls at the borders<br />
and no conductivity at the interior. The spherical shell<br />
has been modeled by cubic nodes, in this case <strong>of</strong> Δl=5<br />
km <strong>of</strong> size. The total number <strong>of</strong> nodes is ∼4.14·108 ,and<br />
the amount <strong>of</strong> RAM required is ∼61.5 GBytes. Around<br />
1.1 GBytes are employed for storing the outputs. For a<br />
spatial grid with a 5 km resolution, the time step required<br />
is 8.34 μs. The number <strong>of</strong> time iterations calculated was<br />
2.4·105 , and therefore the simulated time length is ∼2<br />
s. A frequency resolution <strong>of</strong> 0.5 Hz is achieved when<br />
the FFT is computed with these parameters. The total<br />
execution time required when using 32 cores on SM32<br />
(see Section IV-A) is roughly 6.0·105 s, i.e., around seven<br />
days.<br />
The excitation source <strong>of</strong> the cavity has been located at<br />
θ=0 and r=6,372 km, i.e., the North Pole. The excitation<br />
corresponds to a vertical positive Cloud to Ground (+CG)<br />
lightning, and its current is shown in Fig. 2 (bottom). This<br />
stroke starts at t=0, and lasts for 500 μs.<br />
With this spatial arrangement, the problem has symmetry<br />
over the φ coordinate, and therefore the outputs had<br />
been located all φ=0. A total <strong>of</strong> 101 nodes are marked as<br />
output, and they are equally spaced along the coordinate<br />
θ, from 0 to π, for r=6,370 km, i.e., at the surface,<br />
because it is the common location for SR measurements.<br />
As SR analytical models state [22] [23], the two<br />
relevant components <strong>of</strong> the electromagnetic field are Er<br />
and Hφ. In Figure 6, the six components <strong>of</strong> the output<br />
corresponding to θ=π/4 have been plotted, in order to<br />
show this fact. The other output nodes show similar<br />
results, where the two components mentioned are much<br />
greater than the rest.<br />
In order to corroborate the results from the simulations,<br />
the relationship between the modal amplitude <strong>of</strong> the six<br />
first SR and the angular distance to the source for the<br />
101 nodes marked as output (θ=0, θ=π/100,..., θ=π) has<br />
been plotted in Figure 7. This result is in agreement<br />
with analytical model results [22] [23], which show the<br />
amplitude dependence <strong>of</strong> SR modes with the distance to<br />
the source.<br />
- 11 - 15th IGTE Symposium 2012<br />
Fig. 6. Spectra <strong>of</strong> Electric (left) and Magnetic (right) field components.<br />
The relevance <strong>of</strong> Er and Hφ can be observed.<br />
Hphi [T/sqrt(Hz)]<br />
6e-09<br />
5e-09<br />
4e-09<br />
3e-09<br />
2e-09<br />
1e-09<br />
Dependence <strong>of</strong> SR amplitude with distance to the source (θ), lossless cavity<br />
SR1<br />
SR2<br />
SR3<br />
0<br />
0 π/4 π/2<br />
θ [rad]<br />
3π/4 π<br />
Hphi [T/sqrt(Hz)]<br />
1.2e-08<br />
1e-08<br />
8e-09<br />
6e-09<br />
4e-09<br />
2e-09<br />
SR4<br />
SR5<br />
SR6<br />
0<br />
0 π/4 π/2<br />
θ [rad]<br />
3π/4 π<br />
Fig. 7. Dependence <strong>of</strong> SR modal amplitude with θ, for the lossless<br />
cavity.<br />
The simulation has been repeated changing only the<br />
size <strong>of</strong> the spatial grid to Δl=10 km. Doubling the size<br />
<strong>of</strong> the nodes reduces by a factor <strong>of</strong> eight the number<br />
<strong>of</strong> nodes, at the cost <strong>of</strong> a poorer fitting <strong>of</strong> the spherical<br />
geometry and worse spatial resolution. The maximum<br />
valid frequency is also reduced by a factor <strong>of</strong> two, but<br />
this is not important for the study <strong>of</strong> SR, because the<br />
top frequency is still 3 kHz (the condition is λ ≥ 10Δl).<br />
The amount <strong>of</strong> memory required is reduced to 9.1 GBytes<br />
(with 1.1 GBytes for storing the results). The execution<br />
time, again with 32 cores in SM32, is reduced to 7.6·10 4<br />
s, i.e., roughly 21 hours. The magnetic fields in φ direction<br />
at an angular distance <strong>of</strong> π/4 <strong>of</strong> the two simulations<br />
are compared in Figure 8.<br />
The six maxima from each spectra <strong>of</strong> Hφ have been<br />
extracted and averaged, with the aim <strong>of</strong> using them as a<br />
proxy <strong>of</strong> the resonance position. The results are shown<br />
in Table I, for both simulations.<br />
It can be observed that the results for the central frequencies<br />
<strong>of</strong> the six SR are similar in the two simulations<br />
and with the results from the analytical solution. For the<br />
case <strong>of</strong> the 10 km size simulation, the errors for the<br />
central frequencies are always under 3%. This error is
Amplitude [a.u.]<br />
1<br />
0.9<br />
0.8<br />
0.7<br />
0.6<br />
0.5<br />
0.4<br />
0.3<br />
0.2<br />
0.1<br />
10 km<br />
5 km<br />
Comparison <strong>of</strong> H φ at θ=π/4<br />
0<br />
0 10 20 30 40 50<br />
Frequency (Hz)<br />
Fig. 8. Comparison <strong>of</strong> Hφ at θ=π/4, for the two simulations (5 km<br />
and 10 km).<br />
TABLE I<br />
SR CENTRAL VALUES IN HZ, LOSSLESS CAVITY.<br />
1st SR 2nd SR 3rd SR 4th SR 5th SR 6th SR<br />
10 km 10.24 17.74 24.98 32.35 39.63 46.93<br />
5km 10.47 17.99 25.48 32.96 39.98 47.47<br />
Analytical 10.51 18.20 25.75 33.24 40.71 48.17<br />
reduced to less than 1.5% for the 5 km simulation.<br />
VI. CONCLUSION<br />
The TLM is briefly described and the inherent parallel<br />
areas <strong>of</strong> the algorithm are pointed out. It has been<br />
parallelized for shared memory architectures, by using<br />
OpenMP. The solution obtained has been employed to<br />
simulate the Earth-ionosphere waveguide, and to observe<br />
the changes produced by the conductivity pr<strong>of</strong>ile over<br />
the cut-<strong>of</strong>f frequency, by comparing the results with<br />
the lossless waveguide. The effect <strong>of</strong> this conductivity<br />
is to increase the value <strong>of</strong> the cut-<strong>of</strong>f frequencies,<br />
being equivalent to having a narrower waveguide. The<br />
algorithm has been run on two different platforms and<br />
benchmarked. The algorithm scales up to a speedup <strong>of</strong><br />
16 by using 30 CPUs. In order to obtain the maxima<br />
speedups, it is necessary to set a policy <strong>of</strong> Round-Robin<br />
memory allocation, in order to minimize the effects <strong>of</strong><br />
the NUMA architecture. Finally, a huge simulation (the<br />
whole Earth-ionosphere cavity with a 5 km resolution)<br />
has been performed for validation <strong>of</strong> the parallel algorithm.<br />
The lossless version <strong>of</strong> the cavity is solved, and<br />
the electromagnetic fields obtained are consistent with<br />
the analytical solution <strong>of</strong> the cavity. Since a Cartesian<br />
grid <strong>of</strong> only 5 km size per cell was employed, the<br />
errors are lower than 1.5% for the Schumann resonance<br />
frequencies. The spatial resolution achieved enables the<br />
possibility <strong>of</strong> using this model to study the global effects<br />
generated by local phenomena in the Earth-ionosphere<br />
cavity.<br />
Acknowledgments: This work was supported by the<br />
Consejería de Innovación, Ciencia y Empresa <strong>of</strong> Andalusian<br />
Government and Ministerio de Ciencia e Innovación<br />
- 12 - 15th IGTE Symposium 2012<br />
<strong>of</strong> Spain under projects with references PO7-FQM-03280<br />
and FIS2010-15170, co-financed with FEDER funds <strong>of</strong><br />
the European Union.<br />
REFERENCES<br />
[1] Flynn, L.J.: Intel halts development <strong>of</strong> 2 new microprocessors, The<br />
New York Times, May 8, 2004, retrieved on March 2, 2011 (2004)<br />
[2] Yu, W., Yang X., Liu, Y., Ma, L.-C., Su, T., Huang, N.-T., Mittra,<br />
R., Maaskant, R., Lu, Y., Che, Q., Lu, R., Su, Z.: A new direction<br />
in computational electromagnetics: solving large problems using<br />
the parallel FDTD on the BlueGene/L Supercomputer providing<br />
teraflop performance, IEEE antennas and Propag. Mag., 50(2), 26–<br />
44 (2008).<br />
[3] Christopoulos, C.: The Transmission-Line Modelling Method,<br />
TLM, IEEE Press, Piscataway, N.J. (1995)<br />
[4] Johns, P.B, Beurle, R.L.: Numerical solution <strong>of</strong> 2-dimensional<br />
scattering problems using a transmission-line matrix, Proc. Inst.<br />
Elec. Eng., 118(9): 1203–1208, 1971.<br />
[5] De Cogan, D., Pulko, S.H., O’Connor, W.J.: Transmission-Line<br />
Matrix in computational mechanics, CRC Press, Boca Raton, Fla.<br />
(1995)<br />
[6] Enders, P., Pulko, S.H., Stubbs, D.M.: TLM for diffusion: consistent<br />
first time step. Two-dimensional case, International Journal <strong>of</strong><br />
numerical modelling-electronic networks devices and fields, 15(3),<br />
251–259 (2002)<br />
[7] Morente, J.A., Molina-Cuberos, G.J., Portí, J., Besser, B.P., Salinas,<br />
A., Schwingenschuch, K., Lichtenegger, H.: A numerical simulation<br />
<strong>of</strong> Earth’s electromagnetic cavity with the Transmission Line<br />
Matrix method: Schumann resonances, J. Geophys, Res., 108(A5),<br />
1195–1205 (2003)<br />
[8] S. Toledo-Redondo, Parrot, M., and Salinas, A., “Variation <strong>of</strong> the<br />
first cut-<strong>of</strong>f frequency <strong>of</strong> the Earth-ionosphere waveguide observed<br />
by DEMETER”, J. Geophys. Res., vol. 117, pp. A04321, 2012.<br />
[9] Cummer, S.A.:Modeling electromagnetic propagation in the Earthionosphere<br />
waveguide, IEEE Trans. ant. Propag., 48(9), 1420–1432<br />
(2000)<br />
[10] Kirk, D.B., Hwu, W.W.: Programming massively parallel processors,<br />
a hands-on approach, Morgan Kaufmann, Burlington, M.A.<br />
(2010)<br />
[11] Chapman, B., Jost, G., van der Pas, R.: Using OpenMP: Portable<br />
Shared Memory Parallel Programming, The MIT Press, Cambridge,<br />
Massachussets (2007)<br />
[12] Breshears, C.: The art <strong>of</strong> concurrency, a thread monkey’s guide<br />
to writing parallel applications, O’Reilly, Sebastopol, CA (2009)<br />
[13] Morente, J.A., Jiménez, G., Portí, J., Khalladi, M.: Dispersion<br />
analysis for TLM mesh <strong>of</strong> symmetrical condensed nodes with<br />
stubs, IEEE Trans. Microwave Theory Tech. 43(2), 452–456 (1995)<br />
[14] Johns, P.B.: A symmetrical condensed node for the TLM method,<br />
IEEE Trans. Microwave Theory Tech. 35(4), 370–377 (1987)<br />
[15] Naylor, P., Desai, R.A.: New three dimensional symmetrical condensed<br />
lossy node for solution <strong>of</strong> electromagnetic wave problems<br />
by TLM, Electron. Lett., 26(7), 492-494 (1990)<br />
[16] Rycr<strong>of</strong>t, M.J., Harrison, R.G., Nicoll, K.A., Mareev, E.A.: An<br />
overview <strong>of</strong> Earth’s global electric circuit and atmospheric conductivity,<br />
Space Sci. Rev., 137, 83–105 (2008)<br />
[17] Pechony, O., Price, C.: Schumann resonance parameters calculated<br />
with a partially uniform model on Earth, Venus, Mars, and<br />
Titan, Radio Sci., 39, RS5007 (2004)<br />
[18] Storey, L.R.O.: An investigation <strong>of</strong> whistling atmospherics, Philosophical<br />
transactions <strong>of</strong> the royal society <strong>of</strong> London series A -<br />
Mathematical and physical sciences, 246(908), 113–141 (1953)<br />
[19] Baba, Y., Rakov, V.A.: Present understanding <strong>of</strong> the lightning return<br />
stroke, in Lightning: Principles, instruments and applications,<br />
Springer (2009)<br />
[20] Cheng, D.K.: Field and wave electromagnetics, Addison-Wesley<br />
(1989)<br />
[21] Linux Manual pages, numactl(8),<br />
http://linuxmanpages.com/man8/numactl.8.php<br />
[22] Sentman, D.D., Schumann Resonances, in Handbook <strong>of</strong> atmospheric electrodynamics,<br />
CRC Press, Boca Raton, Fla, (1995)<br />
[23] Toledo-Redondo, S., Salinas, A., Portí, J., Morente, J.A., Fornieles, J.,<br />
Méndez, A., Galindo-Zaldívar, J., Pedrera, A., Ruiz-Constán, A., and Anahnah,<br />
F., Study <strong>of</strong> Schumann resonances based on magnetotelluric records<br />
from the western Mediterranean and Antarctica, J. Geophys. Res., 115, D22,<br />
114, (2010).
- 13 - 15th IGTE Symposium 2012<br />
A Novel Parametric Model Order Reduction<br />
Approach with Applications to Geometrically<br />
Parameterized Microwave Devices<br />
Stefan Burgard∗ , Ortwin Farle∗ , and Romanus Dyczij-Edlinger∗ ∗Chair for Electromagnetic Theory, Saarland <strong>University</strong>, D-66123 Saarbrücken, Germany<br />
E-mail: edlinger@lte.uni-saarland.de<br />
Abstract—Methods <strong>of</strong> model-order reduction approximate the transfer behavior <strong>of</strong> a given high-dimensional system by that<br />
<strong>of</strong> a low-order one, which is much faster to evaluate. In the parametric case, the system features additional parameters,<br />
such as material properties or geometric design variables. The parametric order-reduction methods available today still<br />
exhibit a number <strong>of</strong> limitations, particularly with respect to convergence rates and the size <strong>of</strong> the reduced-order model.<br />
This contribution presents a novel technique based on affine parameter reconstruction and parameter-dependent projection<br />
matrices. It features high rates <strong>of</strong> convergence, supports local adaptation, and yields reduced-order models that are <strong>of</strong> very<br />
low dimension and thus fast to evaluate.<br />
Index Terms—Computer-aided engineering, geometric parameters, parametric model order reduction, parametric models.<br />
I. INTRODUCTION<br />
This paper addresses microwave components with linear<br />
time-invariant system properties. Since most practical<br />
structures possess complicated shape and inhomogeneous<br />
material properties, their fields-level analysis requires numerical<br />
methods, such as the finite-element (FE) method.<br />
FE discretization in the frequency domain results in<br />
systems <strong>of</strong> linear equations which are characterized by<br />
sparse matrices <strong>of</strong> high dimension. While solving a FE<br />
system at one single operating frequency may not be particularly<br />
demanding on modern computers, the analysis<br />
<strong>of</strong> broad frequency bands still tends to be very timeconsuming.<br />
The situation gets even worse when multiple<br />
parameters, such as material properties or geometric<br />
design variables, are present, and entire response surfaces<br />
are to be computed.<br />
Methods <strong>of</strong> model-order reduction (MOR) address this<br />
issue by approximating the behavior <strong>of</strong> the original system<br />
by a reduced-order model (ROM) that is very cheap<br />
to solve. As long as the frequency is the sole parameter,<br />
powerful single-point [1], [2] or multi-point [3], [4]<br />
algorithms are readily available. The incorporation <strong>of</strong><br />
additional parameters, especially those <strong>of</strong> geometric type,<br />
still poses challenges, with respect to convergence rates,<br />
computing times, and model dimension. One particular<br />
difficulty with geometric parameters is that they enter the<br />
FE matrices in the form <strong>of</strong> multivariate rational functions<br />
<strong>of</strong> complicated structure. The authors use the technique<br />
<strong>of</strong> [5] and [6] for affine geometry approximation.<br />
Present parametric model-order reduction (PMOR)<br />
techniques fall under two categories: The one class comprises<br />
methods [7], [5] that employ one global projection<br />
space for all parameters, including the frequency. It is<br />
characteristic <strong>of</strong> such entire-domain methods that the<br />
ROM dimension is large and rises quickly with increasing<br />
size <strong>of</strong> the parameter domain. The other class includes<br />
methods that instantiate frequency-domain ROMs for<br />
a set <strong>of</strong> sampling points in the domain <strong>of</strong> geometric<br />
parameters and employ interpolation over sub-domains<br />
to account for geometry variations. Thanks to their local<br />
nature, the resulting sub-domain ROMs are <strong>of</strong> low dimension.<br />
While existing techniques [8] - [11] interpolate<br />
the frequency-domain ROMs directly, the method proposed<br />
in this paper interpolates projection matrices. One<br />
particular advantage <strong>of</strong> this approach is to decouple the<br />
approximation <strong>of</strong> the effects <strong>of</strong> geometric parameters on<br />
the FE matrices, which may be the dominant source <strong>of</strong><br />
error, from the actual PMOR process.<br />
The remainder <strong>of</strong> the paper is organized as follows:<br />
Section II presents the underlying parameter-dependent<br />
FE system. The treatment <strong>of</strong> geometric parameters is<br />
reviewed in Section III. The new PMOR approach is<br />
developed in Section IV. This constitutes the main contribution<br />
<strong>of</strong> the paper. Section V gives numerical examples<br />
that demonstrate the benefits <strong>of</strong> the suggested approach.<br />
A brief summery in Section VI closes the paper.<br />
II. ORIGINAL SYSTEM<br />
We consider a time-harmonic electromagnetic FE system<br />
<strong>of</strong> dimension N which possesses Q inputs and outputs,<br />
respectively, and depends on the frequency f ∈ R<br />
and a vector p ∈P⊂RP <strong>of</strong> P geometric parameters.<br />
The input vector is denoted by u, the generalized state<br />
by x(f,p), and the output by y(f,p). The system is<br />
assumed to be <strong>of</strong> the form<br />
I<br />
J<br />
<br />
φi(f)Ai(p) x(f,p) = θj(f)Bj u, (1a)<br />
i=1<br />
j=1<br />
J<br />
y(f,p) = ηj(f)B<br />
j=1<br />
T <br />
j x(f,p), (1b)<br />
wherein Bj ∈ RN×Q , and the functions φi,θj,ηj : R →<br />
C and Ai : RP → RN×N are continuous. Eq. (1) implies
that the topology <strong>of</strong> the FE mesh, i.e. the number and<br />
connectivity <strong>of</strong> the FE nodes, must remain the same over<br />
the whole parameter domain P. Meshes <strong>of</strong> this kind<br />
can be constructed for a wide class <strong>of</strong> parameterized<br />
geometries by, e.g., the morphing method <strong>of</strong> [12].<br />
III. GEOMETRY INTERPOLATION<br />
In many cases, the parameter-dependent matrices<br />
Ai(p) are just multivariate rational functions. Nevertheless,<br />
their explicit representation [13] is quite complex<br />
in practice, because it requires tracking the effects <strong>of</strong><br />
all geometric parameters from the solid model through<br />
the mesh generation process to the FE matrix generation<br />
stage. Therefore, the present paper follows the suggestion<br />
<strong>of</strong> [5] and [6] to approximate Ai(p) by a function <strong>of</strong><br />
simpler structure. We set<br />
Ai(p) ≈ <br />
Γβ(p)A<br />
β<br />
β<br />
i for p ∈P, (2)<br />
wherein β =[β1,...,βP ] is a multi-index, A β<br />
i ∈ CN×N ,<br />
and Γβ : R P ↦→ R is a suitable interpolation function.<br />
Thus, the approximate system reads:<br />
<br />
φi(f)<br />
i<br />
<br />
Γβ(p)A<br />
β<br />
m <br />
i x ′ (f,p)= <br />
θj(f)Bju, (3a)<br />
y<br />
j<br />
′ <br />
(f,p) =<br />
<br />
x ′ (f,p), (3b)<br />
ηj(f)B<br />
j<br />
T j<br />
The interpolation functions Γβ are obtained as follows:<br />
For each parameter p, choose a set Np <strong>of</strong> interpolation<br />
points ψ k p,<br />
Np = ψ k p ∈ R <br />
k =1,...,Kp , (4)<br />
and associated interpolation functions γ k p : R → R with<br />
γ k p (ψ l p)=δkl for ψ l p ∈Np. (5)<br />
Next construct a tensor-grid G = N1 × ... ×NP for<br />
the domain P. Then the interpolation point pβ and<br />
interpolation function Γβ are given by<br />
pβ =[ψ β1<br />
1 ,...,ψβP P ], (6a)<br />
Γβ(p) =γ β1<br />
1 (ψ1) · ...· γ βP<br />
P (ψP ). (6b)<br />
By (6) and (2), the interpolation matrices A β<br />
i<br />
are given<br />
by the system matrices Ai <strong>of</strong> the original FE system (1)<br />
at the interpolation point pβ:<br />
A β<br />
i = Ai(pβ). (7)<br />
IV. PARAMETRIC ORDER REDUCTION<br />
We construct the parametric ROM by replacing the<br />
test and trial space, respectively, <strong>of</strong> the interpolated FE<br />
system (3) by an n dimensional subspace S(p) which<br />
depends continuously on the parameter vector p. For<br />
this purpose, a Galerkin procedure based on a parameterdependent<br />
projection matrix V(p) :RP → RN×n , with<br />
S(p) =range {V(p)} , (8)<br />
- 14 - 15th IGTE Symposium 2012<br />
1 1<br />
1 2<br />
[ , ]<br />
1 2<br />
1 2<br />
[ , ]<br />
1 3<br />
1 2<br />
1,1<br />
1,2<br />
2 1<br />
1 2<br />
[ , ]<br />
2 2<br />
1 2<br />
2,1<br />
2,2<br />
3 1<br />
1 2<br />
[ , ]<br />
3 2<br />
[ , ] [ , ]<br />
1 2<br />
2 3 3 3<br />
[ , ] [ , ] [ , ]<br />
Fig. 1. Hypercube topology<br />
1 2<br />
1 2<br />
is applied to (3). The resulting ROM is <strong>of</strong> the form<br />
<br />
φi(f)<br />
i<br />
<br />
Γβ(p)<br />
β<br />
Ãβi<br />
(p)<br />
<br />
˜x = <br />
θj(f)<br />
j<br />
˜ Bj(p)u, (9a)<br />
<br />
˜y(f,p) = ηj(f) ˜ B T <br />
j (p) ˜x(f,p), (9b)<br />
with<br />
j<br />
à β<br />
i (p) =VT (p)A β<br />
i V(p), (10a)<br />
˜Bj(p) =V T (p)Bj. (10b)<br />
As long as n ≪ N, the frequency response <strong>of</strong> the ROM<br />
can be computed much more efficiently than the original<br />
one.<br />
A. Parameter dependent projection matrix<br />
We start by computing n dimensional one-parameter<br />
ROMs with respect to frequency at all interpolation<br />
points pβ ∈ G. The resulting projection matrices are<br />
denoted by ˆ Vβ ∈ C N×n .<br />
The interpolation points pβ ∈Gsubdivide the param-<br />
eter domain into hypercubes H β ⊂ R P . Based on the<br />
line segments L k p = ψ k p,ψ k+1<br />
p<br />
H β = L β1<br />
1<br />
,wehave<br />
× ...×LβP P . (11)<br />
Fig. 1 illustrates the setting in the two-dimensional case.<br />
Starting from one-dimensional hat functions ξ k p : R → R,<br />
ξ k p (ψ) =<br />
⎧<br />
ψ k−1<br />
p −ψ<br />
for ψ ∈Lk−1 p ,<br />
⎪⎨ ψ<br />
⎪⎩<br />
k−1<br />
p −ψk p<br />
ψ k+1<br />
p −ψ<br />
ψ k+1<br />
p −ψk for ψ ∈L<br />
p<br />
k p,<br />
0 else,<br />
(12)<br />
we construct piecewise multi-linear interpolation functions<br />
Ξβ : R P → R <strong>of</strong> compact support:<br />
Ξβ(p) =ξ β1<br />
1 (ψ1) · ...· ξ βP<br />
P (ψP ). (13)<br />
Within a given hypercube Hα , the parameterdependent<br />
projection matrix V(p) is defined by<br />
V(p) = <br />
Ξβ(p) ˆ VβT α β for p ∈H α . (14)<br />
pβ∈H α
Herein, the matrices T α β ∈ Rn×n are provided in order<br />
to conduct state transformations. They are constructed<br />
as follows: Following [8] and [9], a singular value<br />
decomposition [14] is performed,<br />
<br />
= U diag σW H , (15)<br />
<br />
ˆVβ1 ,..., ˆ Vβ (2P )<br />
to determine a basis Rα ∈ RN×n for the n dimensional<br />
subspace <strong>of</strong> highest energy over the hypercube Hα , i.e.,<br />
the subspace corresponding to the n largest singular<br />
values:<br />
Rα = U(:, 1:n). (16)<br />
For any relevant state Rα˜x, we require the ROM state<br />
at the interpolation point pβ, ˆ VβTα β ˜x, to be as close as<br />
possible:<br />
!<br />
=min ∀˜x ∈ C n , (17a)<br />
Rα˜x − ˆ VβT α β ˜x2<br />
⇒ ˜x − R H α ˆ VβT α β ˜x = 0 ∀˜x ∈ Cn . (17b)<br />
Thus,<br />
T α β =<br />
<br />
R T α ˆ −1 Vβ . (18)<br />
Eq. (18) underlines that interpolating the bases ˆ Vβ directly,<br />
which is equivalent to taking Tα β = I, may cause<br />
gross error.<br />
B. Assembly<br />
Plugging (14) into (10) leads to the following representation<br />
<strong>of</strong> the reduced matrices within the hypercube Hα :<br />
à β<br />
<br />
i (p) =<br />
pγ∈Hα <br />
pδ∈Hα Ξγ(p)Ξδ(p) A β<br />
i,γ,δ , (19a)<br />
˜Bj(p) = <br />
Ξγ(p)Bj,γ, (19b)<br />
wherein<br />
pγ∈H α<br />
A β<br />
i,γ,δ =(Tα γ ) T V T γ A β<br />
i VδT α δ , (20a)<br />
Bj,γ =(T α γ ) T V T γ Bj. (20b)<br />
Note that all the coefficient matrices in (20) are <strong>of</strong><br />
reduced size and can be computed in advance. No O(N)<br />
operations are required during the solution process.<br />
V. NUMERICAL EXAMPLES<br />
In the examples below, the single-parameter ROMs<br />
with respect to frequency at the interpolation points are<br />
computed by means <strong>of</strong> the single-point algorithm <strong>of</strong> [1].<br />
A. Dielectric Post<br />
Fig. 2 shows the H plane filter <strong>of</strong> [16]. It consists <strong>of</strong><br />
a dielectric post at the center <strong>of</strong> an air-filled rectangular<br />
waveguide. The model features two parameters: the operating<br />
frequency f ∈ [15, 25] GHz and the geometric<br />
parameter p ∈ [−1.5, 1.5] mm which defines the radius r<br />
<strong>of</strong> the post according to<br />
r =2.5 mm + p. (21)<br />
Fig. 3 shows instantiations <strong>of</strong> the parametric mesh [12]<br />
for p ∈{−1.5, 0, 1.5} mm.<br />
- 15 - 15th IGTE Symposium 2012<br />
10<br />
Γ (1)<br />
WG<br />
Ω<br />
10<br />
ɛd<br />
20<br />
μd<br />
Γ (2)<br />
d<br />
ɛr = μr =1<br />
Γ (1)<br />
d<br />
2r<br />
Γ (2)<br />
WG<br />
5<br />
Fig. 2. Structure <strong>of</strong> rectangular waveguide filter [16]. All dimensions<br />
are in mm. Material properties <strong>of</strong> rod: relative electric permittivity ɛd =<br />
4, relative magnetic permeability μd =1. Waveguide ports are denoted<br />
by Γ (1)<br />
WG and Γ(2)<br />
WG , respectively.<br />
Fig. 3. Instantiations <strong>of</strong> the parametric FE mesh for different values<br />
<strong>of</strong> the geometry parameter: p ∈ {−1.5, 0, 1.5} mm. Note that the<br />
meshes share the same topology.<br />
1) Response Surface and Errors: Fig. 4 presents the<br />
response surface <strong>of</strong> the magnitude <strong>of</strong> the reflection coefficient<br />
|S11|. The parametric ROM is based on M =5<br />
interpolation points placed at the locations <strong>of</strong> the zeros<br />
<strong>of</strong> the fifth-order Chebyshev polynomial <strong>of</strong> the first kind.<br />
The expansion frequency for the single-parameter ROMs<br />
is set at the center <strong>of</strong> the frequency band, f exp =20GHz.<br />
We define the error in S11 by<br />
ES11 (f,p) = S11(f,p) − S11(f,p), (22)<br />
wherein ˜ S11 denotes the PMOR result, and S11 is the<br />
reference solution, which is computed by conventional<br />
FE analysis, using the same mesh. The complete error<br />
surface is given in Fig. 5. It can be seen that errors are<br />
in the order <strong>of</strong> 10 −3 , which is below the typical level <strong>of</strong><br />
the FE discretization error. Note that calculating the error<br />
surface is only possible for very simple structures, like<br />
the present filter, because each <strong>of</strong> the 101×101 sampling<br />
points in f − p space requires a separate FE run.
Fig. 4. Dielectric post: Response surface <strong>of</strong> the magnitude <strong>of</strong> the<br />
reflection coefficient S11 as a function <strong>of</strong> operating frequency f and<br />
radius variation p.<br />
Fig. 5. Dielectric post: Error surface <strong>of</strong> |S11|.<br />
Computational data for conventional FE analysis and<br />
two different PMOR models, ROM3 and ROM5, using<br />
M =3and M =5interpolation points, respectively,<br />
are given in Table I. It can be seen that, even though the<br />
dimension <strong>of</strong> the original FE system is very small, the<br />
larger <strong>of</strong> the two models, ROM5, is still 150 times faster<br />
to evaluate.<br />
2) Analysis <strong>of</strong> Suggested Procedure: Our first goal<br />
is to compare the proposed method, which employs<br />
TABLE I<br />
COMPUTATIONAL DATA FOR DIELECTRIC POST.<br />
Model ROM5 ROM3 FE<br />
Number <strong>of</strong> grid points 5 3 -<br />
Moment-matching order 10 7 -<br />
Dimension 22 16 5616<br />
Model generation (s) ∗ 365.8 177.7 -<br />
Evaluations per s∗ 481.0 757.3 3.2<br />
|Average error in S11| 4.99 · 10−4 5.93 · 10−3 0<br />
∗ MATLAB code on Intel Pentium 4 (3GHz), one thread used.<br />
- 16 - 15th IGTE Symposium 2012<br />
|E |<br />
S11<br />
10 0<br />
10 −1<br />
10 −2<br />
10 −3<br />
10 −4<br />
Direct interpolation <strong>of</strong> ROM bases<br />
Present method<br />
10<br />
−1.5 −1 −0.5 0 0.5 1 1.5<br />
−5<br />
Radius variation (mm)<br />
Fig. 6. Dielectric post: Magnitude <strong>of</strong> error in reflection coefficient S11<br />
as a function <strong>of</strong> radius variation p at f =25GHz. Note the positive<br />
effects <strong>of</strong> state transformations in the present method.<br />
state transformations, to direct interpolation <strong>of</strong> the ROM<br />
bases ˆ Vβ. For this purpose, we consider a ROM based<br />
on M =5equidistant sampling points. Fig. 6 presents<br />
the error in S11 (22) as a function <strong>of</strong> radius variation p<br />
at f =25GHz: The necessity <strong>of</strong> proper state transformations<br />
is evident.<br />
The next test addresses the rate <strong>of</strong> convergence <strong>of</strong> the<br />
proposed method. We start from 3 equidistant interpolation<br />
points at refinement level r =1, and refine the grid<br />
recursively. Thus, the total number <strong>of</strong> points at refinement<br />
level r is<br />
|G| =2 r +1. (23)<br />
Our measure <strong>of</strong> error is ĒS11 , the average error in S11<br />
at f =25GHz,<br />
ĒS11<br />
= 1<br />
Ns<br />
Ns <br />
n=1<br />
| S11(pn) − S11(pn)|, (24)<br />
based on Ns = 257 equally spaced sampling points,<br />
pn ∈ [−1.5, 1.5] mm. Fig. 7 presents the magnitude <strong>of</strong><br />
the average error as a function <strong>of</strong> refinement level for<br />
direct ROM interpolation, a variant <strong>of</strong> the present method<br />
which uses piecewise linear geometry interpolation, and<br />
the suggested approach, employing global polynomial<br />
interpolation for the geometry. The results <strong>of</strong> Fig. 7<br />
show that the total error is dominated by the effects<br />
<strong>of</strong> geometry interpolation: The suggested method clearly<br />
outperforms competing approaches.<br />
B. Mitered Microstrip Bend<br />
Our second example, the mitered microstrip bend<br />
shown in Fig. 8, is a truly three-dimensional structure<br />
with more than one million FE unknowns. The<br />
model features two parameters, the operating frequency<br />
f ∈ [1, 10] GHz and a geometric parameter p ∈<br />
[−0.7, 0.7] mm which controls the width t <strong>of</strong> the mitered
|Average error in S 11 |<br />
10 0<br />
10 −1<br />
10 −2<br />
10 −3<br />
10 −4<br />
10 −5<br />
10 −6<br />
Proposed approach − global polynomial<br />
Proposed approach − piecewise linear<br />
ROM interpolation − piecewise linear<br />
0 1 2<br />
Refinement level<br />
3 4<br />
Fig. 7. Dielectric post: Magnitude <strong>of</strong> average error versus grid<br />
refinement level r at f = 25 GHz. The proposed method benefits<br />
from interpolating the FE matrices by polynomials <strong>of</strong> higher-order.<br />
μr,ɛr<br />
2.413<br />
t<br />
60<br />
60 0.794<br />
Fig. 8. Structure <strong>of</strong> a mitered microstrip bend. Dimensions are in mm.<br />
Material properties <strong>of</strong> substrate: ɛr =2.2, μr =1.<br />
bend. We have:<br />
t =1.7062 mm + p. (25)<br />
Again, the parametric ROM is based on M = 5<br />
interpolation points placed at the locations <strong>of</strong> the zeros<br />
<strong>of</strong> the fifth-order Chebyshev polynomial <strong>of</strong> the first kind.<br />
The expansion frequency for the single-parameter ROMs<br />
is set to f exp =5GHz.<br />
Fig. 9 shows the response surface <strong>of</strong> the magnitude <strong>of</strong><br />
the reflection coefficient S11, calculated by the proposed<br />
method. Fig. 10 presents |S11| and the corresponding<br />
error |ES11 | (22) with respect to conventional FE simulations<br />
versus frequency for the case p =0.2 mm. The<br />
fact that the error is always more than 25 dB below the<br />
signal level underlines the high quality <strong>of</strong> the ROM.<br />
Table II provides computational data for conventional<br />
FE simulation and the ROM. It can be seen that it<br />
takes more than 2 hours to build the parametric ROM.<br />
However, once the ROM is available, it can be evaluated<br />
more than 2200 times per second. For comparison, one<br />
- 17 - 15th IGTE Symposium 2012<br />
Fig. 9. Mitered microstrip bend: Response surface <strong>of</strong> the magnitude<br />
<strong>of</strong> the reflection coefficient |S11| as a function <strong>of</strong> operating frequency<br />
f and miter parameter p.<br />
|S 11 | (dB)<br />
−30<br />
−40<br />
−50<br />
−60<br />
−70<br />
−80<br />
−90<br />
−100<br />
Proposed approach<br />
Error <strong>of</strong> proposed approach<br />
2 4 6<br />
Frequency (GHz)<br />
8 10<br />
Fig. 10. Mitered microstrip bend: |S11| and error |ES11 | versus<br />
frequency. Parameter: p =0.2 mm.<br />
conventional FE solution takes 180 s, which is more than<br />
400 000 times longer!<br />
VI. CONCLUSIONS<br />
This paper has presented a PMOR methodology for<br />
FE models with geometric parameters. It is characteristic<br />
<strong>of</strong> the new approach that geometry approximation<br />
is separated from the actual ROM generation process.<br />
Moreover, the suggested method incorporates state transformations<br />
that improve the quality <strong>of</strong> the interpolated<br />
projection matrices. In consequence, the present PMOR<br />
method reaches higher rates <strong>of</strong> convergence than previous<br />
approaches. Since the resulting parametric models are <strong>of</strong><br />
small dimension, they are very fast to evaluate.<br />
TABLE II<br />
COMPUTATIONAL DATA FOR MICROSTRIP BEND.<br />
Model ROM FE<br />
Number <strong>of</strong> grid points 5 -<br />
Moment-matching order 20 -<br />
Dimension 42 1,175,382<br />
Model generation (s) ∗ 7513.2 -<br />
Evaluations per s ∗ 2267.9 5.56 · 10 −3<br />
|Avr. error in S11| at p =0.2 mm 1.06 · 10 −4 0<br />
∗ MATLAB code on Intel Xeon E5620, one thread used.
REFERENCES<br />
[1] R. D. Slone, R. Lee, and J. F. Lee, “Broadband Model Order<br />
Reduction <strong>of</strong> Polynomial Matrix Equation using Single-Point<br />
Well-Conditioned Asymptotic Waveform Evaluation: Dreivations<br />
and Theory,” Int. J. Numer. Meth. Eng., vol. 58, pp. 2325 – 2342,<br />
Dec. 2003.<br />
[2] Y. Zuh, A. C. Cangellaris, “Finite Element-Based Model Order<br />
Reduction <strong>of</strong> Electromagnetic Devices,” Int. J. Numer. Model.,<br />
vol. 15, pp. 73 – 92, 2002.<br />
[3] R. D. Slone, J.-F. Lee, R. Lee, “Automating Multipoint Galerkin<br />
AWE for a FEM Fast Frequency Sweep,” IEEE Trans. Magn.,<br />
vol. 38, no. 3, pp. 637 – 640, March 2002.<br />
[4] A. Schultschik, O. Farle, R. Dyczij-Edlinger, “An Adaptive Multi-<br />
Point Fast Frequency Sweep for Large-Scale Finite Element<br />
Models,” IEEE Trans. Magn., vol. 45, no. 3, pp. 1108 – 1111,<br />
March 2009 .<br />
[5] R. Dyczij-Edlinger and O. Farle, “Finite element analysis <strong>of</strong><br />
linear boundary value problems with geometrical parameters,”<br />
COMPEL, vol. 28, no. 4, pp. 779 – 794, 2009.<br />
[6] O. Farle, S. Burgard, and R. Dyczij-Edlinger “Passivity Preserving<br />
Parametric Model-Order Reduction for Non-affine Parameters,”<br />
Math. Comp. Model. Dyn. Sys., vol. 17, no. 3, pp. 279 – 294,<br />
2011.<br />
[7] J.R. Phillips, “Variational interconnect analysis via PMTBR,”<br />
ICCAD, pp. 872 – 879, 7-11 Nov. 2004.<br />
[8] B. Lohmann, R. Eid, “Efficient Order Reduction <strong>of</strong> Parametric and<br />
Nonlinear Models by Superposition <strong>of</strong> Locally Reduced Models,”<br />
Methoden und Anwendungen der Regelungstechnik, pp. 27 – 36,<br />
Aachen:Shaker-Verlag, 2009.<br />
[9] H. Panzer, J. Mohring, R. Eid, and B. Lohmann, “Parametric<br />
Model Order Reduction by Matrix Interpolation,” at - Automatisierungstechnik,<br />
vol. 58, no. 8, pp. 475 – 484, 2010.<br />
[10] O. Farle and R. Dyczij-Edlinger, “Numerically Stable Moment<br />
Matching for Linear Systems Parameterized by Polynomials in<br />
Multiple Variables with Applications to Finite Element Models <strong>of</strong><br />
Microwave Structures,” IEEE Trans. Antennas Propag., vol. 58,<br />
no. 11, pp. 3675 – 3684, Sep. 2010.<br />
[11] D. Amsallem, J. Cortial, K. Carlberg, C. Farhat, “A method<br />
for interpolating on manifolds structural dynamics reduced-order<br />
models,” Int. J. Numer. Meth. Eng., vol. 80, no. 9, pp. 1241 –<br />
1258, Nov. 2009.<br />
[12] S. Burgard, Morphing von Finite-Elemente-Netzen, Studienarbeit,<br />
Lehrstuhl für Theoretische Elektrotechnik, Universität des Saarlandes,<br />
2008. In German.<br />
[13] J. Pomplun and F. Schmidt, “Accelerated a Posteriori Error<br />
Estimation for the Reduced Basis Method with Application to<br />
3D Electromagnetic Scattering Problems,” SIAM J. Sci. Comput.,<br />
vol. 32, no. 2, pp. 498 – 520, 2010.<br />
[14] G. H. Golub and C. F. Van Loan, Matrix Computations, Baltimore:Johns<br />
Hopkins <strong>University</strong> Press, pp. 69 – 75, 1996.<br />
[15] J. Rubio, J. Arroyo, and J. Zapata, “SFELP - An Efficient Methodology<br />
for Microwave Circuit Analysis,” IEEE Trans. Microw.<br />
Theory Techn., vol. 49, no. 3, pp. 509 – 516, Mar. 2001.<br />
[16] J. P. Webb, “Finite Element Analysis <strong>of</strong> H-plane Rectangular<br />
Waveguide Problems,” Microw. Antennas Propag., IEE <strong>Proceedings</strong><br />
H, vol. 133, no. 2, pp. 91 – 94, April 1986.<br />
- 18 - 15th IGTE Symposium 2012
- 19 - 15th IGTE Symposium 2012<br />
Efficient Finite-Element Computation <strong>of</strong><br />
Far-Fields <strong>of</strong> Phased Arrays by Order Reduction<br />
A. Sommer∗ , O. Farle∗ , and R. Dyczij-Edlinger∗ ∗Chair for Electromagnetic Theory, Saarland <strong>University</strong>, D-66123 Saarbrücken, Germany<br />
E-mail: edlinger@lte.uni-saarland.de<br />
Abstract—This paper presents an efficient numerical method for computing the far-fields <strong>of</strong> phased antenna arrays over<br />
broad frequency bands as well as wide ranges <strong>of</strong> steering and look angles. The suggested approach combines finiteelement<br />
analysis, projection-based model-order reduction, and empirical interpolation. Numerical results demonstrate that<br />
evaluation times are reduced by orders <strong>of</strong> magnitude, compared to traditional methods.<br />
Index Terms—Empirical interpolation, far field computation, finite-element method, and model order reduction.<br />
I. INTRODUCTION<br />
In many areas <strong>of</strong> application, such as radar or wireless<br />
communications, phased antenna arrays need to be analyzed<br />
over broad frequency bands as well as wide ranges<br />
<strong>of</strong> steering and look angles. Finite-element (FE) based<br />
analysis <strong>of</strong> such structures involves two major steps:<br />
First, the near field is computed as a function <strong>of</strong> angular<br />
frequency ω and steering angles (θs,φs). Second, a<br />
discrete near-field-to-far-field (NF-FF) operator is applied<br />
to determine the far field as a function <strong>of</strong> frequency and<br />
look angles (θ,φ). Using conventional approaches, this<br />
procedure tends to be very time-consuming. The reasons<br />
are as follows: Since typical antenna arrays are electrically<br />
large and consist <strong>of</strong> high numbers <strong>of</strong> radiators, the<br />
corresponding FE systems are <strong>of</strong> very large dimension.<br />
Moreover, broadband analysis requires large numbers <strong>of</strong><br />
sampling frequencies. At each <strong>of</strong> them, the large-scale<br />
FE system has to solved, by matrix factorization or some<br />
iterative method. In addition, wide variations in steering<br />
angles imply large numbers <strong>of</strong> sampling angles (θs,φs).<br />
Each <strong>of</strong> them leads to separate excitation and right-hand<br />
side (RHS), respectively. Finally, wide variations in look<br />
angles result in large numbers <strong>of</strong> sampling points (θ,φ).<br />
Each <strong>of</strong> them requires a separate NF-FF transformation<br />
at each operating point (θs,φs; ω) <strong>of</strong> the antenna array.<br />
To reduce computational efforts, we propose a twostep<br />
approach: We first construct a reduced-order model<br />
(ROM) for the near fields which is very cheap to solve<br />
at any value <strong>of</strong> the parameter triple (θs,φs; ω). For<br />
this purpose, a multi-point model-order reduction (MOR)<br />
method with self-adaptive expansion point selection [1],<br />
[2] is applied. The second step utilizes the empirical<br />
interpolation (EI) method [3], [4] to construct an affine<br />
approximation to the NF-FF operator as a function <strong>of</strong><br />
frequency and look angles. It, too, is very fast to evaluate<br />
at any value <strong>of</strong> the parameter triple (θ,φ; ω). Combining<br />
both steps yields a highly efficient numerical model<br />
for computing the far-fields as a function <strong>of</strong> the five<br />
parameters (θ,φ; θs,φs; ω), which is ideally suited for<br />
fast online evaluation: In Section VI, computing a farfield<br />
pattern based on 4830 look angles takes only 2.4 s,<br />
on a personal computer executing plain MATLAB code.<br />
The construction <strong>of</strong> the model by MOR and EI is<br />
much more time-consuming and must be performed in<br />
advance, in an <strong>of</strong>fline step. The most expensive procedure<br />
in the MOR algorithm is the FE analysis <strong>of</strong> the array at a<br />
number <strong>of</strong> expansion points (θs,φs; ω), which are chosen<br />
adaptively. Note that, as long as a direct solver is used,<br />
changes in the three parameters are not equally expensive:<br />
Since ω affects the FE matrix, each frequency value<br />
requires a new matrix factorization, which is computationally<br />
expensive. On the other hand, the steering angles<br />
(θs,φs) enter the RHS only. Thus, changes in angle just<br />
require additional forward-back substitutions, which are<br />
much cheaper. Therefore, the adaptive point placement<br />
strategy <strong>of</strong> the MOR method ought to vary ω as rarely<br />
as possible. The new frequency-slicing greedy method<br />
presented in Section IV-B implements this strategy in<br />
a systematic fashion. For the highest-accuracy ROM <strong>of</strong><br />
Section VI, it improves computing time by a factor <strong>of</strong> 7<br />
at the cost <strong>of</strong> increasing ROM size by 5%, compared to<br />
state-<strong>of</strong>-the-art methods [1], [2].<br />
The numerical experiments <strong>of</strong> Section VI indicate that<br />
both MOR and EI feature exponential convergence, in<br />
accordance with theoretical results [5]. Our results for<br />
a real-world example [6] demonstrate that the suggested<br />
two-step approach for the broadband analysis <strong>of</strong> radiation<br />
patterns <strong>of</strong> phased arrays achieves high accuracy and reduces<br />
evaluation times by orders <strong>of</strong> magnitude, compared<br />
to conventional approaches.<br />
II. FAR-FIELD COMPUTATION<br />
By the vector Huygens principle in the frequency<br />
domain [7], the radiation vector F <strong>of</strong> an arbitrary antenna<br />
array, which is enclosed by a surface S, isgivenby<br />
<br />
ω j ˆr·r c F (ˆr,ω)=ˆr × e 0<br />
S<br />
′<br />
J s (r ′ ,ω)dS ′ × ˆr<br />
+ 1<br />
<br />
ω j c ˆr·r<br />
e 0<br />
η0 S<br />
′<br />
M s (r ′ ,ω)dS ′ × ˆr. (1)<br />
Here c0 and η0 denote the vacuum speed <strong>of</strong> light and<br />
characteristic impedance, respectively, and ˆr is the unit<br />
vector in the direction <strong>of</strong> the observer. The equivalent<br />
electric and magnetic surface current densities, J s and
M s, are given in terms <strong>of</strong> the electric and magnetic nearfields<br />
E and H by<br />
J s (r ′ ,ω)=ˆn × H (r ′ ,ω) with r ′ ∈ S, (2a)<br />
M s (r ′ ,ω)=−ˆn × E (r ′ ,ω) with r ′ ∈ S, (2b)<br />
wherein ˆn stands for the outward-pointing unit normal<br />
vector on the Huygens surface S. The far-fields EF and<br />
HF are obtained from the radiation vector (1) by<br />
ω<br />
c r<br />
e−j 0<br />
EF (r,ω)=−jμ0ω F (ˆr,ω) , (3a)<br />
4πr<br />
HF (r,ω)=−j ω<br />
ω −j c r<br />
e 0<br />
ˆr × F (ˆr,ω) (3b)<br />
c0 4πr<br />
and the directive gain [8] is given by<br />
2 μ0ω F (ˆr,ω)<br />
D (ˆr,ω)=<br />
8πc0<br />
2<br />
2 , (4)<br />
P (ω)<br />
wherein μ0 describes the vacuum permeability. In (4),<br />
the total radiated power P (ω) is determined from<br />
P (ω) = 1<br />
2 ℜ<br />
<br />
ˆn × E (r<br />
S<br />
′ ,ω) · H (r ′ ,ω)dS ′<br />
<br />
. (5)<br />
III. FE MODEL AND MULTI-POINT MOR METHOD<br />
FE analysis <strong>of</strong> the near fields <strong>of</strong> a phased array <strong>of</strong> L<br />
antennas leads to a linear system <strong>of</strong> the form<br />
(A0 + ωA1+ω 2 L<br />
A2) x (p) =ω up (p) bp, (6a)<br />
p=1<br />
y (p) = C0 + ω −1 <br />
C1 x (p) , (6b)<br />
P (p) =ω −1 x ∗ (p) Dx (p) . (6c)<br />
Herein, A0, A1, A2 ∈ CN×N are the stiffness, damping,<br />
and mass matrices, respectively, x denotes the solution<br />
vector in terms <strong>of</strong>E, p = (θs,φs,ω) ∈ R3 the parameter<br />
vector, and N the dimension <strong>of</strong> the FE system. The<br />
output vector y ∈ C6H holds the electric and the magnetic<br />
near-field values E and H, respectively, sampled<br />
at H points on the Huygens surface S. In (6b), the<br />
matrices C0 ∈ C6H×N and C1 ∈ C6H×N carry out<br />
the sampling process and magnetic field computation on<br />
S. Furthermore, the Hermitian matrix D ∈ CN×N <strong>of</strong> the<br />
bilinear form (6c) represents the computation <strong>of</strong> the total<br />
radiated power (5). It can be seen that the system matrix<br />
<strong>of</strong> (6a) depends on the angular frequency ω only, while<br />
the RHS also depends on the steering angles θs and φs.<br />
Note that the RHS is constructed by a superposition <strong>of</strong><br />
L linearly independent vectors bp ∈ CN with parameterdependent<br />
weights up (p).<br />
To obtain the near-fields vector y and the total radiated<br />
power P , the large-scale system (6a) has to be solved for<br />
each parameter vector p <strong>of</strong> interest. Our goal is to bypass<br />
this time-consuming procedure. Since the FE system (6)<br />
exhibits affine parameter dependence [1], it is well-suited<br />
for projection-based MOR. The idea is to approximate<br />
the FE solution x (p) in a low dimensional subspace<br />
according to<br />
x (p) ≈ ˆx (p) =V˜x (p) (7)<br />
- 20 - 15th IGTE Symposium 2012<br />
with ˆx ∈ CN , ˜x ∈ Cn , V ∈ CN×n , and n ≪ N for<br />
all p ∈ D. Here, D denotes the considered parameter<br />
domain. For numerical stability, the columns <strong>of</strong> the trial<br />
matrix V are chosen to be orthogonal. Substituting the<br />
approximation (7) for x (p) in (6a) and testing with V∗ leads to the ROM:<br />
2<br />
ω q L<br />
Ãq ˜x (p) =ω up (p) ˜ bp, (8a)<br />
q=0<br />
ˆy (p) =<br />
p=1<br />
1<br />
ω −r Cr˜x ˜ (p) , (8b)<br />
r=0<br />
ˆP (p) =ω −1˜x ∗ (p) ˜ D˜x (p) , (8c)<br />
wherein the reduced matrices and vectors are given by<br />
Ãq = V ∗ AqV with Ãq ∈ C n×n , (9)<br />
˜bp = V ∗ bp with bp<br />
˜ ∈ C n , (10)<br />
˜Cr = CrV with Cr<br />
˜ ∈ C 6H×n , (11)<br />
˜D = V ∗ DV with D˜ n×n<br />
∈ C . (12)<br />
Using a multi-point (MP) MOR method, the trial matrix<br />
V is constructed from FE solutions on a discrete set<br />
De ⊂D<strong>of</strong> expansion points pi ∈ De such that<br />
range V =span{x (p1) ,...,x (pn)} . (13)<br />
As long as n ≪ N, the ROM (8) can be solved much<br />
more efficiently than the original system (6). Thus, the<br />
computational costs for determining both the near-field<br />
values and the total radiated power can be kept very low.<br />
IV. SELF-ADAPTIVE EXPANSION POINT SELECTION<br />
The residual r <strong>of</strong> ˆx with respect to (6a) takes the form<br />
r (p) =<br />
2<br />
ω q L<br />
(AqV) ˜x (p) − ω up (p) bp. (14)<br />
q=0<br />
Thus, the computation <strong>of</strong> its 2-norm,<br />
r (p) 2<br />
2 =<br />
2 2<br />
p=1<br />
ω<br />
q1=0 q2=0<br />
q1+q2 ˜x (p) ∗ V ∗ A ∗ q1Aq2V ˜x (p)<br />
− 2ℜ<br />
+ ω 2<br />
L<br />
p=0 q=0<br />
L<br />
p1=0 p2=0<br />
2<br />
ω q+1 b ∗ pAqV <br />
˜x (p)<br />
L<br />
up1(p)up2(p) b ∗ p1bp2 <br />
, (15)<br />
just involves matrices and vectors <strong>of</strong> the ROM dimension<br />
n ≪ N and is therefore very fast. This motivates the use<br />
<strong>of</strong> a residual-based error indicator ρ(Dds) in the pointplacement<br />
strategy:<br />
ρ(Dds) = max r(p)2 . (16)<br />
p∈Dds<br />
Herein, Dds stands for a dense sampling <strong>of</strong> the considered<br />
domain.
Algorithm 1 Conventional Greedy Algorithm.<br />
Given: Dds, p1 ∈ Dds, and ɛ.<br />
n =0. {Initialize ROM dimension.}<br />
repeat<br />
n ← n +1.<br />
ωc = ω(pn).<br />
Compute LU factorization <strong>of</strong> A(ωc).<br />
Determine x (pn) by forward-back substitution.<br />
Construct ROM by (8a).<br />
Compute residual r(p) 2 for all p ∈ Dds.<br />
Place expansion point pn+1 using (17).<br />
until ρn(Dds)
Let P denote the number <strong>of</strong> far-field look angles for<br />
which (23) is to be evaluated. It can be seen that, although<br />
the solution ˜x (p) <strong>of</strong> the ROM (8) is used, the computational<br />
effort for merely one single operating-point p ∈ D<br />
is still <strong>of</strong> complexity O (PH + Hn), i.e., the far-field<br />
computation itself is expensive, too. Our solution to this<br />
problem is to adopt an idea from [2] and employ the<br />
EI method [3], [4] to construct an affine decomposition<br />
<strong>of</strong> the exponential function (22). The <strong>of</strong>fline part <strong>of</strong> this<br />
method uses a greedy strategy to determine a set <strong>of</strong> M<br />
basis functions {qm} M<br />
m=1 , interpolation points {r′ m} M<br />
m=1<br />
and parameter values {d ′ m} M<br />
m=1 such that the interpolant<br />
ê (d, r ′ ) defined by<br />
ê (d, r ′ M<br />
)= αm (d) qm (r ′ ) (24)<br />
m=1<br />
approximates (22) for all (r ′ , d) ∈ S ×M. Having<br />
constructed the interpolation matrix<br />
⎡<br />
⎤<br />
⎢<br />
BM = ⎣<br />
. ..<br />
⎥<br />
⎦ (25)<br />
q1 (r ′ 1)<br />
.<br />
q1 (r ′ M ) ... qM (r ′ M )<br />
<strong>of</strong>fline, the parameter-dependent coefficients<br />
{αm (d)} M<br />
m=1 are obtained online, by solving the<br />
lower triangular system<br />
⎡ ⎤ ⎡<br />
α1 (d) e (r<br />
⎢<br />
[BM ]<br />
. ⎥ ⎢<br />
⎣ . ⎦ = ⎣<br />
αM (d)<br />
′ ⎤<br />
1, d)<br />
. ⎥<br />
. ⎦ . (26)<br />
, d)<br />
e (r ′ M<br />
Substituting the empirical interpolant (24) for e (d, r ′ ) in<br />
(23) results in<br />
Ix (˜x (p) , ê (d, r ′ ) ,ω)=ω −1 M<br />
ΔS αm (d)<br />
H<br />
h=1<br />
m=1<br />
qm (r ′ h) ˆn (r ′ h) × ˜ C1 (r ′ h) ˜x (p) . (27)<br />
Under the precondition that the sampling points on the<br />
Huygens surface S remain constant for all M steps <strong>of</strong> the<br />
EI method, the online part <strong>of</strong> (24) can be implemented<br />
such that it takes only O (M) operations. Thus, the<br />
computational efforts for computing P far-field values<br />
by (27) for a given operating-point p ∈ D are only <strong>of</strong><br />
order O (PM + Mn). Since, in practice, M ≪ H, the<br />
costs <strong>of</strong> the far-field computation are greatly reduced.<br />
VI. NUMERICAL RESULTS<br />
In the following, we consider the FE model <strong>of</strong> a<br />
dual-polarized tapered slot antenna array (TSAA) [6]<br />
consisting <strong>of</strong> L = 40 antennas, whose geometry is<br />
depicted in Fig. 1. The frequency band is given by<br />
f ∈ [2, 4] GHz, and the scan angles <strong>of</strong> interest are in<br />
the range <strong>of</strong> (θs,φs) ∈ 0, π<br />
2<br />
3 × [0, 2π) rad .Weuse<br />
#Dds = 17040 training points in the <strong>of</strong>fline part <strong>of</strong><br />
the self-adaptive multi-point method <strong>of</strong> Section IV and<br />
construct the ROM (8) by both the conventional greedy<br />
algorithm and the new FSG approach <strong>of</strong> Section IV-B.<br />
- 22 - 15th IGTE Symposium 2012<br />
Fig. 1. Geometry <strong>of</strong> the TSAA [6]. Dimensions: length l =8cm,<br />
width w = 8 cm, height h = 7 cm, and displacement <strong>of</strong> adjacent<br />
antennas s =2cm.<br />
Maximum norm: local error indicator<br />
10 0<br />
10 −2<br />
10 −4<br />
10 −6<br />
10 −8<br />
Conventional greedy method<br />
FSG method<br />
50 100 150 200 250 300 350<br />
ROM dimension n<br />
Fig. 2. Normalized error indicator (16) versus ROM dimension n<br />
for the conventional and the new FSG method. Circles mark changes<br />
in expansion-point frequency in the FSG method, requiring matrix<br />
factorization.<br />
A. Properties <strong>of</strong> FSG method<br />
Fig. 2 presents the behavior <strong>of</strong> the normalized error<br />
indicator (16) as a function <strong>of</strong> ROM dimension n. It<br />
can be seen that the standard method achieves nearly<br />
constant rates <strong>of</strong> convergence, whereas the FSG approach<br />
converges rather slowly during the early stages <strong>of</strong> the<br />
iteration. This behavior is expected because, early on, the<br />
frequency sampling <strong>of</strong> the FSG method is very poor. On<br />
the other hand, the standard procedure must factorize the<br />
FE matrix at each iteration, whereas the FSG method requires<br />
factorizations only when the expansion frequency<br />
changes, i.e., at the iterations marked by circles in Fig. 2.<br />
Thus, to compare overall computational efficiency, we<br />
have measured computing times for the same threshold<br />
ɛ <strong>of</strong> the error indicator. Table I presents the results. It<br />
can be seen that, depending on the threshold level, the<br />
proposed FSG method is 5 to 7.5 times faster.
Relative error e n<br />
TABLE I<br />
TSAA: COMPUTING TIME FOR ROM CONSTRUCTION (8)<br />
Residual Time t ∗ Speed-up Dimension n<br />
threshold ɛ Alg. 1 Alg. 2 factor Alg. 1 Alg. 2<br />
2.7 e−3 92.16 h 18.34 h 5.025 153 285<br />
4.6 e−5 164.98 h 24.25 h 6.803 265 333<br />
1.3 e−6 221.32 h 29.46 h 7.513 346 363<br />
∗ MATLAB code on Intel(R) Xeon(R) E5620 CPU at 2.40 GHz.<br />
10 0<br />
10 −2<br />
10 −4<br />
10 −6<br />
10 −8<br />
Conventional greedy method<br />
FSG method<br />
50 100 150 200 250 300 350<br />
ROM dimension n<br />
Fig. 3. Relative error in near-fields (28) versus ROM dimension n<br />
for the conventional approach and the FSG method. Parameter: p =<br />
( π π<br />
rad, − rad, 3.645 GHz) /∈ Dds.<br />
4 6<br />
B. Error in near-fields<br />
Our measure for the error in the near-fields at a given<br />
parameter vector p is the relative error e(p) defined by<br />
en (p) = x (p) − ˆxn (p)2 .<br />
x (p)2 (28)<br />
To investigate the convergence behavior <strong>of</strong> the<br />
projection-based MOR method, we choose a representative<br />
parameter vector, p = π π<br />
4 rad, − 6 rad, 3.645 GHz /∈<br />
Dds, and evaluate (28) as a function <strong>of</strong> ROM dimension<br />
n. Fig. 3 shows that both the conventional approach<br />
and the FSG method exhibit exponential convergence.<br />
C. Error in far-fields<br />
The following tests are based on #Mds = 28380<br />
training points in the <strong>of</strong>fline part <strong>of</strong> the EI method. The<br />
considered look angles are in the range <strong>of</strong> (θ,φ) ∈<br />
π<br />
2<br />
0, × [0, 2π) rad .<br />
2<br />
We first investigate the error <strong>of</strong> the empirical interpolant<br />
êm (d, r ′ ) <strong>of</strong> (24) with respect to the true value<br />
<strong>of</strong> the exponential function e (d, r ′ ) <strong>of</strong> (22). For this<br />
purpose, we choose a representative parameter vector,<br />
d =(3.789 GHz, − π π<br />
4 rad, 5 rad) /∈ Mds and monitor the<br />
relative error em(d),<br />
<br />
<br />
<br />
<br />
em (d) = max <br />
<br />
<br />
, (29)<br />
r ′ ∈Sh<br />
e (d, r ′ ) − êm (d, r ′ )<br />
e (d, r ′ )<br />
as a function <strong>of</strong> the number <strong>of</strong> EI coefficients m. The<br />
results shown in Fig. 4 demonstrate that the EI method<br />
leads to exponential convergence.<br />
- 23 - 15th IGTE Symposium 2012<br />
Relative error e m<br />
10 2<br />
10 0<br />
10 −2<br />
10 −4<br />
10 −6<br />
10<br />
0 200 400 600 800<br />
−8<br />
Number <strong>of</strong> coefficients m<br />
Fig. 4. Relative error in exponential function (29) versus number <strong>of</strong><br />
EI coefficients m. Parameter: d = 3.789 GHz, − π<br />
4<br />
rad, π<br />
5 rad .<br />
TABLE II<br />
AVERAGE ERROR IN DIRECTIVE GAIN.<br />
Steering angles<br />
(θs,φs)<br />
Average error eD (30)<br />
2.57 GHz 3.30 GHz 3.95 GHz<br />
( π π<br />
rad, 4 2 rad) 1.2 × 10−3 1.7 × 10−3 2.6 × 10−3 ( π π<br />
rad, 4 4 rad) 1.1 × 10−3 1.8 × 10−3 2.8 × 10−3 ( π<br />
6 rad, 0 rad) 1.3 × 10−3 1.9 × 10−3 3.3 × 10−3 ( π π<br />
rad, − 3 3 rad) 2.2 × 10−3 2.0 × 10−3 3.1 × 10−3 In our final test, we consider the error in radiation<br />
pattern <strong>of</strong> the phased antenna array, by measuring the<br />
average error in directive gain eD,<br />
eD (p) = 1<br />
<br />
P <br />
D<br />
(p, dp) −<br />
<br />
P <br />
p=1<br />
ˆ <br />
D (p, dp)<br />
<br />
<br />
. (30)<br />
D (p, dp) <br />
Table II presents error values for 12 different parameter<br />
vectors p /∈ Dds, corresponding to the far-field plots in<br />
Fig. 5 – Fig. 7. It can be seen that the results <strong>of</strong> the<br />
suggested MOR approach are in very good agreement<br />
with reference data. Computational parameters for this<br />
test can be found in Table III and Table IV.<br />
D. Overall runtime performance<br />
Fig. 5 – Fig. 7 show three-dimensional radiation patterns<br />
<strong>of</strong> the TSAA for different operating frequencies<br />
and four steering angles per frequency. Computational<br />
data <strong>of</strong> the original FE model and the ROM are given in<br />
Table III and Table IV, respectively. Without doubt, the<br />
<strong>of</strong>fline part <strong>of</strong> the algorithm leads to some one-time costs<br />
for constructing the ROM and the affine approximation<br />
to the NF-FF operator. However, once they are available,<br />
computing time for the near-fields improves by a factor <strong>of</strong><br />
68000, compared to conventional FE analysis. Moreover,<br />
post-processing time for one radiation pattern based on<br />
P =4, 830 look angles reduces by a factor <strong>of</strong> 12. Thus,<br />
the total speed-up factor for computing one near-field<br />
solution plus the corresponding far-field pattern is 910.<br />
VII. CONCLUSIONS<br />
An efficient two-step MOR method for computing<br />
the far-field patterns <strong>of</strong> phased antenna arrays has been
(a) θs = π<br />
4<br />
rad, φs = π<br />
2<br />
π<br />
π<br />
rad. (b) θs = rad, φs = 4 4 rad.<br />
(c) θs = π<br />
π<br />
π<br />
rad, φs =0rad. (d) θs = rad, φs = − 6 3 3 rad.<br />
Fig. 5. Radiation patterns <strong>of</strong> the TSAA at f =2.57 GHz, determined<br />
by the two-step MOR method using P =4, 830 look angles.<br />
(a) θs = π<br />
4<br />
rad, φs = π<br />
2<br />
π<br />
π<br />
rad. (b) θs = rad, φs = 4 4 rad.<br />
(c) θs = π<br />
π<br />
π<br />
rad, φs =0rad. (d) θs = rad, φs = − 6 3 3 rad.<br />
Fig. 6. Radiation patterns <strong>of</strong> the TSAA at f =3.30 GHz, determined<br />
by the two-step MOR method using P =4, 830 look angles.<br />
presented. Thanks to the new FSG technique, evaluation<br />
times for a real-world example [6] improve by a factor<br />
<strong>of</strong> 5 to 7.5 over earlier MOR approaches, and by a factor<br />
<strong>of</strong> 910 compared to conventional FE analysis.<br />
REFERENCES<br />
[1] V. de la Rubia, U. Razafison, and Y. Maday, ”Reliable fast<br />
frequency sweep for microwave devices via the reduced-basis<br />
method”, IEEE Trans. Microw. Theory Techn., vol. 57, pp. 2923-<br />
2937, Dec. 2009.<br />
[2] M. Fares, J. S. Hesthaven, Y. Maday, and B. Stamm, ”The reduced<br />
basis method for the electric field integral equation”, J. Comput.<br />
Phys., vol. 230, pp. 5532-5555, 2011.<br />
- 24 - 15th IGTE Symposium 2012<br />
(a) θs = π<br />
4 rad, φs = π<br />
2 rad. (b) θs = π<br />
4 rad, φs = π<br />
4 rad.<br />
(c) θs = π<br />
π<br />
π<br />
rad, φs =0rad. (d) θs = rad, φs = − 6 3 3 rad.<br />
Fig. 7. Radiation patterns <strong>of</strong> the TSAA at f =3.95 GHz, determined<br />
by the two-step MOR method using P =4, 830 look angles.<br />
TABLE III<br />
COMPUTATIONAL DATA OF ORIGINAL FE MODEL OF TSAA.<br />
Parameters: θs = π<br />
π<br />
rad, φs = rad, f = 2.57 GHz.<br />
4 2<br />
FE dimension N 2, 553, 439<br />
Number <strong>of</strong> near-field points H 12, 800<br />
Number <strong>of</strong> look angles P 4, 830<br />
Time for solving FE system (6a) 2192.4 s∗ Time for computing radiation pattern 28.9 s∗ ∗ MATLAB code on Intel(R) Xeon(R) E5620 CPU at 2.40 GHz.<br />
TABLE IV<br />
COMPUTATIONAL DATA OF REDUCED-ORDER MODEL OF TSAA.<br />
Parameters: θs = π<br />
π<br />
rad, φs = rad, f = 2.57 GHz.<br />
4 2<br />
ROM dimension n 300<br />
Number <strong>of</strong> EI coefficients m 350<br />
Number <strong>of</strong> look angles P 4, 830<br />
Offline time for generating ROM (8) 20.36 h∗ Offline time for EI method 33.97 h∗ Online time for solving ROM (8a) 0.0321 s∗ Online time for radiation pattern 2.4087 s∗ ∗ MATLAB code on Intel(R) Xeon(R) E5620 CPU at 2.40 GHz.<br />
[3] M. Barrault, Y. Maday, N. C. Nguyen, and A. T. Patera, ”An<br />
’empirical interpolation’ method: application to efficient reducedbasis<br />
discretisation <strong>of</strong> partial differential equations”, C. R. Acad.<br />
Sci. Paris, Ser. I 339, pp. 667-672, 2004.<br />
[4] M. A. Grepl, Y. Maday, N. C. Nguyen, A. T. Patera, ”Efficient reduced<br />
basis treatment <strong>of</strong> nonaffine and nonlinear partial differential<br />
equations”, M2AN Math. Model. Numer. Anal. 41, pp. 575605,<br />
2007.<br />
[5] P. Binev, A. Cohen, W. Dahmen, R. DeVore, G. Petrova, and P.<br />
Wojtaszczyk, ”Convergence rates for greedy algorithms in reduced<br />
basis methods”, SIAM J. Math. Anal., vol. 43, pp. 1457-1472,<br />
2011.<br />
[6] T.-H. Chio, and D. H. Schaubert, ”Parameter study and design <strong>of</strong><br />
wide-band widescan dual-polarized tapered slot antenna arrays”,<br />
IEEE Trans. Antennas Propag., vol. 48, pp. 879-886, June 2000.<br />
[7] E. J. Rothwell and M. J. Cloud, ”Electromagnetics”, CRC Press,<br />
2009<br />
[8] S. J. Orfanidis, ”Electromagnetic Waves and Antennas”,<br />
http://www.ece.rutgers.edu/ orfanidi/ewa.
- 25 - 15th IGTE Symposium 2012<br />
Nanoparticle device for biomedical and<br />
optoelectronics applications<br />
R. Iovine, L. La Spada and L. Vegni<br />
Department <strong>of</strong> Applied Electronics, <strong>University</strong> <strong>of</strong> Roma Tre, Via della Vasca Navale 84, 00146 Rome, Italy<br />
E-mail: riovine@uniroma3.it<br />
Abstract—In this contribution a nanoparticle device, operating in the visible regime based on the Localized Surface Plasmon<br />
Resonance (LSPR) phenomenon, is presented. The nanoparticle electromagnetic properties are evaluated by a new analytical<br />
model and compared to the results obtained by numerical analysis. A near-field enhancement is obtained by arranging the<br />
nanoparticles in a linear array. Analytical formulas, describing such enhancement, are presented. The structure can find<br />
application for medical diagnostics and optoelectronics applications.<br />
Index Terms— LSPR, Medical diagnostics, Nanoparticle, Near-field Enhancement, Optoelectronics Applications<br />
I. INTRODUCTION<br />
In the last few years, several researches have paid<br />
attention to gold nanoparticles optical properties relate to<br />
the interaction <strong>of</strong> these structures with electromagnetic<br />
field at Visible (VIS) and Near Infrared Region (NIR)<br />
[1].<br />
When the electromagnetic field interacts with small metal<br />
particles the conduction electrons start oscillating<br />
collectively. This phenomenon is now well known and<br />
called Localized Surface Plasmon (LSP) [2]. If the<br />
frequency <strong>of</strong> the incident field matches the natural<br />
frequency oscillation <strong>of</strong> the electrons cloud the resonance<br />
condition is established with a strong dependence on the<br />
shape, size, composition <strong>of</strong> the nanoparticles as well as<br />
on the dielectric properties <strong>of</strong> the background<br />
environment [3].<br />
The analytical closed form electromagnetic solution to<br />
evaluate the electromagnetic behavior <strong>of</strong> metal<br />
nanoparticles exists only for the spherical shapes [4]. The<br />
possibility to predict the electromagnetic properties <strong>of</strong><br />
different kind <strong>of</strong> shapes is now very important due to the<br />
fact that the progress in nan<strong>of</strong>abrication technology<br />
allows to realize many shapes <strong>of</strong> particles [5] suitable for<br />
several application field such as biomedical sensing [6]<br />
and thin film solar cells [7].<br />
For example in [8] the possibility to control the<br />
enhancement <strong>of</strong> the Surface Enhanced Raman Scattering<br />
(SERS) using gold nanoparticles in the field <strong>of</strong> diagnostic<br />
oncology is reported. In [9] the possibility to use gold<br />
nanoparticles to produce in an efficient way heat energy<br />
from absorbed light energy that may be employed for<br />
selective PhotoThermal Therapy (PTT) is referred.<br />
The aim <strong>of</strong> this contribution is to propose the design <strong>of</strong> a<br />
nanostructure device consisting in a gold linear chain<br />
array <strong>of</strong> nanocubes, deposited on a silica substrate.<br />
For the cube particle a new analytical quasi static model<br />
describing its resonant behavior in terms <strong>of</strong> absorption<br />
and scattering cross section is presented. The results<br />
obtained by the analytical model are compared to the<br />
other ones performed through proper full-wave<br />
simulations [10] and by using the boundary integral<br />
method approach [11].<br />
The electromagnetic behavior <strong>of</strong> the device is evaluated<br />
for different inter-particle distance. In particular, the far<br />
field properties and the near electric field distribution are<br />
numerically obtained and the performances <strong>of</strong> the<br />
structure are analyzed for possible optoelectronics<br />
applications (design <strong>of</strong> absorbing layers) and for<br />
biosensing applications (refractive index measurements).<br />
II. QUASI STATIC ANALYTICAL MODEL FOR THE<br />
CUBE PARTICLE<br />
In general the nanoparticles have a size smaller compared<br />
to wavelength (e.g. at optical frequencies) so, it is<br />
possible to assume that all the conduction electrons in a<br />
nanoparticle see the same field at a given time (quasi –<br />
static approximation).<br />
Figure 1: Scheme <strong>of</strong> interaction between electromagnetic field and<br />
small particles compared to wavelength.<br />
The displacement <strong>of</strong> the electrons by incident<br />
electromagnetic field induces a dipolar charge separation<br />
(positive nuclei – free electrons) generating a restoring<br />
force which conflicts with incident field. The electron<br />
position is determined by the following equation:<br />
<br />
<br />
(1)<br />
<br />
where is the electron mass, is the electron damping<br />
coefficient and is the restoring coefficient.<br />
The relation (1) is a second order inhomogeneous<br />
differential equation with the following solution for<br />
harmonic excitation:<br />
<br />
<br />
(2)<br />
where <br />
is the natural frequency <strong>of</strong> the system.
This model is equivalent to a classical mechanical<br />
oscillator and represents a good physical interpretation to<br />
understand the Localized Surface Plasmon Resonance<br />
(LSPR) phenomenon. The resonance condition is<br />
established for and the denominator <strong>of</strong> (2) tends<br />
to zero and the coefficient and are very difficult to<br />
evaluate and are implicitly related to<br />
geometry/electromagnetic properties <strong>of</strong> the particles and<br />
permittivity value <strong>of</strong> the dielectric environment.<br />
However, exploiting the limit <strong>of</strong> electrically small<br />
particles it is possible to evaluate the resonant behavior <strong>of</strong><br />
the cube nanoparticle in accurate way. In order to study<br />
such electromagnetic properties, in terms <strong>of</strong> scattering<br />
and absorption cross-section, the following assumptions<br />
will be done:<br />
the particle is homogeneous and the surrounding<br />
material is a homogeneous, isotropic and nonabsorbing<br />
medium.<br />
The impinging plane wave has the electric field E<br />
parallel and the propagation vector k perpendicular<br />
to the nanoparticle principal axis, as depicted in<br />
Figure 2.<br />
Figure 2: Geometrical sketch <strong>of</strong> the gold nanocube particle.<br />
Under such conditions, we can relate the macroscopic<br />
nanoparticle properties to the polarizability <strong>of</strong> the<br />
nanoparticle.<br />
It is well known that [12], in case <strong>of</strong> an arbitrary shaped<br />
particle, its polarizability can be expressed as:<br />
(3)<br />
where is the volume <strong>of</strong> the particle, the surrounding<br />
dielectric environment permittivity, the inclusion<br />
dielectric permittivity and is the depolarization factor.<br />
The nanoparticle polarizability strongly depends on the<br />
inclusion geometry, its metallic electromagnetic<br />
properties , and the permittivity <strong>of</strong> the surrounding<br />
dielectric environment . In particular, the factor <strong>of</strong><br />
a nanoparticle plays a critical role in the polarizability<br />
resonant behaviour for the LSPR strength.<br />
Starting from [12], it is possible to develop new<br />
analytical closed-form formulas for the scattering and<br />
absorption cross-section <strong>of</strong> the aforementioned particles.<br />
The general corresponding expressions read, respectively:<br />
<br />
<br />
(4)<br />
- 26 - 15th IGTE Symposium 2012<br />
where is the wavenumber, is the<br />
wavelength and is the refractive index <strong>of</strong> the<br />
surrounding dielectric environment. Im stands for<br />
"Imaginary part".<br />
By considering the electric field polarization <strong>of</strong> the<br />
impinging plane wave, the absorption cross-section reads<br />
[13]:<br />
<br />
<br />
<br />
<br />
where is:<br />
<br />
<br />
<br />
<br />
<br />
III. BOUNDARY ELEMENT METHOD APPROACH<br />
Under quasi - static approximation the electric field can<br />
be expressed through the scalar potential as:<br />
(5)<br />
(6)<br />
(7)<br />
For homogeneous isotropic frequency-dispersive media<br />
can be determined easily from the Laplace equation:<br />
<br />
(8)<br />
In fact, by assuming an impulsive source the solution <strong>of</strong><br />
(8) is well known through the Green function <br />
as:<br />
<br />
<br />
<br />
<br />
(9)<br />
where and are the position vector and source vector,<br />
respectively.<br />
However, if we have an inhomogeneous medium such as<br />
a nanoparticle embedded in a dielectric environment<br />
(Figure 3) the solutions (9) are also valid but need to be<br />
satisfied by appropriate boundary conditions.<br />
Figure 3: Gold nanocube particle embedded in a dielectric environment.
In [11] it is possible to evaluate the scalar potential for<br />
the inhomogeneous medium:<br />
<br />
<br />
<br />
(10)<br />
by adding an artificial charge distribution at the boundary<br />
<strong>of</strong> discontinuity, determined from the continuity <strong>of</strong><br />
the tangential electric field and normal component <strong>of</strong> the<br />
dielectric displacement [14].<br />
The expression (10) can be converted from boundary<br />
integrals to bounday elements. Following the procedure<br />
reported in [15] it is possible to discretize the particle<br />
boundary into small surface by assuming that surface<br />
charges are located at the center <strong>of</strong> the surface element.<br />
In this way, it is possible to obtain numerically for a<br />
given external excitation the surface charge density <br />
and, consequently, the near electric field distribution and<br />
the far field properties in terms <strong>of</strong> absorption, scattering<br />
and extinction cross sections.<br />
IV. RESULTS FOR THE SINGLE PARTICLE<br />
The electromagnetic properties for the cube particle are<br />
evaluated using the quasi static analytical model,<br />
boundary element method (BEM) approach [15] and are<br />
compared to the results obtained with full-wave<br />
numerical simulations [10].<br />
We have assumed that the structure is excited by an<br />
impinging plane wave as shown in Figure 2. In addition:<br />
for the cube particle, experimental values [16] <strong>of</strong> the<br />
complex permittivity function have been inserted;<br />
the surrounding dielectric medium is vacuum.<br />
Far field properties in terms <strong>of</strong> absorption and scattering<br />
cross - section are shown in Figure 4 and Figure 5.<br />
Figure 4: Absorption and scattering cross section spectra obtained with<br />
the analytical model (l=50 nm).<br />
- 27 - 15th IGTE Symposium 2012<br />
Figure 5: Absorption and scattering cross section spectra obtained<br />
through full-wave simulations (l=50nm).<br />
There is a good agreement among the results obtained<br />
with the analytical model (Figure 4) and full-wave<br />
simulations (Figure 5).<br />
Full-wave simulations are also compared with the<br />
numerical results obtained with the BEM as shown in<br />
Figure 6.<br />
Figure 6: Comparison between extinction spectra obtained with BEM<br />
and full-wave simulations (l=50nm).<br />
The difference among the results shown in Figure 6 could<br />
be associated to the different discretization <strong>of</strong> the edge <strong>of</strong><br />
the particle with these two approaches.<br />
Near electric field distribution is obtained through fullwave<br />
simulation as depicted in Figure 7.<br />
Figure 7: Near electric field distribution for a single nanocube particle<br />
(l=50nm). The incident electric field amplitude is 1 V/m.<br />
In Figure 7 is clearly shown the dipolar charge repartition<br />
according to the quasi-static approach.<br />
V. LSPR DEVICE<br />
To enhance the mechanism <strong>of</strong> the LSPR it is possible the<br />
use <strong>of</strong> inter-coupling among nanoparticles. Such effect
originates from the charge induction among two or more<br />
nanoparticles which interact stronger as they get closer to<br />
each other [17].<br />
To use this enhancement mechanism we propose a<br />
structure consisting in a linear chain <strong>of</strong> gold nanocubes<br />
deposited on a silica substrate, excited by a plane wave as<br />
depicted in Figure 8.<br />
Figure 8: Linear chain <strong>of</strong> gold nanocubes on silica substrate with<br />
a=500nm, b=100nm, l=50 nm, l/8
Figure 11: Absorption and scattering cross section spectra obtained with<br />
the full-wave simulations (d=l= 50nm).<br />
VII. BIOSENSING APPLICATION OF THE DEVICE<br />
By using very small inter-particle distance among the<br />
nanoparticles it is possible to obtain high scattering and<br />
low absorption efficiencies (Figure 12, TABLE I). These<br />
properties are very important for biosensing applications.<br />
In fact high absorption efficiency could heat the<br />
biological sample invalidating medical diagnosis.<br />
Figure 12: Absorption and scattering cross section spectra obtained with<br />
the full-wave simulations (d=l/8= 6.25nm).<br />
For biosensing application we suppose that the device<br />
(grey) is in direct contact with the biological sample<br />
under test (green) as depicted in Figure 13. The sensor<br />
behavior is related to the effective refractive index<br />
variation <strong>of</strong> the overall system "LSPR device - biological<br />
compound".<br />
Once the biological compound is placed on the device,<br />
the system "sensor-biological compound" is illuminated<br />
by an optical electromagnetic field (Figure 13). The<br />
detected signal has a new frequency position and its<br />
magnitude and amplitude width are both dependent on<br />
the different characteristics <strong>of</strong> the biological compound.<br />
- 29 - 15th IGTE Symposium 2012<br />
Figure 13: The sensing system operation scheme<br />
The biological sample used to test this device is an insilico<br />
replica with values or Refractive Index (RI) taken<br />
from the literature. In particular the RI values <strong>of</strong> rat<br />
mammary adipose and tumor tissue have been considered<br />
[18]. These data (TABLE II) were acquired using an<br />
interferometric imaging system (Optical Coherence<br />
Tomography - OCT technique).<br />
TABLE II<br />
Tissue type Refractive<br />
index<br />
(mean value)<br />
Tumor 1.39<br />
Adipose 1.467<br />
The data show that a difference exists between the RI <strong>of</strong> a<br />
adipose tissue and that <strong>of</strong> tumor tissue.<br />
The electromagnetic sensor response is evaluated in terms<br />
<strong>of</strong> extinction cross-section through full-wave simulations<br />
[10] as depicted in Figure 14.<br />
Figure 14: Extinction spectra for rat mammary cancer (RI=1.39) and<br />
adipose tissue (RI=1.467).<br />
As shown in Figure 14 the resonant peak shifts from 634<br />
nm for a tumor tissue to 650 nm for a regular adipose<br />
tissue. Sensitivity is evaluated as S=Δλ/Δn expressed in<br />
nm/RIU (Refractive Index Unit). In this case sensitivity<br />
reached 207nm/RIU.
Near electric field distribution obtained for this sensing<br />
platform (Figure 15) is less concentrated compared to the<br />
other one obtained for d=l (Figure 10).<br />
Figure 15: Near electric field distribution for d=l/8= 6.25 nm. The<br />
incident electric field amplitude is 1 V/m.<br />
This result is in accord to the prevailing scattering<br />
phenomenon (TABLE I).<br />
VIII. CONCLUSION<br />
In this paper a nanostructure device operating in the<br />
visible regime was proposed. The device consisting in a<br />
gold linear chain array <strong>of</strong> nanocubes, deposited on a silica<br />
substrate. In this way a near-field enhancement is<br />
obtained and analytical formulas to describe this<br />
phenomenon are presented.<br />
For the single nanoparticle good agreement among<br />
analytical results and numerical solutions was achieved.<br />
Exploiting electromagnetic properties <strong>of</strong> the device it was<br />
shown that the proposed structure could be successfully<br />
used as a biomedical sensor or as an optoelectronic<br />
device.<br />
[1]<br />
REFERENCES<br />
A. Moores and F. Goettmann, "The plasmon band in noble metal<br />
nanoparticles: an introduction to theory and applications," New<br />
Journal <strong>of</strong> Chemistry, vol. 30, pp. 1121-1132, 2006.<br />
[2] E. Hutter and J.H. Fendler, "Exploitation <strong>of</strong> Localized Surface<br />
Plasmon Resonance," Advanced Materials, vol. 16, pp. 1685-<br />
1706, 2004.<br />
[3] L.J. Sherry, S.-H. Chang, G.C. Schatz and R.P. Van Duyne,<br />
"Localized Surface Plasmon Resonance Spectroscopy <strong>of</strong> Single<br />
Silver Nanocubes," Nano Lett., vol. 5, pp. 2034–2038, 2005.<br />
[4] G. Mie, "Contributions to the optics <strong>of</strong> turbid media, particularly<br />
<strong>of</strong> colloidal metal solutions," Ann. Phys., vol. 25, pp. 377-445,<br />
1908.<br />
[5] M. Tréguer-Delapierre, J. Majimel, S. Mornet, E. Duguet and S.<br />
Ravaine, "Synthesis <strong>of</strong> non-spherical gold nanoparticles," Gold<br />
Bulletin, vol. 41, pp. 195-207, 2008.<br />
[6] W. Cai, T. Gao, H. Hong and J. Sun, “Application <strong>of</strong> gold<br />
nanoparticles in cancer nanotechnology,” Nanotechnology,<br />
[7]<br />
Science and Application, vol. 1, pp. 17-32, 2008.<br />
K.R. Catchpole and A. Polman, “Plasmonic solar cells,” Optics<br />
Express, vol. 16, pp. 21793-21800, 2008.<br />
[8] D.S. Grubisha, R.J. Lipert, H.-Y. Park, J. Driskell and M.D.<br />
Porter, "Femtomolar Detection <strong>of</strong> Prostate-Specific Antigen: An<br />
Immunoassay Based on Surface - Enhanced Raman Scattering and<br />
Immunogold Labels," Anal. Chem., vol. 75, pp. 5936-5943, 2003.<br />
[9] S. Kessentini, D. Barchiesi, T. Grosges and M. Lamy de la<br />
Chapelle, "Selective and Collaborative Optimization Methods for<br />
Plasmonics: A Comparison," PIERS Online, vol. 7, pp. 291-295,<br />
2011.<br />
[10] CST Computer Simulation <strong>Technology</strong>, www.cst.com<br />
- 30 - 15th IGTE Symposium 2012<br />
[11] U. Hohenester and J. Krenn, "Surface plasmon resonances <strong>of</strong><br />
single and coupled metallic nanoparticles: A boundary integral<br />
method approach," Phys. Rev. B, vol. 72, pp.195429, 2005.<br />
[12] A. Sihvola, "Electromagnetic Mixing Formulas and<br />
Applications," The Instution <strong>of</strong> Engineering and <strong>Technology</strong> -<br />
London, 2008.<br />
[13] L. La Spada, R. Iovine and L. Vegni, "Nanoparticle<br />
Electromagnetic Properties for Sensing Applications," Advances<br />
in Nanoparticles, vol. 1, pp. 9-14, 2012.<br />
[14] F.J. Garcìa de Abajo, "Retarded field calculation <strong>of</strong> electron<br />
energy loss in inhomogeneous dielectrics," Physical Review B,<br />
vol. 65, pp. 115418.1-115418.17, 2002.<br />
[15] U. Hohenester and A. Trugler, "MNPBEM- A Matlab toolbox for<br />
the simulation <strong>of</strong> plasmonic nanoparticles," Computer Physics<br />
Communications, vol. 183, pp. 370-381, 2012.<br />
[16] P.B. Johnson and R.W. Christy, “Optical Constants <strong>of</strong> the Noble<br />
Metals,” Phys. Rev. B, vol. 6, pp.4370-4379, 1972.<br />
[17] T. Chung, S.-Y. Lee, E.Y. Song, H. Chun and B. Lee, "Plasmonic<br />
Nanostructures for Nano-Scale Bio - Sensing," Sensors, vol. 11,<br />
pp. 10907-10929, 2011.<br />
[18] A.M. Zisk, E.J. Chaney and S.A. Boppart, "Refractive index <strong>of</strong><br />
carciogen-induced rat mammary tumours," Phys. Med. Biol., vol.<br />
51, pp. 2165-2177, 2006.
- 31 - 15th IGTE Symposium 2012<br />
Validation <strong>of</strong> measurements with conjugate heat<br />
transfer models<br />
M. Schrittwieser 1, 2 , O. Bíró 1, 2 , E. Farnleitner 3 , and G. Kastner 3<br />
1 Institute for Fundamentals and Theory in Electrical Engineering, Inffeldgasse 18, A-8010 <strong>Graz</strong>, Austria<br />
2 Christian Doppler Laboratory for Multiphysical Simulation, Analysis and Design <strong>of</strong> Electrical Machines,<br />
Innfeldgasse 18 A-8010 <strong>Graz</strong>, Austria<br />
3 Andritz Hydro GmbH, Dr. Karl- Widdmann- Strasse 5, A-8160 Weiz, Austria<br />
E-mail: schrittwieser@tugraz.at<br />
Abstract— The paper presents a comparison <strong>of</strong> thermal measurements on three stator duct models <strong>of</strong> an electrical machine.<br />
These models differ from each other by the slot section components. The measurements show the advantages and<br />
disadvantages <strong>of</strong> different variations. In order to study the measurement results in detail, a comparison with Computational<br />
Fluid Dynamics (CFD) was conducted, where it was useful to apply the Conjugate Heat Transfer (CHT) method, because it<br />
takes the convection and conduction into account. Therefore the conditions for the numerical heat transfer model can be<br />
determined more realistically, especially for the temperature rise in the solid domains caused by losses.<br />
Index Terms— Fluid Flow, Measurement, Stators, Thermal Analysis<br />
I. INTRODUCTION<br />
Hydro generators located in water power plants<br />
produce electric power in the range <strong>of</strong> more than 10<br />
MVA. The arising losses lead to a temperature rise in the<br />
electrical machine. The temperature rise is caused by<br />
copper, hysteresis, eddy current and mechanical losses<br />
during the generator operating. The heat has to be<br />
discharged to ensure the operating characteristics and this<br />
is the purpose <strong>of</strong> the cooling scheme. For designing the<br />
cooling <strong>of</strong> a generator, thermal and air flow networks are<br />
mostly used. Therefore the used parameters have to be<br />
established theoretically, by measurements or by CFD.<br />
The temperature rise has to be handled by solving the<br />
energy equation with the focus on heat convection and<br />
heat conduction. The convective heat transfer coefficient<br />
(HTC) is one <strong>of</strong> the most important parameters <strong>of</strong> these<br />
networks and must be known accurately. Examples <strong>of</strong><br />
networks are presented in [1] and [2].<br />
In the last years several investigations have been<br />
carried out on the topic <strong>of</strong> heat transfer, especially for low<br />
power electrical machines. Two different methods have<br />
emerged to get information about the HTC. One uses<br />
thermal resistances, defined with the aid <strong>of</strong> temperatures<br />
gained by measurements [3], [4] and [5]. The other<br />
employs CFD calculations combined with measurements<br />
[6], [7] and [8].<br />
The convective HTC has been calculated for large<br />
numerical models with CFD at different parts in [9] and<br />
[10] where the numerical effort is very high due to the<br />
large number <strong>of</strong> nodes in the model. A special set-up <strong>of</strong><br />
boundary conditions has been tried to reduce the section<br />
to be analyzed for comparable results with special<br />
attention payed to the rotor stator interaction. Only the<br />
fluid material properties are significant in these CFD<br />
simulations and the temperatures have been defined at the<br />
walls as a boundary condition from measurements. The<br />
refinement <strong>of</strong> the mesh near the wall for calculating an<br />
exact heat transfer is very important. An indicator <strong>of</strong> the<br />
mesh density is the dimensionless wall distance y + which<br />
should be about y 1<br />
+ ≤ [11]. The primary reason for this<br />
is that the HTC is a function <strong>of</strong> the dimensionless wall<br />
distance [12].<br />
The heat transfer caused by conduction has been<br />
considered in several papers by the finite element method<br />
(FEM) [13], [14] and [15]. The advantage <strong>of</strong> CFD over<br />
FEM is the consideration <strong>of</strong> the actual wall heat transfer<br />
coefficient. The disadvantage <strong>of</strong> the CFD is that the<br />
losses cannot be considered, while FEM is capable <strong>of</strong><br />
this. Therefore the copper and iron losses have to be<br />
implemented differently in CFD e.g. using the conjugate<br />
heat transfer (CHT) method. The sources can be defined<br />
in the solid domains and the material properties play an<br />
important role for the CHT solution.<br />
This paper presents a mutual validation <strong>of</strong> calorimetric<br />
measurements and a numerical calculation. The CHT<br />
method (fluid and solid heat transfer) is applied to a stator<br />
duct model. The losses have been defined as sources in<br />
the solid domains. The main objective is to evaluate the<br />
slot geometries with different winding assemblies. All<br />
three models have been measured at 5 flow rate points to<br />
pinpoint their thermal characteristics.<br />
II. MEASUREMENT<br />
A simplified model <strong>of</strong> a stator section has been under<br />
experimental investigation at the ANDRITZ Hydro. The<br />
main objective <strong>of</strong> the measurements has been to find and<br />
compare the thermal characteristics <strong>of</strong> different winding<br />
assemblies.<br />
Air has been used as cooling fluid for the experimental<br />
set-up.<br />
A. Investigated model<br />
The laboratory model and the cooling scheme are<br />
shown in detail in Fig. 1. The cooling fluid streams from<br />
the Inlet through the measuring nozzle (a) to the<br />
temperature probe (b) and from there through rectangular<br />
channels <strong>of</strong> wood (c) into the stator duct model (d). After<br />
heat exchange the warm air streams through wood ducts<br />
to an outlet channel, which contains resistance thermo<br />
elements (f). The outer surface <strong>of</strong> the model has been
insulated (e) for reduction <strong>of</strong> secondary heat flux.<br />
Fig. 1: Calorimetric measurement and experimental set-up <strong>of</strong> the stator<br />
laboratory model; (a) measuring nozzle, (b) Pt-100 temperature probe,<br />
(c) wood channels; (d) stator duct model; (e) insulation <strong>of</strong> the model and<br />
(f) resistance thermo elements<br />
B. Measuring physical parameters<br />
The measuring nozzle defines the volume flow rate Vin<br />
immediately in front <strong>of</strong> the model inlet.<br />
The fluid temperature T and density has been<br />
measured at the inlet and outlet <strong>of</strong> the stator duct model.<br />
These calorimetric measurement data allow calculating<br />
the heat flux after reaching steady state. The energy<br />
exchange occurs in the stator duct model. Therefore, it is<br />
important to calculate also the solid temperature and fluid<br />
temperature in the ducts. Fig. 2 shows the positions <strong>of</strong> the<br />
temperature probes in the iron domain.<br />
Fig. 2: Position <strong>of</strong> measurement probes in the iron; (a) 1 st stator core, (b)<br />
2 nd stator core and (c) heating rod<br />
The stator model consists <strong>of</strong> a section including 5 slots<br />
in circumferential direction and 3 ventilation ducts with<br />
distance bars between the laminated iron sheets in axial<br />
direction. The temperature has been measured in two<br />
stator cores. Therefore, fifteen Pt-100 resistance<br />
thermometers with 20 mm probe length have been<br />
positioned at each stator core.<br />
The heat sources have been simulated with heating<br />
rods positioned in the winding bars made <strong>of</strong> solid copper.<br />
The source has been induced with heating rods positioned<br />
in a hole in the middle <strong>of</strong> the copper bars, see Fig. 3. The<br />
- 32 - 15th IGTE Symposium 2012<br />
length <strong>of</strong> the rod has been 100 mm, with a diameter <strong>of</strong> 6<br />
mm and a constant heat output. The upper and lower bars<br />
have been heated up to reach steady state. The heat output<br />
has been constant during the whole experiment.<br />
Fig. 3: Position <strong>of</strong> temperature measurement devices for the cooling<br />
fluid; solid temperature positions for measuring (a) copper temperature<br />
and (b) spacer temperature; position <strong>of</strong> (c) the heating rod<br />
Thereupon the temperatures have been measured in<br />
each copper bar with two NiCrNi thermocouples 60 mm<br />
in length. The temperature in the spacer has been<br />
measured by a Pt-100.<br />
C. Results <strong>of</strong> measurements<br />
Table I shows the measurement data obtained for the 5<br />
different operating points for each model under<br />
investigation. The temperature differences have been<br />
normalized by the fluid inlet temperature.<br />
Model A<br />
Model B<br />
Model C<br />
TABLE I<br />
CALORIMETRIC MEASUREMENT RESULTS<br />
Vin in Tout − TinTcopper<br />
−Tin<br />
m Tin<br />
Tin<br />
3 /s kg/m 3<br />
T − T<br />
iron in<br />
Tin<br />
0.080 1.130 0.12 1.39 0.27<br />
0.060 1.129 0.16 1.60 0.36<br />
0.040 1.138 0.26 1.99 0.55<br />
0.025 1.134 0.41 2.35 0.81<br />
0.015 1.133 0.68 2.96 1.30<br />
0.079 1.155 0.15 1.80 0.48<br />
0.061 1.154 0.21 2.04 0.60<br />
0.041 1.152 0.31 2.40 0.83<br />
0.025 1.154 0.51 2.94 1.22<br />
0.015 1.152 0.81 3.55 1.80<br />
0.078 1.155 0.15 1.80 0.51<br />
0.060 1.149 0.20 2.03 0.64<br />
0.041 1.147 0.31 2.34 0.85<br />
0.025 1.149 0.51 2.84 1.23<br />
0.015 1.151 0.81 3.39 1.77<br />
III. MODEL GEOMETRIES<br />
The measurement set-up has been implemented in<br />
ANSYS CFX [11].<br />
The whole numerical model is shown in Fig. 4. The<br />
cooling scheme is the same as during the measurements<br />
i.e. the wood channels have also been modeled. Adiabatic<br />
walls have been defined at the top and the bottom <strong>of</strong> the
numerical model in z-direction.<br />
In addition to this simulation, a pperiodic<br />
boundary<br />
condition has been defined at the surfaaces<br />
normal to the<br />
x-direction. The goal <strong>of</strong> this is to reduce<br />
the section to be<br />
analyzed (less number <strong>of</strong> nodes togeether<br />
with smaller<br />
elements).<br />
Fig. 4: CHT model<br />
Fig. 5 visualizes the numerical statoor<br />
model in detail.<br />
For the calculation, one slot section hass<br />
been investigated<br />
due to the inlet condition, which is the same for each slot<br />
section as in the measurement. The wwinding<br />
assemblies<br />
are nearly the same, i.e. copper bars (d) and (e) with<br />
insulation (f), spacer (g) between the wwinding<br />
bars and,<br />
for positioning in radial direction, the sllot<br />
wedge (h). The<br />
iron (b) and (c) is located at the top aand<br />
bottom <strong>of</strong> the<br />
fluid (a).<br />
The difference in the models is thee<br />
contact between<br />
insulation and the iron.<br />
Fig. 5: Numerical stator model consists <strong>of</strong> the (a) fluid in the stator duct,<br />
(b) iron teeth, (c) iron yoke, (d) top copper bar, (e) bottom copper bar,<br />
(f) insulation, (g) spacer between bars and (h) slott<br />
wedge<br />
The following models differ from eacch<br />
other in the type<br />
<strong>of</strong> the winding assembly. There are diffferent<br />
options for<br />
mounting the winding, which will be eexplained<br />
in detail<br />
for each model.<br />
A. Model A<br />
This is a model with an air gap (white) between<br />
insulation and iron teeth, see Fig. 6. TThis<br />
air gap has a<br />
constant length. The cooling fluid caan<br />
stream in axial<br />
direction from one duct to another due tto<br />
the air gap.<br />
Fig. 6: Model A with air gap (white)<br />
- 33 - 15th IGTE Symposium 2012<br />
B. Model B<br />
A ripple spring (white dashhed)<br />
is positioned on one<br />
side between the iron teeth annd<br />
the insulation instead <strong>of</strong><br />
the air gap, as shown in Fig. 7. This ripple spring has had<br />
a corrugation in diagonal direcction.<br />
This corrugation has<br />
been smoothed along the surfaace.<br />
The implementation <strong>of</strong><br />
this has been done with a thermmal<br />
resistance at the contact<br />
interface. This causes an asymmmetric<br />
energy transport and<br />
the fluxes are higher at the sidee<br />
without a ripple spring.<br />
Fig. 7: Model B with ripple spring (whhite<br />
dashed)<br />
C. Model C<br />
This model is similar to moodel<br />
A, with the difference<br />
that epoxy resin (white dotted) ) is present. This is shown<br />
in Fig. 8. In this case the air caan<br />
stream in axial direction<br />
through the air gap (white), likee<br />
in model A.<br />
Fig. 8: Model C with epoxy resin (whitte<br />
dotted) and air gap (white)<br />
IV. NUMERICAAL<br />
METHOD<br />
The material properties havee<br />
a significant influence on<br />
the numerical solution. In CFDD,<br />
the fluid properties play<br />
the most important role and tthe<br />
solid domains are not<br />
taken into account. Only the coonvection<br />
has an influence<br />
in such calculations and the connduction<br />
is not considered.<br />
The conjugate heat transferr<br />
method differs from the<br />
conventional CFD simulation iin<br />
the consideration <strong>of</strong> the<br />
heat conduction in the energyy<br />
equation. Therefore, the<br />
thermal conductivity has to bbe<br />
known and defined for<br />
each medium in the CFD code [16].<br />
A. Turbulence Model<br />
Computational Fluid Dynammics<br />
uses the Finite Volume<br />
Method for solving the transporrt<br />
equations:<br />
∂ ρ ∂ρ<br />
u j<br />
+<br />
∂t ∂x<br />
j<br />
= 0<br />
(1)<br />
∂ρu ∂ρuu<br />
i i j ∂ p ∂ρτ<br />
ij<br />
+ = + + ρ fi<br />
∂t ∂xj∂ x xi ∂x<br />
j<br />
(2)<br />
∂ρet ∂ρue i t<br />
+<br />
∂t ∂x ∂uip<br />
∂ ∂uiτij ∂qj<br />
=− + + ρuf<br />
i i + + Q(3)<br />
∂x<br />
∂x ∂x<br />
j j<br />
j j<br />
These equations can be solveed<br />
for laminar flows. If the<br />
velocity and all other parametters<br />
vary in a random and<br />
chaotic way, the regime is calleed<br />
turbulent [17]. For most<br />
problems, it is unnecessary to resolve the detailed<br />
turbulent fluctuations and it is sufficient to calculate the<br />
time averaged properties <strong>of</strong> the flow. Therefore, the
Reynolds Averaged Navier Stokes (RANS) equations<br />
have been used. The Reynolds Stress Tensor is another<br />
unknown variable and further equations must be defined<br />
for the solution to calculate the unknown parameters [18].<br />
In the present case the Shear Stress Transport (SST)<br />
turbulence model [19] has been used. The advantage <strong>of</strong><br />
the SST turbulence model is that it combines the<br />
advantages <strong>of</strong> the k- turbulence model in the free stream<br />
and the advantage <strong>of</strong> the k- turbulence model near the<br />
wall [20], [21].<br />
B. Model Configuration and Boundary Conditions<br />
The mass flow rate and the inlet temperature have been<br />
defined as measured, see Table I. The pressure at the<br />
outlet has been defined as ambient pressure.<br />
The heat output from the heating element has been<br />
defined on a length <strong>of</strong> 100 mm in the middle <strong>of</strong> the<br />
copper bars with a constant value gained from the<br />
measurement.<br />
C. Fluid Properties<br />
The specific heat capacity cp, the dynamic viscosity <br />
and the thermal conductivity have been defined as the<br />
following constant values in Table II.<br />
TABLE II<br />
AIR IDEAL GAS MATERIAL PARAMETERS<br />
cp <br />
J/kgK Pa s W/mK<br />
1004.4 1.831·10 -5 2.61·10 -5<br />
For an ideal gas, the density is calculated with the ideal<br />
gas equation [16]:<br />
n⋅p ρ =<br />
R ⋅T<br />
0<br />
abs<br />
- 34 - 15th IGTE Symposium 2012<br />
(4)<br />
dh = cp dT<br />
(5)<br />
Here, n is the molecular weight, pabs is the absolute<br />
pressure, T is the temperature, R0 is the universal gas<br />
constant and is the density.<br />
These material properties from Table 2 have been<br />
adapted to implement the temperature dependence <strong>of</strong> the<br />
streaming fluid. In this case the specific heat capacity cp<br />
is expressed by the zero pressure polynomial [11]<br />
cp<br />
2 3 4<br />
= a1+ a2T + aT 3 + a4T + aT 5 (6)<br />
R<br />
S<br />
with the temperature T in Kelvin and the gas constant for<br />
air Rs = 287.058 J/kgK and the following coefficients:<br />
a1 = 3.57 , a2 = -4.3·10 -4 K -1 ,<br />
a3 = -4.2·10 -8 K -2 , a4 = 3.1·10 -9 K -3 ,<br />
a5 = -2.4·10 -12 K -4 .<br />
The values for the viscosity are approximated by the<br />
Sutherlands formula<br />
nμ<br />
μ T0+ Sμ T <br />
= ,<br />
(7)<br />
μ0<br />
T + SμT0 <br />
similarly to the conductivity <br />
λ<br />
λ<br />
T + S T <br />
+ <br />
0 λ<br />
= <br />
0 T SλT0 In these formulas, S and S stand for the Sutherland<br />
constant and n and n for the appropriate exponents. For<br />
the reference viscosity and reference conductivity the<br />
following values has been chosen from a material<br />
property table [22] at the reference temperature T0=325 K<br />
which is close to the mean operating temperature <strong>of</strong> the<br />
cooling fluid.<br />
S=77.80 K , 0=1.97·10 -5 Pa s , n=1.57 ,<br />
S=60.71 K , 0=2.82·10 -3 W/mK , n=1.66 .<br />
The material properties are accurately approximated in<br />
the temperature range from about 260 K to 670 K with<br />
this approach. It is not recommended to use the same<br />
parameters outside this range <strong>of</strong> temperature [22].<br />
D. Solid Properties<br />
The CHT method solves the following transport<br />
equation in the solid domains:<br />
nλ<br />
∂ρh ∂ρu h ∂ ∂T<br />
+ = λ+ S<br />
∂t ∂x ∂x <br />
∂x<br />
<br />
t s t<br />
j E<br />
<br />
j j j<br />
The important parameter in this equation is the thermal<br />
conductivity , which has a great influence on the results<br />
<strong>of</strong> the heat conduction and have to be known exactly.<br />
These parameters have been defined as isotropic for the<br />
copper, insulation, spacer, slot wedge, ripple spring and<br />
epoxy resin and as anisotropic for the iron.<br />
V. COMPARING NUMERICAL RESULTS WITH<br />
MEASUREMENTS<br />
The following figures show a comparison <strong>of</strong><br />
measurement data (dashed line) and CHT solution data<br />
(solid line) for the three different parameters. The<br />
temperature values have been normalized in the following<br />
figures.<br />
A. Copper temperature<br />
The temperature difference in Fig. 9 is calculated with<br />
an average value <strong>of</strong> the copper temperature in the top and<br />
bottom bar and the fluid temperature at the inlet. The<br />
deviation is due to the copper temperature because the<br />
inlet temperature has been defined from the<br />
measurements for the CFD calculation and is exactly the<br />
same like in the measurement.<br />
An average deviation has been calculated with 1.98 %<br />
for model A, 0.95 % K for model B and 1.06 % for model<br />
C. The diagram shows that the differences between the<br />
models become smaller with a higher flow rate for the<br />
measurements contrary to the calculation. The difference<br />
<strong>of</strong> the results at the highest flow rate is calculated for<br />
model A with 3.54 %, for model B with 1.34 % and for<br />
model C with 1.57 %.<br />
.<br />
.<br />
(8)<br />
(9)
Normalized temperature difference<br />
1,1<br />
1,0<br />
0,9<br />
0,8<br />
0,7<br />
0,6<br />
Measurement A CHT A<br />
Measurement B CHT B<br />
Measurement C CHT C<br />
0,5<br />
0,01 0,02 0,03 0,04 0,05 0,06 0,07 0,08 0,09<br />
Volume flow rate in m 3 /s<br />
Fig. 9: Normalized temperature difference between copper temperature<br />
and fluid inlet temperature for the three stator duct models<br />
The distribution <strong>of</strong> the temperature is plotted in Fig. 10<br />
for stator duct model A and B. The surface is defined at<br />
the middle <strong>of</strong> the first stator core (see Fig. 2) and the<br />
temperature is shown at the whole solid domains.<br />
Fig. 10: Temperature distribution through the middle <strong>of</strong> stator core 1<br />
with all parts; (a) model A, (b) model B and (c) model C<br />
The highest copper temperatures are found in model A<br />
(a). The asymmetric temperature in model B (b) is also<br />
- 35 - 15th IGTE Symposium 2012<br />
recognizable in the iron; the temperature in the iron is<br />
higher on the opposite side <strong>of</strong> the ripple spring (bottom<br />
side) caused by the higher heat flux. The epoxy resin in<br />
model C (c) contributes a lower temperature in the<br />
insulation than along the air gap (see detailed in Fig. 10<br />
c). This will have a positive effect on the properties <strong>of</strong> the<br />
insulation during the aging.<br />
B. Iron temperature<br />
The iron temperature has been calculated as an average<br />
value <strong>of</strong> all measuring points (Fig. 2). The difference to<br />
the fluid inlet temperature has been plotted as before. The<br />
average deviation is 13.20 % for model A, 6.04 % for<br />
model B and 3.52 % for model C. It is worth noting that<br />
model A has the highest deviation for the iron<br />
temperature, see Fig. 11. The deviation for each winding<br />
assembly decreases with a higher volume flow rate.<br />
Normalized temperature difference<br />
1,2<br />
1,1<br />
1,0<br />
0,9<br />
0,8<br />
0,7<br />
0,6<br />
0,5<br />
0,4<br />
0,3<br />
0,2<br />
0,1<br />
Measurement A CHT A<br />
Measurement B CHT B<br />
Measurement C CHT C<br />
0,0<br />
0,01 0,02 0,03 0,04 0,05 0,06 0,07 0,08 0,09<br />
Volume flow rate in m<br />
Fig. 11: Temperature difference between iron temperature and fluid<br />
inlet temperature for the three stator duct models<br />
3 /s<br />
Normalized temperature difference<br />
C. Fluid temperatures<br />
The fluid temperature rise is shown in Fig. 12.<br />
1,2<br />
1,1<br />
1,0<br />
0,9<br />
0,8<br />
0,7<br />
0,6<br />
0,5<br />
0,4<br />
0,3<br />
0,2<br />
0,1<br />
Measurement A CHT A<br />
Measurement B CHT B<br />
Measurement C CHT C<br />
0,0<br />
0,01 0,02 0,03 0,04 0,05 0,06 0,07 0,08 0,09<br />
Volume flow rate in m<br />
Fig. 12: Difference between fluid inlet and outlet temperature<br />
3 /s
The temperature difference has been calculated with the<br />
inlet and outlet temperature <strong>of</strong> the air. The difference<br />
decreases with the volume flow rate. The average value<br />
<strong>of</strong> the difference is 0.97 % for model A, 1.09 % for model<br />
B and 0.99 % for model C. The highest deviation at the<br />
lowest flow rate is about 2.84 % for model A, 3.47 % for<br />
model B and 3.10 % for model C.<br />
VI. CONCLUSION<br />
The paper has described the conjugate heat transfer<br />
method for a stator model example. The advantage <strong>of</strong><br />
using CHT is that the heat transfer coefficient is<br />
inherently solved and needs not be defined as constant at<br />
the surfaces.<br />
The comparison <strong>of</strong> the numerical solution shows a<br />
good agreement with measurements for each stator duct<br />
model. The average deviation <strong>of</strong> the temperature<br />
difference between copper and fluid inlet temperature has<br />
been less than about 1.4 % for all models. The<br />
temperature difference has been calculated between the<br />
iron and fluid inlet temperature. Therefore, the average<br />
deviation is under 8 %. The heating up <strong>of</strong> the air has been<br />
calculated with a difference less than 1.5 % and at the<br />
lowest operating point the difference reaches the maximal<br />
deviation <strong>of</strong> 3.1 % and the slightest deviation with the<br />
highest flow rate <strong>of</strong> about 0.2 %. This can be explained<br />
by the fact that steady state is reached faster for lower<br />
flow rates than for higher ones. Based on these results,<br />
the conclusion can be drawn that the best agreement is<br />
obtained for model C and the worst for model A. These<br />
investigations provide a determination <strong>of</strong> proper model<br />
conditions in the slot region, which can be used for<br />
further CHT researches.<br />
VII. ACKNOWLEDGMENT<br />
This work has been supported by the Christian Doppler<br />
Laboratory for Multiphysical Simulation, Analysis and<br />
Design <strong>of</strong> Electrical Machines (MuSicEl) and ANDRITZ<br />
Hydro GmbH.<br />
REFERENCES<br />
[1] E. Farnleitner and G. Kastner, “Contemporary methods <strong>of</strong><br />
ventilation design for pumped storage generators,“ e&I, vol. 127,<br />
no. 1-2, pp. 24-29, 2010, DOI: 10.1007/s00502-010-0711-8.<br />
[2] G. Traxler-Samek, R. Zickermann and A. Schwery, “Cooling<br />
airflow, losses, and temperatures in large air-cooled synchronous<br />
machines,“ IEEE Transactions on Industrial Electronics, vol. 57,<br />
no. 1, pp. 172-180, Jan. 2010.<br />
[3] C. Kral, T. G. Habetler, R. G. Harley, F. Pirker, G. Pasoli, H.<br />
Oberguggenberger and C. J. M. Fenz, “Rotor temperature<br />
estimation <strong>of</strong> squirrel-cage induction motors by means <strong>of</strong> a<br />
combined scheme <strong>of</strong> parameter estimation and a thermal<br />
equivalent model,“ IEEE Transactions on Industry Applications,<br />
vol. 40, no. 4, July-Aug. 2004.<br />
[4] D. Staton, A. Boglietti, and A. Cavagnio, ”Solving the motor<br />
difficult aspects <strong>of</strong> electric motor thermal analysis in small and<br />
medium size industrial induction motors,” IEEE Transactions on<br />
Energy Conversion, vol. 20, no. 3, Sept. 2005, DOI:<br />
10.1109/TEC.2005.847979.<br />
[5] A. Boglietti and A. Cavagnino, “Analysis <strong>of</strong> the endwinding<br />
cooling effects in TEFC induction motors,” IEEE Transactions in<br />
Industry Applications, vol. 43, no. 5, pp. 1214-1222, 2007, DOI:<br />
10.1109/TIA.2007.904399.<br />
[6] B.D.J. Maynes, R.J. Kee, C.E. Tindall and R.G. Kenny,<br />
“Simulation <strong>of</strong> airflow and heat transfer in small alternators using<br />
- 36 - 15th IGTE Symposium 2012<br />
CFD,” IEE <strong>Proceedings</strong>- Electric Power Applications, vol. 150,<br />
no. 2, pp. 146-152, 2003, DOI: 10.1049/ip-epa:20020754.<br />
[7] C. Kral, A. Haumer, M. Haigis, H. Lang and H. Kapeller,<br />
“Comparison <strong>of</strong> a CFD analysis and a thermal equivalent circuit<br />
model <strong>of</strong> a TEFC induction machine with measurements,“ IEEE<br />
Transactions on Energy Conversion, vol. 24, no. 4, pp. 809-818,<br />
Dec. 2009, DOI: 10.1109/TEC.2009.2025428.<br />
[8] M. Hettegger, B. Streibl, O. Biro and H. Neudorfer,<br />
“Measurements and simulations <strong>of</strong> the convective heat transfer<br />
coefficients on the end windings <strong>of</strong> an electrical machine,“ IEEE<br />
Transactions on Industrial Electronics, vol. 59, no. 5, pp. 2299-<br />
2308, May 2012, DOI: 10.1109/TIE.2011.2161656.<br />
[9] M. Schrittwieser, A. Marn, E. Farnleitner and G. Kastner,<br />
“Numerical analysis <strong>of</strong> heat transfer and flow <strong>of</strong> stator duct<br />
models,” XX th International Conference on Electrical Machines,<br />
Sept. 2012.<br />
[10] S. Klomberg, E. Farnleitner, G. Kastner and O. Bìrò, “Heat<br />
transfer analysis on end windings <strong>of</strong> a hydro generator using a<br />
stator-slot-section model,” 15 th IGTE Symposium, Sept. 2012<br />
[11] ANSYS Inc., “ANSYS CFX- Solver Modeling Guide”, Release<br />
13.0, ANSYS Inc.<br />
[12] W. Vieser, T. Esch and F. Menter, “Heat transfer predictions<br />
using advanced two equation turbulence models,“ CFX Technical<br />
Memorandum, 2002.<br />
[13] C.C. Hwang, S. Wu and Y. Jiang, “Novel approach to the solution<br />
<strong>of</strong> temperature distribution in the stator <strong>of</strong> an induction motor,”<br />
IEEE Transactions on Energy Conversion, vol. 15, no. 4, pp. 401-<br />
406, Dec. 2000, DOI: 10.1109/60.900500.<br />
[14] S. Mezani, N. Takorabet and B. Laporte, “A combined<br />
electromagnetic and thermal analysis <strong>of</strong> induction motors,” IEEE<br />
Transactions on Magnetics, vol. 41, no. 5, pp. 1572-1575, May<br />
2005, DOI: 10.1109/TMAG.2005.845044.<br />
[15] L. Weili, C. Guan and P. Zheng, “Calculation <strong>of</strong> a Complex 3-D<br />
Model <strong>of</strong> a turbogenerator with end region regarding electrical<br />
losses, cooling, and heating,” IEEE Transactions on Energy<br />
Conversion, vol. 26, no. 4, pp. 1073-1080, Dec. 2011, DOI:<br />
10.1109/TEC.2011.2161610.<br />
[16] ANSYS Inc., “ANSYS CFX- Solver Theory Guide,” Release<br />
13.0, ANSYS Inc.<br />
[17] F. Kreith and M. S. Bohn, Principles <strong>of</strong> Heat Transfer, 6 th ed.<br />
Southbank, Australia: Thomson Learn., 2001.<br />
[18] H. K. Versteeg and W. Malalasekera, “An Introduction to<br />
Computational Fluid Dynamics – The finite volume method”,<br />
[19] F. R. Menter, “Two- equation eddy- viscosity turbulence models<br />
for Engineering Applications,” AIAA Journal, vol. 32, pp. 1598-<br />
1605, Aug. 1994.<br />
[20] D. C. Wilcox, Turbulence Modeling for CFD: Solutions Manual.<br />
2 nd Edition, La Canada, CA: DCW Industries Inc., 1994.<br />
[21] B. Launder and D. Spalding, Mathematical models <strong>of</strong> turbulence.<br />
London, U.K.: Academic Press, 1972.<br />
[22] VDI Heat Atlas, 10 th ed. Berlin, Germany: Springer- Verlag, 2006.
- 37 - 15th IGTE Symposium 2012<br />
Computing the shielding effectiveness <strong>of</strong> waveguides using FE-mesh<br />
truncation by surface operator implementation<br />
C. Tuerk∗ , W. Renhart † , and C. Magele †<br />
∗Armament and Defence <strong>Technology</strong> Agency, Ministry <strong>of</strong> Defence and Sports, Rossauer Laende 1, A-1090 Vienna<br />
† Institute for Fundamentals and Theory in Electrical Engineering, Inffeldgasse 18, A-8010 <strong>Graz</strong><br />
E-mail: christian.tuerk@bmlvs.gv.at<br />
Abstract—A plane wave incident perpendicular to one open end <strong>of</strong> a conductive tube, as part <strong>of</strong> a honeycomb-structure, is<br />
attenuated on its way through it. In order to calculate its total attenuation for various frequencies the FE-method will be<br />
used. This requires a reflectionless truncation <strong>of</strong> the FE-mesh for which a Surface Operator Boundary Condition (SOBC)<br />
will be employed. In order to show the accuracy and applicability <strong>of</strong> the FEM with SOBC, the results will be compared<br />
to entirely analytical solutions as well as to easy-to-use engineering formulae.<br />
Index Terms—Finite Element Method (FEM), Shielding, Surface Operator Boundary Condition (SOBC), Waveguide<br />
I. INTRODUCTION<br />
Previous works i.e. [1] have shown the implementation<br />
<strong>of</strong> a surface operator boundary condition derived from<br />
an analytical model into the FE-mesh. Honeycombs can<br />
be considered waveguides-beyond-cut<strong>of</strong>f (WBC) and are<br />
therefore employed as vents for large shielded enclosures,<br />
like shielded rooms, while maintaining a certain degree<br />
<strong>of</strong> attenuation <strong>of</strong> a plane wave incident on it.<br />
The resulting attenuation imposed by a single conductive<br />
tube will be calculated under different ratios <strong>of</strong><br />
length-to-diameter <strong>of</strong> the tube and at selected frequencies.<br />
Existing literature like [2] provide engineering rules for<br />
designing waveguides-beyond-cut<strong>of</strong>f (WBC) as a shielding<br />
component whereas others like [3] present analytical<br />
details on the physics <strong>of</strong> the transmission <strong>of</strong> electromagnetic<br />
power in waveguides <strong>of</strong> various cross-sections. The<br />
results, obtained through numerical computation in the<br />
frequency range <strong>of</strong> 3GHz to 18GHz for a practical<br />
design <strong>of</strong> a real life waveguide are then compared to<br />
both approaches and subsequently discussed.<br />
II. MODELLING<br />
A. Surface Operator Boundary Conditions (SOBC)<br />
Fig. 1 shows the setup used for the computation <strong>of</strong> the<br />
plane wave field strength incident on the tube and exiting<br />
it. The right-hand-side boundary is modelled by means<br />
<strong>of</strong> SOBCs (Γtr) matching the impedance <strong>of</strong> free space<br />
between the tube and the termination <strong>of</strong> the problem area.<br />
A plane wave travelling along the x-axis will experience<br />
a certain degree <strong>of</strong> attenuation by the waveguide as long<br />
as the waveguide-beyond-cut<strong>of</strong>f (WBC) condition is met.<br />
A fraction <strong>of</strong> the initial power <strong>of</strong> the wave penetrates the<br />
waveguide and is terminated reflectionless at Γtr.<br />
Based on results obtained through [5] and [4] the implementation<br />
<strong>of</strong> this surface operator boundary condition<br />
for the truncation surface Γtr <strong>of</strong> the FE-mesh can be<br />
directly derived from the Maxwell equations<br />
∇× E = −jωμ H, ∇× H = jωɛ E. (1)<br />
After splitting the field vectors E and H as well as the<br />
∇-operator into their normal and orthogonal tangential<br />
Fig. 1. Modelling the Waveguide<br />
components as in 2<br />
E = Et + nEn, H = Ht + nHn, ∇ = ∇t + ∂<br />
n (2)<br />
∂n<br />
the Maxwell Equations can be reformulated as follows:<br />
n Hn = − 1<br />
jωμ ∇t × Et<br />
(3)<br />
n En = 1<br />
jωɛ ∇t × Ht. (4)<br />
With these relations the normal components <strong>of</strong> the field<br />
components En and Hn can be eliminated in equation 1.<br />
A couple <strong>of</strong> mathematical operations finally yield<br />
∂(n × Et)<br />
= −jωμ<br />
∂n<br />
Ht − 1<br />
jωɛ∇t × (∇t × Ht) (5)<br />
∂(n × Ht)<br />
= jωɛ<br />
∂n<br />
Et + 1<br />
jωμ∇t × (∇t × Et). (6)<br />
These equations are commonly valid, consequently on the<br />
truncation surface (see fig. 1) too. On Γtr the situation<br />
is as shown in fig. 2 in a local coordinate system.<br />
The propagation <strong>of</strong> the wave can be represented by the<br />
wave vector k. Due to the knowledge <strong>of</strong> the angle <strong>of</strong><br />
incidence on Γtr it can be decomposed into its normal<br />
and tangential components as given in the following set<br />
<strong>of</strong> equations:<br />
k = kt<br />
+ <br />
β, β = ± k2 − kt 2 , k = ω √ μɛ. (7)
Fig. 2. Wave at any point on Γtr<br />
The surface normal n is represented by the local coordinate<br />
ζ. In order to get rid <strong>of</strong> the ∂<br />
∂n term on the left-handside<br />
<strong>of</strong> equation 5 and equation 6 an integration along ζ<br />
over the half-space must be performed. Assuming a lossy<br />
media, the field components must decay to zero at infinity<br />
which allows for<br />
∞<br />
Ht0e <br />
ζ=0<br />
−jβζ dζ = 1<br />
jβ Ht0<br />
(8)<br />
∞<br />
Et0e −jβζ dζ = 1<br />
jβ Et0.<br />
ζ=0<br />
<br />
Ht0 and <br />
Et0 are the tangential field vectors at ζ =0.<br />
Together with equation 7, relations 5 and 6 can now be<br />
rewritten as<br />
n × <br />
Et0 =<br />
n × <br />
Ht0 =<br />
−ωμHt0 <br />
<br />
k2 − kt 2 + ∇t × (∇t × Ht0)<br />
ωɛ k2 − kt 2<br />
(9)<br />
ωɛEt0 <br />
<br />
k2 − kt 2 − ∇t × (∇t × Et0)<br />
ωμ k2 − kt 2<br />
. (10)<br />
Transverse components <strong>of</strong> the outgoing wave may be<br />
transformed into the Fourier domain, only to see, that its<br />
tangential derivatives can be expressed as ∇t = −j kt.<br />
Substitution in equation 9 and 10 leads to<br />
n × <br />
Et0 =<br />
n × <br />
Ht0 =<br />
−ωμHt0 <br />
<br />
k2 − kt 2 − kt × ( kt × Ht0)<br />
ωɛ k2 − kt 2<br />
(11)<br />
ωɛEt0 <br />
<br />
k2 − kt 2 + kt × ( kt × Et0)<br />
ωμ k2 .<br />
2<br />
− kt<br />
(12)<br />
These relations between the tangential components <strong>of</strong><br />
Et0<br />
and Ht0 can now be used to model the so called<br />
surface operator boundary conditions (SOBC) on Γtr.<br />
Equations 11 and 12 allow for any angle <strong>of</strong> incidence <strong>of</strong><br />
the plane wave on a truncating surface Γtr. Since only<br />
perpendicular incidence on the waveguide and on Γtr are<br />
considered, the use <strong>of</strong> a first-order SOBC is reasonable<br />
- kt =0.<br />
Application <strong>of</strong> the Galerkin method to the well-known<br />
A, v-formulation makes use <strong>of</strong> the n × Ht on the Neu-<br />
- 38 - 15th IGTE Symposium 2012<br />
mann Boundary (see [6]).<br />
− <br />
+ <br />
Ω<br />
+ <br />
ΓH<br />
Ω<br />
∇× Ni · 1<br />
μ ∇× AdΩ<br />
Ni · (n × ( 1<br />
μ ∇× A)) dΓ<br />
<br />
n× H<br />
Ni · (σ + jωɛ)jω( A −∇v)dΩ =0. (13)<br />
On the Neumann boundary (ΓH) the underbraced term<br />
in equation 13 is substituted by the Fourier transformed<br />
integral <strong>of</strong> equation 6 which prescribes the truncation <strong>of</strong><br />
the FE-mesh directly.<br />
B. Surface Impedance Boundary Conditions (SIBC)<br />
An increased incident angle results always in a larger<br />
wave vector kt and obviously the curl curl-terms in<br />
equations 11 and 12 become more and more relevance<br />
to achieve accurate boundary conditions. If the wave<br />
propagates perpendicularly to Γtr, the vector kt equals<br />
zero. This is the considered case for all results presented<br />
herein. Hence the second term on the right-hand-side in<br />
equations 11 and 12 equal zero and first order SIBCs<br />
remain:<br />
n × <br />
<br />
−ωμHt0 μ<br />
Et0 = = −<br />
k<br />
ɛ Ht0 = −Z0 Ht0 (14)<br />
n × <br />
ωɛ <br />
<br />
Et0 ɛ<br />
Ht0 = = −<br />
k μ Et0 = 1<br />
Et0<br />
(15)<br />
Z0<br />
The impedance <strong>of</strong> the mesh-terminating plane Γtr can<br />
now be directly prescribed.<br />
III. SETUP<br />
Fig. 1 shows the setup used for the computation <strong>of</strong> the<br />
plane wave field strength incident on the tube and exiting<br />
it. The right-hand-side <strong>of</strong> the problem area is terminated<br />
by means <strong>of</strong> the introduced SOBC. A plane wave originating<br />
from the stimulus plane penetrates the tube. Only<br />
a fraction <strong>of</strong> the incident power ”leaks” through it, since<br />
at the frequencies considered it represents a waveguidebeyond-cut<strong>of</strong>f<br />
(WBC). This small fraction <strong>of</strong> the incident<br />
wave is terminated reflectionless at Γtr. The detail <strong>of</strong> the<br />
aluminium tube with a square cross-section and lengths<br />
ranging from 20mm ... 80mm is shown in fig. 3.<br />
The grid shown in figure 3 represents the macro<br />
elements used for modelling only.<br />
IV. RESULTS<br />
A. Finite Element Method with SOBC<br />
Since frequencies above 1GHz are <strong>of</strong> interest, simulations<br />
at distinct frequencies in the range <strong>of</strong> 3 ... 18GHz at<br />
a stepwidth <strong>of</strong> 3GHz are considered. At each frequency<br />
the length <strong>of</strong> the tube is stepped through by 10mm in<br />
the range between 20mm and 80mm. The cross-section<br />
<strong>of</strong> the waveguide is kept constant. Fig. 4 shows the<br />
resulting attenuation <strong>of</strong> a plane wave on its way through
Fig. 3. Details <strong>of</strong> the waveguide-beyond-cut<strong>of</strong>f<br />
the WBC. At 15GHz the attenuation <strong>of</strong> the incident<br />
wave starts to approach zero and the tube becomes a<br />
waveguide as known from RF-applications and has also<br />
been described in [3]. As long as the frequencies are<br />
Fig. 4. Attenuation <strong>of</strong> a plane wave at distinct lengths and frequencies<br />
below the cut<strong>of</strong>f-frequency, the attenuation does not only<br />
depend on the ratio between f, the frequency used, and<br />
the cut<strong>of</strong>f-frequency fc <strong>of</strong> the structure, but also depends<br />
on the length <strong>of</strong> the tube. The relationship is non-linear<br />
and therefore clearly contrasting the engineering rules<strong>of</strong>-thumb<br />
as provided in the following section.<br />
The following figure (Fig. 5) shows the computation<br />
<strong>of</strong> the field strengths on either side <strong>of</strong> the waveguidebeyond-cut<strong>of</strong>f.<br />
It is operated at 9GHz and the righthand-side<br />
is terminated by means <strong>of</strong> the SOBC described<br />
before. The colors in the figure refer to the absolute value<br />
<strong>of</strong> the field strengths <strong>of</strong> the electrical component <strong>of</strong> the<br />
plane wave at a particular moment. Due to the necessity<br />
<strong>of</strong> a fine mesh for the computation <strong>of</strong> fields along the<br />
waveguide (coloured grey), no field strengths are visible.<br />
Following the general formula <strong>of</strong> the power density <strong>of</strong> a<br />
plane wave<br />
S = 1<br />
2 Re( E × H∗ ) (16)<br />
and the impedance <strong>of</strong> free space <strong>of</strong><br />
Z0 =<br />
μ0<br />
ɛ0<br />
- 39 - 15th IGTE Symposium 2012<br />
Fig. 5. A waveguide 30mm in length, operated at 9GHz<br />
the attenuation <strong>of</strong> the power through the waveguide can<br />
be calculated. With<br />
at =20lg |Emaxin|<br />
|Emaxout|<br />
(17)<br />
the degree <strong>of</strong> the attenuation (at)[dB] can be determined<br />
based on the field strength <strong>of</strong> the electrical component<br />
<strong>of</strong> the plane wave on the left-hand-side <strong>of</strong> the tube<br />
(|Emaxin|) and on the right-hand-side (|Emaxout|). The<br />
maxima <strong>of</strong> the respective field strengths are taken from<br />
a line parallel to the x-axis along the centre <strong>of</strong> the tube.<br />
B. Engineering Rules<br />
For applications using frequencies below approximately<br />
1GHz [2] proposes the use <strong>of</strong> simple ”design<br />
rules”:<br />
fc = 150<br />
b , fc[GHz], diameter[mm] (18)<br />
at = 27.3<br />
b l, at[dB],<br />
b =<br />
diameter, length[mm] (19)<br />
√ 2a, forsquare cross − section[mm]<br />
f ≤ fc<br />
10 , usablefrequencyf (20)<br />
with fc being the cut<strong>of</strong>f-frequency in [GHz], at representing<br />
the shielding effectiveness in [dB] and any<br />
dimension given in [mm]. Formulae 18 to 20 show that<br />
the cut<strong>of</strong>f-frequency only depends on the diameter <strong>of</strong> the<br />
tube which is, to some degree, in accordance with [3].<br />
It has to be distinguished whether a square, rectangular<br />
or circular waveguide is used. As for the rectangular<br />
cross-sections [3] reads that the larger dimension governs<br />
the cut<strong>of</strong>f-frequency fc. For circular shapes the diameter<br />
counts. One may also have noticed that the engineering<br />
rules do not account for any matter in the waveguide<br />
but free space. Since the WBC is used as a vent with<br />
shielding properties its cut<strong>of</strong>f-wavelength follows<br />
λc = c0<br />
fc<br />
≈ 2a. (21)<br />
This is in line with [3] and equation 22 if ɛ = ɛ0 and<br />
μ = μ0. Waveguides filled with dielectric matter for<br />
transmission properties are beyond the scope <strong>of</strong> this work<br />
since they are neither useful as vents nor as a shielding<br />
component.
As long as the frequency <strong>of</strong> interest is below the<br />
highest usable frequency as given in equation 20 the<br />
tube yields an attenuation according to equation 19.<br />
Application <strong>of</strong> this set <strong>of</strong> formulae to the waveguide<br />
under consideration at 12GHz provides the following<br />
graph (fig. 6):<br />
Fig. 6. Engineering rules applied at 12GHz<br />
Figure 6 shows the application <strong>of</strong> the engineering rules<br />
at the cut<strong>of</strong>f-frequency fc = 12GHz. The calculation<br />
<strong>of</strong> the shielding effectiveness with the engineering rules<br />
(blue dashed line) naturally exceed the limits obtained<br />
by means <strong>of</strong> the numerical value since equation 20 has<br />
not been considered so far. This equation is obviously a<br />
very rough estimate <strong>of</strong> the maximum usable frequency.<br />
It requires this waveguide not to be used above 1.2GHz.<br />
This is very conservative, since the green solid line<br />
(the uppermost line) shows the course <strong>of</strong> the shielding<br />
effectiveness at 9GHz <strong>of</strong> this particular waveguide. The<br />
engineering rules yield similar results, but on the safe<br />
side. Since it is not clear which limit in terms <strong>of</strong> shielding<br />
effectiveness underlies this set <strong>of</strong> easy-to-use engineering<br />
rules, one has to be very careful with its application. Even<br />
if it was possible to adjust equation 20 to this result, the<br />
behaviour <strong>of</strong> a waveguide may render this unreliable due<br />
to its nonlinear attenutation <strong>of</strong> a plane wave as fig. 4<br />
clearly shows.<br />
C. Analytical Approach<br />
When considering a waveguide-beyond-cut<strong>of</strong>f (WBC)<br />
for shielding purposes, the lowest mode <strong>of</strong> a TE or TMwave<br />
propagating through it is <strong>of</strong> interest. It represents<br />
the cut<strong>of</strong>f-frequency fc. For waveguides with a square<br />
cross-section [3] reads for TE10-mode<br />
fc = 1<br />
2 √ 1<br />
(22)<br />
ɛμ a<br />
with a being the length <strong>of</strong> the edge <strong>of</strong> the square. For a<br />
waveguide as used for this work, fc =14.99GHz which<br />
matches the result shown in figure 4. With increasing<br />
frequencies the attenuation <strong>of</strong> the plane waves vanishes<br />
above approximately 15GHz regardless <strong>of</strong> the length <strong>of</strong><br />
it. In other words, illuminating this particular waveguide<br />
at frequencies ≥ 15GHz will render it useless as a shield.<br />
- 40 - 15th IGTE Symposium 2012<br />
Since waveguides are generally used for transmission<br />
<strong>of</strong> electromagnetic energy there are, apart from the engineering<br />
rules above, no analytical formulations available<br />
to determine the attenuation <strong>of</strong> a plane wave penetrating a<br />
waveguide below its cut<strong>of</strong>f-frequency - there is no distinct<br />
mode <strong>of</strong> energy flow in the waveguide. For the same<br />
reason there are no analytical formulations known for<br />
plane waves penetrating a waveguide at other angles than<br />
perpendicular to the cross-section <strong>of</strong> it (see section V).<br />
V. CONCLUSION<br />
This paper shows how Surface Operator Boundary<br />
Conditions (SOBC) can be implemented in an A − v<br />
formulation to be used with the Galerkin method. The<br />
SOBC are used to model a Neumann Boundary Condition<br />
which allows for reflectionless termination <strong>of</strong> a problem<br />
area. The use <strong>of</strong> the SOBC allows for a significant<br />
speed-up <strong>of</strong> the computation <strong>of</strong> the problem because the<br />
absorbing boundary is only a single term which does not<br />
require additional finite elements to be modelled. For the<br />
construction <strong>of</strong> vents in a shielded room, waveguides below<br />
their cut<strong>of</strong>f-frequencies are employed. The described<br />
model has been used for the computation <strong>of</strong> the shielding<br />
effectiveness <strong>of</strong> waveguides at frequencies exceeding<br />
1GHz and compared and contrasted to an analytical<br />
approach and a set <strong>of</strong> easy-to-use engineering rules. It<br />
can now clearly be shown, that well known and verified<br />
analytical solutions can be met by numerical models<br />
as far as the cut<strong>of</strong>f-frequency <strong>of</strong> square waveguides is<br />
concerned. By the same token, it can be shown that<br />
simple design rules are very conservative i.e. delivering<br />
smaller numbers <strong>of</strong> shielding attenuation than actually<br />
can be yielded in real designs. It can not be said, that this<br />
set <strong>of</strong> easy-to-use rules are valid only below ≈ 1GHz.<br />
So far, only plane waves incident perpendicular to<br />
an open end <strong>of</strong> the waveguide have been modelled and<br />
computed. Future efforts will be put on different angles<br />
<strong>of</strong> incidence. There exist hints, that stacked arrays <strong>of</strong><br />
waveguides (honeycomb structures) suffer a deterioration<br />
<strong>of</strong> total shielding effectiveness compared to the attenuation<br />
provided by a single tube. This behaviour may also<br />
be investigated in the future.<br />
REFERENCES<br />
[1] W. Renhart, C. Magele and C. Tuerk, ”Thin Layer Transition<br />
Matrix Description Applied to the Finite Element Method”,IEEE<br />
Trans on Magn., Vol. 45, No. 3, 2009, pp. 1638- 1641.<br />
[2] Louis T. Gnecco, ”The Design <strong>of</strong> Shielded Enclosures”, Newnes<br />
Press, ISBN 0-7506-7270-6<br />
[3] Karoly Simonyi, ”Theoretische Elektrotechnik”, 6. Auflage, VEB<br />
Deutscher Verlag der Wissenschaften, Berlin 1977<br />
[4] W. Renhart, C. Magele and C. Tuerk, ”Improved FE-mesh truncation<br />
by surface operator implementation to speed up antenna<br />
design” (unpublished).<br />
[5] Sergei Tretyakov, Analytical Modeling in Applied Electromagnetics,<br />
1st ed. Artech House, chapters 2, 3, 2003.<br />
[6] O. Biro, ”Edge element formulations <strong>of</strong> eddy current problems”,<br />
Computer methods in applied mechanics and engineering, vol. 169,<br />
pp. 391-405, 1999.
- 41 - 15th IGTE Symposium 2012<br />
Heat Transfer Analysis on End Windings <strong>of</strong> a Hydro<br />
Generator using a Stator-Slot-Sector Model<br />
1, 2 Stephan Klomberg, 3 Ernst Farnleitner, 3 Gebhard Kastner, 1, 2 Oszkár Bíró<br />
1 Christian Doppler Laboratory for Multiphysical Simulation, Analysis and Design <strong>of</strong> Electrical Machines,<br />
Inffeldgasse 18, A-8010 <strong>Graz</strong>, Austria<br />
2 Institute for Fundamentals and Theory in Electrical Engineering, Inffeldgasse 18, A-8010 <strong>Graz</strong>, Austria<br />
3 Andritz Hydro GmbH, Dr.-Karl-Widdmann-Strasse 5, A-8160 Weiz, Austria<br />
E-mail: stephan.klomberg@tugraz.at<br />
Abstract — An accurate evaluation <strong>of</strong> the convective heat transfer coefficient on end windings needs usually large numerical<br />
models. These calculations involve an enormous amount <strong>of</strong> time and are not feasible for finding correlations between the<br />
convective heat transfer coefficient, massflow, rotational speed and geometry. On the basis <strong>of</strong> a parameter study this paper<br />
shows that a simplified stator-slot-sector model is equally accurate as a pole-sector-model but more practicable and faster.<br />
Index Terms—Cooling, Electric machines, Fluid dynamics,<br />
Heat transfer.<br />
I. INTRODUCTION<br />
Large hydro generators are working with a high<br />
efficiency nevertheless the losses <strong>of</strong> these machines can<br />
reach up to several MW’s. These heat losses must be<br />
dissipated from the generator. Designing the cooling <strong>of</strong> a<br />
generator is nowadays well-engineered. The use <strong>of</strong> flow<br />
and thermal networks in this subject is state <strong>of</strong> the art [1].<br />
Flow networks decompose the complex geometry in<br />
discrete network elements to calculate the air flow<br />
through a machine. They include pressure generating<br />
elements (active) like fans and poles or other rotating<br />
components and passive elements like ducts, inlets or<br />
outlets. The fundamentals <strong>of</strong> these components are<br />
determined theoretically, by measurements or<br />
computational fluid dynamics (CFD). The computation <strong>of</strong><br />
the temperature rise in the active parts is handled with<br />
thermal networks where the convective heat transfer and<br />
heat conduction coefficients and a reference air<br />
temperature are major parameters. Examples about flow<br />
and thermal networks are found in [1] and [2].<br />
Strictly speaking, the most important factor, the<br />
convective heat transfer coefficient, has to be calculated<br />
with large numerical generator models. Analyzing these<br />
models needs much time and is inappropriate for<br />
parameter studies. The need <strong>of</strong> coefficients for the<br />
networks makes the consideration <strong>of</strong> an equivalent<br />
smaller model enabling a faster calculation rational.<br />
The standard equation for the convective wall heat<br />
transfer coefficient is<br />
q W<br />
=<br />
(1)<br />
TW- Tref According to [3] this equations is applicable for forced<br />
convection if q W is the wall heat flux density, TWall the<br />
wall temperature and Tref a reference temperature in a<br />
properly control surface in the calculating volume.<br />
In the last years, several investigations have been<br />
carried out on the topic <strong>of</strong> heat transfer on end windings<br />
<strong>of</strong> electrical machines especially for totally enclosed fan<br />
cooled induction motors. The most development has been<br />
on smaller machines in a power class <strong>of</strong> a few kVA. One<br />
method obtains the heat transfer coefficients by<br />
measuring temperatures and implements these<br />
temperatures in thermal resistances [4], [5]. A second<br />
kind <strong>of</strong> approach involves CFD calculations combined<br />
with measurements [6], [7]. The end winding heat<br />
transfer <strong>of</strong> large hydro generators have not yet been<br />
investigated.<br />
The large hydro generator presented in this paper is air<br />
ventilated by a motor-driven fan in radial direction. A<br />
longitudinal section <strong>of</strong> the investigated generator is<br />
shown in Figure 1. The fluid enters the machine at the<br />
inlet (a) without a spin, flows through the end winding<br />
bars (d, e) in the pole area (f) and through the stator ducts<br />
(h) to the outlet (j).<br />
b<br />
c<br />
d<br />
e<br />
axis <strong>of</strong> rotation<br />
a j<br />
Figure 1: Pr<strong>of</strong>ile <strong>of</strong> the investigated hydro generator<br />
showing the (a) inlet, (b) bearing support, (c) support<br />
ring, (d) bottom bar, (e) top bar, (f) salient pole, (g) airgap,<br />
(h) stator ducts, (i) outlet area, (j) outlet, (k) shaft<br />
The purpose <strong>of</strong> this paper is to develop a so called slotsector<br />
model which is smaller and faster to calculate than<br />
k<br />
h<br />
i<br />
f<br />
g
a standard model with all components and an enormous<br />
number <strong>of</strong> elements. The slot-sector model should be<br />
investigated and optimized for calculating an accurate<br />
heat transfer coefficient.<br />
II. NUMERICAL SIMULATION OF THE HEAT FLUX<br />
Turbulence models are one <strong>of</strong> the most important parts<br />
in numerical fluid simulation. Therefore the two most<br />
commonly used models, the standard k- model by Jones<br />
and Launder [8] and the shear stress transport model by<br />
Menter [9] have been compared.<br />
The fundamental equation to calculate the heat flux at<br />
the wall is<br />
W= cpu *<br />
T + (TW -T) (2)<br />
where is the density and cp is the specific heat capacity.<br />
It should be pointed out that the two turbulence models<br />
apply different approaches for the dimensionless near<br />
wall velocity u * and the dimensionless temperature at the<br />
wall T + . Vieser et al tested and explained these heat<br />
transfer predictions in [10] for different test cases.<br />
A strong impact on the wall treatment has the density<br />
<strong>of</strong> the used mesh near walls described by the<br />
dimensionless distance from the wall<br />
y + = u y<br />
<br />
(3)<br />
This parameter depends especially on the height <strong>of</strong> the<br />
first element adjacent to the wall y. The other<br />
parameters are the friction velocity u and the kinematic<br />
viscosity . The smaller y is chosen, the more accurate<br />
the calculated heat transfer coefficient becomes. A<br />
parameter study in section IV will show this correlation.<br />
A short overview <strong>of</strong> the influence <strong>of</strong> y + on the<br />
convective heat transfer coefficient is<br />
T + =fy + <br />
=fW W=fT + , u * y + =f(y)<br />
u * =fy + (4)<br />
<br />
The commercial s<strong>of</strong>tware package ANSYS-CFX-13<br />
[11] has been used for the numerical simulations<br />
described in this paper. There are two main calculation<br />
methods for a rotor stator simulation, the transient and the<br />
steady-state approach. A transient calculation needs large<br />
computing resources and takes a long time. Therefore the<br />
steady-state method has been chosen. There are two<br />
variants, the stage method and the frozen rotor method.<br />
These steady-state approaches are only approximations<br />
because the transient terms in the flow equations are<br />
neglected. Nevertheless, their balance <strong>of</strong> computational<br />
efficiency and accuracy is ideal for parameter studies<br />
with many working points. They differ in the treatment <strong>of</strong><br />
the interface between two components. The stage method<br />
averages the fluxes in circumferential direction on bands<br />
and transmits these fluxes to the downstream component.<br />
- 42 - 15th IGTE Symposium 2012<br />
Only one passage per component has to be modeled, and<br />
furthermore, it can be used for large pitch ratios which<br />
highly reduce the number <strong>of</strong> elements.<br />
The frozen approach works with a frame change at the<br />
interface without averaging the fluxes. Therefore, it needs<br />
to model more passages per component. The conservation<br />
equations for the rotor are solved in a rotating system, the<br />
equations <strong>of</strong> the stator in a static frame <strong>of</strong> reference. The<br />
consistency <strong>of</strong> speed and pressure is combined at the<br />
interface. These relations are illustrated in Figure 2 and<br />
explained in detail in [11].<br />
Figure 2: Steady-state methods: stage and frozen rotor<br />
R1/ R2 and S1/ S2 are rotational periodicities; pR/ pS are<br />
pitch ratios<br />
The standard setting in ANSYS-CFX-13 for ideal gas<br />
is temperature independent, i.e. the thermal conductivity<br />
, the specific heat capacity cp and the dynamic viscosity<br />
are constant. This is a simplification <strong>of</strong> reality which<br />
may make the results more inaccurate. An ideal gas with<br />
temperature dependence has been modeled as a<br />
consequence, and compared to a temperature independent<br />
ideal gas.<br />
The dynamic viscosity (5) and the thermal conductivity<br />
(6) have been modeled with the Sutherland’s formula<br />
[11]. The reference temperature Tref has been set to 325<br />
K. The reference molecular viscosity o, the reference<br />
molecular conductivity o, the Sutherland constants S/ S<br />
and the temperature exponents n/ n are listed below for<br />
both equations.<br />
<br />
0<br />
<br />
0<br />
S = 77.8 K<br />
0 = 1.972 10 -5 Pa s<br />
n = 1.574<br />
= Tref + S <br />
T + S T<br />
n<br />
<br />
Tref (5)<br />
= Tref + S <br />
T + S T<br />
n (6)<br />
Tref S = 60.7 K<br />
0 = 2.82 10 -2 W / m K<br />
n = 1.676<br />
As illustrated in equation (7), the specific heat capacity<br />
has been calculated with the zero pressure polynomial<br />
[11].<br />
cp RS = t 1 +t 2 T+t 3 T 2 +t 4T 3 +t 5T 4 (7)
RS = 287.058 J / kg K<br />
t1 = 3.574<br />
t2 = -4.2691 10 -4<br />
t3 = -4.1854 10 -8<br />
t4 = 3.0986 10 -9<br />
t5 = -2.3848 10 -12<br />
All physical values have been found by automatically<br />
adjusting them to measured thermodynamic properties <strong>of</strong><br />
dry air gathered in [12]. These values are valid in a<br />
temperature range from 260 K to 670 K.<br />
III. EXPLANATION OF THE 3D MODEL<br />
This chapter shows the structure <strong>of</strong> the reference model<br />
called pole-sector model (PSM). Four different slot-sector<br />
models (SSM) will be explained, too.<br />
The reference model has been reduced to one pole<br />
sector <strong>of</strong> the whole circumference <strong>of</strong> the generator. A<br />
rotational periodic condition is given at both<br />
circumferential sides. A symmetry condition in axial<br />
direction further reduces the numerical effort. The model<br />
is shown in Figure 3. It is simulated with the frozen rotor<br />
approach and the number <strong>of</strong> elements is about 30 million.<br />
The calculation time <strong>of</strong> the PSM is longer than a week<br />
because <strong>of</strong> this large amount <strong>of</strong> elements. Nevertheless,<br />
the mesh <strong>of</strong> this model is rather coarse over the whole<br />
volume. This fact is especially important near the wall <strong>of</strong><br />
the end windings.<br />
c<br />
d<br />
b<br />
a<br />
Figure 3: Pole-sector model: (a) inlet, (b) bearing support,<br />
(c) support ring, (d) bottom bar, (e) top bar, (f) salient<br />
pole, (g) air-gap, (h) stator ducts, (i) outlet area, (j) outlet<br />
Measuring temperatures at walls in a large hydro<br />
generator demands a high effort and a long preparation<br />
time. The measuring sensors must be fixed during the<br />
construction <strong>of</strong> the components, which makes such<br />
investigations complicated and expensive. Not least due<br />
to these facts, the calculated wall heat transfer<br />
coefficients (WHTC) <strong>of</strong> the PSM have been taken as<br />
e<br />
j<br />
f<br />
i<br />
h<br />
g<br />
- 43 - 15th IGTE Symposium 2012<br />
reference values. By means <strong>of</strong> simulating several working<br />
points with different volume flow rates and rotational<br />
speeds, a wide scope has been covered. The volume flow<br />
rate is set as the inlet boundary condition and the static<br />
pressure as the outlet condition. All walls, especially the<br />
end winding walls, <strong>of</strong> the model have a fixed<br />
temperature. Conduction in solids is not considered.<br />
The aim <strong>of</strong> the investigations is developing a<br />
simplified model with acceptable computational demands<br />
for a numerical parameter study. The best fitting<br />
computational approach for this issue is the stage model.<br />
The question is how much can the PSM be reduced by<br />
maintaining similar accuracy. To clarify this, four<br />
different simplifications have been modeled.<br />
The first idea was to reduce the model as much as<br />
possible. In order to achieve this, the whole generator has<br />
been reduced to a circumferential section <strong>of</strong> one slot.<br />
Furthermore, the rotor, the stator ducts and the outlet area<br />
are not considered. The interface between the rotor<br />
domain and the inlet domain as well as the air gap serves<br />
as a simplified outlet. Due to this, it is difficult to find an<br />
appropriate boundary condition at the simplified outlet.<br />
Only one end winding bar is considered and a diffuser<br />
has been put in front <strong>of</strong> the inlet to get a radial inflow<br />
onto the end winding area. The number <strong>of</strong> elements is<br />
only about 0.6 million due to all these reductions. This<br />
slot-sector model is named SSM_1.<br />
The next step was extending the model SSM_1 with<br />
the rotor domain to get the second model (SSM_2). The<br />
outlet is moved to the symmetry plane <strong>of</strong> the rotor. The<br />
interface between the rotor and the stator ducts is<br />
assumed to be a fixed wall. The number <strong>of</strong> elements is<br />
less than 1 million.<br />
The third model (SSM_3) is enhanced with the stator<br />
ducts and the outlet area. These parts have also a<br />
circumferential extension <strong>of</strong> one slot only. Because <strong>of</strong><br />
this, the same boundary conditions as in the PSM are<br />
possible. The number <strong>of</strong> elements rises to 1.5 million.<br />
The last remaining problem has been the inflow. A slot<br />
section <strong>of</strong> the inlet domain doesn’t allow a three<br />
dimensional spreading <strong>of</strong> the flow onto the end windings.<br />
The consideration <strong>of</strong> the entire inlet area leads to the last<br />
simplified slot-sector model called SSM_4. The final<br />
model includes all components, but it contains only one<br />
slot with a pitch ratio <strong>of</strong> 22.5, i.e. one end winding bar<br />
and its surrounding stator ducts are modeled. This model<br />
has the best features for using the steady-state approach<br />
stage and a circumferential averaging <strong>of</strong> the WHTC is<br />
expected to be appropriate.<br />
The number <strong>of</strong> elements has been reduced to 2 million.<br />
On the one hand, only one slot section has been modeled,<br />
and on the other hand, the rotor and the inlet domains<br />
have been geometrically simplified and meshed coarser<br />
than the components <strong>of</strong> the PSM. The meshes <strong>of</strong> the end<br />
winding bars <strong>of</strong> the PSM and <strong>of</strong> all four SSM have the<br />
same structure and mesh density.<br />
An accurate prediction <strong>of</strong> the heat transfer coefficient<br />
is possible with a fine near wall mesh only. By means <strong>of</strong><br />
a parameter study, the influence <strong>of</strong> the mesh density on<br />
the WHTC has been investigated. The end windings’<br />
domain is meshed starting from an extremely coarse grid
to a very fine one. These various meshes are illustrated in<br />
Table 1.<br />
TABLE I<br />
DIFFERENT MESHES OF THE END WINDING BAR<br />
y 1.element number <strong>of</strong> <br />
[mm] elements<br />
<br />
5,00 45.000 <br />
3,00 65.000 <br />
2,00 81.000 <br />
1,00 146.000 <br />
0,50 318.000 <br />
0,25 693.000 <br />
0,12 989.000 <br />
0,06 1.492.000 <br />
0,05 1.682.000 <br />
The focus has been on a defined height <strong>of</strong> the first<br />
element at the walls. The rest <strong>of</strong> the volume is<br />
automatically meshed with a defined ratio <strong>of</strong> growth and<br />
a Poisson distribution normal to the wall [13].<br />
IV. RESULTS<br />
The evaluation has been carried out by calculating the<br />
WHTC at the end windings as defined in (1). Further<br />
results are normalized to the reference values for a better<br />
overview. The end winding bar is split into 5 zones to get<br />
the variation <strong>of</strong> the WHTC along the bar. Figure 4 shows<br />
the zones, beginning with T1 and T2 on the top bar. TB3<br />
is the junction from the top to the bottom bar and it is<br />
followed by B2 and B1.<br />
Figure 4: End winding bar with 5 zones<br />
Figure 5 shows the comparison <strong>of</strong> the WHTCs<br />
obtained by the four slot-sector models along an end<br />
winding bar. As a criterion for an acceptable agreement<br />
between the PSM and the SSMs, a ratio PSM / SSM in<br />
the range <strong>of</strong> 0.8 - 1.2 has been chosen. The first 3 slotsector<br />
models cannot fulfill this target, especially the area<br />
TB3 at the end <strong>of</strong> the bar is too inaccurate. The version<br />
SSM_4 is just in the range, except in zone T1. This area<br />
- 44 - 15th IGTE Symposium 2012<br />
<strong>of</strong> the top bar is located at the beginning <strong>of</strong> the air gap<br />
and the rotor and is highly influenced by the motion <strong>of</strong><br />
the rotor. This can be also seen in Figure 6, detail x and y,<br />
where the velocity is very high.<br />
PSM / SSM<br />
2,0<br />
1,8<br />
1,6<br />
1,4<br />
1,2<br />
1,0<br />
0,8<br />
0,6<br />
0,4<br />
0,2<br />
SSM_1 SSM_2<br />
SSM_3 SSM_4<br />
0,0 T1 T2 TB3 B2 B1<br />
Figure 5: Comparison <strong>of</strong> the four slot-sector models<br />
Figure 6 shows the comparison <strong>of</strong> the velocity contours<br />
in the symmetry plane <strong>of</strong> the inlet for the models SSM_4<br />
and SSM_3. The contours <strong>of</strong> SSM_1 and SSM_2 are<br />
similar to SSM_3. These pictures essentially show that<br />
the inflow velocity is higher if the whole inlet is used for<br />
the calculations. Hence the ratios <strong>of</strong> the slot-sector<br />
models 1-3 in the zones TB3, B2 and B1 in Figure 5 are<br />
out <strong>of</strong> the range <strong>of</strong> 0.8 - 1.2.<br />
x y<br />
Figure 6: Velocity in the symmetry plane <strong>of</strong> the inlet<br />
domain <strong>of</strong> a) SSM_4 and b) SSM_3<br />
The interaction between the rotor and the end windings is<br />
illustrated in Figure 7. The turbulent kinetic energy at the<br />
interfaces between the inlet and the rotor as well as<br />
between the top bar and the inlet is shown. There are<br />
vortices with high energy at the PSM contour seen in<br />
Figure 7a. The computation with the model SSM_4
generates the well known circumferential bands (see<br />
Figure 7b) characteristic <strong>of</strong> the stage method. In other<br />
words, by averaging the physical values on<br />
circumferential bands it is not possible to calculate local<br />
vortices. Therefore the use <strong>of</strong> a slot-sector model<br />
underestimates the rotor stator interaction.<br />
Inlet – Top bar<br />
Inlet – Top bar<br />
Inlet - Rotor<br />
Inlet - Rotor<br />
Figure 7: Turbulent kinetic energy on selected interfaces<br />
in a) PSM, b) SSM_4<br />
The graph in Figure 8 shows the heat transfer<br />
coefficient in dependence on the dimensionless distance<br />
from the wall at the top bar. The SST and the k-<br />
turbulence models have been used for this study. The<br />
WHTC increases with decreasing dimensionless distance<br />
from the wall. The coefficient reaches its peak at about y +<br />
= 5 and fluctuates around the maximum value. The factor<br />
y + is very sensitive to varying the near wall velocity due<br />
to a different volume flow rate or rotational speed with<br />
the same mesh and geometry. This mesh refinement study<br />
confirms the investigations <strong>of</strong> [14].<br />
- 45 - 15th IGTE Symposium 2012<br />
y=min<br />
1,1<br />
1,0<br />
0,9<br />
0,8<br />
0,7<br />
0,6<br />
0 1<br />
y<br />
10 100<br />
+ [-]<br />
Figure 8: Mesh refinement study in the zones T1, T2,<br />
TB3 with the k- and the SST turbulence model<br />
Depending on the previous investigations, a parameter<br />
study with various working points has been carried out.<br />
The slot-sector model SSM_4 has been used with<br />
different operating conditions but the SST turbulence<br />
model has always been applied. The results have been<br />
averaged and a standard deviation has been calculated.<br />
Figure 9 shows the results obtained.<br />
PSM / SSM<br />
1,4<br />
1,2<br />
1,0<br />
0,8<br />
0,6<br />
0,4<br />
0,2<br />
T1 SST T1 k-e<br />
T2 SST T2 k-e<br />
TB3 SST TB3 k-e<br />
averaged ratio (y+ < 30, ideal gas temperature independent)<br />
averaged ratio (y+ < 1, ideal gas temperature independent)<br />
averaged ratio (y+ < 1, ideal gas temperature dependent)<br />
standard deviation<br />
0,0 T1 T2 TB3 B2 B1<br />
Figure 9: Normalized WHTC in dependence <strong>of</strong> y + and the<br />
type <strong>of</strong> ideal gas<br />
First, the SSM has been calculated with the same mesh<br />
density near the end winding walls as the PSM. The<br />
results are located in the given range <strong>of</strong> 0.8 – 1.2. The<br />
second simulation run has been done with the finest mesh<br />
<strong>of</strong> the end winding domain. The curve is decreasing<br />
nearly parallel to the first one. The reference model has<br />
been only simulated with a coarse mesh near the end<br />
winding bars, hence an estimation <strong>of</strong> the accuracy is not<br />
possible. These two curves have been calculated with a<br />
temperature independent ideal gas. Therefore, the last<br />
curve has been simulated with an ideal gas with<br />
temperature dependency. The ratio <strong>of</strong> the heat transfer<br />
coefficients increases about 5% with temperature<br />
independence assumed.<br />
Regarding these findings, it can be stated that a slotsector<br />
model with a very fine mesh near walls and an<br />
adjusted ideal gas leads to sufficiently accurate results.
V. CONCLUSION<br />
The comparison <strong>of</strong> a pole-sector model with various<br />
slot-sector models shows that a simplification with less<br />
numerical effort is possible. An extreme reduction <strong>of</strong> the<br />
generator is not recommended, because all components<br />
have to be considered. Due to modeling one slot only, the<br />
stage approach is an adequate and fast calculating method<br />
for this kind <strong>of</strong> model structure. The averaged deviation<br />
<strong>of</strong> the wall heat transfer coefficient from the reference<br />
values is about 12%. Possible improvements <strong>of</strong> the slotsector<br />
model are the use <strong>of</strong> an adjusted ideal gas and a<br />
fine mesh near walls. Furthermore, the influence <strong>of</strong> the<br />
dimensionless distance from the wall has been confirmed.<br />
ACKNOWLEDGMENT<br />
This work has been supported by the Christian<br />
Doppler Research Association (CDG) and by the Andritz<br />
Hydro GmbH.<br />
- 46 - 15th IGTE Symposium 2012<br />
REFERENCES<br />
[1] E. Farnleitner and G. Kastner, "Moderne Methoden der<br />
Ventilationsauslegung von Pumpspeichergeneratoren," e&i, vol.<br />
127, pp. 24-29, 2010.<br />
[2] G. Traxler-Samek, R. Zickermann and A. Schwery, "Cooling<br />
airflow, losses, and temperatures in large air-cooled synchronous<br />
machines," IEEE Transactions on Industrial Electronics, vol. 57,<br />
no. 1, pp. 172-180, Jan. 2010.<br />
[3] H. Herwig, "Kritische Anmerkungen zu einem weitverbreiteten<br />
Konzept: der Wärmeübergangskoeffizient a," Forschung im<br />
Ingenieurwesen, vol. 63, pp. 13-17, 1997.<br />
[4] A. Boglietti and A. Cavagnino, "Analysis <strong>of</strong> the endwinding<br />
cooling effects in TEFC induction iotors," IEEE Transactions on<br />
Industry Applications, vol. 43, no. 5, pp. 1214-1222, Sept.-Oct.<br />
2007.<br />
[5] A. Boglietti, A. Cavagnino, D. Staton and M. Popescu,<br />
"Experimental assessment <strong>of</strong> end region cooling arrangements in<br />
induction motor endwindings," IET Electric Power Applications,<br />
vol. 5, no. 2, pp. 203-209, Feb. 2011.<br />
[6] C. Micallef, S. Pickering, K. Simmons and K. Bradley, "Improved<br />
cooling in the end region <strong>of</strong> a strip-wound totally enclosed fancooled<br />
induction electric machine," IEEE Transactions on<br />
Industrial Electronics, vol. 55, no. 10, pp. 3517-3524, Oct. 2008.<br />
[7] M. Hettegger, B. Streibl, O. Bíró and H. Neudorfer, "Identifying<br />
the heat transfer coefficients on the end-winding <strong>of</strong> an electrical<br />
machine by measurements and simulations," in 19th ICEM, Rome,<br />
2010.<br />
[8] W. P. Jones and B. E. Launder, "The prediction <strong>of</strong> laminarization<br />
with a two-equation model <strong>of</strong> turbulence," International Journal <strong>of</strong><br />
Heat and Mass Transfer, vol. 15, no. 2, pp. 301-314, Feb. 1972.<br />
[9] F. R. Menter, "Two-equation eddy-viscosity turbulence models for<br />
engineering applications," AIAA Journal, vol. 32, pp. 1598-1605,<br />
1994.<br />
[10] W. Vieser, T. Esch and F. Menter, "Heat transfer predictions using<br />
advanced two-equation turbulence models," CFX Technical<br />
Memorandum, vol. CFX-VAL10/0602, 2002.<br />
[11] "ANSYS 13.0 documentation," ANSYS, Inc., Canonsburg, 2010.<br />
[12] F. Kreith, R. M. Manglik and M. S. Bohn, Principles <strong>of</strong> Heat<br />
Transfer, 7 ed., Stamford: Cengage Learning, 2011.<br />
[13] "ANSYS ICEM CFD 13.0 documentation," ANSYS, Inc.,<br />
Canonsburg, 2010.<br />
[14] M. Schrittwieser, A. Marn, E. Farnleitner and G. Kastner,<br />
"Numerical analysis <strong>of</strong> heat transfer and flow <strong>of</strong> stator duct<br />
models," in 20th ICEM, Marseille, 2012.
- 47 - 15th IGTE Symposium 2012<br />
Numerical Investigation <strong>of</strong> Linear Systems Obtained<br />
by Extended Element-Free Galerkin Method<br />
Taku Itoh∗ , Soichiro Ikuno∗ , and Atsushi Kamitani †<br />
∗ Tokyo <strong>University</strong> <strong>of</strong> <strong>Technology</strong>, 1404-1 Katakura, Hachioji, Tokyo 192-0982, Japan<br />
† Yamagata <strong>University</strong>, 4-3-16 Johnan, Yonezawa, Yamagata, 992-8510, Japan<br />
E-mail: taku@m.ieice.org<br />
Abstract—To impose not only the essential boundary condition but also the natural one without any integrations, the<br />
Element-Free Galerkin method (EFG) has been reformulated, and this method is called an eXtended EFG (X-EFG). A<br />
linear system obtained by the X-EFG becomes an asymmetric saddle point problem. Numerical experiments show that,<br />
by using IC-Bi-CGSTAB and IC-GMRES(m), the linear system can be solved more than 9 times faster than the LU<br />
factorization in relatively large problem. However, there are some cases where these iterative methods sometimes do not<br />
converge regardless <strong>of</strong> the condition number. To avoid these cases, and to stably solve the linear system as fast as possible,<br />
a flow chart for choosing an appropriate solver has been constructed by using the results <strong>of</strong> the numerical experiments.<br />
Index Terms—Meshless methods, Element-free Galerkin methods, Saddle point problems, Asymmetric linear systems<br />
I. INTRODUCTION<br />
Meshless methods such as the Element-Free Galerkin<br />
method (EFG) [1] and the Meshless Local Petrov-<br />
Galerkin method (MLPG) [2] have widely been applied<br />
to numerical simulations in a lot <strong>of</strong> fields, including<br />
electromagnetics [3], [4], [5], [6], [7]. In the meshless<br />
methods, elements <strong>of</strong> a geometrical structure are no<br />
longer necessary.<br />
Especially in the EFG, the Lagrange multiplier [1] is<br />
employed for imposing the essential boundary condition.<br />
Recently, the EFG has been reformulated together with<br />
anewimposing method <strong>of</strong> the essential boundary condition<br />
[8]. In this EFG, the essential boundary condition<br />
can be satisfied without using any integrations. However,<br />
it must be noted here that, in this EFG, the natural<br />
boundary condition is imposed by evaluating integrations.<br />
Especially for a three-dimensional (3D) problem, surface<br />
integrals must be evaluated for imposing the natural<br />
boundary condition. For this reason, if there exists an<br />
easier method for imposing the natural boundary condition,<br />
the method is helpful for developing a numerical<br />
code based on the EFG.<br />
The purpose <strong>of</strong> the present study is to reformulate<br />
a 3D EFG so that not only the essential boundary<br />
condition but also the natural one can be imposed without<br />
any integrations. The reformulated method is called an<br />
eXtended EFG (X-EFG). A linear system obtained by<br />
the X-EFG becomes a saddle point problem, and the<br />
coefficient matrix is asymmetric. For the purpose <strong>of</strong><br />
stably obtaining a numerical solution as fast as possible,<br />
appropriate solvers for the asymmetric linear system are<br />
also investigated numerically.<br />
II. EXTENDED ELEMENT-FREE GALERKIN METHOD<br />
A. New Reformulation<br />
In this section, we describe a new reformulation <strong>of</strong><br />
EFG which is different from that described in [8]. For<br />
simplicity, we consider a 3D Poisson problem:<br />
−Δu = f in V, (1)<br />
u =ū on SD, (2)<br />
∂u<br />
∂n =¯q on SN, (3)<br />
where V is a region bounded by a simple closed surface<br />
∂V that consists <strong>of</strong> both SD and SN. Here, SD and SN<br />
satisfy SD ∪ SN = ∂V and SD ∩ SN = φ. In addition, ū<br />
and ¯q are known functions on SD and SN, respectively,<br />
and n is an outward unit normal on ∂V . Furthermore,<br />
f(x) is a given function on V , and x =[x, y, z] T .<br />
From (1), the weak form is derived as<br />
<br />
<br />
∀<br />
w s.t. w∂V : ∇w ·∇u d<br />
=0 3 <br />
x = wf d 3 x . (4)<br />
V<br />
where w(x) is a test function. Note that the constraint <strong>of</strong><br />
w(x) in (4) is different from that in [8].<br />
To discretize (4), the nodes, x1, x2,...,xN are first<br />
placed both in V and on ∂V , and shape functions<br />
φ1(x),φ2(x),...,φN(x) are determined by using the<br />
Moving Least-Squares (MLS) approximation [1], [2], [7].<br />
Here, N is the number <strong>of</strong> nodes in V ∪ ∂V . In the<br />
following, M denotes the number <strong>of</strong> nodes on ∂V .In<br />
addition, the orthonormal system in R N and that in R M<br />
are denoted by {e1, e2,...,eN } and {ē1, ē2,...,ēM },<br />
respectively.<br />
Let us first discretize the weak form (4). To this end,<br />
we assume that u and w can be expanded with φi(x)(i =<br />
1, 2,...,N) as follows:<br />
N<br />
N<br />
u(x) = ûiφi(x), w(x) = ˆwiφi(x). (5)<br />
i=1<br />
By substituting (5) into (4), we obtain<br />
where<br />
i=1<br />
( ˆw,Aû − f) =0, (6)<br />
ˆw =[ˆw1, ˆw2,..., ˆwN ] T , (7)<br />
V
and<br />
û =[û1, û2,...,ûN] T . (8)<br />
In addition, A and f are defined as<br />
N N<br />
<br />
A ≡ ∇φi ·∇φj d 3 xeie T j , (9)<br />
f ≡<br />
i=1 j=1<br />
N<br />
<br />
V<br />
φif d 3 xei . (10)<br />
i=1 V<br />
Next, the constraint w| ∂V =0 in (4) is discretized. To<br />
this end, the constraint is rewritten as the equivalent<br />
proposition:<br />
<br />
∀<br />
β(s, t) :<br />
∂V<br />
β(s, t)w(x(s, t)) dS =0, (11)<br />
and an arbitrary function β(s, t) is assumed to be<br />
contained in span(N1,N2,...,NM ), where N1(s, t),<br />
N2(s, t),...,NM (s, t) are linearly independent functions<br />
on ∂V . Here, s and t are parameters for representing<br />
∂V . By using N1(s, t),N2(s, t),...,NM (s, t), (11) can<br />
be discretized as<br />
( ˆw, ck) =0(k =1, 2,...,M), (12)<br />
where ck (k =1, 2,...,M) are defined as<br />
N<br />
<br />
ck ≡ Nk(s, t)φi(x(s, t)) dS ei. (13)<br />
i=1<br />
∂V<br />
Note that (12) indicate ˆw ∈ V ⊥ , where<br />
V = span(c1, c2,...,cM ). (14)<br />
Hence, the weak form (4) can be discretized as<br />
∀ ˆw ∈ V ⊥ :(ˆw,Aû − f) =0. (15)<br />
Since (V ⊥ ) ⊥ = V, (15) can be written as<br />
Aû − f ∈ V. (16)<br />
Therefore, there exists ˆv ∈ R M such that<br />
Aû + C ˆv = f, (17)<br />
where C ≡ [c1, c2,...,cM ] and ˆv ≡ [ˆv1, ˆv2,...,ˆvM ] T .<br />
Finally, the essential boundary condition (2) and<br />
the natural one (3) are simultaneously discretized. By<br />
the similar procedures for discretizing the constraint<br />
w| ∂V =0 , (2) and (3) can be discretized as<br />
D T û = g, (18)<br />
where D ≡ [d1, d2,...,dM ], and<br />
⎧<br />
N<br />
<br />
Nk(s, t)φi(x(s, t)) dS ei,<br />
⎪⎨ i=1 ∂V<br />
for xk ∈ SD,<br />
dk ≡ N<br />
<br />
Nk(s, t)<br />
⎪⎩ i=1 ∂V<br />
∂φi<br />
(x(s, t)) dS ei,<br />
∂n<br />
for xk ∈ SN<br />
(k =1, 2,...,M). (19)<br />
- 48 - 15th IGTE Symposium 2012<br />
In addition, g ≡ [g1,g2,...,gM ], and<br />
⎧ <br />
⎪⎨ Nk(s, t)ū(s, t) dS, for xk ∈ SD,<br />
SD<br />
gk ≡ <br />
⎪⎩ Nk(s, t)¯q(s, t) dS, for xk ∈ SN<br />
SN<br />
(k =1, 2,...,M). (20)<br />
Note that, for xk ∈ SD, dk is exactly the same asck.<br />
Equations (17) and (18) can be written in the form,<br />
<br />
A C<br />
DT <br />
û f<br />
= . (21)<br />
O ˆv g<br />
Equation (21) is a discretized form <strong>of</strong> the Poisson problem.<br />
Throughout this subsection, the reformulation <strong>of</strong><br />
EFG is finished. In the following, the reformulated EFG<br />
is referred to as an eXtended EFG (X-EFG).<br />
B. Selection <strong>of</strong> linearly independent functions<br />
As mentioned above, the linearly independent functions<br />
Nk (k =1, 2,...,M) are required for discretizing<br />
the essential and natural boundary conditions. Here, the<br />
δ-functions defined on ∂V are employed as Nk (k =<br />
1, 2,...,M) so that the these boundary conditions may<br />
be satisfied exactly. The explicit form <strong>of</strong> Nk(s, t) is given<br />
as<br />
Nk(s, t) = δ(s − sk)δ(t − tk)<br />
<br />
<br />
<br />
∂x ∂x <br />
(k =1, 2,...,M).<br />
× <br />
∂s ∂t <br />
(22)<br />
Note that, on ∂V , the kth boundary node xk is represented<br />
by sk and tk, i.e., xk = x(sk,tk). By using (22),<br />
C, dk and gk(k =1, 2,...,M) can be rewritten as<br />
C =<br />
N<br />
M<br />
φi(x(sk,tk))eiē<br />
i=1 k=1<br />
T<br />
k , (23)<br />
⎧<br />
N<br />
⎪⎨ φi(x(sk,tk))ei, for xk ∈ SD,<br />
dk = i=1<br />
N ∂φi<br />
⎪⎩<br />
∂n<br />
i=1<br />
(x(sk,tk))ei,<br />
(24)<br />
for xk ∈ SN,<br />
<br />
ū(sk,tk), for xk ∈ SD,<br />
gk =<br />
(25)<br />
¯q(sk,tk), for xk ∈ SN.<br />
It must be noted here that, in the X-EFG, the coefficient<br />
matrix is not symmetric except for the case where<br />
∂V = SD. However, the essential and natural boundary<br />
conditions can easily be imposed, since C, D and g can<br />
be evaluated without any integrations.<br />
III. SOLVING LINEAR SYSTEM (21)<br />
A linear system that has a coefficient matrix <strong>of</strong> a 2 ×<br />
2 block structure as in (21) are called a saddle point<br />
problem. In this section, we consider solving the saddle<br />
point problem (21). For the following discussion, (21) is<br />
rewritten as<br />
Aˆx = b, (26)
where<br />
A≡<br />
A C<br />
D T O<br />
<br />
û<br />
, ˆx ≡<br />
ˆv<br />
<br />
f<br />
, and b ≡<br />
g<br />
<br />
. (27)<br />
A. Direct solvers<br />
As a direct solver for saddle point problems, there is a<br />
method that utilizes the 2 × 2 structure <strong>of</strong> A [9]. In this<br />
method, A is decomposed by the Cholesky factorization.<br />
Since A is a main part <strong>of</strong> A, the computational cost may<br />
be decreased by using this method in comparison with<br />
that <strong>of</strong> the Gaussian elimination. However, we do not<br />
employ this method for solving (21). This is because<br />
there were some cases that the Cholesky factorization<br />
<strong>of</strong> A was failed in preliminary numerical experiments.<br />
Thus, we consider that the method in [9] is not stable<br />
for solving (21) in this problem.<br />
It must be noted here that A can be decomposed by the<br />
LU factorization. Hence, we adopt the LU factorization as<br />
a direct solver. In addition, an ordering method is used to<br />
decrease fill-ins before the LU factorization is executed.<br />
B. Iterative Schemes for Solving Saddle Point Problems<br />
As an iterative scheme for solving saddle point problems,<br />
Uzawa’s method [10] is well known. Starting with<br />
initial guesses û0 and ˆv0, Uzawa’s method consists <strong>of</strong><br />
the following coupled iteration:<br />
Aûk+1 = f − C ˆvk, (28)<br />
ˆvk+1 = ˆvk + ω(D T ûk+1 − g), (29)<br />
where ω>0 is a relaxation parameter. In (28), a linear<br />
system depending on the size <strong>of</strong> A must be solved.<br />
Hence, if the size <strong>of</strong> A is large, the computational cost<br />
for solving (28) may be expensive.<br />
On the other hand, the Arrow-Hurwicz method [10]<br />
is also well known as an inexpensive iterative scheme<br />
in comparison with the Uzawa’s method. Starting with<br />
initial guesses û0 and ˆv0, the Arrow-Hurwicz method<br />
consists <strong>of</strong> the following coupled iteration:<br />
ûk+1 = ûk + α(f − Aûk − C ˆvk), (30)<br />
ˆvk+1 = ˆvk + ω(D T ûk+1 − g), (31)<br />
where α is also a relaxation parameter. The Arrow-<br />
Hurwicz method is useful for the case where the size <strong>of</strong><br />
A is large. This is because a linear system do not exist<br />
in this iteration. This iteration can be written in terms<br />
<strong>of</strong> a matrix splitting A = P−Q, i.e., as the fixed-point<br />
iteration,<br />
P ˆxk+1 = Qˆxk + b, (32)<br />
where<br />
<br />
1<br />
P≡ αI O<br />
DT − 1<br />
ω I<br />
<br />
1<br />
, Q≡ αI − A −C<br />
O − 1<br />
ω I<br />
<br />
, (33)<br />
and ˆx T k ≡ ûT k ˆvT <br />
k . In 3D problems, the size <strong>of</strong> A tends<br />
to be large. Thus, we adopt the Arrow-Hurwicz method<br />
as an iterative scheme for solving (21).<br />
- 49 - 15th IGTE Symposium 2012<br />
C. Preconditioned Krylov Subspace Methods<br />
For asymmetric linear systems, the incomplete LU<br />
factorization (ILU) [10] is well known as a preconditioner<br />
for Krylov subspace methods. Although ILU can be applied<br />
to (21), we do not employ ILU. This is because the<br />
coefficient matrix <strong>of</strong> (21) is almost symmetric. Namely,<br />
we consider utilizing the matrix property.<br />
To utilize “almost symmetric”, we adopt the incomplete<br />
Cholesky factorization (IC) [11] as a preconditioner<br />
for Krylov subspace methods. To this end, we propose<br />
a strategy for generating preconditioned matrices. In<br />
this strategy, preconditioned matrices LDLT <strong>of</strong> IC are<br />
generated as<br />
<br />
A D<br />
DT <br />
LDL<br />
O<br />
T , (34)<br />
where L is a lower triangular matrix, and D is a diagonal<br />
matrix. In (34), we assume<br />
<br />
A C<br />
A =<br />
DT <br />
A D<br />
<br />
O DT <br />
. (35)<br />
O<br />
As mentioned above, if xk ∈ SD, the kth column <strong>of</strong>D<br />
is exactly the same as that <strong>of</strong> C. In addition, there is<br />
no difference between the matrix A <strong>of</strong> (21) and that <strong>of</strong><br />
(34). Hence, we consider that the assumption (35) can<br />
be acceptable. Note that we adopt an algorithm <strong>of</strong> IC in<br />
which matrices LDL T are as sparse as the matrix A [11].<br />
Even we use IC as a preconditioner, Krylov subspace<br />
methods for asymmetric linear systems have to be chosen.<br />
As Krylov subspace methods for asymmetric linear<br />
systems, Bi-CGSTAB [12] and GMRES(m) [13] are well<br />
known and these iterative methods have produced a lot<br />
<strong>of</strong> attractive results [14]. Here, m is some fixed integer<br />
parameter, and GMRES(m) restarts every m steps [13].<br />
For solving (21), we adopt both methods with IC. In the<br />
following, Bi-CGSTAB with IC and GMRES(m) with IC<br />
are referred to as IC-Bi-CGSTAB and IC-GMRES(m),<br />
respectively.<br />
IV. NUMERICAL EXPERIMENTS<br />
In this section, some numerical solvers as chosen<br />
in Section III are applied to a linear system (21). To<br />
generate the linear system (21), a 3D Poisson problem is<br />
discretized by using the X-EFG.<br />
Throughout the present section, the region V is assumed<br />
as V =(−0.5, 0.5) × (−0.5, 0.5) × (−0.5, 0.5).<br />
In addition, the natural boundary condition is imposed<br />
on the surface SN defined as −0.25 ≤ x ≤ 0.25,<br />
−0.25 ≤ y ≤ 0.25 and z = 0.5, and the essential<br />
boundary condition is imposed on SD ≡ ∂V − SN.<br />
Moreover, the functions f(x), ū and ¯q are determined<br />
so that the analytic solution <strong>of</strong> the 3D Poisson problem<br />
may be u =exp(−x 2 − y 2 − z 2 ).<br />
The boundary nodes x1, x2,...,xM are uniformly<br />
placed on ∂V , and the nodes xM+1, xM+2,...,xN are<br />
also uniformly placed in V . In addition, the exponential
Fig. 1. Dependence <strong>of</strong> the relative error on the size <strong>of</strong> coefficient<br />
matrix. In this figure, u e and u n are exact and numerical solutions,<br />
respectively.<br />
weight function [1],<br />
⎧<br />
⎨exp[−(r/c)<br />
w(r) ≡<br />
⎩<br />
2 ] − exp[−(R/c) 2 ]<br />
1 − exp[−(R/c) 2 (r ≤ R),<br />
]<br />
(36)<br />
0 (r>R),<br />
is adopted for the MLS approximation. Here, R denotes a<br />
support radius, and c is a user-specified parameter. We set<br />
R =1.9h and c = h, where h is the minimum distance<br />
between two nodes.<br />
In the MLS approximation, the shape functions<br />
φi(x) (i =1, 2,...,N) can be determined by<br />
φi(x) =p T (x)B −1 (x)bi(x), (37)<br />
where p T (x) =[1xyz]. In addition, the matrix B(x)<br />
and the vector bi(x) are defined as<br />
B(x) =<br />
N<br />
wk(x)p(xk)p T (xk), (38)<br />
k=1<br />
bi(x) =wi(x)p(xi), (39)<br />
where wi(x) =w(|x−xi|). In (9), the partial derivatives<br />
<strong>of</strong> φi(x) by X(= x, y, and z) can be obtained as<br />
where<br />
φi,X(x) =p T X(x)B −1 (x)bi(x)<br />
+p T (x)[B −1<br />
X (x)bi(x)+B −1 (x)bi,X(x)], (40)<br />
B −1<br />
X (x) =−B−1 (x)BX(x)B −1 (x). (41)<br />
For evaluating (9) and (10), a cubic cell structure being<br />
independent <strong>of</strong> the nodes is used [1], [7], and the Gauss-<br />
Legendre quadrature is employed. The number NQ <strong>of</strong><br />
quadrature points depends on the number m <strong>of</strong> nodes in<br />
a cell. Throughout this section, NQ is handled on the<br />
similar criterion in [1], i.e., NQ = nQ × nQ × nQ, where<br />
nQ = ⌊ √ m +0.5⌋ +2. In addition, the number NC <strong>of</strong><br />
cells is set as NC = mC × mC × mC, where mC =<br />
⌊N 1/3 ⌋.<br />
As a LU factorization, we adopt the sequential SuperLU<br />
[15]. In addition, the Column Approximate Minimum<br />
Degree Ordering (COLAMD) [16] is employed<br />
- 50 - 15th IGTE Symposium 2012<br />
Fig. 2. Dependence <strong>of</strong> the computational time for solving (21) on the<br />
size <strong>of</strong> coefficient matrix.<br />
as an ordering method. This ordering method can easily<br />
be used by setting options.ColPerm = COLAMD in<br />
the SuperLU. For the Arrow-Hurwicz method, we set<br />
α =1.5 and ω =0.05. In addition, for IC-GMRES(m),<br />
we set m = 200. For all iterative solvers, an initial guess<br />
<strong>of</strong> ˆx in (26) is set as ˆx0 = 0.<br />
Computations were performed on a computer equipped<br />
with a 2.66GHz Intel Core i7 920 processor, 24GB RAM,<br />
Ubuntu Linux ver. 12.04, and g++ ver. 4.6.3. Note that we<br />
only used a single core <strong>of</strong> this processor in the following<br />
experiments. Compiler options were set as “-O3 -Wall<br />
-m64” for all solvers.<br />
A. Determining εtol for Iterative Solvers<br />
For the Arrow-Hurwicz method, the iteration is repeated<br />
until ||ˆxk+1 − ˆxk||/||ˆxk+1|| ≤ εtol is satisfied,<br />
where k is the iteration number and ˆxk is the approximate<br />
solution in the kth iteration. Also, for IC-Bi-<br />
CGSTAB and IC-GMRES(m), the iteration is repeated<br />
until ||rk+1||/||b|| ≤ εtol is satisfied, where rk+1 is the<br />
(k +1)th residual vector that can be obtained in the<br />
algorithm <strong>of</strong> Krylov subspace methods. Note that the<br />
maximum norm is adopted for the definition <strong>of</strong> ||·||.<br />
To determine εtol, the dependence <strong>of</strong> relative error on<br />
the size <strong>of</strong> coefficient matrix is shown in Fig. 1. Here,<br />
the relative error ε ≡||ue−un ||/||ue ||, where ue and un are the exact and numerical solutions <strong>of</strong> u, respectively.<br />
In addition, by the first equation <strong>of</strong> (5), un is evaluated<br />
with û that is determined by the LU factorization. From<br />
Fig. 1, we see ε>10−4 . Hence, we consider that, in this<br />
problem, εtol =10−8is sufficient for obtaining un that<br />
has almost the same accuracy shown in Fig. 1.<br />
B. Performance <strong>of</strong> Direct and Iterative Solvers<br />
Let us first investigate the performance <strong>of</strong> the LU factorization,<br />
the Arrow-Hurwicz method, IC-Bi-CGSTAB<br />
and IC-GMRES(m). To this end, the dependence <strong>of</strong><br />
the computational time for solving (21) by using these<br />
methods on the size <strong>of</strong> coefficient matrix is shown in
(a)<br />
(b)<br />
Fig. 3. Histories <strong>of</strong> the relative residual for IC-Bi-CGSTAB and IC-<br />
GMRES(m), and those <strong>of</strong> the relative error for the Arrow-Hurwicz<br />
method. (a) and (b) are for N + M = 19083 and 42083, respectively.<br />
Fig. 2. We see from this figure that the computational<br />
time <strong>of</strong> the LU factorization is less than that <strong>of</strong> the<br />
Arrow-Hurwicz method. In addition, from this figure,<br />
there is no obvious difference between the computational<br />
time <strong>of</strong> IC-Bi-CGSTAB and that <strong>of</strong> IC-GMRES(m), and<br />
the computational time <strong>of</strong> both methods is less than that<br />
<strong>of</strong> the LU factorization. Especially for the case where the<br />
size N + M <strong>of</strong> the coefficient matrix is relatively large,<br />
the computational time can be decreased by using both<br />
methods, e.g., for N + M = 42083, IC-BiCGSTAB and<br />
IC-GMRES(m) are about 9 and 15 times faster than the<br />
LU factorization. From these results, we consider that the<br />
strategy described in (34) works well for solving (21).<br />
It must be noted here that the Arrow-Hurwicz method<br />
does not converge for N +M = 42083. Similarly, IC-Bi-<br />
CGSTAB and IC-GMRES(m) do not converge for N +<br />
M = 19083. Thus, in Fig. 2, there is no data for these 3<br />
cases. Hence, the iterative solvers are not always stable<br />
in this problem.<br />
To investigate behavior <strong>of</strong> the iterative solvers for<br />
N + M = 19083 and 42083, histories <strong>of</strong> the relative<br />
residual ||rk+1||/||b|| for Krylov subspace methods, and<br />
those <strong>of</strong> the relative error ||ˆxk+1 − ˆxk||/||ˆxk+1|| for<br />
- 51 - 15th IGTE Symposium 2012<br />
Fig. 4. Dependence <strong>of</strong> the condition number <strong>of</strong> A on the size <strong>of</strong><br />
coefficient matrix.<br />
the Arrow-Hurwicz method are shown in Fig. 3. We<br />
see from Fig. 3(a) that IC-BICGSTAB rapidly diverges<br />
and IC-GMRES(m) oscillates. In addition, the Arrow-<br />
Hurwicz method converges though the convergence speed<br />
is slow. Indeed, the iteration number for the Arrow-<br />
Hurwicz method is 35324 when ||ˆxk+1 − ˆxk||/||ˆxk+1||<br />
is satisfied. From Fig. 3(b), we see that IC-Bi-CGSTAB<br />
and IC-GMRES(m) converge rapidly. In addition, the<br />
relative residual <strong>of</strong> IC-GMRES(m) is stably decreased<br />
until restarting. For N +M = 42083, the Arrow-Hurwicz<br />
method does not converge, even after more than 500000<br />
iterations.<br />
Next, we investigate a property <strong>of</strong> the coefficient matrix<br />
<strong>of</strong> (21). To this end, the dependence <strong>of</strong> the condition<br />
number <strong>of</strong> A on the size <strong>of</strong> coefficient matrix is shown<br />
in Fig. 4. We see from this figure that the condition<br />
numbers are not very large, even for N + M = 19083<br />
and 42083. Hence, from the condition numbers, it is<br />
difficult to obtain the reason why the Krylov subspace<br />
methods and the Arrow-Hurwicz method do not converge<br />
for N + M = 19083 and 42083, respectively.<br />
From these results, it is difficult that we recognize an<br />
appropriate solver in advance. Hence, to stably solve (21)<br />
as fast as possible, we suggest that IC-GMRES(m) is<br />
first used in order to choose an appropriate solver. After<br />
ˆn iterations <strong>of</strong> IC-GMRES(m), if<br />
||rk+1||/||b|| ≤ ˆεtol<br />
(42)<br />
is not satisfied, then it is recognized that IC-GMRES(m)<br />
will not converge. In this case, the iteration <strong>of</strong> IC-<br />
GMRES(m) is finished, and (21) is solved by the LU<br />
factorization. If (42) is satisfied after ˆn iterations, then it<br />
is recognized that IC-GMRES(m) will converge. Hence,<br />
in this case, the iteration <strong>of</strong> IC-GMRES(m) is continued.<br />
We consider that ˆεtol =10 −2 and ˆn = max(30, (N +<br />
M)/500) are reasonable choice for this problem.<br />
Although the convergence speed is slow, the Arrow-<br />
Hurwicz method may work for the case where not only<br />
Krylov subspace methods do not converge but also the<br />
LU factorization cannot execute. This may occur when<br />
N + M is too large.
The above suggestions for choosing solvers are summarized<br />
as a flow chart shown in Fig. 5. Note that, in<br />
this flow chart, IC-Bi-CGSTAB can be used instead <strong>of</strong><br />
IC-GMRES(m) by setting ˆεtol and ˆn appropriately. This<br />
is because, in Fig. 2, when IC-GMRES(m) converges,<br />
IC-Bi-CGSTAB also converges, and there is no obvious<br />
difference between the computational time <strong>of</strong> IC-<br />
GMRES(m) and that <strong>of</strong> IC-Bi-CGSTAB.<br />
V. CONCLUSION<br />
To impose not only the essential boundary condition<br />
but also the natural one without any integrations, the EFG<br />
has been reformulated, and this method is called a X-<br />
EFG. A linear system obtained by the X-EFG becomes<br />
an asymmetric saddle point problem. To investigate appropriate<br />
solvers for this problem, the linear system that<br />
is obtained from a 3D Poisson problem discretized by<br />
the X-EFG has been solved by the LU factorization,<br />
the Arrow-Hurwicz method, IC-Bi-CGSTAB, and IC-<br />
GMRES(m) in the numerical experiments. Conclusions<br />
obtained in the present study are summarized as follows:<br />
1) By using the X-EFG, the essential and natural<br />
boundary conditions can be imposed without any<br />
integrations.<br />
2) Although the linear system obtained by the X-<br />
EFG has the asymmetric coefficient matrix, the<br />
incomplete Cholesky factorization works well as a<br />
preconditioner for Bi-CGSTAB and GMRES(m).<br />
3) By using IC-BiCGSTAB and IC-GMRES(m), the<br />
linear system can be solved faster than the LU<br />
factorization in relatively large problems. However,<br />
these iterative methods sometimes do not converge<br />
regardless <strong>of</strong> the condition number.<br />
4) To stably solve the linear system as fast as possible,<br />
an appropriate solver can be chosen by the flow<br />
chart shown in Fig. 5.<br />
As future work, the X-EFG will be applied to more<br />
practical problems in various fields, including electromagnetics.<br />
ACKNOWLEDGMENTS<br />
This work was partially supported by JSPS KAKENHI<br />
Grant Numbers 24700053 and 22360042.<br />
REFERENCES<br />
[1] T. Belytschko, Y. Y. Lu, and L. Gu, “Element-free Galerkin<br />
methods,” Int. J. Numer. Methods Eng., vol. 37, pp. 229–256,<br />
1994.<br />
[2] S. N. Atluri and T. Zhu, “A new meshless local Petrov-Galerkin<br />
(MLPG) approach in computational mechanics,” Comput. Mech.,<br />
vol. 22, pp. 117–127, 1998.<br />
[3] A. Manzin, D. P. Ansalone, and O. Bottauscio, “Numerical<br />
modeling <strong>of</strong> biomolecular electrostatic properties by the elementfree<br />
Galerkin method,” IEEE Trans. on Magn., vol. 47, no. 5, pp.<br />
1382–1385, 2011.<br />
[4] S. Ikuno, T. Hanawa, T. Takayama, and A. Kamitani, “Evaluation<br />
<strong>of</strong> parallelized meshless approach: Application to shielding<br />
current analysis in HTS,” IEEE Trans. on Magn., vol. 44, pp.<br />
1230–1233, 2008.<br />
- 52 - 15th IGTE Symposium 2012<br />
✓<br />
Start<br />
✒<br />
✏<br />
✑<br />
❄<br />
Iteration <strong>of</strong> IC-GMRES(m) is repeated<br />
until iteration number k =ˆn.<br />
❄<br />
✟ ✟✟✟✟✟<br />
❍<br />
❍<br />
❍<br />
❍<br />
||rk+1|| ❍<br />
❍ ≤ ˆεtol? ❍Yes<br />
❍❍❍❍❍ ||b|| ✟<br />
✟<br />
✟<br />
✟<br />
✟<br />
✟<br />
No ✓ ❄<br />
IC-GMRES(m)<br />
✒<br />
❄<br />
✟ ✟✟✟✟✟<br />
❍<br />
❍<br />
❍<br />
❍<br />
❍<br />
❍ Is N + M too large? ❍Yes<br />
✟<br />
❍❍❍❍❍<br />
✟<br />
✟<br />
✟<br />
✟<br />
✟ No ✓ ❄<br />
✓ ❄<br />
Arrow-Hurwicz<br />
✒<br />
✏<br />
LU factorization<br />
✒<br />
✑<br />
✏<br />
✑<br />
✏<br />
✑<br />
Fig. 5. A flow chart for choosing an appropriate solver. Note that<br />
IC-Bi-CGSTAB can be used instead <strong>of</strong> IC-GMRES(m) in this flow<br />
chart.<br />
[5] G. F. Parreira, E. J. Silva, A. Fonseca, and R. Mesquita, “The<br />
element-free Galerkin method in three-dimensional electromagnetic<br />
problems,” IEEE Trans. on Magn., vol. 42, no. 4, pp. 711–<br />
714, 2006.<br />
[6] G. Ni, S. L. Ho, S. Yang, and P. Ni, “Meshless local Petrov-<br />
Galerkin method and its application to electromagnetic field<br />
computations,” International Journal <strong>of</strong> Applied Electromagnetics<br />
and Mechanics, vol. 19, pp. 111–117, 2004.<br />
[7] G. R. Liu, Meshfree Methods: Moving beyond the Finite Element<br />
Method (2nd Edition). Boca Raton: CRC Press LLC, 2009.<br />
[8] A. Kamitani, T. Takayama, T. Itoh, and H. Nakamura, “Extension<br />
<strong>of</strong> meshless Galerkin/Petrov-Galerkin approach without using<br />
Lagrange multipliers,” Plasma and Fusion Research, vol. 6, no.<br />
2401074, 2011.<br />
[9] J. Zhao, “The generalized Cholesky factorization method for<br />
saddle point problems,” Applied Mathematics and Computation,<br />
vol. 92, pp. 49–58, 1998.<br />
[10] Y. Saad, Iterative Methods for Sparse Linear Systems, 2nd Edition.<br />
Philadelphia: SIAM, 2003.<br />
[11] G. H. Golub and C. F. Van Loan, Matrix Computations, 3rd<br />
Edition. Baltimore and London: Johns Hopkins <strong>University</strong> Press,<br />
1996.<br />
[12] H. A. van der Vorst, “Bi-CGSTAB: A fast and smoothly converging<br />
variant <strong>of</strong> Bi-CG for the solution <strong>of</strong> nonsymmetric linear<br />
systems,” SIAM J. Sci. Stat. Comput., vol. 13, no. 2, pp. 631–644,<br />
1992.<br />
[13] Y. Saad and M. H. Schultz, “GMRES: A generalized minimal<br />
residual algorithm for solving nonsymmetric linear systems,”<br />
SIAM J. Sci. Stat. Comput., vol. 7, no. 3, pp. 856–869, 1986.<br />
[14] H. A. van der Vorst, Iterative Krylov Methods for Large Linear<br />
Systems (Cambridge Monographs on Applied & Computational<br />
Mathematics). Cambridge: Cambridge <strong>University</strong> Press, 2003.<br />
[15] J. W. Demmel, S. C. Eisenstat, J. R. Gilbert, X. S. Li, and J. W. H.<br />
Liu, “A supernodal approach to sparse partial pivoting,” SIAM J.<br />
Matrix Analysis and Applications, vol. 20, pp. 720–755, 1999.<br />
[16] T. A. Davis, J. R. Gilbert, S. Larimore, and E. Ng, “A column<br />
approximate minimum degree ordering algorithm,” ACM Trans.<br />
Mathematical S<strong>of</strong>tware, vol. 30, pp. 353–376, 2004.
- 53 - 15th IGTE Symposium 2012<br />
Electromagnetic Wave Propagation Simulation<br />
in Corrugated Waveguide using Meshless Time<br />
Domain Method<br />
Soichiro Ikuno∗ , Yoshihisa Fujita∗ , Taku Itoh∗ , Susumu Nakata † and Atsushi Kamitani ‡<br />
∗Tokyo <strong>University</strong> <strong>of</strong> <strong>Technology</strong>, 1404-1 Katakura, Hachioji, Tokyo 192-0982, Japan<br />
† Ritsumeikan <strong>University</strong>, 1-1-1 Nojihigashi, Kusatsu, Shiga 525-8577, Japan<br />
‡ Yamagata <strong>University</strong>, 4-3-16 Johnan, Yonezawa, Yamagata, 992-8510, Japan<br />
E-mail: s.ikuno@ieee.org<br />
Abstract—The simulation <strong>of</strong> the electromagnetic wave propagation in complex shaped corrugated waveguide using Meshless<br />
Time Domain Method (MTDM) based on the Radial Point Interpolation method is numerically investigated. MTDM does<br />
not require finite elements or meshes <strong>of</strong> a geometrical structure as well as other meshless method. In MTDM, only the<br />
necessary information is the location <strong>of</strong> nodes, and the arrangement <strong>of</strong> the node structure <strong>of</strong> electric fields and magnetic<br />
fields. By using the simulation code for analyzing a magnetic wave propagation phenomenon in a complex shaped waveguide,<br />
the influence <strong>of</strong> node alignment on a wave propagation is numerically evaluated. Moreover, the influence <strong>of</strong> frequencies<br />
and pitch <strong>of</strong> corrugate on the dumping rate is evaluated. The results <strong>of</strong> computation show that the node alignment based<br />
on the staggered grid that is generally used in standard FDTD is suitable for the numerical calculation. In addition, the<br />
relationship between a pitch <strong>of</strong> corrugate and a frequency is numerically evaluated.<br />
Index Terms—FDTD, RPIM, wave propagation, corrugated waveguide<br />
I. INTRODUCTION<br />
In the Large Helical Device (LHD), the electron cyclotron<br />
heating device is used for plasma heating. The<br />
electrical power that is made by the gyrotron system<br />
transmits to LHD by using long corrugated waveguide.<br />
However, it is not clear that the shape <strong>of</strong> curvature <strong>of</strong> the<br />
waveguide or transmission gain <strong>of</strong> electromagnetic wave<br />
propagation theoretically.<br />
Finite Difference Time Domain (FDTD) method has<br />
provided the solution <strong>of</strong> Maxwell’s equation directly,<br />
and the method is applied for electromagnetic wave<br />
propagation simulation frequently [1], [2]. Furthermore,<br />
FDTD method has great advantages in terms <strong>of</strong> parallelization<br />
and treatment <strong>of</strong> problems and so on. However,<br />
the numerical domain should be divided into rectangle<br />
meshes if FDTD method is applied for the simulation,<br />
and it is difficult to treat the problem in the complex<br />
domain.<br />
As is well known that the meshless approach does<br />
not require finite elements or meshless <strong>of</strong> a geometrical<br />
structure. And various meshless approaches such as the<br />
diffuse element method [3], the element-free Galerkin<br />
(EFG) method [4] and the meshless local Petrov-Galerkin<br />
(MLPG) [5] method and the radial point interpolation<br />
method (RPIM) has been developed [6]. And these<br />
methods are applied to a variety <strong>of</strong> engineering fields<br />
and the fields <strong>of</strong> computational magnetics [7], [8], [9].<br />
Particularly, meshless approaches based on RPIM are<br />
applied to time dependent problems [10]. Meshless Time<br />
Domain Method (MTDM) [11] does not require finite<br />
elements or meshes <strong>of</strong> a geometrical structure as well as<br />
other meshless method. In MTDM, only the necessary<br />
information is the location <strong>of</strong> nodes, and the arrangement<br />
<strong>of</strong> the node structure <strong>of</strong> electric fields and magnetic<br />
fields. Thus, MTDM can be easily applied for the time<br />
dependent simulation <strong>of</strong> the problem in the complex<br />
shaped domain.<br />
The purpose <strong>of</strong> the present study is to develop the<br />
numerical code for analyzing electromagnetic wave propagation<br />
in corrugated waveguide, and to investigate the<br />
optimal shape <strong>of</strong> corrugated waveguide.<br />
II. SHAPE FUNCTION BASED ON RPIM<br />
First, we scatter N nodes x1, x2, ··· , xN in the target<br />
domain and the boundary, and assign the Radial Basis<br />
Function (RBF) w1(x),w2(x), ··· ,wN (x) with compact<br />
support to the nodes. Then, the solution u(x) can<br />
be expanded as<br />
u(x) =[w(x) T , p(x) T ]G −1<br />
<br />
u<br />
= φ(x)u, (1)<br />
0<br />
where the vector w(x), p(x), u(x) and φ(x) are defined<br />
by<br />
w(x) =[w1(x),w2(x), ··· ,wN (x)] T , (2)<br />
p(x) =[p1(x),p2(x), ··· ,pM(x)] T , (3)<br />
u =[u1,u2, ··· ,uN ] T , (4)<br />
φ(x) =[φ1(x),φ2(x), ··· ,φN(x)] T . (5)<br />
where φi(x) denotes a shape function on i−th node.<br />
The components <strong>of</strong> the vector p(x) are monomials <strong>of</strong><br />
the space variables. For example, p(x) T =[1,x,y] and<br />
p(x) T =[1,x,y,x2 ,xy,y2 ] are monomials for the linear<br />
and the quadratic approximation. Furthermore, the matrix<br />
G is defined by following equation.<br />
<br />
W P<br />
G =<br />
P T <br />
, (6)<br />
O
Here, the matrices W and P are defined by following<br />
equations.<br />
W =[w(x1), w(x2), ··· , w(xn)] T , (7)<br />
P =[p(x1), p(x2), ··· , p(xn)] T . (8)<br />
In the present study, following three functions are<br />
adopted for the weight function.<br />
⎧<br />
⎨ e<br />
wi(xj) =<br />
⎩<br />
−(r/c)2 − e−(R/c)2 1.0 − e−(R/c)2 , r < R,<br />
(9)<br />
0, r ≥ R,<br />
<br />
r<br />
2 wi(xj) =1.0− 6.0<br />
R<br />
<br />
r<br />
3 <br />
r<br />
4 +8.0 − 3.0<br />
(10)<br />
R R<br />
r<br />
<br />
2<br />
−0.5<br />
wi(xj) = +1.0<br />
(11)<br />
R<br />
Here, R denotes a support radius <strong>of</strong> the influence domain<br />
and c denotes a constant. Moreover, r is defined by<br />
r = |x − xi|. Under the above assumptions, the shape<br />
function and its derivative can be expressed as<br />
N<br />
M<br />
φk(x) = wi(x)gi,k + pj(x)gN+j,k, (12)<br />
∂φk<br />
∂x =<br />
∂φk<br />
∂y =<br />
N<br />
i=1<br />
N<br />
i=1<br />
i=1<br />
∂wi(x)<br />
∂x gi,k +<br />
∂wi(x)<br />
gi,k +<br />
∂y<br />
j=1<br />
M<br />
j=1<br />
M<br />
j=1<br />
∂pj(x)<br />
∂x gN+j,k, (13)<br />
∂pj(x)<br />
gN+j,k, (14)<br />
∂y<br />
where, gi,j denotes the (i, j) element <strong>of</strong> matrix G −1 .<br />
Note that the shape function satisfy the Kronecker<br />
delta function property, i.e.<br />
<br />
1, i = j,<br />
φi(xj) =<br />
(15)<br />
0, i = j.<br />
From this property the function can be expanded by using<br />
the shape function based on RPIM as follows.<br />
u(xi) =<br />
N<br />
φi(xj)ui = ui. (16)<br />
j=1<br />
In the next section, Meshless Time Domain Method is<br />
formulated by using above shape function.<br />
III. MESHLESS TIME DOMAIN METHOD<br />
In the present study, 2D electromagnetic wave propagation<br />
<strong>of</strong> TM mode is adopted for the evaluation. The<br />
governing equation <strong>of</strong> the problem is defined by<br />
ε ∂Ez<br />
∂t = −σEz + ∂Hy<br />
∂x<br />
μ ∂Hx<br />
∂t<br />
μ ∂Hy<br />
∂t<br />
∂Hx<br />
− , (17)<br />
∂y<br />
= −∂Ez , (18)<br />
∂y<br />
∂Ez<br />
= , (19)<br />
∂x<br />
- 54 - 15th IGTE Symposium 2012<br />
where, Hx and Hy denote the magnetic field <strong>of</strong> x and<br />
y component, and Ez denotes the electric field <strong>of</strong> z<br />
component. In addition, ε, σ and μ denote permitivity,<br />
permeability and electroconductivity, respectively.<br />
The system is discretized with respect to time by<br />
applying Leap Frog Method, and it is transformed to<br />
following equations.<br />
ε n+1<br />
Ez − E<br />
Δt<br />
n z<br />
+ σE n+ 1<br />
2<br />
z<br />
= ∂Hn+ 1<br />
1<br />
2<br />
2<br />
y ∂Hn+ x<br />
− ,<br />
∂x ∂y<br />
μ<br />
<br />
H<br />
Δt<br />
(20)<br />
n+1/2<br />
x − H n−1/2<br />
<br />
x = − ∂En z<br />
,<br />
∂y<br />
μ<br />
<br />
H<br />
Δt<br />
(21)<br />
n+1/2<br />
y − H n−1/2<br />
<br />
y = ∂En z<br />
.<br />
∂x<br />
(22)<br />
As we mentioned above, the shape function <strong>of</strong> RPIM<br />
has the Kronecker delta function property (15). By using<br />
the shape function and the property, the system can be<br />
discretized with respect to space as follows.<br />
E n+1<br />
<br />
ε σ<br />
<br />
z,i = α − E<br />
Δt 2<br />
n z,i<br />
⎤<br />
N 1 n+ 2<br />
+ H<br />
N 1 n+ 2 H ⎦ , (23)<br />
H<br />
H<br />
1 n+ 2<br />
x,i<br />
1 n+ 2<br />
y,i<br />
j=1<br />
y,j<br />
1<br />
2 = Hn− x,i<br />
1<br />
2 = Hn− y,i<br />
Here, φ E i and φH i<br />
∂φ H j<br />
∂x −<br />
Δt<br />
−<br />
μ<br />
Δt<br />
+<br />
μ<br />
N<br />
j=1<br />
N<br />
j=1<br />
j=1<br />
x,j<br />
E n ∂φ<br />
z,j<br />
E j<br />
∂y<br />
E n ∂φ<br />
z,j<br />
E j<br />
∂x<br />
∂φ H i<br />
∂y<br />
, (24)<br />
. (25)<br />
denote the shape function for electric<br />
field and magnetic field, and the parameter α is defined<br />
as following equation.<br />
Note that, the average <strong>of</strong> E n z,i<br />
1<br />
α = ε σ . (26)<br />
+<br />
Δt 2<br />
and En+1 z,i is adopted for<br />
E n+1/2<br />
z,i . By solving (23), (24) and (25) alternately in<br />
each time step, we can obtain the result that describes<br />
the time dependent behavior <strong>of</strong> the electromagnetic wave<br />
propagation in various shape <strong>of</strong> wave guide.<br />
In the present study, the Perfectly Matched Layer<br />
(PML) and the Perfect Magnetic Conductor (PMC) are<br />
used for absorbing boundary condition and boundary<br />
condition. The electric field <strong>of</strong> z component is divided<br />
into<br />
Ez = Ezx + Ezy, (27)<br />
where components are governed by following equations.<br />
jωεEzx + σxEzx = ∂Hy<br />
, (28)<br />
∂x<br />
jωεEzy + σyEzx = − ∂Hx<br />
. (29)<br />
∂x
Here, j denotes a imaginary unit, and ω denotes a angular<br />
frequency. By using (27), the basic governing equation<br />
<strong>of</strong> PML is written as follows.<br />
ε ∂Ezx<br />
∂t = −σxEzx + ∂Hy<br />
, (30)<br />
∂x<br />
ε ∂Ezy<br />
∂t = −σyEzy − ∂Hx<br />
, (31)<br />
∂x<br />
μ ∂Hx<br />
∂t = −σ∗ yHx − ∂Ez<br />
, (32)<br />
∂y<br />
μ ∂Hy<br />
∂t = −σ∗ yHy + ∂Ez<br />
. (33)<br />
∂x<br />
Here, μ denotes permeability. Taking into account the<br />
delta function property <strong>of</strong> the shape function based on<br />
RPIM, and discretizing respect to time using the Leap-<br />
Flog method, we can obtain following discretized equations<br />
for PML<br />
E n zx,m =<br />
E n zy,m =<br />
H<br />
H<br />
1 n+ 2<br />
x,m =<br />
1 n+ 2<br />
y,m =<br />
ε<br />
Δt<br />
ε<br />
Δt<br />
<br />
σx<br />
− E<br />
2<br />
n−1<br />
zx,m +<br />
N<br />
H<br />
i=1<br />
ε σy<br />
+<br />
Δt 2<br />
<br />
σx<br />
− E<br />
2<br />
n−1<br />
zy,m +<br />
N<br />
H<br />
i=1<br />
ε σy<br />
+<br />
Δt 2<br />
<br />
μ<br />
Δt − σ∗ <br />
1<br />
y n− 2 Hx,m −<br />
2<br />
μ<br />
Δt + σ∗ y<br />
2<br />
<br />
μ<br />
Δt − σ∗ <br />
1<br />
x n− 2 Hy,m +<br />
2<br />
μ<br />
Δt + σ∗ x<br />
2<br />
N<br />
i=1<br />
N<br />
i=1<br />
n− 1<br />
2<br />
y,i<br />
n− 1<br />
2<br />
x,i<br />
∂φ H i<br />
∂x<br />
∂φ H i<br />
∂y<br />
E n ∂φ<br />
z,i<br />
E i<br />
∂y<br />
E n ∂φ<br />
z,i<br />
E i<br />
∂x<br />
, (34)<br />
, (35)<br />
, (36)<br />
, (37)<br />
where Δt denotes a step size <strong>of</strong> time and superscript n<br />
denotes number <strong>of</strong> steps.<br />
In MTDM, nodes for electric field and magnetic field<br />
should be separated, and following four types <strong>of</strong> node<br />
alignment is adopted for accuracy evaluation as shown<br />
in Fig. 1.<br />
Fist alignment type is based on normal meshless<br />
method that means a node for electric field and magnetic<br />
field is located same position as shown in Fig. 1 (a). The<br />
node for magnetic field located a center <strong>of</strong> diagonally <strong>of</strong><br />
nodes for electric field in second type as shown in Fig.<br />
1 (b). Third type is based on staggered grid which is<br />
generally used in standard FDTD, and fourth type is a<br />
mixed version <strong>of</strong> second and third type as shown in Fig.<br />
1 (c) and (b), respectively.<br />
IV. INFLUENCE OF NODE ALIGNMENT<br />
As is well known that FDTD is an explicit method.<br />
Thus, the method must be satisfies the Courant condition,<br />
- 55 - 15th IGTE Symposium 2012<br />
(a) (b)<br />
(c) (d)<br />
Fig. 1. The schematic view <strong>of</strong> four types <strong>of</strong> node alignment <strong>of</strong> electric<br />
field and magnetic field.<br />
i.e.,<br />
Δt < 1<br />
v<br />
1<br />
<br />
2 1<br />
+<br />
Δx<br />
<br />
1<br />
Δy<br />
<br />
,<br />
2<br />
(38)<br />
where Δx and Δy denote a division size <strong>of</strong> x and y<br />
direction, and v denotes a wave speed. On the other<br />
hand, MTDM has not concept <strong>of</strong> mesh as we mentioned<br />
above. Therefore, following criterion is derived for stable<br />
calculation [11].<br />
min |xi − x|<br />
i<br />
Δt <<br />
. (39)<br />
v<br />
Here, min |xi − x| denotes a distance <strong>of</strong> neighboring<br />
i<br />
node, and the step size <strong>of</strong> time Δt is determined so as<br />
to satisfy the criterion (39).<br />
To evaluate the influence <strong>of</strong> node alignment, value <strong>of</strong><br />
the dumping rate RD is introduced.<br />
<br />
RD =<br />
<br />
Γout<br />
Γin<br />
Pz dl<br />
Pz dl<br />
(40)<br />
Here, Pz denote a pointing vector P = B × E <strong>of</strong> z<br />
component, and Γin, Γout denote a the source input line<br />
and the observation line, respectively. We can see from<br />
above equation, if the value <strong>of</strong> RD satisfies RD =1,<br />
the waveguide regards as an ideal zero loss waveguide.<br />
In addition, physical parameters for the calculation are<br />
shown in Table I
TABLE I<br />
PHYSICAL PARAMETERS FOR THE CALCULATION. HERE λ DENOTES<br />
A WAVE LENGTH.<br />
Damping rate, R D<br />
Input Wave Sine wave<br />
Amplitude 1.0 [V/m]<br />
Frequency 1.0, 15.0, 30.0 [GHz]<br />
Wave speed 3.0 × 10 8 [m/s]<br />
Distance <strong>of</strong> neighboring node 20/λ<br />
Number <strong>of</strong> layer for PML 16<br />
Dimension <strong>of</strong> PML 4<br />
Reflectance factor <strong>of</strong> PML −80 [dB]<br />
1.4<br />
1.2<br />
1<br />
0.8<br />
0.6<br />
0.4<br />
0.2<br />
(a)<br />
(b)<br />
(c)<br />
0<br />
0.04 0.042 0.044 0.046 0.048 0.05 0.052 0.054 0.056 0.058 0.06<br />
Support radius, R<br />
Fig. 2. The values <strong>of</strong> dumping rate RD are plotted as a function <strong>of</strong><br />
support radius R in case <strong>of</strong> the first type node alignment as shown in<br />
Fig. 1 (a). Note that lines (a), (b) and (c) are evaluated by using Eq.<br />
(9), (10) and (11), respectively.<br />
In Fig. 2, 3, 4 and 5, we show the influence <strong>of</strong><br />
support radius R on dumping rate RD with various<br />
weight functions. Note that a normal line waveguide is<br />
adopted for the evaluation, and same value <strong>of</strong> support<br />
radius R is adopted for electric field shape function and<br />
magnetic field shape function. In addition, a frequency<br />
<strong>of</strong> input wave is fixed as 1 [GHz]. We can see from<br />
these figures that the values <strong>of</strong> dumping rate RD are not<br />
strictly stable in case <strong>of</strong> spline weight function is adopted<br />
for the shape function construction. On the other hand,<br />
if the Gauss type weight function is adopted for weight<br />
function the value <strong>of</strong> dumping rate RD generally continue<br />
to be flat around unit value in case <strong>of</strong> all the types<br />
<strong>of</strong> node alignment. From this result, Gauss type weight<br />
function (9) is suitable for MTDM weight function, and<br />
for the rest <strong>of</strong> this Gauss type weight function is adopted<br />
for following calculation. Furthermore, we can see from<br />
these figures that node alignments <strong>of</strong> third type (see Fig.<br />
1 (c)) lead us stable calculation. Thus, in the following<br />
calculation node alignment <strong>of</strong> third type is adopted.<br />
V. WAVE PROPAGATION SIMULATION IN<br />
CORRUGATED WAVEGUIDE<br />
Let us first show the distribution <strong>of</strong> electric field in<br />
curved corrugated waveguide. The schematic view <strong>of</strong><br />
the curved corrugated waveguide which is used in the<br />
calculation is shown in Fig. 6 (a). The pitch <strong>of</strong> the<br />
- 56 - 15th IGTE Symposium 2012<br />
Damping rate, R D<br />
1.4<br />
1.2<br />
1<br />
0.8<br />
0.6<br />
0.4<br />
0.2<br />
(a)<br />
(b)<br />
(c)<br />
0<br />
0.04 0.042 0.044 0.046 0.048 0.05 0.052 0.054 0.056 0.058 0.06<br />
Support radius, R<br />
Fig. 3. The values <strong>of</strong> dumping rate RD are plotted as a function <strong>of</strong><br />
support radius R in case <strong>of</strong> the second type node alignment as shown<br />
in Fig. 1 (b). Note that lines (a), (b) and (c) are evaluated by using Eq.<br />
(9), (10) and (11), respectively.<br />
Damping rate, R D<br />
1.4<br />
1.2<br />
1<br />
0.8<br />
0.6<br />
0.4<br />
0.2<br />
(a)<br />
(b)<br />
(c)<br />
0<br />
0.04 0.042 0.044 0.046 0.048 0.05 0.052 0.054 0.056 0.058 0.06<br />
Support radius, R<br />
Fig. 4. The values <strong>of</strong> dumping rate RD are plotted as a function <strong>of</strong><br />
support radius R in case <strong>of</strong> the third type node alignment as shown in<br />
Fig. 1 (c). Note that lines (a), (b) and (c) are evaluated by using Eq.<br />
(9), (10) and (11), respectively.<br />
Damping rate, R D<br />
1.4<br />
1.2<br />
1<br />
0.8<br />
0.6<br />
0.4<br />
0.2<br />
(a)<br />
(b)<br />
(c)<br />
0<br />
0.04 0.042 0.044 0.046 0.048 0.05 0.052 0.054 0.056 0.058 0.06<br />
Support radius, R<br />
Fig. 5. The values <strong>of</strong> dumping rate RD are plotted as a function <strong>of</strong><br />
support radius R in case <strong>of</strong> the fourth type node alignment as shown<br />
in Fig. 1 (d). Note that lines (a), (b) and (c) are evaluated by using Eq.<br />
(9), (10) and (11), respectively.
y(m)<br />
0.4<br />
0.35<br />
0.3<br />
0.25<br />
0.2<br />
0.15<br />
0.1<br />
0.05<br />
0<br />
0 0.05 0.1 0.15 0.2<br />
x(m)<br />
(a) (b)<br />
2.5<br />
2<br />
1.5<br />
1<br />
0.5<br />
0<br />
-0.5<br />
-1<br />
-1.5<br />
-2<br />
E z<br />
Fig. 6. (a): The schematic view <strong>of</strong> the analytic region and (b): the<br />
distribution <strong>of</strong> electric field Ez in corrugated waveguide in case <strong>of</strong><br />
W =50mm.<br />
(a) (b)<br />
Fig. 7. Analytic models for evaluating dumping rate RD. (a): Line<br />
wave guide, (b): Curved wave guide<br />
corrugate made at regular intervals for straight part, and<br />
unequally-spaced gaps are made on a curved part as<br />
shown in Fig. 6 (a). The distribution <strong>of</strong> electric field<br />
<strong>of</strong> z component Ez in case <strong>of</strong> W = 50 mm is also<br />
shown in Fig. 6 (b). In this figure, the reflected wave<br />
is observed at the curved part <strong>of</strong> waveguide. Note that<br />
the reflected wave increase as the width <strong>of</strong> waveguide<br />
W increases. In other words, the damping rate increase<br />
as the value <strong>of</strong> W increase, and this phenomenon also<br />
relevant to wavelength and curvature <strong>of</strong> waveguide.<br />
Next, we evaluate the influence <strong>of</strong> frequencies and<br />
pitch <strong>of</strong> corrugate on the dumping rate RD. The analytic<br />
models for evaluating dumping rate RD are line corrugated<br />
waveguide (see Fig. 7 (a)) and curved corrugated<br />
waveguide (see Fig. 7 (b)). The pitch <strong>of</strong> corrugate shape<br />
is Cλ where C denotes a constant, and the pitch <strong>of</strong><br />
the corrugate made at regular intervals for straight part<br />
and unequally-spaced gaps are made on a curved part<br />
in curved corrugated waveguide as well as previous<br />
evaluation.<br />
By using the analytic models, the influence <strong>of</strong> frequen-<br />
-2.5<br />
- 57 - 15th IGTE Symposium 2012<br />
Damping rate, R D<br />
2<br />
1.5<br />
1<br />
0.5<br />
1GHz<br />
15GHz<br />
30GHz<br />
0<br />
0λ 0.2λ 0.4λ 0.6λ 0.8λ 1λ 1.2λ<br />
Pitch <strong>of</strong> Corrugated waveguide<br />
Fig. 8. The influence <strong>of</strong> frequencies and pitch <strong>of</strong> corrugate on dumping<br />
rate RD in the line waveguide.<br />
Damping rate, R D<br />
2.5<br />
2<br />
1.5<br />
1<br />
0.5<br />
1GHz<br />
15GHz<br />
30GHz<br />
0<br />
0λ 0.2λ 0.4λ 0.6λ 0.8λ 1λ 1.2λ<br />
Pitch <strong>of</strong> Corrugated waveguide<br />
Fig. 9. The influence <strong>of</strong> frequencies and pitch <strong>of</strong> corrugate on dumping<br />
rate RD in the curved waveguide.<br />
cies and pitch <strong>of</strong> corrugate D on dumping rate RD in the<br />
corrugated waveguide is evaluated, and the results are<br />
shown in Fig. 8 and 9. In the line waveguide, the magnetic<br />
wave propagates stationary in case <strong>of</strong> 0.0λ
• The values <strong>of</strong> dumping rate RD are not strictly stable<br />
in case <strong>of</strong> spline weight function is adopted for the<br />
shape function construction.<br />
• On the other hand, if the Gauss type weight function<br />
is adopted for weight function the value <strong>of</strong> dumping<br />
rate RD generally continue to be flat around unit<br />
value in case <strong>of</strong> all the types <strong>of</strong> node alignment.<br />
• The node alignment based on the staggered grid that<br />
is generally used in standard FDTD should be used<br />
for MTDM simulation.<br />
• The reflected wave increase as the width <strong>of</strong> waveguide<br />
W increases. In other words, the damping rate<br />
increase as the value <strong>of</strong> W increase, and this phenomenon<br />
also relevant to wavelength and curvature<br />
<strong>of</strong> waveguide.<br />
• In the line corrugated waveguide, the magnetic wave<br />
propagated stationary in case <strong>of</strong> 0.0λ
- 59 - 15th IGTE Symposium 2012<br />
Optimization <strong>of</strong> Permanent Magnet Linear Actuator<br />
for Braille Screen<br />
*Ivan S. Yatchev, *Iosko S. Balabozov, *Krastio L. Hinov, *Vultchan T. Gueorgiev and<br />
**Dimitar N. Karastoyanov<br />
* Faculty <strong>of</strong> Electrical Engineering, Technical <strong>University</strong> <strong>of</strong> S<strong>of</strong>ia, 8, Kliment Ohridsky Blvd., 1000 S<strong>of</strong>ia, Bulgaria<br />
** Institute <strong>of</strong> Information and Communication Technologies, Bulgarian Academy <strong>of</strong> Sciences, Acad. G. Bonchev St.,<br />
Block 2, 1113 S<strong>of</strong>ia, Bulgaria<br />
E-mail: yatchev@tu-s<strong>of</strong>ia.bg<br />
Abstract—Permanent magnet linear actuator intended for driving a needle in Braille screen has been optimized. The mover<br />
<strong>of</strong> the actuator is a combined one - it consists <strong>of</strong> permanent magnet and ferromagnetic discs. The optimization is carried out<br />
with respect to minimal magnetomotive force ensuring required minimum electromagnetic force on the mover. The<br />
optimization factors are dimensions <strong>of</strong> the cores and mover parts under additional constraint for overall dimension <strong>of</strong> the<br />
actuator. Finite element analysis, response surface methodology and design <strong>of</strong> experiments have been employed for the<br />
optimization. The obtained optimal solution is verified again by finite element analysis.<br />
Index Terms—actuators, Braille screen, optimization, secondary models.<br />
I. INTRODUCTION<br />
Application <strong>of</strong> permanent magnets in the constructions<br />
<strong>of</strong> different actuators has been intensively increased in<br />
recent years. One <strong>of</strong> the reasons for their application is<br />
the possibility for development <strong>of</strong> energy efficient<br />
actuators. New constructions <strong>of</strong> permanent magnet<br />
actuators are employed for different purposes. One such<br />
purpose is the facilitation <strong>of</strong> perception <strong>of</strong> images by<br />
visually impaired people using the so called Braille<br />
screens. Recently, different approaches have been utilized<br />
for the actuators used to move Braille dots [1]-[6].<br />
Typical view <strong>of</strong> a Braille screen is shown in Fig. 1.<br />
Figure 1: Braille screen with needles (dots) driven by<br />
linear actuators.<br />
In the present paper, recently developed permanent<br />
magnet linear actuator for driving a needle (dot) in Braille<br />
screen is optimized using response surface methodology<br />
(RSM) and design <strong>of</strong> experiments (DoE).<br />
The nature <strong>of</strong> the main application puts very firm<br />
requirements about the driver <strong>of</strong> the Braille screen<br />
needles. These requirements can be summarized as<br />
follows:<br />
- firm dimension constraints-especially in radial<br />
direction: outer diameter <strong>of</strong> the driver 3-6 mm;<br />
- holding force 02-05 N;<br />
- minimum energy consumption.<br />
The minimum energy consumption can be achieved by<br />
polarized construction <strong>of</strong> the driving electromagnet<br />
actuator because no power will be consumed at steady<br />
state.<br />
II. ACTUATOR CONSTRUCTION<br />
The principal actuator construction is shown in Fig. 2.<br />
The moving part is axially magnetized cylindrical<br />
permanent magnet with two ferromagnetic discs on both<br />
sides.<br />
The two coils are connected in series in such way that<br />
they create magnetic flux <strong>of</strong> opposite directions in the<br />
region <strong>of</strong> the permanent magnet. In this way, depending<br />
on the polarity <strong>of</strong> the power supply, the permanent<br />
magnet will move either up or down. When motion up is<br />
needed, the upper coil should create flux in the air gap<br />
coinciding with the flux <strong>of</strong> the permanent magnet. Lower<br />
coil at the same time will create opposite flux and the<br />
permanent magnet will move in upper direction. When<br />
motion down is needed, the polarity <strong>of</strong> the power supply<br />
is reversed. The motion is transferred to the Braille dot<br />
using the non-magnetic shaft.<br />
Figure 2: Principal construction <strong>of</strong> the studied actuator.<br />
1–upper shaft; 2–upper core; 3–outer core; 4–upper coil; 5-upper disc;<br />
6–magnet; 7–lower disc; 8–lower coil; 9–lower core; 10–lower shaft
The actuator features increased energy efficiency, as<br />
the power supply is needed only during the switching<br />
between the two end positions <strong>of</strong> the mover. In each end<br />
position, the permanent magnet creates holding force,<br />
which keeps the mover in this position.<br />
III. STATIC FORCE CHARACTERISTICS<br />
Static magnetic field <strong>of</strong> the actuator is modeled using<br />
the finite element method and the program FEMM [7].<br />
Axisymmetric model is adopted as the actuator features<br />
rotational symmetry. The electromagnetic force acting on<br />
the mover is obtained using the weighted stress tensor<br />
approach.<br />
Typical static force characteristics <strong>of</strong> the actuator are<br />
shown in Fig. 3. The stroke <strong>of</strong> the actuator, denoted with<br />
x, is set to zero when the shaft is situated symmetrically<br />
between the upper and lower cores.<br />
c1=-1,c2=1<br />
c1=1,c2=-1<br />
1.2<br />
1<br />
F, N<br />
c1=0,c2=0<br />
0.8<br />
0.6<br />
0.4<br />
0.2<br />
0<br />
-0.6 -0.4 -0.2 -0.2 0 0.2 0.4 0.6<br />
-0.4<br />
-0.6<br />
-0.8<br />
-1<br />
-1.2<br />
x, mm<br />
Figure 3: Typical force-stroke characteristics <strong>of</strong> the<br />
studied construction.<br />
c1 and c2 show the direction <strong>of</strong> MMF in both coils <strong>of</strong> the construction.<br />
c1=-1, c2=1 – upward movement <strong>of</strong> the shaft;<br />
c1=1, c2=-1 –downward movement <strong>of</strong> the shaft; c1=0, c2=0 –non<br />
energized coils, the force is due to the permanent magnet only.<br />
The upper and lower curves in Fig. 3 represent the<br />
force when the shaft is moving in upward and downward<br />
direction. The middle curve shows the force when no<br />
current flows in the coils. In that case the force is due to<br />
the magnetic flux <strong>of</strong> the permanent magnet. The<br />
characteristic is symmetrical towards the origin <strong>of</strong> the<br />
force-stroke coordinate system and its final values (when<br />
the shaft is close to upper or lower cores) is called<br />
holding force – Fh. This is the only force that keeps the<br />
shaft in both stable position – upper and lower and it<br />
should resist to the force created by the touching fingers<br />
and the mover’s own weight.<br />
The starting force – Fs is the initial force that acts on<br />
the shaft when it is in its final upper position and both<br />
coils are energized in such a manner to create force in<br />
downward direction or the opposite – the shaft is in final<br />
lower position and force is acting upward.<br />
The construction should guarantee overcoming <strong>of</strong> the<br />
holding force, created by the permanent magnet, when<br />
the coils are properly energized.<br />
The upper coil excites in the upper core magnetic flux<br />
that is equal or bigger than the flux <strong>of</strong> the permanent<br />
magnet but contrary directed. At the same time, the flux<br />
excited by the lower coil is coincident with the flux <strong>of</strong> the<br />
- 60 - 15th IGTE Symposium 2012<br />
permanent magnet.<br />
The construction minimizes the requirements towards<br />
the starting force and guarantees that it will start moving<br />
even for small value <strong>of</strong> the starting force if only it<br />
exceeds the own weight <strong>of</strong> the shaft.<br />
IV. SECONDARY MODELS<br />
Finite element method, DoE and RSM have been used<br />
for creation <strong>of</strong> the secondary models. Full factorial design<br />
has been applied.<br />
The fixed geometric parameters are shown in Fig. 4 and<br />
their values are given in Table 1.<br />
Figure 4. Fixed parameters <strong>of</strong> the actuator.<br />
TABLE I<br />
FIXED GEOMETRIC PARAMETERS<br />
Dimension<br />
Designation<br />
(in Fig. 4)<br />
Value<br />
(in mm)<br />
Outer core diameter D 5<br />
Outer magnet diameter Dm 2<br />
Inner coil diameter Dw1 2.4<br />
Outer coil diameter Dw2 4<br />
Shaft diameter Ds 1<br />
Inner core diameter Dc 1.2<br />
Core thickness hc 2<br />
The varied parameters are:<br />
- The length <strong>of</strong> the upper and lower cores - hw,<br />
- The axial dimension <strong>of</strong> the ferromagnetic disks -<br />
hd,<br />
- The length <strong>of</strong> the permanent magnet - hm,<br />
- The current density in the coils - J.<br />
The varied parameters with geometric representations<br />
are shown in Fig. 5.
Figure 5: Varied geometric parameters <strong>of</strong> the actuator.<br />
The DoE methodology has been used for varied<br />
parameters to create polynomial secondary models. For<br />
each combination <strong>of</strong> values <strong>of</strong> the varied parameters a<br />
family <strong>of</strong> static force-stroke characteristics was obtained.<br />
Based on them secondary models for holding force Fh,<br />
starting force Fs and ampere-turns <strong>of</strong> the coils – NI have<br />
been made.<br />
The precision <strong>of</strong> secondary models has been estimated<br />
by the relative error between the value obtained by te<br />
secondary model and corresponding value obtained by<br />
the FEM model. The difference between secondary and<br />
FEM models for 27 calculation points is given in Fig.6.<br />
relative error, %<br />
0.15<br />
0.1<br />
0.05<br />
0<br />
-0.05<br />
-0.1<br />
-0.15<br />
-0.2<br />
-0.25<br />
0 5 10 15 20 25 30<br />
number <strong>of</strong> calculation points<br />
Figure 6: Relative error between secondary and FEM<br />
models.<br />
V. OPTIMIZATION<br />
The objective function is minimal magnetomotive force<br />
<strong>of</strong> the coils. The optimization parameters are dimensions<br />
<strong>of</strong> the permanent magnet, ferromagnetic discs and the<br />
cores. As constraints, minimal electromagnetic force<br />
acting on the mover, minimal starting force and overall<br />
outer diameter <strong>of</strong> the actuator have been set. The<br />
Fs<br />
Fh<br />
- 61 - 15th IGTE Symposium 2012<br />
optimization is carried out using sequential quadratic<br />
programming.<br />
The canonic form <strong>of</strong> the optimization problem is:<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
where:<br />
- NI — ampere-turns — minimizing energy<br />
consumption with satisfied force requirements;<br />
- Fh — holding force — mover (shaft) in upper<br />
position, no current in the coils;<br />
- Fs — starting force — mover (shaft) in upper or<br />
lower position and energized coils;<br />
- J — coils current density;<br />
- hw, hm, hd—geometric dimensions according to<br />
the sketch in Fig. 6.<br />
Minimization <strong>of</strong> magneto-motive force NI is direct<br />
subsequence <strong>of</strong> the requirement for minimum energy<br />
consumption.<br />
Constraints for Fs and Fh have already been discussed.<br />
The lower bounds for the dimensions are imposed by the<br />
manufacturing limits and the upper bound for the current<br />
density is determined by the thermal balance <strong>of</strong> the<br />
actuator.<br />
The radial dimensions <strong>of</strong> the construction are directly<br />
dependent by the outer diameter <strong>of</strong> the core – D which<br />
fixed value was discussed earlier. The influence <strong>of</strong> those<br />
parameters on the behavior <strong>of</strong> the construction have been<br />
studied in previous work [8] that make clear that there is<br />
no need radial dimensions to be included in the set <strong>of</strong><br />
optimization parameters.<br />
The optimization is carried out by sequential quadratic<br />
programming. The optimization results are as follows:<br />
<br />
<br />
<br />
<br />
<br />
The optimal parameters were set as input values to the<br />
FEM model. The force-stroke characteristics <strong>of</strong> the<br />
optimal actuator is shown in Fig.7 and Fig.8.<br />
Fh, N<br />
0.3<br />
0.2<br />
0.1<br />
0<br />
-0.1<br />
-0.2<br />
-0.3<br />
Fh - holding force in lower position <strong>of</strong> the shaft<br />
Fh - holding force in upper position <strong>of</strong> the shaft<br />
-0.4 -0.3 -0.2 -0.1 0<br />
x, mm<br />
0.1 0.2 0.3 0.4<br />
Figure 7: Force-stroke characteristic <strong>of</strong> the optimal<br />
actuator. The force is created by the permanent magnet<br />
only (no current in the coils).
F, N<br />
0.8<br />
0.7<br />
0.6<br />
0.5<br />
0.4<br />
0.3<br />
0.2<br />
0.1<br />
0<br />
F - final force (shaft imoved in lower position, coils still energized)<br />
Fs - starting force (shaft in upper position)<br />
Fh - force switches to the holding force when current is ceased<br />
-0.4 -0.3 -0.2 -0.1 0<br />
x, mm<br />
0.1 0.2 0.3 0.4<br />
Figure 8. Force-stroke characteristic <strong>of</strong> the optimal<br />
actuator. Coils are energized. The shaft is displaced from<br />
final upper to final lower position.<br />
In Figs. 9 and 10, the magnetic field <strong>of</strong> the optimal<br />
actuator is plotted for two cases.<br />
Figure 9: Magnetic field <strong>of</strong> the optimal actuator with<br />
shaft in upper position and coils energized to create<br />
downward force.<br />
Figure 10: Magnetic field <strong>of</strong> the optimal actuator with no<br />
current in the coils.<br />
- 62 - 15th IGTE Symposium 2012<br />
The force constraints for Fs and Fh are active which<br />
can be expected when minimum energy consumption is<br />
required. The active constraint for hw is also expected<br />
because longer upper and lower cores size which<br />
respectively means longer coils will increase the leakage<br />
coil flux and corrupted coil efficiency.<br />
VI. CONCLUSION<br />
The employed approach has confirmed its robustness<br />
for solution to the optimization problem for the actuator.<br />
The obtained optimal solution satisfies the specific<br />
requirements for actuators for Braille screen.<br />
[1]<br />
REFERENCES<br />
Nobels T., F. Allemeersch, K. Hameyer, “Design <strong>of</strong> a high power<br />
density electromagnetic actuator for a portable Braille display.“<br />
Int. Conf. EPE-PEMC 2002, Dubrovnik & Cavtat, 2002.<br />
[2] Kawaguchi Y., K. Ioi, Y. Ohtsubo, “Design <strong>of</strong> new Braille display<br />
using inverse principle <strong>of</strong> tuned mass damper.” Proc.<strong>of</strong> SICE<br />
annual conference 2010, Taipei, Taiwan, Aug. 18-21, pp. 379-383.<br />
[3] Kwon, H.J., Lee, S.W., Lee, S. Braille code display device with a<br />
PDMS membrane and thermopneumatic actuator. IEEE<br />
[4]<br />
international conference on micro electro mechanical systems<br />
(XXI), MEMS, Tucson, 2008, pp. 527-530.<br />
Chaves, D., Peixoto, I., Lima, A., Vieira, M., de Araujo, C.<br />
Microtuators <strong>of</strong> SMA for Braille display system. IEEE<br />
international workshop on medical measurements and<br />
[5]<br />
applications, MeMeA Cetraro, Italy, May 20-30, 2009, pp. 64-68.<br />
Hernandez, H., Preza, E., Velazquez, R. Characterization <strong>of</strong> a<br />
piezoelectric ultrasonic linear motor for braille displays.<br />
Electronics, robotics and automotive mechanics conference<br />
CERMA Cuernavaca, Mexico, Sep. 22-25, 2009, pp. 402-407.<br />
[6] Cho, H.C., Kim, B.S., Park, J.J., Song, J.B. (2006) Development<br />
<strong>of</strong> a Braille display using piezoelectric linear motors.<br />
[7]<br />
International joint conference SICE-ICASE, 2006, Busan, Korea,<br />
Oct. 18-21, pp. 1917-1921.<br />
D. Meeker, Finite element method magnetics version 3.4, 2005.<br />
[8] Yatchev I., K. Hinov, V. Gueorgiev, D. Karastoyanov, I.<br />
Balabozov, Force characteristics <strong>of</strong> an electromagnetic actuator<br />
for Braille screen, <strong>Proceedings</strong> <strong>of</strong> Thirteenth International<br />
Conference on Electrical Machines, Drives and Power Systems<br />
ELMA 2011, 21-22 October 2011, Varna, Bulgaria, pp. 338-341.
- 63 - 15th IGTE Symposium 2012<br />
3D Finite Element Analysis <strong>of</strong> Induction Heating<br />
System for High Frequency Welding<br />
*Ilona I. Iatcheva, *Georgi H. Gigov , *Georgi C. Kunov and *Rumena D. Stancheva<br />
*Technical <strong>University</strong> <strong>of</strong> S<strong>of</strong>ia, Kliment Ohridski 8, S<strong>of</strong>ia 1000, Bulgaria<br />
E-mail: iiach@tu-s<strong>of</strong>ia.bg<br />
Abstract—The aim <strong>of</strong> the work is investigation <strong>of</strong> induction heating system used for longitudinal, high frequency pipe<br />
welding. The problem was considered as 3D coupled electromagnetic and temperature field problem and has been solved<br />
using finite element method and COMSOL 4.2 s<strong>of</strong>tware package. Time harmonic electromagnetic and transient thermal fields<br />
have been studied in order to estimate system efficiency and factors influencing on the quality <strong>of</strong> the welding process and<br />
required energy.<br />
Index Terms— finite element method, high frequency welding, 3D coupled field analysis.<br />
small scale, carbon steel tubes and pipes. It consists <strong>of</strong><br />
spiral inductor, which induced a voltage across the edges<br />
<strong>of</strong> the moving open pipe material. The induced voltage<br />
causes high frequency currents, concentrated on the<br />
surface layer due to the skin and proximity effects. The<br />
currents flow along the two edges in opposite directions<br />
in so called “V”-zone (Fig.2) to the point where they<br />
meet, causing rapid heating <strong>of</strong> the metal and surface<br />
melting. The weld squeeze rolls are used to apply<br />
pressure, which forces the heated metal into contact and<br />
forms welding bond.<br />
I. INTRODUCTION<br />
The induction heating is widely used in the heat<br />
treatment <strong>of</strong> conducting details due to its advantages:<br />
high quality and efficiency <strong>of</strong> the heating processes, good<br />
accuracy in heating <strong>of</strong> certain zones in a short time and<br />
clean operating conditions [ 1]-[4].<br />
The aim <strong>of</strong> the present research is investigation <strong>of</strong><br />
induction heating system used for high frequency<br />
longitudinal pipe welding. The main task is to determine<br />
optimal factors and parameters influencing on quality <strong>of</strong><br />
the welding process and required energy: welding<br />
frequency, welding speed, ‘vee’ angle, presence <strong>of</strong> the<br />
ferrite impeder (inner and outer), tube thickness and etc.<br />
The solution <strong>of</strong> the problem is based on the precise 3Dmodelling<br />
and FEM analysis <strong>of</strong> the electromagnetic and<br />
thermal processes, taking place in the investigated system.<br />
Detailed determination <strong>of</strong> the electromagnetic and<br />
temperature field distribution and its dependence on the<br />
mentioned above parameters is important condition for<br />
effective control and management <strong>of</strong> the welding process.<br />
II. INVESTIGATED INDUCTION HEATING SYSTEM<br />
The principal geometry <strong>of</strong> the investigated system is<br />
shown in Fig.1.<br />
squeeze<br />
point <strong>of</strong> roll<br />
closure<br />
direction <strong>of</strong><br />
movement<br />
impeder<br />
core<br />
inductor<br />
steel pipe<br />
cooling water<br />
welded<br />
bond<br />
Figure 1: Geometry <strong>of</strong> the investigated induction system<br />
The system is designed for high frequency welding <strong>of</strong><br />
Figure 2: In the “V”-zone HF currents flow along the two edges in<br />
opposite directions.<br />
The system includes also inner ferrite impeder, which<br />
concentrates magnetic flux and improves the welding<br />
efficiency. The cooling water flows inside the inductor<br />
and impeder for system cooling.<br />
As it can be seen from the geometry in Fig.1 the<br />
impeder is located not along the pipe axe, but moved<br />
closer to the welded region - i.e. the system is not<br />
axesymmetric and has to be analysed as three<br />
dimensional.<br />
The system has been investigated and electromagnetic<br />
and thermal processes have been modelled for the<br />
parameters shown in Table I.<br />
TABLE I<br />
PARAMETERS OF THE SYSTEM<br />
Parameter Value<br />
Applied current I 1000 A<br />
Voltage U 500 V<br />
cos 0,1<br />
Frequency f 200kHz 500kHz<br />
End heating temperature 13001450 0 C<br />
Cooling water<br />
temperature<br />
40 0 C
III. MATHEMATICAL MODEL OF THE COUPLED FIELD<br />
PROBLEM<br />
Mathematical modeling <strong>of</strong> the processes in the<br />
investigated system for high frequency welding are based<br />
on the analysis <strong>of</strong> coupled – electromagnetic and<br />
temperature field distribution in the considered device.<br />
As it has been already mention the geometry <strong>of</strong> the object<br />
is a complex, nonsymmetric and electromagnetic and<br />
thermal field have to be studied as three-dimensional.<br />
The present work deals with modeling <strong>of</strong> the 3D time<br />
harmonic electromagnetic field. The eddy current losses,<br />
obtained in electromagnetic field analysis are field<br />
sources in modeling <strong>of</strong> the transient thermal field<br />
The electromagnetic field problem has been studied not<br />
only in the system elements, but also in wide buffer zone<br />
around the devise. It helps to define correct boundary<br />
conditions in field modeling. In Fig.3 is shown<br />
investigated region, used in electromagnetic field<br />
modeling. It includes domains: 1- inductor; 2impeder;<br />
3- welded pipe; 4- cooling water; 5- buffer<br />
zone with air.<br />
1<br />
4<br />
Figure 3: Investigation domains<br />
2<br />
5<br />
3<br />
Electromagnetic field distribution can be described with<br />
equations (1) and (2):<br />
<br />
<br />
-1<br />
A<br />
<br />
( A)<br />
J e <br />
(1)<br />
t<br />
<br />
E j<br />
A V<br />
(2)<br />
where A is magnetic vector potential , J is current<br />
density, E is electrical strength , V is scalar electric<br />
potential, is electric conductivity and is magnetic<br />
permeability.<br />
The boundary conditions are A 0<br />
<br />
for the buffer zone<br />
boundaries.<br />
The time varying electromagnetic field produces eddy<br />
currents:<br />
- 64 - 15th IGTE Symposium 2012<br />
<br />
J jA<br />
(3)<br />
and corresponding Joule losses – source <strong>of</strong> the heating in<br />
the region:<br />
<br />
1<br />
*<br />
[ ] JJ<br />
Q <br />
2<br />
<br />
(4)<br />
The transient thermal field is modeled by equation:<br />
T<br />
. C (<br />
kT<br />
) Q (5)<br />
t<br />
where k is thermal conductivity , T is temperature, is<br />
density, C is heat capacity and Q is heat source, obtained<br />
in electromagnetic field analysis.<br />
IV. FEM ANALYSIS - 3D COUPLED PROBLEM<br />
Numerical simulation <strong>of</strong> the coupled - electromagnetic<br />
and thermal fields was carried out using FEM and<br />
COMSOL 4.2 package [4].<br />
In Fig.4 is shown investigated system with the buffer<br />
zone around it and Fig.5 presents FEM mesh, used in<br />
solving the problem.<br />
Figure 4: Investigated system with the buffer zone around it.<br />
Figure 5: FEM mesh.
Some results, obtained in solving the problem for<br />
frequency 300 KHz are shown in Fig. 6, Fig. 7, Fig. 8,<br />
Fig. 9 and Fig. 10.<br />
The analysis <strong>of</strong> electromagnetic field distribution<br />
indicates that maximal value <strong>of</strong> the magnetic flux density<br />
is about 0.19T. These values are reached in the “V” zone<br />
and around the inductor. Two different cross sections<br />
illustrate distribution <strong>of</strong> the magnetic flux density in the<br />
system in Fig.6 and Fig.7.<br />
Figure 6: Distribution <strong>of</strong> magnetic flux density in the<br />
investigated region, f= 300 KHz.<br />
Figure 7: Distribution <strong>of</strong> magnetic flux density along the<br />
“V” zone, f= 300 KHz.<br />
The results, obtained for current density distribution in<br />
the entire region are shown in Fig.8. Two specific for the<br />
problem cross sections - around “point <strong>of</strong> closure” and<br />
spiral inductors are picking out. The maximal value is<br />
- 65 - 15th IGTE Symposium 2012<br />
1,13x10 9 A/m 2 . Current density distribution around the<br />
“point <strong>of</strong> closure” is shown in Fig.9 and in Fig.10 around<br />
the spiral inductor.<br />
.<br />
Figure 8: Current density distribution in the entire region<br />
Figure 9: Current density distribution around “point <strong>of</strong><br />
closure”<br />
V. CONCLUSION<br />
3D-coupled electromagnetic and temperature field<br />
problem and has been solved using finite element method<br />
and COMSOL 4.2 s<strong>of</strong>tware package in order to<br />
investigate induction heating system for high frequency
pipe welding. The obtained temperature value around the<br />
“point <strong>of</strong> closure” is about 1400 0 C.<br />
REFERENCES<br />
[1] R.Baumer, Y.Adonyi ”Transient High-Frequency Welding<br />
[2]<br />
Simulations <strong>of</strong> Dual-Phase Steels”, Welding Journal, October<br />
2009, vol. 88, pp. 193 – 201<br />
D.Kim, T. Kim, Y.Park, K.Sung, M.Kang, C.Kim, I.Lee and<br />
S.Rhee, “Estimation <strong>of</strong> weld quality in high-frequency electric<br />
resistance welding”, Welding Journal, March 2007, pp. 27 – 31.<br />
[3] A. Shamov, I. Lunin, V. Ivanov, High frequency metal welding,<br />
[4]<br />
Leningrad, ‘Mashinostroenie”1977 (In Russian).<br />
COMSOL Version 4.2 User’s Guide, 2011.<br />
- 66 - 15th IGTE Symposium 2012
- 67 - 15th IGTE Symposium 2012<br />
Optimization Algorithms in the View <strong>of</strong> State<br />
Space Concepts<br />
M. Neumayer∗ , D. Watzenig∗ , G. Steiner∗ , and B. Brandstätter †<br />
∗Institute <strong>of</strong> Electrical Measurement and Measurement Signal Processing, <strong>Graz</strong> <strong>University</strong> <strong>of</strong> <strong>Technology</strong>,<br />
Kopernikusgasse 24/4, A-8010 <strong>Graz</strong>, Austria, E-mail: neumayer@TU<strong>Graz</strong>.at<br />
† Elin Motoren GmbH, Elinmotorenstrasse 1, A-8160 Preding/Weiz, Austria<br />
Abstract—The working principles <strong>of</strong> optimization algorithms <strong>of</strong>fer several characteristics which naturally arise in state<br />
estimation, or more generally when dealing with state space systems. In this paper we will treat similarities between the<br />
two disciplines and show how concepts <strong>of</strong> state estimation, including the incorporation <strong>of</strong> model uncertainty information,<br />
can be used in optimization.<br />
Index Terms—optimization, state space methods<br />
I. INTRODUCTION<br />
Numerical optimization is generally referred to as<br />
solving a problem <strong>of</strong> form [1]<br />
x ∗ = argminΨ(x) (1)<br />
s.t. C(x) ≤ 0, (2)<br />
where Ψ:R N → R 1 is called the objective function<br />
and the vector x ∈ R N contains the variables <strong>of</strong> interest.<br />
Possible constraints on the vector x are formulated by<br />
the vectorial function C(x) as a set <strong>of</strong> equalities and<br />
inequalities. By this the space <strong>of</strong> the feasible solutions<br />
becomes a subspace <strong>of</strong> R N .<br />
An enormous variety <strong>of</strong> algorithms and solution<br />
strategies for such problems has been the output <strong>of</strong><br />
research activities in the past years. Yet it has to be<br />
mentioned that the presented form <strong>of</strong> the optimization<br />
problem is only a part <strong>of</strong> actual problems. I.e. the<br />
discipline <strong>of</strong> multi objective optimization asks for an<br />
optimal solution given several objective functions Ψi [2].<br />
Such formulations are <strong>of</strong> importance in multi physical<br />
problem scenarios. Another important class are robust<br />
optimization approaches which aim for a stable solution<br />
under the scenario <strong>of</strong> uncertainty or tolerances in the<br />
objective function [3]. Thus, existing manufacturing<br />
tolerances can be incorporated to optimization based<br />
design process. A general distinction between the number<br />
<strong>of</strong> the different (iterative) optimization algorithms used<br />
today can be made by separating them into deterministic<br />
and stochastic methods.<br />
Deterministic methods most <strong>of</strong>ten make use <strong>of</strong> gradient<br />
and curvature information <strong>of</strong> the objective function Ψ<br />
in order to efficiently detect the minimum. Hereby efficiency<br />
is typically defined by the number <strong>of</strong> evaluations<br />
<strong>of</strong> Ψ. Classical first and second order deterministic methods<br />
try to minimize Ψ by determining a descent direction<br />
out <strong>of</strong> gradient or gradient and Hessian information. I.e.<br />
the classical steepest descent method uses the iteration<br />
xk+1 = xk − sg(xk), (3)<br />
to find x∗ in a step by step approach. Hereby<br />
g(xk) = ∇Ψ is defined as the gradient <strong>of</strong> Ψ with<br />
respect to the elements <strong>of</strong> xk. Due to this nature the<br />
result <strong>of</strong> deterministic methods can be strongly affected<br />
by the starting point x0. Also local minima <strong>of</strong> Ψ will<br />
result in a termination <strong>of</strong> the algorithm before the global<br />
minima is found. Stochastic methods rely on some<br />
sort <strong>of</strong> randomness to explore the parameter space in<br />
search for the minimum. A main difference with respect<br />
to deterministic methods is their ability to overcome<br />
local minima <strong>of</strong> the objective function. For stochastic<br />
optimization algorithms it has become common to let<br />
them run for a certain time or a number <strong>of</strong> evaluations <strong>of</strong><br />
Ψ. Of course, hybrid algorithms have been proposed to<br />
combine the advantages <strong>of</strong> the two classes <strong>of</strong> algorithms.<br />
Although we have only pointed out the basics <strong>of</strong> some<br />
fundamental concepts <strong>of</strong> optimization we can observe,<br />
that in all algorithms some kind <strong>of</strong> evolution from the<br />
vector xk to the vector xk+1 occurs. Modern system<br />
theory uses so called state space models as a unified<br />
framework to describe dynamical systems [4]. The general<br />
form <strong>of</strong> a discrete time, nonlinear, time-variant state<br />
space model is given by<br />
xk+1 = F k(xk)+Bk(uk)+wk, (4)<br />
yk = Hk(xk)+vk, (5)<br />
where F k : R N → R N presents the system dynamics,<br />
Bk : R L → R N describes the affect onto xk+1 due<br />
to an input term uk ∈ R L , and Hk : R N → R M<br />
describes a measurement process. The terms wk ∈ R N<br />
and vk ∈ R M are referred to as process noise and<br />
measurement noise. We observe some similarities<br />
between the state space concept and the topics discussed<br />
in concern with optimization. Yet we have to say that<br />
state space methods and models follow a quite organized
scheme.<br />
In this paper we will point out similarities between optimization<br />
techniques and state space models and methods.<br />
The paper is structured as follows. In section II we<br />
give a short introduction about the state space concept.<br />
In section II we review optimization techniques in the<br />
sense <strong>of</strong> state space methods and present similarities<br />
as well as mathematical tools potential for a general<br />
description. Section IV lists several state space techniques<br />
which relate with topics <strong>of</strong> optimization and thus could<br />
potentially be used to improve optimization. Finally we<br />
present an exemplary hybrid optimization scheme which<br />
we derive from the state space view and demonstrate its<br />
behavior using some <strong>of</strong> the suggested approaches.<br />
II. THE STATE SPACE CONCEPT IN MORE DETAIL<br />
uk Bk<br />
wk<br />
xk+1<br />
z −1<br />
F k<br />
Hk<br />
Fig. 1. Diagram <strong>of</strong> the state space model given by equation (4) and (5).<br />
Figure 1 depicts the structure <strong>of</strong> a state space model<br />
given by equations (4) and (5). The core purpose <strong>of</strong> a<br />
state space model is to describe the evolution <strong>of</strong> the<br />
state vector x over the time or the discrete time steps,<br />
respectively. As can be observed by the equations (4)<br />
and (5) or by figure 1, this evolution is determined<br />
by a deterministic drift due to the dynamics <strong>of</strong> F k<br />
and the input uk and a stochastic diffusion due to<br />
the process noise wk. The system is referred to be an<br />
autonomous system if B is zero. The function Hk<br />
provides a deterministic measure about the internal state.<br />
In addition the measurement noise vk acts as an additive<br />
disturber. We can already observe that the state space<br />
concept is able to provides several aspects which we<br />
pointed out in the introductional part about optimization<br />
algorithms in a natural way.<br />
It should be mentioned that a state space model is<br />
referred to be linear if all system components are matrices.<br />
This class is <strong>of</strong> large importance as many technical<br />
processes can be described by this.<br />
A. State Space Methods<br />
In this part <strong>of</strong> section II we want to give a<br />
short introduction about two important disciplines in<br />
association with state space models. These are state<br />
estimation and state control.<br />
State estimation is referred to the task to find an<br />
estimate ˆxk <strong>of</strong> the state xk using the measurements y k ,<br />
the input uk and the model.<br />
vk<br />
y k<br />
- 68 - 15th IGTE Symposium 2012<br />
State control is a special kind <strong>of</strong> feedback control<br />
where the system input uk is formed as a function <strong>of</strong><br />
the state vector xk. A notable controller out <strong>of</strong> this class<br />
is the dead beat control system. This approach enables<br />
a control system to reach the steady state within a finite<br />
number <strong>of</strong> iterations.<br />
III. OPTIMIZATION AND STATE SPACE METHODS<br />
Looking onto all points discussed so far we can consider<br />
a relation between the measurement function H k<br />
and the objective function Ψ. I.e. for a design problem<br />
where one is interested to meet a desired output yd, Ψ<br />
could be <strong>of</strong> form<br />
Ψ(x) =(H k(x) − y d ) T W (Hk(x) − y d ) . (6)<br />
Hereby the positive definite matrix W presents a weighting<br />
matrix. From a system theoretic point <strong>of</strong> view we<br />
could consider the function Ψ as a (nonlinear) control<br />
plant <strong>of</strong> MISO (multiple input single output) type.<br />
A. Classical Deterministic Methods Reviewed<br />
Classical deterministic optimization methods like the<br />
already mentioned steepest descent method (see equation<br />
(3)) take use <strong>of</strong> local gradient or curvature information<br />
<strong>of</strong> the function Ψ. While the steepest descent algorithm<br />
just takes use <strong>of</strong> the gradient information the well known<br />
Gauss-Newton (GN) method defined by<br />
xn+1 = xn + sG −1<br />
k gk, (7)<br />
takes use <strong>of</strong> the Hessian G matrix which provides curvature<br />
information about Ψ to improve the convergence<br />
behavior. In both schemes, the steepest descent method<br />
and the GN method, the system matrix F isgivenbythe<br />
identity matrix I. For objective functions Ψ <strong>of</strong> form (6),<br />
the practical realization <strong>of</strong> the GN method is given by<br />
xn+1 = xn − s(JJ T ) −1 Jr (8)<br />
where J is the Jacobian <strong>of</strong> the system H with respect<br />
to the state vector x. Herebyr =(y − y d) defines the<br />
residual vector <strong>of</strong> the output <strong>of</strong> F with respect to y d .<br />
The gradient g = ∇xΨ <strong>of</strong> the objective function (6) with<br />
respect to x (to keep the notation short we set the matrix<br />
W to be the identity matrix I) isgivenasg = J(y−y d)<br />
and (JJ T ) approximates the Hessian G [1].<br />
<br />
JJ T −1<br />
z −1<br />
I<br />
−sJ<br />
H k<br />
Fig. 2. State space representation <strong>of</strong> a second order scheme.<br />
y k<br />
−y d<br />
Figure 2 depicts the GN scheme as a control system for<br />
the objective function (plant) Ψ following equation (8).<br />
For the steepest descent method the matrix B is replaced<br />
by the identity matrix I. Note, that all matrices depend
on the iteration index k. A control system with this<br />
property is referred to as a time varying control system.<br />
We observe that neither the steepest descent algorithm<br />
nor the GN method are state space control systems as<br />
these methods do not take use <strong>of</strong> the state vector x.<br />
However, we can observe a closed loop scheme in figure<br />
2. It is hard to argue whether we see the steepest descent<br />
algorithm as a drive system or as a closed loop control<br />
system as for B = −g and s replacing the input u<br />
(scalar) no closed loop is required. However, with respect<br />
to the different input matrix B the powerfulness <strong>of</strong> the<br />
GN-method becomes clear from a system theoretic point<br />
<strong>of</strong> view.<br />
B. Stochastic Methods Reviewed<br />
With the availability <strong>of</strong> more and more computational<br />
power stochastic optimization methods have become<br />
<strong>of</strong> increased interest for many practical problems.<br />
Interesting issues for the application <strong>of</strong> stochastic<br />
methods is their ability to overcome local minima, and<br />
the not given necessity for derivative information. This<br />
is <strong>of</strong> concern for not differentiable or not continuous<br />
problems<br />
In contrast to deterministic methods, stochastic<br />
methods most <strong>of</strong>ten rely on a set <strong>of</strong> N individual vectors<br />
x N which explore the objective function on their own.<br />
Over the time a mutual exchange <strong>of</strong> information from<br />
the the different realizations x N is performed which<br />
mixes the individuals. Concepts about the individual<br />
exploration <strong>of</strong> each individual on Ψ as well as the<br />
exchange <strong>of</strong> mutual information between the individuals<br />
is <strong>of</strong>ten based on concepts <strong>of</strong> nature like evolution<br />
principles resulting in the class <strong>of</strong> genetic algorithms<br />
(GA). I.e. certain elements <strong>of</strong> two arbitrarily selected<br />
vectors xi and xj are exchanged, replaced by a weighted<br />
mean or just individually disturbed by a random variable.<br />
A contrastable aspect with respect to the behavior <strong>of</strong><br />
deterministic methods is the fact, that the combining<br />
principles do not automatically remove the weakest<br />
individual (the realization with the highest value <strong>of</strong> Ψ).<br />
Instead also the strongest individual can be removed by<br />
some random procedure. Exactly this property enables<br />
the behavior that stochastic methods can overcome local<br />
minima. Other well known strategies for stochastic<br />
optimization are particle swarm optimization (PSO),<br />
nitching evolution techniques or differential evolution<br />
(DE) [5]. Also the behavior <strong>of</strong> an ant colony or bacteria<br />
in a nutrient solution [6] have been used as strategies to<br />
find a solution minimizing Ψ.<br />
The enormous variety <strong>of</strong> differently labeled stochastic<br />
algorithms [7] makes it <strong>of</strong>ten hard to distinguish the<br />
differences between. More important it is hard to<br />
charge the efficiency <strong>of</strong> the different methods and their<br />
suitability for different applications. In the following we<br />
will provide an approach to present several aspects <strong>of</strong><br />
- 69 - 15th IGTE Symposium 2012<br />
stochastic optimization within the unified framework <strong>of</strong><br />
state space techniques.<br />
In state space models randomness has the unified<br />
entrance into the system formulated by the process noise<br />
w. By setting the deterministic input vector u to zero the<br />
resulting system becomes an autonomous system. While<br />
different stochastic optimization strategies are originated<br />
by more or less random inspiritments, system theory<br />
takes use <strong>of</strong> probabilistic methods to describe the behavior<br />
in concern with randomness. Hereby any random<br />
process is described by a probability density function<br />
(pdf) denoted by π(·). The mathematical framework used<br />
to describe stochastic behavior is based on Bayes law<br />
π(x|y) = π(y|x)π(x)<br />
∝ π(y|x)π(x), (9)<br />
π(y)<br />
where π(y|x) is referred to as the likelihood function<br />
and π(x) is referred to as prior. The evidence π(y) has<br />
the role <strong>of</strong> a normalization constant and can be skipped<br />
leading to the right hand side formula <strong>of</strong> the posterior<br />
distribution π(x|y). The likelihood function provides a<br />
probability measure for x originating a certain output y.<br />
The prior π(x) gives a probability statement about x.<br />
We can already link this concepts to the optimization<br />
problem given by equation (1) and the constraints<br />
formulated in equation (2), as the likelihood function<br />
obviously is able to express C(x) by becoming zero for<br />
infeasible solutions. However, this concept also enables<br />
the possibility <strong>of</strong> a continuous measure for the state x,<br />
i.e. we can incorporate ”gray regions” for the solution.<br />
The understanding <strong>of</strong> the likelihood is maybe not that<br />
obvious. For easier explanation we write the likelihood<br />
corresponding to the objective function (6) as<br />
<br />
π(yd |x) ∝ exp − (Hk(x) − yd ) T <br />
W (H k(x) − yd ) .<br />
(10)<br />
The likelihood function is the exponential <strong>of</strong> the negative<br />
objective function but it states Ψ as a probability measure.<br />
It has to be noted that 0 < <br />
N<br />
exp(−Ψ(x))dx < ∞<br />
has to hold in the Lebesgue sense, to form a likelihood<br />
function from an objective function. Such a formulation<br />
is known from simulated annealing (SA). Hereby a<br />
stochastic algorithm seeks for the modes (maxima) <strong>of</strong><br />
the function exp(−Ψ(x)/T ), where T is an artificial<br />
temperature which decreases over time. Note, that due<br />
to the temperature T , SA is different with respect to<br />
Bayesian inference as the likelihood has a physical<br />
meaning where no term like T occurs. Given all these<br />
aspects stochastic optimization can be fully seen in the<br />
context <strong>of</strong> state estimation and we can work out some<br />
conceptual ideas that are used in state estimation in the<br />
next section.<br />
The exchange <strong>of</strong> mutual information depends on a so<br />
called resampling scheme which stays outside the state<br />
space model. While different stochastic methods have<br />
brought up a variety <strong>of</strong> exchange schemes also state
estimation methods have brought up unified methods like<br />
residual, stratified, or systematic resampling, etc. [8]. We<br />
will not focus on these aspects <strong>of</strong> stochastic optimization<br />
methods at this point, but we will provide a description<br />
about the stochastic diffusion <strong>of</strong> states in the state space<br />
view <strong>of</strong> optimization.<br />
While deterministic methods select the state update<br />
from gradient or curvature information in order to decrease<br />
Ψ and thus follow strictly deterministic rules,<br />
the probabilistic change is summarized by means <strong>of</strong><br />
pdf’s π(·). State space theorists have developed the<br />
ChapmanKolmogorov equation<br />
<br />
π(xk|yd )= π(xk|xk−1)π(xk−1|yd )dxk−1, (11)<br />
R N<br />
to provide a probabilistic measure about the state evolution<br />
given the current state and its posterior. While<br />
equation (11) is not hard to derive using the mathematical<br />
tool <strong>of</strong> marginalization, it provides two interesting insides<br />
about the update in stochastic optimization methods.<br />
• The state update is described by π(xk|xk−1) and<br />
does not depend on the current value <strong>of</strong> the objective<br />
function.<br />
• The update probability depends on π(xk−1|y d ),but<br />
there is no guarantee that xk−1 will be changed.<br />
The transition kernel π(xk|xk−1) describes the probability<br />
<strong>of</strong> the state exchange from state xk−1 to the state xk.<br />
A remarkable point about this formulation is the fact, that<br />
the update is independent from the current value <strong>of</strong> Ψ or<br />
the posterior. This is an important fact that explains the<br />
powerfulness <strong>of</strong> stochastic methods. If the kernel would<br />
depend on Ψ, stochastic methods would end with the<br />
same stalling behavior in local minima as deterministic<br />
methods do, as then a deterministic drift is present.<br />
In most cases the kernel π(xk|xk−1) is even reduced<br />
to π(xk). The pdf π(xk−1|y d) in equation (11) induces<br />
another important principle in stochastic optimization<br />
which can be directly connected to the mutual information<br />
exchange. It states, that the update <strong>of</strong> the state due<br />
to the proposal kernel is not guaranteed. Instead we can<br />
see the result π(xk|y d) only provides a relative number<br />
for the new state π(xk) to be accepted.<br />
IV. STATE SPACE METHODS FOR OPTIMIZATION<br />
In this section we want to discuss some more state<br />
space concepts and their use for stochastic optimization.<br />
We have selected these methods as we see them<br />
to be important with nowadays needs. State estimation<br />
techniques are among the algorithms which have seen<br />
one <strong>of</strong> the strongest developments in the past decades.<br />
The early origin was given by the Apollo space flight<br />
programm in the 1960’s where the Kalman filter has<br />
seen it’s breakthrough. Since then both, single point<br />
and population-based methods have been developed, to<br />
regain knowledge from the hidden states <strong>of</strong> a system<br />
given the actually observed function values for an optimal<br />
designed objective function in order to recover x from<br />
- 70 - 15th IGTE Symposium 2012<br />
noisy observations. In this sense we first have to discuss<br />
the meaning <strong>of</strong> the likelihood function π(x|y d) in more<br />
detail. Following the definition <strong>of</strong> a multivariate Gaussian<br />
random variable y<br />
y ∝ exp −(y − μ) T Σ −1 (y − μ) , (12)<br />
where μ expresses the mean and Σ is the covariance<br />
matrix, we observe, that the likelihood function has the<br />
mean <strong>of</strong> a Gaussian distribution expressing uncertainty<br />
about y. In this sense the measurement noise v becomes<br />
relevant for a first as the likelihood function expressed<br />
this noise in terms <strong>of</strong> a probability measure. This will<br />
lead us directly to the aspects brought in the following<br />
subsection.<br />
The consequent use <strong>of</strong> this approach brought up powerful<br />
stochastic state estimation algorithms like state observers,<br />
sequential Monte Carlo methods like the already<br />
mentioned Kalman filter or Particle filters, or even more<br />
powerful Markov chain Monte Carlo (MCMC) methods.<br />
A. Enhanced Error Model<br />
A matter <strong>of</strong> concern with the solution <strong>of</strong> physical<br />
motivated optimization problems are the computational<br />
costs in concern with the evaluation <strong>of</strong> the objective<br />
function Ψ. This especially holds if the underlying problem<br />
requires the solution <strong>of</strong> partial differential equations<br />
(PDE’s) which has to be done by numerical methods<br />
like the finite element method (FEM). Recently the<br />
use <strong>of</strong> approximation techniques has become popular in<br />
both, state estimation and optimization [9], [10]. Hereby<br />
the computational costly evaluation <strong>of</strong> H k is replaces<br />
by a cheap approximation or surrogate function H ∗ k.<br />
Subsequently this leads to the cost function Ψ∗ due to<br />
the approximation error<br />
e = H ∗ k − H k. (13)<br />
We can reformulate the relation between H k and H ∗ k to<br />
H ∗ k = Hk +(H ∗ k − H k) =Hk + e. (14)<br />
This is an interesting formulation as we can look on<br />
the approximation error e as an additive error similar to<br />
the measurement noise v depicted in figure 1. Although<br />
the approximation error e depends on the state x, and<br />
thus is a deterministic error, we can think about a<br />
probabilistic description about e in the concept <strong>of</strong> a<br />
Gaussian distribution. This is an approach <strong>of</strong>ten taken<br />
in several fields <strong>of</strong> state estimation and system theory.<br />
It ends up exactly in the idea covered by the so called<br />
enhanced error model [11]. Although the approximation<br />
error e is <strong>of</strong> deterministic nature a probabilistic model is<br />
built from samples about the state space R N . Then the<br />
likelihood function π ∗ (y d|x) becomes<br />
π ∗ (y d|x) ∝ exp −(y ∗ − y d + μ e) T Σ −1<br />
e (y ∗ − y d + μ e) ,<br />
(15)<br />
and the optimization can be performed on this<br />
computational less costly function. Given the degree
<strong>of</strong> accuracy <strong>of</strong> the approximation H ∗ k the solution can<br />
be seen as good as a solution obtained by Hk, orthe<br />
approximation approach can be used to find a good<br />
initial solution which can be refined in less optimization<br />
steps using the accurate model.<br />
In general the determination <strong>of</strong> the mean μ e and the<br />
covariance matrix Σe requires a large number <strong>of</strong> samples.<br />
However, during the setup and model testing phase for the<br />
optimization problem typically enough data is generated<br />
to describe e in the presented way.<br />
B. Hybrid Schemes<br />
Another useful aspect about the use <strong>of</strong> state space<br />
schemes for optimization is the natural possibility to incorporate<br />
both, deterministic and stochastic methods for<br />
building hybrid optimization schemes. This can be easily<br />
done by enabling the input vector uk and building an<br />
outer feedback system as discussed in subsection III-A.<br />
The natural representation <strong>of</strong> the interaction between<br />
the deterministic drift and the stochastic interaction is<br />
therefore <strong>of</strong> interest, as it illustrates the powerfulness<br />
<strong>of</strong> the combination. I.e. if only some elements <strong>of</strong> the<br />
gradient g are available because the function Ψ is not<br />
steady with respect to this variables, we can only use<br />
the available gradient information for the input vector<br />
u. The other components <strong>of</strong> x are updated by the<br />
stochastic algorithm. In addition, an outer resampling<br />
scheme retains the property <strong>of</strong> a stochastic optimization<br />
scheme to overcome local minima.<br />
C. Robust Schemes<br />
Uncertainty is in many aspects a concerning topic in<br />
state estimation. This is given due to the fact, that models<br />
<strong>of</strong>ten do not cover all physical aspects due to reduction.<br />
Also optimization engineers have developed robust target<br />
functions in order to find solutions insensitive with<br />
respect to parameter variations <strong>of</strong> x [3]. Such robust<br />
objective functions are typically <strong>of</strong> form<br />
min max Ψ(x, ξ), (16)<br />
x ξ<br />
where ξ describes an immanent given uncertainty in the<br />
parameters. Most <strong>of</strong>ten the absolute value <strong>of</strong> ξ is limited.<br />
Robust state estimation has brought up the H∞ concept<br />
[4], where the estimation error e = x − ˆx is minimized<br />
using an approach <strong>of</strong> form<br />
min max J (x, ˆx, v, w). (17)<br />
ˆx v,w<br />
Hereby no limitations about the process noise w and<br />
the measurement noise v are assumed. The H∞ filter<br />
seeks for the best estimate under worst case conditions.<br />
Mostly game theoretic approaches are used to formulate<br />
the function J . We pointed this out, as control scientist<br />
have gained a lot <strong>of</strong> experience in the field and there<br />
might be useful aspects for optimization.<br />
- 71 - 15th IGTE Symposium 2012<br />
V. A NUMERICAL EXAMPLE<br />
To provide a numerical example about the presented<br />
considerations <strong>of</strong> state space methods for optimization<br />
we want to present a simple optimization problem<br />
consisting <strong>of</strong> an inverse problem for a resistor network<br />
example. Figure 3(a) depicts the resistor network under<br />
investigation. The black lines illustrate resistors with<br />
a value <strong>of</strong> R1 = 1Ω. The gray colored lines mark a<br />
circular disk <strong>of</strong> radius r where resistors with a value <strong>of</strong><br />
R2 are placed. Hereby the mapping between the circle<br />
radius and the resistor values is discontinuous by the<br />
way, that the resistor has to be fully placed inside the<br />
circle. It is now aim to find the radius r <strong>of</strong> the circle<br />
and the resistor value R2 from some electrical boundary<br />
measurements. These measurement built the vector yd. A problem <strong>of</strong> this kind is a classical inverse problem<br />
where we aim on the determination <strong>of</strong> the state vector<br />
x = T r R2 from measurements yd.<br />
Figure 3 exemplary depicts a part <strong>of</strong> the cost function.<br />
Hereby the R2 and r were set to R2 =0.5Ω and r =<br />
0.5m. The corner length was set to r =1mand was<br />
discretized by 40 resistors. A current is injected at the<br />
upper left corner and 5 equidistant measurement points<br />
(ampere meters) are connected to the lower edge.<br />
(a) Resistor network.<br />
0.7<br />
0.6<br />
0<br />
x 10<br />
8<br />
7<br />
6<br />
5<br />
4<br />
3<br />
2<br />
1<br />
−4<br />
Fig. 3. Test example and objective function.<br />
Ψ<br />
0.5<br />
0.4<br />
r (m)<br />
0.55<br />
0.5<br />
0.45<br />
0.3<br />
0.2 0.4<br />
R (Ω)<br />
2<br />
(b) Objective function Ψ.<br />
We now want to apply a hybrid optimization approach<br />
where the corner points about the algorithm can be stated<br />
by the following:<br />
• We use a population based scheme.<br />
• We use gradient information about the resistor<br />
value R2.<br />
• We work on a reduced model (only half the number<br />
<strong>of</strong> resistor elements per edge).<br />
In state estimation such an algorithm belongs to the class<br />
<strong>of</strong> sequential Monte Carlo (SMC) methods and is mostly<br />
referred to as Particle filter (PF) [12].<br />
Arguable one <strong>of</strong> the most interesting points in this list<br />
is the use <strong>of</strong> a reduced model to solve the optimization<br />
problem. Figure 4(a) depicts the objective function (6)<br />
(W was set to be the identity matrix) when using the<br />
reduced model for solving the optimization problem with<br />
data from the fine model. One can obtain, that the depicted<br />
part <strong>of</strong> the objective function does not even include<br />
a minima. Figure 4(b) depicts the likelihood <strong>of</strong> form (15),<br />
using an enhanced error model. As we can see, the point
where the likelihood function has its maxima presents<br />
the true solution. Thus, if our optimization algorithm is<br />
designed to minimize the corresponding objective function<br />
is should be possible to find the solution although<br />
working on the reduced model. Figure 5 depicts the<br />
Ψ *<br />
0.2<br />
0.15<br />
0.1<br />
0.05<br />
0<br />
0.7<br />
0.6<br />
0.5<br />
0.4<br />
0.3<br />
r (m)<br />
0.7<br />
0.6<br />
0.5<br />
0.4<br />
0.3<br />
R (Ω)<br />
2<br />
(a) Objective function Ψ ∗ .<br />
π(r,R 2 )<br />
1<br />
0.8<br />
0.6<br />
0.4<br />
0.2<br />
0<br />
0.7<br />
0.6<br />
0.5<br />
0.4<br />
0.3<br />
r (m)<br />
0.3<br />
0.4<br />
R 2 (Ω)<br />
(b) Posteriori probability π ∗ (r, R2).<br />
Fig. 4. Determination <strong>of</strong> r and R2 using a reduced model.<br />
behavior and the result <strong>of</strong> the proposed hybrid scheme<br />
for the given problem using the reduced model for the<br />
solution. Figure 5(a) depicts the state <strong>of</strong> the population.<br />
As can be seen, the population is clustered around the<br />
correct solution. Hereby the background color depicts<br />
the objective function for the fine model but as stated<br />
the coarse model is used! Figure 5(b) and figure 5(c)<br />
depict the decrease <strong>of</strong> the objective function and the<br />
increase <strong>of</strong> the likelihood function, respectively. The dots<br />
illustread the spread <strong>of</strong> the population. As can be seen<br />
both, the likelihood function and the objective function<br />
can become smaller or larger, respectively. Thus, the<br />
property <strong>of</strong> stochastic methods is given.<br />
Ψ(r,R 2 )<br />
0.035<br />
0.03<br />
0.025<br />
0.02<br />
0.015<br />
0.01<br />
0.005<br />
r (m)<br />
0.7<br />
0.65<br />
0.6<br />
0.55<br />
0.5<br />
0.45<br />
0.4<br />
0.35<br />
0<br />
1 2 3 4 5 6<br />
Iteration<br />
7 8 9 10<br />
(b) Objective function.<br />
0.3<br />
0.3 0.4 0.5 0.6 0.7<br />
R (Ω)<br />
2<br />
(a) Particles.<br />
Fig. 5. Output <strong>of</strong> the particle filter.<br />
π(r,R 2 )<br />
1<br />
0.9<br />
0.8<br />
0.7<br />
0.6<br />
0.5<br />
0.4<br />
0.3<br />
0.2<br />
0.1<br />
0<br />
1 2 3 4 5 6<br />
Iteration<br />
7 8 9 10<br />
(c) Posteriori probability.<br />
VI. OUTLOOK<br />
In this paper we demonstrated a state space view on<br />
optimization algorithms. Both, deterministic and stochastic<br />
methods were exploited in the content <strong>of</strong> the unified<br />
state space representation. We demonstrated that<br />
0.5<br />
0.6<br />
0.7<br />
- 72 - 15th IGTE Symposium 2012<br />
deterministic approaches can be considered as standard<br />
feedback systems, whereas stochastic methods can be<br />
directly linked to state estimation. We explored features<br />
<strong>of</strong> stochastic state estimation using Bayes law and subsequently<br />
demonstrated the usefulness <strong>of</strong> state estimation<br />
techniques for optimization. All our considerations are<br />
summarized in a hybrid optimization algorithm working<br />
on a reduced model where we demonstrated the natural<br />
interaction <strong>of</strong> deterministic and stochastic methods using<br />
state space descriptions. Further research will focus in<br />
two directions. First, we would extend the presented<br />
hybrid scheme using some more sophisticated methods.<br />
Second we consider work on a formal description <strong>of</strong><br />
different stochastic algorithms using methods from probability<br />
theory.<br />
REFERENCES<br />
[1] R. Fletcher, Practical Methods <strong>of</strong> Optimization; (2nd Ed.), Wiley-<br />
Interscience, New York, USA, 1987.<br />
[2] L. dos Santos Coelho and P. Alotto, Multiobjective Electromagnetic<br />
Optimization Based on a Nondominated Sorting Genetic Approach<br />
With a Chaotic Crossover Operator, IEEE Transactions on Magnetics,<br />
vol.44, no.6, pp.1078-1081, 2008.<br />
[3] P. Alotto, C. Magele, W. Renhart, A. Weber, G. Steiner Robust<br />
target functions in electromagnetic design, COMPEL: The International<br />
Journal for Computation and Mathematics in Electrical<br />
and Electronic Engineering, Vol. 22 Iss: 3, pp.549 - 560, 2003.<br />
[4] D. Simon, Optimal state estimation, Kalman, H∞ and nonlinear<br />
approaches, Wiley - Interscience, John Wiley & Sons, Inc., New<br />
Jersey, 2006.<br />
[5] R. Storn and K. Price, Differential evolution - a simple and efficient<br />
heuristic for global optimization over continuous spaces, Journal<br />
<strong>of</strong> Global Optimization 11: pp.341-359, 1997.<br />
[6] L. dos Santos Coelho, C. da Costa Silveira, C.A. Sierakowski, and<br />
P. Alotto, Improved Bacterial Foraging Strategy Applied to TEAM<br />
Workshop Benchmark Problem, IEEE Transactions on Magnetics,<br />
vol.46, no.8, pp.2903-2906, Aug. 2010.<br />
[7] O. Hajji, S. Brisset, and P. Brochet, Comparing stochastic optimization<br />
methods used in electrical engineering, Systems, Man<br />
and Cybernetics, 2002 IEEE International Conference on , vol.7,<br />
no., pp. 6 pp. vol.7, 6-9 Oct. 2002.<br />
[8] R. Douc, O. Cappe, and E. MoulinesComparison <strong>of</strong> resampling<br />
schemes for particle filtering, In 4th International Symposium on<br />
Image and Signal Processing and Analysis (ISPA), pp.64-69, 2005.<br />
[9] Albunni, M.N.; Rischmuller, V.; Fritzsche, T.; Lohmann, B.; ,<br />
Multiobjective Optimization <strong>of</strong> the Design <strong>of</strong> Nonlinear Electromagnetic<br />
Systems Using Parametric Reduced Order Models, IEEE<br />
Transactions on Magnetics, vol.45, no.3, pp.1474-1477, March<br />
2009.<br />
[10] A. I. Forrester, A. Sóbester and A. J. Keane, Engineering Design<br />
via Surrogate Modelling A Practical Guide, Wiley, 2008.<br />
[11] J. P. Kaipio and E. Somersalo, Statistical and computational<br />
inverse problems, New York: Applied Mathematical Sciences,<br />
Springer, 2004.<br />
[12] M.S. Arulampalam, S. Maskell, N. Gordon, and T. Clapp,<br />
A tutorial on particle filters for online nonlinear/non-Gaussian<br />
Bayesian tracking IEEE Transactions on Signal Processing 50 (2),<br />
pp.174188, 2002.
- 73 - 15th IGTE Symposium 2012<br />
Quasi TEM Analysis <strong>of</strong> 2D Symmetrically Coupled<br />
Strip Lines with Finite Grounded Plane using HBEM<br />
*Saša S. Ilić, *Mirjana T. Perić, *Slavoljub R. Aleksić and *Nebojša B. Raičević<br />
*<strong>University</strong> <strong>of</strong> Niš, Faculty <strong>of</strong> Electronic Engineering <strong>of</strong> Niš, Aleksandra Medvedeva 14, 18000 Niš, Serbia<br />
E-mail: sasa.ilic@elfak.ni.ac.rs<br />
Abstract—The hybrid boundary element method (HBEM), based on combination between equivalent electrodes method<br />
(EEM) and boundary element method (BEM), is applied for characteristic parameters determination <strong>of</strong> symmetrically coupled<br />
strip lines with a finite width grounded plane. Even and odd modes are considered in the paper. All results for the characteristic<br />
impedance and the effective dielectric permittivity are compared with the finite element method (FEM).<br />
Index Terms—Characteristic impedance, Equivalent Electrodes Method (EEM), Finite Element Method (FEM), Hybrid<br />
Boundary Element Method (HBEM).<br />
strip lines parameters, when the strip line is above an infinite-width<br />
grounded plane [14]. The HBEM can be also<br />
applied to analysis <strong>of</strong> corona effects [15] and metamaterial<br />
structures [16]. A problem <strong>of</strong> symmetrically<br />
coupled strip line placed above infinite grounded plane is<br />
investigated in [17].<br />
The HBEM is applied, in this paper, to calculate the<br />
even- and odd- mode characteristic impedance <strong>of</strong> 2D<br />
symmetrically coupled strip lines with finite grounded<br />
plane, shown in Fig. 1. The quasi TEM analysis is used in<br />
this paper.<br />
I. INTRODUCTION<br />
Over the years, many authors have analyzed<br />
symmetrically and asymmetrically coupled or ordinary<br />
strip lines with width-limited dielectric substrate using<br />
numerous numerical and analytical methods [1]-[9]. The<br />
variational method [1], the Garlekin’s method, the method<br />
<strong>of</strong> moments [2]-[4], the boundary element method [5], the<br />
conformal mapping, the moving perfect electric wall<br />
method [6]-[9] etc. are some <strong>of</strong> the commonly used<br />
methods. On the other side, the problem <strong>of</strong> the widthlimited<br />
microstrip grounded plane has not been so <strong>of</strong>ten<br />
investigated, although these forms <strong>of</strong> microstrips are<br />
typical in practice. In [7]-[9] the microstrip line with<br />
finite-width dielectric and grounded plane was analyzed.<br />
A so-called moving perfect electric wall method (MPEW)<br />
in conjunction with the conformal mapping method<br />
(CMM) was applied in those papers.<br />
An application <strong>of</strong> boundary element method (BEM)<br />
usually contains singular and nearly singular integrals<br />
whose evaluation is difficult although original problems<br />
are not singular. In order to avoid numerical integrations,<br />
it is possible to substitute small boundary segments by<br />
total charges placed at their centres. The Green’s function<br />
for the electric scalar potential <strong>of</strong> the charges, placed in<br />
the free space at the boundary <strong>of</strong> two dielectrics, is used<br />
and the proposed method is called the hybrid boundary<br />
element method (HBEM) [10-17].<br />
This method presents a combination <strong>of</strong> BEM and<br />
equivalent electrodes method (EEM). The basic idea is in<br />
replacing an arbitrary shaped electrode by equivalent<br />
electrodes (EEs), and an arbitrary shaped boundary<br />
surface between any two dielectric layers by discrete<br />
equivalent total charges per unit length placed in the air.<br />
The basic Green’s function for the electric scalar potential<br />
<strong>of</strong> the charges placed in the free space at the boundary<br />
surface <strong>of</strong> two dielectrics is used. The method is based on<br />
the EEM, on the point-matching method (PMM) for the<br />
potential <strong>of</strong> the perfect electric conductor (PEC)<br />
electrodes and for the normal component <strong>of</strong> the electric<br />
field at the boundary surface between any two dielectric<br />
layers.<br />
The HBEM is applied, until now, to solving<br />
multilayered electromagnetic problems [10], grounding<br />
systems [12], electromagnetic field determination in vicinity<br />
<strong>of</strong> cable terminations [13], as well as to calculation <strong>of</strong><br />
Figure 1: Symmetrically coupled strip line with finite grounded plane.<br />
Symmetrically coupled strip lines can be used as basic<br />
elements for filters, phase shifters, directional couplers,<br />
baluns and combiners [18].<br />
II. THEORETICAL BACKGROUND<br />
The HBEM is applied and corresponding model is formed,<br />
Fig. 2.<br />
Figure 2: Corresponding HBEM model.
Using the existing symmetry, the electric scalar potential<br />
<strong>of</strong> whole system from Fig. 2 is determined:<br />
(e (e, o)<br />
2<br />
B ln l<br />
4<br />
B ln l<br />
3<br />
B ln l<br />
Ki<br />
A<br />
i 1 k 1<br />
( x<br />
Ki<br />
i 3 k 1<br />
( x<br />
Mi<br />
i 1 m 1<br />
( x<br />
q<br />
0<br />
d<br />
ik<br />
2<br />
q<br />
2<br />
2<br />
dik<br />
a<br />
ik<br />
q<br />
x<br />
x<br />
0<br />
a ik<br />
t<br />
im<br />
x<br />
0<br />
t im<br />
)<br />
)<br />
ln<br />
)<br />
2<br />
2<br />
2<br />
ln<br />
ln<br />
( x<br />
( y<br />
( x<br />
( y<br />
( y<br />
( x<br />
x<br />
y<br />
dik<br />
y<br />
y<br />
dik<br />
x<br />
a ik<br />
)<br />
t im<br />
)<br />
a ik<br />
x<br />
)<br />
)<br />
2<br />
2<br />
)<br />
2<br />
t im<br />
)<br />
2<br />
2<br />
2<br />
,<br />
( y<br />
( y<br />
( y<br />
y<br />
dik<br />
y<br />
)<br />
a ik<br />
y<br />
2<br />
)<br />
t im<br />
where the coefficients A and B have following values:<br />
0,<br />
odd(o)<br />
mode;<br />
A<br />
1,<br />
even(e)<br />
mode.<br />
B<br />
1,<br />
odd(o)<br />
mode;<br />
1,<br />
even(e)<br />
mode .<br />
The electric field is E grad( g ( ) . The total number <strong>of</strong><br />
unknowns N tot , will be denoted by:<br />
4<br />
3<br />
N K M A .<br />
tot<br />
i<br />
i 1 i 1<br />
A relation between the normal component <strong>of</strong> the<br />
electric field and the total surface charges is given with<br />
Eq. (2):<br />
n ˆi ( 0<br />
Eim<br />
)<br />
0<br />
( 0<br />
)<br />
t<br />
im i , t im i<br />
q<br />
t im i<br />
,<br />
l im<br />
(2)<br />
where i M m , , 1 , 3 , 2 , i 1 , nˆ i ( nˆ 1 nnˆ<br />
ˆ2<br />
yyˆ<br />
ˆ , nˆ 3 xxˆ<br />
ˆ )<br />
are unit normal vectors oriented from the layer<br />
the layer 0 .<br />
towards<br />
Using the PMM for the potential <strong>of</strong> the perfect<br />
conductors given by (1), the PMM for the normal<br />
component <strong>of</strong> the electric field (2), and the electrical<br />
neutrality condition (3) (only for the even mode!), it is<br />
possible to determine unknown free charges per unit<br />
length on conductors, the total charges per unit length on<br />
the boundary surfaces between two dielectric layers and<br />
the unknown constant 0 .<br />
The electrical neutrality condition is:<br />
2<br />
Ki<br />
4 Ki<br />
q d dik<br />
qa<br />
a ik 0 0.<br />
(3)<br />
i 1 k 1 i 3 k 1<br />
After solving the system <strong>of</strong> linear equations, it is<br />
possible to calculate the capacitance per unit length <strong>of</strong> the<br />
i<br />
2<br />
)<br />
2<br />
(1)<br />
- 74 - 15th IGTE Symposium 2012<br />
strip line given by (4):<br />
K1<br />
K3<br />
(e, o) 1<br />
C qd1k<br />
qa<br />
3k<br />
. (4)<br />
U<br />
k 1 k 1<br />
With the developed program code, the characteristic<br />
impedance <strong>of</strong> the symmetrically coupled strip line is<br />
calculated as<br />
(e, o) (e, o) eff ef (e, o)<br />
Z c Zc<br />
0 / r ,<br />
where<br />
ef eff (e, o) ( (e, o) ( (e, o)<br />
r<br />
C<br />
/C 0<br />
(e, o)<br />
is the effective dielectric permittivity, and Z c0<br />
is the<br />
characteristic impedance <strong>of</strong> the symmetrically coupled<br />
strip line without dielectric layer (free space), for even (e)<br />
and odd (o) modes, respectively.<br />
In order to verify the obtained numerical results for the<br />
characteristic impedance and the effective dielectric<br />
permittivity, the finite element method (FEM) [19] is<br />
used.<br />
III. RESULTS<br />
The results convergence and computation time for the<br />
even and odd modes can be noticed from Table I, for<br />
parameters: r 3 , d / w1<br />
4 , h / d 0 0.<br />
5 , t 1 / w1<br />
0<br />
. 1 ,<br />
s / w1<br />
1<br />
. 0 , w 2 / w1<br />
6 6.<br />
0 and t 2 / t1<br />
2<br />
. 0 , where N tot<br />
is the total number <strong>of</strong> unknowns.<br />
N tot<br />
TABLE I<br />
CONVERGENCE OF RESULTS AND CPU TIME<br />
Even mode Odd mode<br />
eff ef<br />
r<br />
Zc<br />
[ ] eff ef<br />
r<br />
Zc<br />
[ ]<br />
t(s)<br />
298 2.119 158.860 1.817 77.293 4.4<br />
370 2.120 158.871 1.823 77.207 6.9<br />
444 2.121 158.901 1.827 77.162 9.7<br />
585 2.123 158.906 1.832 77.093 16.7<br />
655 2.123 158.917 1.833 77.077 20.9<br />
726 2.124 158.905 1.835 77.048 25.7<br />
800 2.124 158.916 1.836 77.039 31.5<br />
872 2.124 158.916 1.837 77.026 38.8<br />
940 2.125 158.905 1.838 77.007 45.4<br />
1014 2.125 158.914 1.839 77.003 51.4<br />
1085 2.125 158.904 1.840 76.987 58.1<br />
1155 2.125 158.911 1.840 76.986 65.4<br />
1225 2.125 158.917 1.840 76.984 74.1<br />
1296 2.126 158.909 1.841 76.973 86.5<br />
1370 2.125 158.915 1.841 76.972 97.6<br />
First, a very good convergence <strong>of</strong> values <strong>of</strong> both<br />
parameters is achieved for the both modes. Second, a<br />
computation time was much shorter comparing to the time<br />
required by FEM: we needed up to 97.6 seconds for the<br />
system <strong>of</strong> 1370 unknowns, while FEM for solving the<br />
same problem took about 15 minutes with a few hundreds<br />
<strong>of</strong> thousands <strong>of</strong> finite elements.<br />
Equipotential contours and distributions <strong>of</strong> polarized
charges per unit length along boundary surface are shown<br />
in Figs. 3-6 (even and odd modes, respectively) for para-<br />
meters:<br />
r 3 , d / w1<br />
4 , h / d 0 0.<br />
5,<br />
t 1 / w1<br />
0 0.<br />
1,<br />
s / w1<br />
1<br />
. 0 , w 2 / w1<br />
6<br />
. 0 and t 2 / t1<br />
2<br />
. 0 .<br />
Figure 3: Equipotential contours (Even mode).<br />
Figure 4: Equipotential contours (Odd mode).<br />
t 2<br />
t1<br />
1<br />
2<br />
3<br />
4<br />
- 75 - 15th IGTE Symposium 2012<br />
Figure 5: Distribution <strong>of</strong> polarized charges per unit length along<br />
boundary surface (Even mode).<br />
Figure 6: Distribution <strong>of</strong> polarized charges per unit length along<br />
boundary surface (Odd mode).<br />
TABLE II<br />
COMPARED RESULTS FOR CHARACTERISTIC IMPEDANCE OF STRIP LINE VERSUS 2 1 t t AND h d FOR PARAMETERS:<br />
r 3 , d / w1<br />
4 , t 1/<br />
w1<br />
0 0.<br />
05 , s / w1<br />
1<br />
. 0 AND w 2 / w1<br />
6 6.<br />
0 .<br />
h<br />
d<br />
Even mode Odd mode<br />
HBEM FEM HBEM FEM<br />
eff ef<br />
r<br />
Zc<br />
[ ] eff ef<br />
r<br />
Zc<br />
[ ] eff ef<br />
r<br />
Zc<br />
[ ] eff ef<br />
r<br />
Zc<br />
[ ]<br />
0.2 2.3967 84.950 2.3969 84.864 2.0470 64.218 2.0558 63.942<br />
0.4 2.2182 138.000 2.2185 137.829 1.9009 77.198 1.9120 76.805<br />
0.6 2.0856 182.116 2.0862 181.851 1.8642 81.055 1.8757 80.626<br />
0.8 1.9849 220.428 1.9863 220.066 1.8550 82.450 1.8676 81.983<br />
1.0 1.9059 254.371 1.9076 253.892 1.8533 83.020 1.8666 82.318<br />
0.2 2.3938 84.935 2.3965 84.783 2.0470 64.215 2.0555 63.948<br />
0.4 2.2144 137.901 2.2172 137.624 1.9008 77.192 1.9119 76.801<br />
0.6 2.0817 181.901 2.0844 181.526 1.8642 81.048 1.8757 80.618<br />
0.8 1.9810 220.107 1.9844 219.602 1.8543 82.457 1.8676 81.981<br />
1.0 1.9021 253.950 1.9056 253.368 1.8533 83.016 1.8670 82.545<br />
0.2 2.3923 84.907 2.3957 84.741 2.0469 64.213 2.0553 63.949<br />
0.4 2.2122 137.786 2.2156 137.492 1.9008 77.188 1.9119 76.794<br />
0.6 2.0793 181.675 2.0827 181.265 1.8641 81.043 1.8755 80.620<br />
0.8 1.9785 219.777 1.9826 219.241 1.8543 82.453 1.8676 81.978<br />
1.0 1.8997 253.521 1.9038 252.901 1.8532 83.013 1.8666 82.554<br />
0.2 2.3912 84.878 2.3942 84.739 2.0468 64.212 2.0557 63.939<br />
0.4 2.2105 137.677 2.2139 137.412 1.9007 77.184 1.9118 76.792<br />
0.6 2.0775 181.465 2.0809 181.083 1.8641 81.039 1.8756 80.600<br />
0.8 1.9766 219.468 1.9809 218.918 1.8543 82.449 1.8675 81.973<br />
1.0 1.8978 253.120 1.9021 252.483 1.8532 83.010 1.8666 82.548
- 76 - 15th IGTE Symposium 2012<br />
TABLE III<br />
COMPARED RESULTS FOR CHARACTERISTIC IMPEDANCE OF STRIP LINE VERSUS 2 1 w w FOR PARAMETERS:<br />
r 3 , d / w1<br />
4 , t 1/<br />
w1<br />
0 0.<br />
05 , h / d 0<br />
. 5 , s / w1<br />
1<br />
. 0 AND t 2 / t1<br />
2<br />
. 0 .<br />
Even mode Odd mode<br />
w 2 HBEM FEM HBEM FEM<br />
w1<br />
eff ef<br />
r<br />
Zc<br />
[ ] eff ef<br />
r<br />
Zc<br />
[ ] eff ef<br />
r<br />
Zc<br />
[ ] eff ef<br />
r<br />
Zc<br />
[ ]<br />
4.5 2.2100 168.392 2.2120 168.080 1.8799 80.064 1.8915 79.655<br />
5.0 2.1823 165.314 2.1845 165.004 1.8784 79.885 1.8899 79.479<br />
5.5 2.1605 162.805 2.1630 162.490 1.8772 79.743 1.8888 79.344<br />
6.0 2.1430 160.759 2.1458 160.438 1.8749 79.656 1.8876 79.242<br />
8.0 2.0981 155.589 2.1026 155.229 1.8718 79.427 1.8858 78.982<br />
10.0 2.0753 152.991 2.0805 152.606 1.8670 79.352 1.8848 78.898<br />
15.0 2.0492 150.340 2.0575 149.885 1.8598 79.343 1.8598 79.343<br />
TABLE IV<br />
COMPARED RESULTS FOR CHARACTERISTIC IMPEDANCE OF STRIP LINE VERSUS 1 1 w t FOR PARAMETERS:<br />
r 3 , d / w1<br />
4 , w 2 / w1<br />
6 6.<br />
0 , h / d 0 0.<br />
5 , s / w1<br />
1<br />
. 0 AND t 2 / t1<br />
2<br />
. 0 .<br />
Even mode Odd mode<br />
t1<br />
HBEM FEM HBEM FEM<br />
w1<br />
eff ef<br />
r<br />
Zc<br />
[ ] eff ef<br />
r<br />
Zc<br />
[ ] eff ef<br />
r<br />
Zc<br />
[ ] eff ef<br />
r<br />
Zc<br />
[ ]<br />
0.01 2.1768 162.282 2.1632 162.257 1.9031 82.453 1.9251 81.744<br />
0.02 2.1600 161.998 2.1583 161.804 1.8971 81.628 1.9147 81.052<br />
0.03 2.1526 161.595 2.1537 161.316 1.8899 80.913 1.9051 80.416<br />
0.04 2.1474 161.167 2.1496 160.857 1.8827 80.257 1.8963 79.801<br />
0.05 2.1430 160.759 2.1458 160.438 1.8749 79.656 1.8876 79.242<br />
0.06 2.1388 160.362 2.1422 160.021 1.8679 79.074 1.8800 78.670<br />
0.07 2.1352 159.976 2.1388 159.641 1.8610 78.519 1.8754 78.222<br />
0.08 2.1319 159.605 2.1356 159.271 1.8543 77.985 1.8645 77.640<br />
0.09 2.1288 159.250 2.1264 158.700 1.8447 77.470 1.8577 77.130<br />
0.10 2.1258 158.909 2.1293 158.592 1.8413 76.973 1.8507 76.655<br />
To validate the accuracy <strong>of</strong> the presented method, a<br />
comparison is made with the finite element method results<br />
obtained using the FEM s<strong>of</strong>tware [19]. Those results are<br />
shown in Tables II-IV.<br />
As presented in the tables, numerical results for the<br />
effective dielectric permittivity and the characteristic<br />
impedance obtained using the HBEM are obviously in<br />
very good agreement with the FEM values (with few<br />
hundreds <strong>of</strong> thousands finite elements) with divergence<br />
less than 0.4% for the most <strong>of</strong> the cases.<br />
Distributions <strong>of</strong> characteristic impedance versus s / w1<br />
for different values <strong>of</strong> dielectric permittivity r are<br />
shown in Figs. 7 and 8, for:<br />
d / w1<br />
4 , h / d 0 0.<br />
5,<br />
t 1 / w1<br />
0 0.<br />
1 , w 2 / w1<br />
6 6.<br />
0<br />
and t 2 / t1<br />
2 2.<br />
0 .<br />
Fig.7 shows that increasing the values <strong>of</strong> parameter<br />
s / w1,<br />
decreasing the characteristic impedance for even<br />
mode. But, for the odd mode, Fig. 8, increasing the<br />
parameter s / w1,<br />
increasing the characteristic impedance<br />
too. The lowest values for the characteristic impedance<br />
are obtained for the highest value <strong>of</strong> dielectric<br />
permittivity.<br />
The obtained values are compared with the FEM<br />
results, also. A very good results agreement is obtained.<br />
Figure 7: Distribution <strong>of</strong> characteristic impedance versus s / w1<br />
for<br />
different values <strong>of</strong> dielectric permittivity (Even mode).<br />
IV. CONCLUSION<br />
A newly developed hybrid boundary element method is<br />
applied to quasi TEM analysis <strong>of</strong> 2D symmetrically<br />
coupled strip lines with finite grounded plane. Two quasistatic<br />
parameters are calculated: effective dielectric<br />
permittivity and characteristic impedance <strong>of</strong> the line. We<br />
have compared the values <strong>of</strong> parameters with those<br />
obtained by the finite element method. A very good<br />
agreement <strong>of</strong> the results is achieved: maximal relative
error <strong>of</strong> the characteristic impedance is less than 0.4%.<br />
Figure 8: Distribution <strong>of</strong> characteristic impedance versus s / w1<br />
for<br />
different values <strong>of</strong> dielectric permittivity (Odd mode).<br />
All calculations were performed on computer with dual<br />
core INTEL processor 2.8 GHz and 4 GB <strong>of</strong> RAM.<br />
This method can be successfully applied to static,<br />
stationary and quasi-stationary electromagnetic fields, as<br />
well as to the analysis <strong>of</strong> the fields in mechanics, fluid<br />
dynamics, conductive heat flow etc.<br />
Acknowledgement<br />
This research was partially supported by funding from<br />
the Serbian Ministry <strong>of</strong> Education and Science in the<br />
frame <strong>of</strong> the project TR 33008.<br />
REFERENCES<br />
[1] T. Fukuda, T. Sugie, K. Wakino, Y.-D. Lin, and T. Kitazawa,<br />
“Variational method <strong>of</strong> coupled strip lines with an inclined<br />
dielectric substrate,” in Asia Pacific Microwave Conference –<br />
APMC 2009, December 7-10, 2009, pp. 866-869.<br />
[2] R. F. Harrington, Field computation by Moment Methods. New<br />
York: Macmillan, 1968.<br />
[3] T. G. Bryant and J. A. Weiss, “Parameters <strong>of</strong> microstrip<br />
transmission lines and <strong>of</strong> coupled pairs <strong>of</strong> microstrip lines,” IEEE<br />
Trans. Microwave Theory Tech., vol. MMT-16, pp. 1021-1027,<br />
Dec. 1968.<br />
[4] A. Farrar and A. T. Adams, “Characteristic impedance <strong>of</strong><br />
microstrip by the method <strong>of</strong> moments,” IEEE Trans. Microwave<br />
Theory Tech., vol. MMT-18, pp. 65-66, Jan. 1970.<br />
[5] K. Li, and Y. Fujii, “Indirect boundary element method <strong>of</strong> applied<br />
- 77 - 15th IGTE Symposium 2012<br />
to generalized microstrip line analysis with applications to sideproximity<br />
effect in MMICs,” IEEE Trans. Microwave Theory and<br />
Techniques, vol. 40, pp. 237–244, Feb. 1992.<br />
[6] C.E. Smith, and R.S. Chang, “Microstrip transmission line with<br />
finite width dielectric,” IEEE Trans. Microwave Theory and<br />
Techniques, vol. 28, pp. 90–94, Feb. 1980.<br />
[7] J. Svacina, “Analytical models <strong>of</strong> width-limited microstrip lines,”<br />
Microwave and Optical <strong>Technology</strong> Letters, vol. 36, pp. 63–65,<br />
Jan. 2003.<br />
[8] J. Svacina, “New method for analysis <strong>of</strong> microstrip with finitewidth<br />
ground plane”, Microwave and Optical <strong>Technology</strong> Letters,<br />
Vol. 48, No. 2, pp. 396-399, Feb. 2006.<br />
[9] C.E. Smith, and R.S. Chang, “Microstrip transmission line with<br />
finite width dielectric and ground plane,” IEEE Trans. Microwave<br />
Theory and Techniques, vol. 33, pp. 835–839, Sept. 1985.<br />
[10] N. B. Raičević, S. R. Aleksić and S. S. Ilić, “A hybrid boundary<br />
element method for multilayer electrostatic and magnetostatic<br />
problems,” J. Electromagnetics, No. 30, pp. 507-524, 2010.<br />
[11] N. B. Raičević, S. R. Aleksić, “One method for electric field determination<br />
in the vicinity <strong>of</strong> infinitely thin electrode shells,”<br />
Journal Engineering Analysis with Boundary Elements, Elsevier,<br />
No. 34, pp. 97-104, 2010.<br />
[12] S. S. Ilić, N. B. Raičević, and S. R. Aleksić, “Application <strong>of</strong> new<br />
hybrid boundary element method on grounding systems,” in 14th<br />
International IGTE'10 Symp., <strong>Graz</strong>, Austria, Sept. 19-22, 2010,<br />
pp. 160-165.<br />
[13] N. B. Raičević, S. S. Ilić, and S. R. Aleksić, “Application <strong>of</strong> new<br />
hybrid boundary element method on the cable terminations,” in<br />
14th International IGTE'10 Symp., <strong>Graz</strong>, Austria, Sept. 19-22,<br />
2010, pp. 56-61.<br />
[14] S. S. Ilić, S. R. Aleksić, and N. B. Raičević, “TEM analysis <strong>of</strong><br />
strip line with finite width <strong>of</strong> dielectric substrate by using new<br />
hybrid boundary element method,” in 10-th International Conf.<br />
on Applied Electromagnetics ПЕС 2011, Niš, Serbia, September<br />
25-29, Sept. 2011, CD Proc. О8-4.<br />
[15] B. Petković, S. Ilić, S. Aleksić, N. Raičević, and D. Antić, “A<br />
novel approach to the positive DC nonlinear corona design,” J.<br />
Electromagnetics, vol. 31, no. 7, pp. 505-524, Oct. 2011.<br />
[16] N. B. Raicevic, and S. S. Ilic, “One hybrid method application on<br />
complex media strip lines determination,” in 3rd International<br />
Congress on Advanced Electromagnetic Materials in Microwaves<br />
and Optics, METAMATERIALS 2009, London, United Kingdom,<br />
2009, pp. 698-700.<br />
[17] S. S. Ilić, M. T. Perić, S. R. Aleksić, and N. B. Raičević, “Quasi<br />
TEM analysis <strong>of</strong> 2D symmetrically coupled strip lines with<br />
infinite grounded plane using HBEM,” in Proc. XVII-th<br />
International Symposium on Electrical Apparatus and<br />
Technologies SIELA 2012, Bourgas, Bulgaria, 28–30 May, 2012,<br />
pp.147-155.<br />
[18] A. M. Abbosh, “Analytical closed-form solutions for different<br />
configurations <strong>of</strong> parallel-coupled microstrip lines”, in IET<br />
Microwaves, Antennas & Propagation, Vol. 3, Iss. 1, pp. 137-<br />
147, 2009.<br />
[19] D. Meeker, FEMM 4.2, Available:<br />
http://www.femm.info/wiki/Download
- 78 - 15th IGTE Symposium 2012<br />
Design Approach for a Line-Start Internal Permanent<br />
Magnet Synchronous Motor<br />
1,2 V. Elistratova, 1 M. Hecquet, 1 P. Brochet, 2 D. Vizireanu and 2 M. Dessoude<br />
1 L2EP, Ecole Centrale de Lille, Cité Scientifique - BP 48 - 59651 Villeneuve d'Ascq, France<br />
2 EDF R&D, 1 avenue du Général de Gaulle, 92141 Clamart Cedex, France<br />
E-mail: vera.elistratova@ec-lille.fr<br />
Abstract—The work described in this paper deals with the analytical design and optimization <strong>of</strong> a line-start permanent<br />
magnet synchronous motor (LSPM) with radial magnet configuration. The design approach considers a LSPM as an<br />
induction motor (IM) combined with a permanent magnet rotor arrangement and takes into account the characteristics <strong>of</strong><br />
both asynchronous and synchronous regimes and the motor thermal behavior.<br />
Index Terms — LSPM, Eco-design, Optimization, Multi-physical model.<br />
I. INTRODUCTION<br />
A large amount <strong>of</strong> the primary energy resources are<br />
converted into electric energy. As the main portion <strong>of</strong><br />
greenhouse gases is produced by fossil fuels, electricity<br />
generation is responsible for the worldwide air pollution<br />
and global warming [1].<br />
Electric motors are one <strong>of</strong> the main sources <strong>of</strong><br />
electricity consumption (Fig.1) and this expense is up to<br />
70% in the industrial processes in Europe [2]. As so, the<br />
electric motors are responsible for a huge share <strong>of</strong><br />
emission <strong>of</strong> CO2. Moreover, there are a total potential <strong>of</strong><br />
improving the energy efficiency <strong>of</strong> applications using<br />
electric motors in the range <strong>of</strong> 20-30%. The main factors<br />
<strong>of</strong> such improvements are the use <strong>of</strong> variable speed drives<br />
and the use <strong>of</strong> energy efficient motors. Therefore, electric<br />
motor optimization for a better efficiency is essential for<br />
energy saving and the reduction <strong>of</strong> CO2 emissions.<br />
Figure 1: Distribution <strong>of</strong> the industrial electric consumption [2]<br />
Until now low- and medium - power induction motors<br />
(IMs) are widely used in many industrial applications,<br />
such as pumps and fans. In spite <strong>of</strong> the low cost, IMs<br />
normally suffer from relatively poor operational<br />
efficiency and power factor [3]. Although a permanent<br />
magnet synchronous machine (PMSM) can achieve high<br />
operational efficiency and power factor, it lacks the<br />
starting capability <strong>of</strong> the IM. For last few decades the<br />
line-start permanent magnet synchronous motor (LSPM)<br />
has been designed, constructed and tested. Compared<br />
with an IM, a LSPM has a lot <strong>of</strong> advantages: synchronous<br />
speed, higher power factor and efficiency, small size, etc.<br />
Besides it has an ability to start when connected directly<br />
to the mains.<br />
II. PERFORMANCE DESIGN<br />
The objective <strong>of</strong> this research is to find an analytical<br />
model for the LSPM with different magnet<br />
configurations. In the present paper the LSPM with radial<br />
magnet arrangement (Fig.2) is designed. This<br />
configuration has a number <strong>of</strong> advantages: simple and<br />
robust structure, better protection <strong>of</strong> buried permanent<br />
magnets (PMs) from demagnetization. Moreover, in<br />
comparison with the other topologies this one has one <strong>of</strong><br />
the best asynchronous loading capabilities [4, 10].<br />
Figure 2: LSPM architecture under study<br />
An analytical model <strong>of</strong> a PMSM with the same rotor<br />
architecture may be found in [6, 11]. In our case this<br />
model can be applied under the assumption that during<br />
the steady-state regime a LSPM operates as a PMSM.<br />
In general, the structure <strong>of</strong> a LSPM is similar to an IM<br />
but the rotor includes both cage and inserted permanent<br />
magnets. Hence, the LSPM combines IMs and PMSM<br />
structure features: the LSPM will start due to the resultant<br />
<strong>of</strong> two torque components i.e. the asynchronous torque<br />
and magnet opponent torque (braking torque). If for the<br />
entire speed range during starting the asynchronous<br />
torque is higher than the sum <strong>of</strong> the braking and load<br />
torque, the motor will reach the synchronous regime [4].<br />
Therefore, the design <strong>of</strong> a LSPM has to take into account<br />
both types <strong>of</strong> performances: starting capacity and<br />
efficiency in steady state regime.<br />
To simplify the LSPM design process, the proposed<br />
procedure applied in this article treats separately the<br />
running modes: the motor can be considered as an IM<br />
during its start and synchronization and as a PMSM<br />
during the steady-state regime. Figure 3 shows the<br />
workflow diagram applied for the design approach.
Figure 3: Diagram summarizing the design procedure<br />
At the first stage <strong>of</strong> the design we enter the data<br />
concerning the power and the stator parameters. At this<br />
stage the same design methodology as for an IM could be<br />
applied. For economic reasons, the stator <strong>of</strong> the LSPM is<br />
identical to the IM <strong>of</strong> the same power.<br />
The stage 2 is to choose a squirrel cage that gives the<br />
value <strong>of</strong> the rotor cage resistance, the level <strong>of</strong> saturation<br />
in a rotor tooth and the number <strong>of</strong> rotor bars.<br />
At the next stage a configuration <strong>of</strong> the permanent<br />
magnets has to be chosen. As soon as we know the<br />
geometry <strong>of</strong> the rotor and the PMs, the d- and q-axis<br />
reactances Xd and Xq and the no-load EMF could be<br />
computed. Using these parameters, it can be simulated<br />
the start <strong>of</strong> the LSPM taking into account the braking<br />
torque caused by PMs. If the motor is not able to start, the<br />
designer has to go back either to the Stage 2 in order to<br />
change the rotor squirrel cage to improve the starting<br />
torque or to the Stage 3 to change the configuration <strong>of</strong><br />
PMs and to reduce the breaking torque.<br />
Finally, after the successful start <strong>of</strong> the LSPM (Stage<br />
5), performances and steady-state characteristics are<br />
calculated. If they are acceptable, the design solution is to<br />
be considered as one for the optimization procedure (see<br />
chapters III, IV). Otherwise, the design procedure is to be<br />
repeat starting from the Stage 2.<br />
Each stage <strong>of</strong> the diagram will be further detailed.<br />
A. Asynchronous Torque<br />
The electromagnetic design <strong>of</strong> induction motor is a<br />
well-known problem. In this paper the asynchronous part<br />
design is based on the methodology proposed in [8].<br />
The classical expression for the asynchronous torque<br />
can be written as follows:<br />
- 79 - 15th IGTE Symposium 2012<br />
'<br />
2 R2<br />
3pV<br />
Tc<br />
<br />
s<br />
'<br />
R2<br />
2 ' 2<br />
2 f ( Rsc ) ( X1 cX2)<br />
s<br />
<br />
<br />
where V is the phase RMS voltage, p is the number <strong>of</strong><br />
poles, c=1+X1σ/Xm, X1σ , X`2σ are the stator and rotor<br />
leakage reactances, Xm is the magnetizing reactance.<br />
B. Braking torque<br />
The braking torque is found as a function <strong>of</strong> the back-<br />
EMF and the stator resistance [5]:<br />
T<br />
br<br />
2<br />
2 2 2<br />
(1 s)( Rs Xsq(1 s)<br />
)<br />
s<br />
s<br />
2<br />
RsXsd Xsq 2 2<br />
s<br />
3p<br />
ER<br />
<br />
2 ( (1 ) )<br />
where Rs is the stator resistance, ωs is synchronous<br />
electrical speed, E is the RMS value <strong>of</strong> back-EMF, s is<br />
the slip, Xsd, Xsq are the direct and quadrature<br />
synchronous reactances respectively.<br />
The braking torque peaks the maximum at low speed<br />
and declines near synchronous speed.<br />
C. Steady state regime<br />
During the steady state regime the rotor <strong>of</strong> LSPM<br />
rotates at synchronous speed and its cage has no<br />
influence. In this state the performance <strong>of</strong> the machine<br />
could be calculated as for PMSM.<br />
As stated in [10] the RMS armature current is a<br />
function <strong>of</strong> the motor equivalent electrical parameters:<br />
2 2<br />
Ia I ad Iaq,<br />
(3)<br />
where the axis currents are<br />
I<br />
ad<br />
I<br />
V( X cos R sin )<br />
EX <br />
ad<br />
sq s<br />
2<br />
Xsq Xsd Rs<br />
sq<br />
V( R cos X sin )<br />
ER <br />
,<br />
s sd<br />
2<br />
Xsq Xsd Rs<br />
s<br />
where δ is the load angle.<br />
The input power <strong>of</strong> the motor is<br />
2<br />
in 3[ aq ad aq( sd sq) s a ],<br />
,<br />
(1)<br />
(2)<br />
(4)<br />
(5)<br />
P I EI I X X R I<br />
(6)<br />
Neglecting the stator core losses the electromagnetic<br />
power is<br />
P 3[ I EI I ( X X )].<br />
(7)<br />
elm aq ad aq sd sq<br />
The electromagnetic torque developed by a PMSM is<br />
Pelm<br />
Telm<br />
,<br />
(8)<br />
2<br />
ns<br />
where ns is the synchronous speed <strong>of</strong> the rotating<br />
magnetic field.
Taking into consideration the Joule losses Pj, the<br />
mechanical losses Pm and the stator core losses Ps, the<br />
efficiency can be expressed as<br />
Pin PjPsPm .<br />
(9)<br />
P<br />
in<br />
D. Calculation <strong>of</strong> the back-EMF and the direct and<br />
quadrature synchronous reactances<br />
The analytical model <strong>of</strong> the d-q machine parameters is<br />
interesting as it provides the fast evaluation <strong>of</strong> LSPM<br />
performances at steady-state regime, the obtainment <strong>of</strong> all<br />
the characteristics and their integration into the<br />
optimization procedure. For example, it permits us to find<br />
the optimal volume <strong>of</strong> magnets in terms <strong>of</strong> the improved<br />
efficiency and reduced braking torque.<br />
To compute the d- and q-axis reactances Xd and Xq and<br />
the back-EMF, the finite element simulation or<br />
experimental testing are usually used [3-5, 9, 12].<br />
However, there are a number <strong>of</strong> papers where all these<br />
parameters are analytically expressed as function <strong>of</strong> the<br />
studied machine geometry [6, 10, 11].<br />
According to the model in [6] the flat-topped value <strong>of</strong><br />
the flux density in the air gap is<br />
B<br />
ag<br />
2Br<br />
emehhmp <br />
(4eagehhmmp2eagempRh ,<br />
e e R 2 e e pR e e R )<br />
ag m h m h r m h r<br />
(10)<br />
where em is the magnet width, hm is the magnet height, eag<br />
is the air gap length, eh is the hub thickness, Rh is the<br />
external hub radius (in our case as the shaft is made <strong>of</strong> the<br />
non-magnetic material eh = Rh), Rr is the rotor radius, μ0<br />
is the permeability <strong>of</strong> vacuum, μm is the magnet relative<br />
permeability, Br is the remanent magnetization,<br />
α=em/(2∙Rh), β=/(2·Rr) are geometrical coefficients.<br />
The first harmonic <strong>of</strong> the flux density in the air gap is<br />
4 i Bag1 Bag<br />
sin ,<br />
(11)<br />
2<br />
where αi=2/π is the ratio <strong>of</strong> the average-to-maximum<br />
value <strong>of</strong> the normal component <strong>of</strong> the air gap magnetic<br />
flux density.<br />
The first harmonic <strong>of</strong> the back EMF can be expressed<br />
32RbLact f NskwBremehhmsin( p)<br />
E <br />
,<br />
pe ( (4 eh pe( 2 p) R) ehR)<br />
ag h m m m h h m r<br />
(12)<br />
where Lact is the rotor active length, Ns is the number <strong>of</strong><br />
turns in series per phase, kw is the global winding<br />
coefficient.<br />
The direct and quadrature synchronous inductances<br />
could be found as a function <strong>of</strong> the machine geometry<br />
and winding arrangement:<br />
- 80 - 15th IGTE Symposium 2012<br />
2<br />
<br />
4sin( p<br />
)<br />
2p <br />
p<br />
<br />
<br />
<br />
Ld<br />
<br />
<br />
2 eagempRhemeagRh) <br />
<br />
<br />
<br />
2 2<br />
6Lact0kwNsRb<br />
Lq <br />
2p sin(2 p)<br />
2 2<br />
eag p <br />
<br />
<br />
2(6eag Rb2(4 eag Rb)cos(<br />
)<br />
<br />
<br />
(2 eag Rb)(cos(2<br />
) sin(2 )))<br />
<br />
.<br />
2<br />
p(2 eag Rb)<br />
eag<br />
<br />
<br />
<br />
2 2<br />
6Lact0kNR w s b sin(2 p) 8eeR<br />
m h r sin( p)<br />
,<br />
2 2<br />
eag p p(4eagehhmpehemRr (13)<br />
(14)<br />
III. OPTIMIZATION PROBLEM<br />
The goal <strong>of</strong> optimization process consists in finding the<br />
set <strong>of</strong> optimal configurations R * taking into account<br />
parameters and constraints imposed by the design<br />
specification. Table I presents the specification for the<br />
studied LSPM. In the presented study the dependency<br />
between the efficiency and the magnet braking torque is<br />
analyzed.<br />
Table I. Specification <strong>of</strong> the designed LSPM machine<br />
Parameter Value/Feasible interval<br />
Power, [kW] 7.5<br />
Voltage LL, [V] 400<br />
Supply frequency f, [Hz] 50<br />
Rated speed, [rpm] 1500<br />
Height <strong>of</strong> the shaft axe, [mm] 132<br />
Rated Torque, [Nm] 47.75<br />
Overload conditions, Tmax/Trated<br />
≥1.6<br />
Ambient temperature, Tamb [°C] [-10; 40]<br />
Stator winding temperature rise<br />
average, [K]<br />
80<br />
Stator winding hot spot, [K] 90<br />
Load torque, [p.u.]<br />
Linearly from zero to nominal<br />
speed, starting from 0.8pu to 1<br />
p.u.<br />
Power factor, cosφ ≥0.8<br />
Efficiency η, [%] To be maximized<br />
Geometry <strong>of</strong> permanent magnets Radial magnet configuration<br />
The design vector X =[x1, x2,…, xn] T identifies the set<br />
<strong>of</strong> design variables. The design variables can be freely<br />
varied by the designer to define a designed object [7].<br />
The permanent magnet geometry is analytically<br />
predetermined form the imposed specification (Table I).<br />
Consequently, the design vector <strong>of</strong> the studied problem is<br />
composed <strong>of</strong> 3 variables: x1 – length <strong>of</strong> the air gap eag; x2<br />
– magnet height hm; x3 – magnet width em. According to<br />
equations (12-14) these 3 parameters are sufficient to<br />
compute the d- and q-axis reactances Xd and Xq and the<br />
back-EMF. Due to manufacturing constraints all <strong>of</strong> the<br />
components <strong>of</strong> design vector X are discontinuous and<br />
standardized.
Formally, the problem is expressed as follows:<br />
<br />
minimize1<br />
η, Tbr ,<br />
X<br />
<br />
(15)<br />
subject to GX ( )= g 1( X), g 2( X),..., g n(<br />
X)<br />
0,n<br />
=2.<br />
Electromagnetic constraints <strong>of</strong> the problem G(X) are<br />
specified in the Table II.<br />
Table II. Constraints <strong>of</strong> the optimization problem<br />
Function Constraint level<br />
Power factor cosφ, p.u. ≥0.8<br />
≥1.6<br />
Tmax/Trated<br />
Where {Tmax/Trated, cosϕ} are the feasible domains for<br />
the maximum torque ratio for synchronous operation and<br />
power factor.<br />
Taking into consideration the fact that all the<br />
components are discrete, in order to find R* a lot <strong>of</strong><br />
configurations have to be investigated.<br />
IV. OPTIMIZATION TECHNIQUE<br />
The optimization method applied for the considering<br />
problem (15) is the exhaustive enumeration (EE) [7, 13].<br />
It is an exact method with evaluations <strong>of</strong> all possible<br />
combinations <strong>of</strong> the PM dimensions and air gap length.<br />
The method doesn’t have any heuristic rules at all.<br />
Because <strong>of</strong> the presence <strong>of</strong> several objective functions,<br />
the aim <strong>of</strong> multi-objective evolutionary algorithms is to<br />
find compromise solutions rather than a single optimal<br />
point as in scalar optimization problems [14].<br />
These trade<strong>of</strong>f solutions are usually called Pareto<br />
optimal solutions. The EE was applied in order to obtain<br />
a genuine Pareto-Front. The method is not pretended to<br />
be the best one in terms <strong>of</strong> total time <strong>of</strong> calculation, but<br />
on the other hand, it gives reliable results.<br />
Input parameters were:<br />
Design vector:<br />
X =[x1, x2,…, xn] T in our case n = 3.<br />
Objective functions:<br />
F(X) = {f1(X),f2(X),…, fm(X)} in our case m=2<br />
and F(X) = {(1- η), Tbr}.<br />
Constraints:<br />
G(X) = {g1(X), g2(X),…, gk(X)} in our case k =2.<br />
The feasible set Ω= {ω1, ω2,…, ωn},where ωi is the<br />
subset which contains all feasible values for the<br />
component xi <strong>of</strong> the design vector, for i=1…n. In<br />
our case n = 3. As all <strong>of</strong> the components <strong>of</strong> design<br />
vector X are discrete, Ω is a finite set that is<br />
composed <strong>of</strong> the possible standardized values.<br />
Output parameters:<br />
The set <strong>of</strong> optimal solutions:<br />
R * = {Xi * X 0 |G (Xi * ) ≤, for i=1,..m}, where<br />
Xi * is the degenerate interval, and each component<br />
*<br />
<strong>of</strong> X is a Pareto optimal solution. Therefore Xi<br />
has following features:<br />
- 81 - 15th IGTE Symposium 2012<br />
*<br />
fl( ) fl( i) for l 1...<br />
m,<br />
*<br />
f j( ) f j( i)<br />
for at least one index j.<br />
The problem (15) was treated and a total <strong>of</strong> 360<br />
combinations has been enumerated. Among these 360<br />
combinations there are 160 that belong to the feasible<br />
domain defined by optimization constraints. In Fig.4 the<br />
feasible set <strong>of</strong> solutions for the EE and the Pareto frontier<br />
are presented.<br />
Figure 4: Pareto front <strong>of</strong> efficiency versus braking torque<br />
V. DESIGN RESULTS<br />
A boundary point <strong>of</strong> the maximal efficiency from the<br />
Pareto frontier has been chosen for a deeper investigation.<br />
Based on this optimal solution and solving the set <strong>of</strong><br />
equations (1-14) all the characteristics for steady-state<br />
regime and optimal dimensions <strong>of</strong> permanent magnets<br />
and air gap length have been found (Table III).<br />
Table III. Optimal solution for <strong>of</strong> the designed LSPM<br />
Parameter Value<br />
Efficiency η, [%] 91.2<br />
Braking torque, Nm 11.66<br />
Power factor cosφ, p.u. 0.983<br />
Air gap length eag, mm 0.7<br />
Magnet height hm, mm 29.0<br />
Magnet width em, mm 15.0<br />
Overload condition, Tmax/Trated<br />
1.783<br />
Table III shows that the efficiency <strong>of</strong> the designed<br />
LSPM compared with a premium efficiency class<br />
induction motor (PEIM) is greater than 0.8% [20]. The<br />
power factor <strong>of</strong> the PEIM (0.9 p.u.) is much lower<br />
compared with the designed LSPM (0.983 p.u.). It means<br />
the LSPM can achieve a very high power factor in a wide<br />
output power range. This feature assists in saving energy<br />
when the motor is running at different loads.<br />
Based on the designed data the static and dynamic<br />
characteristics were obtained. Figures 5-7 show that the<br />
designed LSPM is able to start and synchronize even at<br />
85% <strong>of</strong> the rated voltage.
Figure 5: Torque versus speed curve <strong>of</strong> the studied motor<br />
with supplied phase voltage equal 231V<br />
Figure 6: Torque versus speed curve <strong>of</strong> the studied motor<br />
with supplied phase voltage equal 85% * 231V<br />
Figure 7: Motor speed during transient start<br />
supplied with different voltages<br />
VI. THERMAL MODEL<br />
An increase in motor temperature can cause the stator<br />
winding insulation degradation and permanent magnet<br />
material decreased performances. According to the design<br />
specification (Table I), the acceptable heating in the<br />
LSPM doesn’t have to exceed 90K. To predict the motor<br />
transient thermal behavior an analytical model based on<br />
the general cylindrical component [21] was developed<br />
(Fig. 8).<br />
Figure 8: A simplified model <strong>of</strong> LSPM as the heating body<br />
- 82 - 15th IGTE Symposium 2012<br />
This model corresponds to the system <strong>of</strong> equations:<br />
dcu<br />
cu 12 ( ) 1<br />
<br />
P A cu st A cu C1 ,<br />
dt<br />
<br />
dst<br />
P 2 12<br />
st A st A ( cu st ) C2 .<br />
<br />
dt<br />
(16)<br />
where ∆θcu and ∆θst are the average heating in the copper<br />
and respectively the stator laminations, A1, A2, A12 are<br />
the heat transfer coefficients, C1 and C2 represent the heat<br />
capacities <strong>of</strong> stator core and stator winding. In order to<br />
determine the temperature, a simplified equivalent<br />
thermal network (ETN) model <strong>of</strong> the LSPM is considered<br />
(Fig. 9).<br />
Figure 9: Simplified equivalent thermal network <strong>of</strong> LSPM<br />
(<br />
cu, a cu, c ) cu -cu, a a, in - cu, c s, st Pcu,<br />
<br />
(<br />
cu, a rot, c c, f ) s, st cu, c curot, c rot <br />
<br />
c, ff Ps,<br />
st,<br />
(17)<br />
<br />
<br />
(<br />
rot, a rot, c ) s, st rot, a a, in rot, s s, st Prot<br />
<br />
<br />
( cu, a rot, a a, f ) a, in cu,<br />
a curot, a rot <br />
a, ff Pa,<br />
in,<br />
<br />
( c, fa, f f) fc, fs, sta, fa, in0.<br />
where Δθcu is the heating in the copper winding; Δθs,st -<br />
heating in the steel stator pack, Δθrot - heating in the rotor;<br />
Δθa,in - heating in the air gap; Δθf - heating in the motor<br />
case, Рcu - source <strong>of</strong> losses in the copper winding, Рs,st -<br />
source <strong>of</strong> losses in the stator pack, Рrot - source <strong>of</strong> losses<br />
in the rotor, Рa,in - source <strong>of</strong> mechanical and additional<br />
losses, Λcu,c - thermal conductivity between the slot<br />
winding and stator core, Λcu,a – thermal conductivity<br />
between the winding and the air gap, Λrot,a – thermal<br />
conductivity between the rotor and the air gap, Λrot,с –<br />
thermal conductivity between the rotor and the stator<br />
core, Λa,f – thermal conductivity between the air inside<br />
the motor and the motor case, Λс,f – thermal conductivity<br />
between the stator core and the motor frame; Λf – thermal<br />
conductivity between the motor frame and the external<br />
air.<br />
The solution <strong>of</strong> the systems (16, 17) enabled us to<br />
model overheating in the main parts <strong>of</strong> the designed<br />
motor (Figs. 10, 11).
Figure 10: The increase <strong>of</strong> winding temperature<br />
Figure 11: The increase <strong>of</strong> rotor core temperature<br />
According to figures 10, 11 maximal overheating in<br />
winding is 79.4°C, maximal overheating in rotor is about<br />
<strong>of</strong> 26.3°C that is in compliance with the specification<br />
requirements (Table I).<br />
VII. CONCLUSION<br />
A design method for a LSPM motor considering the<br />
asynchronous starting capacity and the synchronous<br />
steady state performances is proposed in order to find out<br />
an optimal design solution for the given motor topology.<br />
The approach is based on the design <strong>of</strong> an asynchronous<br />
machine incorporating the effect <strong>of</strong> magnets. The present<br />
analytical model takes into account the radial magnet<br />
topology and is to be extended for the other LSPM<br />
architectures.<br />
It has been shown that the efficiency and the power<br />
factor <strong>of</strong> the designed LSPM is greater compared with a<br />
PEIM <strong>of</strong> the same power.<br />
Thereafter, an analytical thermal model was developed.<br />
The proposed thermal model allows predicting the<br />
overheating in the main parts <strong>of</strong> the motor. During the<br />
design process the thermal model didn’t take part in<br />
optimization procedure.<br />
In future investigations it might be possible to combine<br />
the electro-magnetic and thermal optimization problems<br />
in order to integrate them into optimization procedure.<br />
The verification <strong>of</strong> the analytical approach will be<br />
provided by both finite element and experimental models.<br />
- 83 - 15th IGTE Symposium 2012<br />
REFERENCES<br />
[1] Key world energy statistics. International Energy Agency, 2010.<br />
[2] La rentabilité énergétique les entrainements, Mesures 803, Mars<br />
2008, www.mesures.com.<br />
[3] Jian Li and Jungtae Song and Yunhyun Cho. A High-Performance<br />
Line-Start Permanent Magnet Synchronous Motor Amended From<br />
a Small Industrial Three-Phase Induction Motor. In Industrial<br />
Electronics, 2010 IEEE International Symposium, pp. 1308 -1313.<br />
[4] T. Ruan, H. Pan, Y. Xia « Design and Analysis <strong>of</strong> Two Different<br />
Line-Start PM Synchronous Motors», Artificial Intelligence,<br />
Management Science and Electronic Commerce (AIMSEC), 2011.<br />
[5] Soulard, J.; Nee, H.-P.; , "Study <strong>of</strong> the synchronization <strong>of</strong> linestart<br />
permanent magnet synchronous motors," Industry<br />
Applications Conference, 2000. Conference Record <strong>of</strong> the 2000<br />
IEEE , vol.1, no., pp.424-431 vol.1, 2000.<br />
[6] X. Jannot, J.-C. Vannier, J. Saint-Michel and M. Gabsi, An<br />
Analytical Model for Interior Permanent-Magnet Synchronous<br />
Machine with Circumferential Magnetization Design, IEEE,<br />
10.1109/ELECTROMOTION.2009.5259155, July 2009.<br />
[7] P.Venkataraman, Applied Optimization with<br />
Matlab Programming, A Wiley - Interscience publication, John<br />
Wiley & Sons, New York, 2001.<br />
[8] I.P. Kopylov, Electric Machines: M., Energoatomizdat, 1986.<br />
[9] K. Kurihara, M. Azizur Rahman, High Efficiency Line-Start<br />
Interior Permanent Magnet Synchronous Motors, IEEE Trans.<br />
Industry Applications, Vol. 40 Issue 3, May 2004.<br />
[10] J.F.Gieras, M. Wing, Permanent Magnet Motor <strong>Technology</strong>,<br />
USA, Marcel Dekker, 2002.<br />
[11] D.Fodorean, A. Miraoui, Dimensionnement rapide des machines<br />
synchrones à aimants permanents (MSAP), Techniques de<br />
l’ingénieur, Nov. 10, 2009.<br />
[12] H-P. Nee, L. Lefevre, P. Thelin, J. Soulard, Determination <strong>of</strong> d<br />
and q reactances <strong>of</strong> permanent magnet synchronous motors<br />
without measurements <strong>of</strong> the rotor position, IEEE Trans. on<br />
Industry Applications, Vol. 36, No. 5, 1330-1335, Oct. 2000.<br />
[13] D. Samarkanov, F. Gillon, P.Brochet, D. Laloy , Optimal design<br />
<strong>of</strong> induction machine using interval algorithms, COMPEL: The<br />
International Journal for Computation and Mathematics in<br />
Electrical and Electronic Engineering, Vol. 31, N°.5, pages. 1492 -<br />
1502, ISBN. 0332-1649, 8-2012.<br />
[14] P. Alotto, U. Baumgartner, F. Freschi, M. Jaindl, A. Köstinger,<br />
Ch. Magele, W. Renhart, and M. Repetto, SMES Benchmark<br />
Extended: Introducing Pareto Optimal Solutions Into TEAM22,<br />
IEEE Transactions on Magnetics, Vol. 44, No.6, pp. 1066-1069,<br />
2008.<br />
[15] Mellor, P.H.; Roberts, D.; Turner, D.R.; , "Lumped parameter<br />
thermal model for electrical machines <strong>of</strong> TEFC design," Electric<br />
Power Applications, IEE <strong>Proceedings</strong> B , vol.138, no.5, pp.205-<br />
218, Sep 1991.<br />
[16] IEC 60034-30, Standard on efficiency classes for low voltage AC<br />
motors, 2008.<br />
[17] D. Stoia, M. Antonoaie, D. Ilea, M. Cernat, Design <strong>of</strong> Line Start<br />
PM Motors with High Power Factor, Proc. POWERENG 2007,<br />
Setubal, Portugal, 12-14 April, 2007, published on CD-Rom,<br />
IEEE Catalog Number 07EX1654C, ISBN: 1-4244-0895-4, paper<br />
186.<br />
[18] T. Miller, Synchronization <strong>of</strong> line-start permanent magnet AC<br />
motor, IEEE Trans. Power Apparatus and Systems, vol. PAS-103,<br />
July 1984, pp 1822-1828.<br />
[19] T. Tran, S. Brisset, P. Brochet, A Benchmark for Multi-objective,<br />
Multi-Level and Combinatorial Optimizations <strong>of</strong> a Safety<br />
Isolating Transformer, COMPUMAG 2007, Aachen, Germany,<br />
6- 2007<br />
[20] X. Feng, L. Liu, J. Kang, Y. Zhang, Super Premium Efficient<br />
Line Start-up Permanent Magnet Synchronous Motor, Proc. Of<br />
XIX International Conference on Electrical Machines, ICEM2010,<br />
Roma, Italy, Sept. 6-8, 2010.<br />
[21] A.I. Borisenko Cooling <strong>of</strong> industrial electrical machinery,<br />
Energoatomizdat, 1983.
- 84 - 15th IGTE Symposium 2012<br />
Speed-up <strong>of</strong> Nonlinear Magnetic Field Analysis using a Modified<br />
Fixed-Point Method<br />
Norio Takahashi 1 , Kousuke Shimomura 1 , Daisuke Miyagi 2 and Hiroyuki Kaimori 3<br />
1 Dept. Electrical and Electronic Eng., Okayama <strong>University</strong>, Okayama 700-8530 Japan<br />
2 Dept. Electrical Eng., Tohoku <strong>University</strong>, Sendai 980-8579 Japan<br />
3 Science Solutions Int. Lab., Inc., Tokyo 153-0065 Japan<br />
The nonlinear finite element analysis <strong>of</strong> magnetic fields using the Fixed-Point method (FPM) requires a number <strong>of</strong> iterations and<br />
long CPU time compared with those using the Newton-Raphson method (NRM). On the other hand, the Fixed-Point method has an<br />
advantage that the convergence can be obtained even for a complicated nonlinear anisotropy problem, <strong>of</strong> which the convergence is<br />
very difficult using a conventional Newton-Raphson method. Moreover, it has an advantage that a s<strong>of</strong>tware can be easily obtained by<br />
slightly modifying a linear FEM s<strong>of</strong>tware. We then achieved the speed-up <strong>of</strong> the Fixed-Point method by updating the reluctivity at each<br />
iteration (This is called a modified Fixed-Point method). It is shown that the formulation <strong>of</strong> the Fixed-Point method using the<br />
derivative <strong>of</strong> reluctivity is almost the same as that <strong>of</strong> the Newton-Raphson method. The convergence properties <strong>of</strong> these methods are<br />
compared. It is shown that the modified Fixed-Point method has an advantage that the programming is easy and it has a similar<br />
convergence property to the Newton-Raphson method for an isotropic nonlinear problem.<br />
Index Terms—finite element method, Fixed-Point method, Newton-Raphson method, nonlinear electromagnetic analysis<br />
I. INTRODUCTION<br />
The Fixed-Point method [1,2] has an advantage that the<br />
convergence can be obtained even for a complicated nonlinear<br />
problems [3] such as the analysis considering vector magnetic<br />
properties treating an anisotropic material [4, 5], in which the<br />
convergence is sometimes difficult. In addition, it has an<br />
advantage that the s<strong>of</strong>tware for nonlinear analysis can be<br />
easily obtained by adding a small change to that for linear<br />
analysis. But, the Fixed-Point method requires a number <strong>of</strong><br />
iterations and long CPU time compared with those <strong>of</strong> the<br />
Newton-Raphson method [6]. It is reported that the CPU time<br />
can be reduced by using a constant reluctivity in the<br />
beginning <strong>of</strong> nonlinear iterations [7,8 ]. However, nearly ten<br />
times longer CPU time is still necessary compared with the<br />
Newton-Raphson method.<br />
In this paper, a modified Fixed-Point method, which<br />
updates the derivative <strong>of</strong> reluctivity at each iteration, is<br />
proposed. Furthermore, it is pointed out that the formulation <strong>of</strong><br />
the Fixed-Point method using the derivative <strong>of</strong> reluctivity is<br />
the same as the Newton-Raphson method. The convergence<br />
characteristic <strong>of</strong> the newly proposed Fixed-Point method is<br />
compared with those <strong>of</strong> the Newton-Raphson method.<br />
II. FORMULATION OF NRM AND FPM<br />
A. Newton-Raphson Method<br />
There are two kinds <strong>of</strong> methods which deal with the<br />
nonlinearity in the Newton-Raphson method (NRM). One is<br />
the method A (NRM(B 2 )) which uses ν-B 2 curve. In this<br />
method, the magnetic field strength H is given by<br />
2<br />
H ( B ) B<br />
(1)<br />
B is the flux density. The reluctivity ν is given by<br />
2 H(<br />
B)<br />
( B ) <br />
(2)<br />
B<br />
The other is the method B (NRM(B)) which uses the B-H<br />
curve directly. In this method, the magnetic field strength H is<br />
given by<br />
B<br />
H H(<br />
B )<br />
(3)<br />
B<br />
1) Method A (NRM(B 2 )<br />
The static magnetic field equation can be written as follows<br />
in the case <strong>of</strong> the Newton-Raphson method using the -B 2<br />
curve:<br />
H <br />
( <br />
A)<br />
J<br />
(4)<br />
0<br />
where, A is the magnetic vector potential. J0 is the forced<br />
current density. The Galerkin equation G * i(A (k) ) <strong>of</strong> (4) is given<br />
by<br />
* ( k )<br />
( k 1)<br />
( k 1)<br />
Gi ( A ) <br />
N i ( A ) dV N iJ<br />
0dV<br />
(5)<br />
where, Ni is the interpolation function <strong>of</strong> the edge element.<br />
The residual Gi(A) at the k-th nonlinear iteration is given by<br />
( k ) * ( k )<br />
( k )<br />
G ( A ) G i ( A ) G<br />
( A )<br />
i<br />
( k )<br />
j<br />
i<br />
( k 1)<br />
( k 1)<br />
N i ( A ) dV N iJ<br />
0dV<br />
<br />
( k 1)<br />
( k 1)<br />
N i ( A ) dV A<br />
A<br />
* ( k )<br />
G i ( A ) N ( <br />
<br />
i<br />
( k 1)<br />
Ν ) dV<br />
( k 1)<br />
<br />
( k 1)<br />
( k )<br />
N dV<br />
i<br />
A A<br />
( k )<br />
i<br />
A<br />
j<br />
2<br />
<br />
B<br />
<br />
B<br />
2B<br />
(7)<br />
2<br />
2<br />
Aj<br />
B<br />
Aj<br />
B<br />
Aj<br />
where, A, ν etc. in (6) and (7) are values at the k-th iteration.<br />
∂ν/∂B 2 is the term which represents nonlinear magnetic<br />
properties. The process <strong>of</strong> calculation is as follows:<br />
1) The initial value <strong>of</strong> ν is determined.<br />
2) δA (0) is set to zero.<br />
3) A is updated by A (k) =A (k-1) +δA (k) using δA (k) calculated by<br />
(6).<br />
j<br />
( k )<br />
i<br />
1<br />
(6)
4) ν (k) is calculated using the ν-B 2 curve from B obtained by<br />
A (k) .<br />
5) The process from 3) to 5) is repeated.<br />
6) It is judged to be converged if δB(A (k) ) is less than a<br />
specified small value.<br />
2) Method B (NRM(B))<br />
The static magnetic field equation can be written as follows<br />
in the case <strong>of</strong> the Newton-Raphson method using the B-H<br />
curve:<br />
H J<br />
(8)<br />
0<br />
The Galerkin equation G * i(A) <strong>of</strong> (8) is given by<br />
<br />
* ( k )<br />
G i ( A ) <br />
N i HdV<br />
N iJ<br />
0dV<br />
(9)<br />
The residual Gi(A) at the k-th nonlinear iteration is given by<br />
( k ) * ( k )<br />
( k )<br />
G ( A ) G i ( A ) G<br />
( A )<br />
i<br />
i<br />
( k 1)<br />
( k )<br />
N dV dV dV<br />
i H N iJ<br />
N i H<br />
0<br />
( k 1)<br />
* ( k )<br />
H<br />
( B ) ( k ) (10)<br />
G i ( A ) N <br />
dV<br />
i<br />
Bi<br />
B<br />
( k 1)<br />
* ( k )<br />
H<br />
( B )<br />
( k )<br />
G i ( A ) N i A<br />
dV<br />
B<br />
∂H(B)/∂B is the term which represents nonlinear magnetic<br />
properties.<br />
The process <strong>of</strong> calculation is as follows:<br />
1) The initial value <strong>of</strong> ∂H(B (0) )/∂B is determined.<br />
2) δA (0) is set to zero.<br />
3) A is updated by A (k) =A (k-1) +δA (k) using δA (k) calculated by<br />
(10).<br />
4) H (k) is calculated using the B-H curve from B obtained by<br />
A (k) .<br />
5) The process from 3) to 5) is repeated.<br />
6) It is judged to be converged if δB(A (k) ) is less than a<br />
specified small value.<br />
B. Fixed-Point Method<br />
In the Newton-Raphson method, the reluctivity is updated<br />
in each nonlinear iteration as explained above. In the Fixed-<br />
Point method, the reluctivity is fixed at the first step and it is<br />
not changed during the nonlinear iterations.<br />
According to the concept <strong>of</strong> the Fixed-Point method [1], the<br />
magnetic field strength is given by<br />
H( B)<br />
ν B H<br />
(11)<br />
FP<br />
FP<br />
where, FP is the Fixed-Point reluctivity which is constant<br />
during the nonlinear iterations, HFP is an additional magnetic<br />
field strength.<br />
The static magnetic field equation can be written as follows<br />
in the case <strong>of</strong> the Fixed-Point method:<br />
FP 0 )<br />
( J H B FP<br />
(12)<br />
where, HFP (k) at the k-th nonlinear iteration can be obtained by<br />
the following equation:<br />
( k )<br />
( k1)<br />
( k1)<br />
H FP H(<br />
B ) νFPB<br />
(13)<br />
where, H(B (k-1) ) is the magnetic field strength vector on the B-<br />
H curve corresponding to the flux density B (k-1) at the (k-1)-th<br />
nonlinear iteration. HFP (k) converges to some value after<br />
iterations. The residual Gi(A) <strong>of</strong> (12) is given by<br />
<br />
- 85 - 15th IGTE Symposium 2012<br />
<br />
<br />
( k )<br />
Gi<br />
( A) <br />
( k )<br />
N i ( FP<br />
A ) dV N iJ<br />
0dV<br />
(14)<br />
<br />
( k )<br />
N i H FP dV<br />
By substituting HFP (k) in (13) into HFP (k) in (14), we obtain<br />
( k ) * ( k )<br />
G ( A ) G ( A ) N H<br />
i<br />
i<br />
)<br />
dV<br />
* ( k )<br />
( k 1)<br />
( k 1)<br />
G ( A ) N ( H(<br />
B ) <br />
B ) dV<br />
i<br />
<br />
<br />
<br />
* ( k )<br />
( k )<br />
G ( A ) N ( A<br />
) dV<br />
i<br />
i<br />
i<br />
i<br />
<br />
<br />
FP<br />
FP<br />
FP<br />
i<br />
i<br />
i<br />
FP<br />
( k<br />
FP<br />
( k )<br />
( k 1)<br />
N ( A ) dV N J dV N H ( B ) dV<br />
( k 1)<br />
N ( A ) dV<br />
i 0<br />
( k 1)<br />
( k 1)<br />
N ( A<br />
) dV N J dV N H(<br />
B ) dV<br />
where, δA (k) =A (k) -A (k-1) .<br />
Gi * (A (k) ) is given by<br />
<br />
<br />
*<br />
Gi i FP<br />
i 0<br />
<br />
<br />
i<br />
0<br />
FP<br />
<br />
<br />
<br />
<br />
i<br />
i<br />
2<br />
(15)<br />
( k )<br />
( k )<br />
( A ) <br />
N (<br />
<br />
A ) dV N J dV<br />
(16)<br />
In the actual calculation, (14) is used in the Fixed-Point<br />
method.<br />
The process <strong>of</strong> calculation is as follows:<br />
1) The initial value <strong>of</strong> FP is determined.<br />
2) HFP (0) is set to zero.<br />
3) B (k) is obtained from A (k) which is calculated by (14).<br />
4) HFP (k) is obtained by (13).<br />
5) The right hand side <strong>of</strong> (14) is updated and the process from<br />
3) to 5) is repeated.<br />
6) It is judged to be converged if the change <strong>of</strong> B (k) is less<br />
than the specified small value.<br />
According to (15), we found that HFP (k) is given by<br />
H <br />
A<br />
H H<br />
(17)<br />
( k )<br />
( k )<br />
( k )<br />
( k1)<br />
FP FP<br />
FP<br />
FP<br />
(17) means that the difference HFP (k) is the same as H in (9)<br />
<strong>of</strong> the Newton-Raphson method and it can be used as the<br />
judgment <strong>of</strong> the convergence.<br />
Fig.1 shows the concept <strong>of</strong> the nonlinear magnetic field<br />
analysis using the Fixed-Point method. A white circle on the B<br />
axis is a convergence target. In this method, the reluctivity FP<br />
shown in Fig.1 is given as an initial value, and FP is not<br />
changed during the iterations. The flux density B (1) is obtained<br />
by the linear magnetic field analysis. Next, the HFP (1) which<br />
corresponds to the flux density B (1) on the B-H curve and<br />
FPB (1) on the line <strong>of</strong> FP shown in Fig.1(a) is obtained. During<br />
iterations, HFP (k) becomes the same value, which means the<br />
difference HFP (k) becomes almost zero. Then, the converged<br />
result can be obtained.<br />
C. Modified Fixed-Point Method<br />
In the modified Fixed-Point method, the derivative <strong>of</strong><br />
reluctivity is updated at each iteration. In this expression, the<br />
HFP (k) at the k-th nonlinear iteration in (13) can be rewritten by<br />
the following equation:<br />
( k 1)<br />
( k )<br />
( k 1)<br />
H(<br />
B ) ( k 1)<br />
H H(<br />
B ) B<br />
FP<br />
(18)<br />
B<br />
The residual Gi(A (k) ) is given by<br />
k<br />
k<br />
k<br />
k<br />
k<br />
Gi<br />
G i i<br />
dV <br />
( 1)<br />
( ) * ( )<br />
( 1)<br />
H(<br />
B ) ( 1)<br />
<br />
( A ) ( A ) N H(<br />
B ) B (19)<br />
<br />
B<br />
<br />
(19) can be written as follows:
H<br />
FPB (1) FPB (1)<br />
HB (1) HB νFP<br />
(1) νFP<br />
H (1)<br />
FP<br />
H<br />
FPB (2) FPB (2)<br />
HB (2) HB (2) <br />
H (1)<br />
FP<br />
H (2)<br />
FP<br />
H<br />
FPB (2) FPB (2)<br />
HB (3) HB (3) <br />
H (2)<br />
FP<br />
H (3)<br />
FP<br />
ν<br />
ν<br />
FP<br />
ν<br />
FP<br />
FP<br />
ν<br />
ν<br />
FP<br />
FP<br />
(a)<br />
B (1) B (1)<br />
B (2) B (2)<br />
(b)<br />
<br />
H<br />
B (3) B (3)<br />
<br />
H<br />
( 3 )<br />
FP<br />
<br />
H<br />
( 2 )<br />
FP<br />
( 1 )<br />
FP<br />
B-H curve<br />
( k 1)<br />
( k ) H<br />
( B )<br />
( k ) <br />
G ( A ) NAdV NJdV<br />
i<br />
i<br />
i 0<br />
B<br />
<br />
( k 1)<br />
( k 1)<br />
H<br />
( B )<br />
( k 1)<br />
<br />
<br />
N i H(<br />
B ) dV <br />
N i<br />
A dV<br />
B<br />
(20)<br />
( k 1)<br />
( k 1)<br />
H<br />
( B )<br />
( k ) <br />
<br />
N H(<br />
B ) dV N J dV <br />
N A<br />
dV<br />
i<br />
i 0<br />
i<br />
B<br />
<br />
( k 1)<br />
* ( k ) H<br />
( B )<br />
( k ) <br />
G ( A ) <br />
N A<br />
dV<br />
i<br />
i<br />
B<br />
<br />
(10) and (20) denote that the formulation <strong>of</strong> the modified<br />
Fixed-Point method is the same as that <strong>of</strong> the Newton-<br />
Raphson method.<br />
In the actual calculation <strong>of</strong> the modified Fixed-Point<br />
method, (19) is used.<br />
The process <strong>of</strong> calculation is as follows:<br />
1) The initial value <strong>of</strong> ∂H(B (0) )/∂B is determined.<br />
2) HFP (0) is set to zero.<br />
3) B (k) is obtained from A (k) which is calculated by (19).<br />
4) HFP (k) is obtained by (18).<br />
B<br />
B-H curve<br />
B-H curve<br />
(c)<br />
Fig. 1 Conceptual diagram <strong>of</strong> Fixed-Point method. (a) 1 st step. (b) 2 nd step.<br />
(c) 3 rd step.<br />
B<br />
- 86 - 15th IGTE Symposium 2012<br />
5) The right hand side <strong>of</strong> (19) is updated and the process from<br />
3) to 5) is repeated.<br />
6) It is judged to be converged if the change <strong>of</strong> B (k) is less<br />
than the specified small value.<br />
Fig.2 shows the concept <strong>of</strong> the nonlinear magnetic field<br />
analysis using the modified Fixed-Point method. In this<br />
method, the reluctivity νFP shown in Fig.2 (a) is given as an<br />
initial value, and the derivative ∂H/∂B is updated at each<br />
iteration. At the initial iteration, the linear magnetic field<br />
analysis is carried out using the given ∂H/∂B, and the flux<br />
density B (1) is obtained. Next, H(B (1) ) corresponding to the<br />
flux density B (1) on the B-H curve and<br />
∂H(B (1) )/∂B·B (1) =VFPB (1) on the line ∂H(B (1) )/∂B shown in<br />
Fig.2(a) is obtained. At the first step, HFP (k) =HFP (1) following<br />
the definition <strong>of</strong> HFP (1) in (17). The iteration is carried out<br />
H(B (1) H(B ) (1) )<br />
H<br />
FPB (1) FPB (1)<br />
H (1)<br />
FP<br />
0<br />
H<br />
H(B (2) H(B ) (2) )<br />
FPB (2) FPB (2)<br />
H (1)<br />
FP<br />
0<br />
H<br />
H(B (3) H(B ) (3) )<br />
FPB (3) FPB (3)<br />
H (2)<br />
FP<br />
0<br />
( 1 )<br />
H ( B )<br />
<br />
B<br />
( 2 )<br />
B (1) B (1)<br />
(a)<br />
H ( B )<br />
( 2 )<br />
H FP<br />
<br />
B<br />
<br />
H<br />
<br />
H<br />
( 1 )<br />
FP<br />
( 2 )<br />
FP<br />
B (2) B<br />
( 2)<br />
H(<br />
B )<br />
B<br />
(2)<br />
( 2)<br />
H(<br />
B )<br />
B<br />
(b)<br />
( 3 )<br />
H ( B )<br />
( 3 )<br />
H FP<br />
<br />
B<br />
B (3) B (3)<br />
( 3 )<br />
H ( B )<br />
<br />
B<br />
H ( B )<br />
<br />
B<br />
<br />
H<br />
B-H curve<br />
( 1 )<br />
B<br />
B-H curve<br />
B<br />
B-H curve<br />
( 3 )<br />
FP<br />
B<br />
(c)<br />
Fig. 2 Conceptual diagram <strong>of</strong> Modified Fixed-Point method. (a) 1 st step. (b)<br />
2 nd step. (c) 3 rd step.<br />
3
until δHFP becomes near to zero. H (which corresponds to<br />
HFP in (17)) can be directly obtained by using the Newton-<br />
Raphson method as shown in (10). The modified Fixed-Point<br />
method needs two steps (Eqs. (18) and (19)), but the concept<br />
is the same as that <strong>of</strong> the Newton- Raphson method.<br />
III. ANALYZED MODEL<br />
The modified Fixed-Point method is applied to the analysis<br />
<strong>of</strong> the magnetic field in the billet heater model [9] shown in<br />
Fig.3. Analysis domain <strong>of</strong> the model is 1/8. The material <strong>of</strong><br />
the yoke is 35A230(non-oriented electrical steel). The material<br />
<strong>of</strong> the billet is S45C(carbon steel). The numbers <strong>of</strong> elements<br />
and nodes are 107632 and 115101, respectively. The ampere<br />
turns <strong>of</strong> the coil are set as 70000AT (60Hz). The CPU time<br />
and number <strong>of</strong> iteration <strong>of</strong> the Fixed-Point method (FPM),<br />
modified Fixed-Point method (MFPM), Newton-Raphson<br />
method using ν-B 2 curve (NRM(B 2 )), and Newton-Raphson<br />
method using B-H curve (NRM(B)) are compared. For<br />
simplicity, only the calculation <strong>of</strong> the 1st step <strong>of</strong> the step by<br />
step method for the nonlinear eddy current analysis is carried<br />
out in order to compare the performance <strong>of</strong> each method. As<br />
the total CPU time is almost equal to the multiple <strong>of</strong> number<br />
<strong>of</strong> steps, the comparison <strong>of</strong> only the 1st step is sufficient for<br />
the comparison <strong>of</strong> each method.<br />
IV. RESULTS AND DISCUSSION<br />
Fig.4 shows an example <strong>of</strong> distribution <strong>of</strong> flux density <strong>of</strong><br />
NRM(B) and MFPM. The results <strong>of</strong> NRM(B 2 ) and FPM are<br />
also the same as Fig.4. The comparison <strong>of</strong> the CPU time and<br />
the number <strong>of</strong> iterations are shown in Table I. The<br />
convergence property is shown in Fig.5. The convergence<br />
criterion is B(A) < 2.010 -3 . The convergence criterion<br />
( )<br />
G n<br />
/ G<br />
( 0)<br />
<strong>of</strong> the ICCG method is chosen as less than 10 -5 .<br />
Intel Core2 Duo E8400@ 3.16GHz, 3GB RAM is used. These<br />
results suggest that the convergence property <strong>of</strong> MFPM is near<br />
to that <strong>of</strong> NRM. Especially, MFPM is faster than NRM (B). It<br />
is also clarified that NRM(B 2 ) is faster than NRM(B).<br />
fire-resistant material<br />
billet<br />
y<br />
150<br />
z<br />
x<br />
unit:mm<br />
200<br />
100 100<br />
<br />
yoke<br />
15 15 2510 2510 50 50<br />
adiabator<br />
billet<br />
iron core<br />
<br />
V. CONCLUSIONS<br />
z<br />
x<br />
unit:mm<br />
The obtained results can be summarized as follows:<br />
(a) The formulation <strong>of</strong> the modified Fixed-Point method<br />
y<br />
10<br />
200<br />
25<br />
200<br />
300<br />
15<br />
50<br />
(a) (b)<br />
Fig. 3 Analyzed model <strong>of</strong> billet heater. (a) bird’s eye biew (1/8 region). (b) xy<br />
plane.<br />
- 87 - 15th IGTE Symposium 2012<br />
148<br />
TABLE I<br />
COMPARISON OF CPU TIME AND ITERATIONS<br />
Method CPU Time (sec) Iterations<br />
NRM(B2) 370.08 13<br />
NEM(B) 654.83 28<br />
FPM 3432.24 101<br />
MFPM 781.69 22<br />
PC performance : Intel Core2 Duo E8400@ 3.16GHz, 3GB RAM<br />
Flux density B[T]<br />
3.16<br />
3.24<br />
2.88<br />
2.52<br />
2.16<br />
1.80<br />
1.44<br />
1.08<br />
0.72<br />
0.36<br />
0.00<br />
y<br />
x<br />
(a) (b)<br />
Fig.4 Comparison <strong>of</strong> numerical results <strong>of</strong> Flux distribution using NRM(B)<br />
and MFPM. (a) NRM(B). (b) MFPM.<br />
Number <strong>of</strong> nonconverged<br />
elements<br />
<br />
<br />
<br />
<br />
<br />
<br />
Fig. 5 Convergence Property.<br />
NRM(B2 NRM(B )<br />
NRM(B)<br />
FPM<br />
MFPM<br />
2 )<br />
NRM(B)<br />
FPM<br />
MFPM<br />
<br />
<br />
Iterations<br />
(MFPM) using the derivative <strong>of</strong> reluctivity is almost the<br />
same as that <strong>of</strong> the Newton-Raphson method (NRM).<br />
(b) The modified Fixed-Point method (MFPM) has an<br />
advantage that the CPU time is less than that <strong>of</strong> the<br />
Newton-Raphson method (NRM) in some condition, or<br />
MFPM has almost the same performance as NRM.<br />
Moreover,the programming is easy compared with NRM. <br />
[1]<br />
REFERENCES<br />
F.I.Hantila, G.Preda, and M.Vasiliu : “Polarization method for static<br />
fields”, IEEE Trans. Magn., vol.36, no.4, pp.672-675, 2000.<br />
[2] M.Chiampi, D.Chiarabaglio, and M.Repetto: “A Jiles-Atherton and<br />
fixed-point combined technique for time periodic magnetic field<br />
problems with hysteresis” , IEEE Trans. Magn., vol.31, no.6, pp.4306-<br />
4311, 1995.<br />
[3] D.Miyagi, K.Shimomura, N. Takahashi, H. Kaimori: “Usefulness <strong>of</strong><br />
fixed point method in electromagnetic field analysis in consideration <strong>of</strong><br />
nonlinear magnetic anisotropy”, Digest <strong>of</strong> IEEE CEFC, 2012.<br />
[4] S.Urata, M.Enokizono, T.Todaka, and H.Shimoji: “Magnetic<br />
[5]<br />
characteristic analysis <strong>of</strong> the motor considering 2-D vector magnetic<br />
property”, IEEE Trans. Magn., vol.42, no.4, pp.615-618, 2006.<br />
K.Fujiwara, T.Adachi, and N.Takahashi: “A proposal <strong>of</strong> finite-element<br />
analysis considering two-dimension magnetic properties”, IEEE Trans.<br />
Magn., vol.38, no.2, pp.889-892, 2002.<br />
[6] T.Nakata, N.Takahashi, K.Fujiwara, and N.Okamoto: “Improvements <strong>of</strong><br />
convergence characteristics <strong>of</strong> Newton-Raphson method for nonlinear<br />
4
magnetic field analysis”, IEEE Trans. Magn., vol.28, no.2, pp.1048-<br />
1051, 1992.<br />
[7] E.Dlala, A.Belahcen, and A.Arkkio : “Locally convergent fixed-point<br />
method for solving time-stepping nonlinear field problems”, IEEE Trans.<br />
Magn., vol.43, no.11, pp.3969-3975, 2007.<br />
[8] E.Dlala, A.Belahcen, and A.Arkkio : “A fast fixed-point method for<br />
solving magnetic field problems in media <strong>of</strong> hysteresis”, IEEE Trans.<br />
Magn., vol.44, no.6, pp. 1214 -1217, 2008.<br />
[9] N.Takahashi, S.Nakazaki, D.Miyagi, N.Uchida, K.Kawanaka, and<br />
H.Namba: “3-D optimal design <strong>of</strong> laminated yoke <strong>of</strong> billet heater for<br />
rolling wire rod using ON/OFF method”, Archives <strong>of</strong> Electrical<br />
Engineering, vol.61, no.1, pp.115-123, 2012.<br />
- 88 - 15th IGTE Symposium 2012<br />
5
- 89 - 15th IGTE Symposium 2012<br />
S<strong>of</strong>tware Agent Based Domain Decomposition Method<br />
1) M. Jüttner, 1) A. Buchau, 2) M. Rauscher, 1) W. M. Rucker, and 2) P. Göhner<br />
1) Institute for Theory <strong>of</strong> Electrical Engineering, Pfaffenwaldring 47, D-70569 Stuttgart, Germany<br />
2) Institute <strong>of</strong> Industrial Automation and S<strong>of</strong>tware Engineering, Pfaffenwaldring 47, D-70569 Stuttgart, Germany<br />
E-mail: ite@ite.uni-stuttgart.de<br />
Abstract—A workbench is described, able to divide complex coupled three dimensional multiphysics simulations into<br />
smaller parts based on existing domain decomposition techniques. These parts are calculated by s<strong>of</strong>tware agents allowing to<br />
widely distributes the calculation over multiple distributed computers and even into the cloud to speed up the performance,<br />
to make larger simulations possible and to actively manipulate and control the strategy and the process <strong>of</strong> solving.<br />
Index Terms— cloud computing, coupled multiphysics problems, domain decomposition, s<strong>of</strong>tware agents<br />
different resources like idle workstations, laptops or<br />
smartphones and even online resources located within the<br />
cloud. These resources can be used by established and<br />
multifunctional solving methods like FEM or BEM. FEM<br />
is able to solve non-linear and anisotropic material effects<br />
and lead to large systems <strong>of</strong> non-linear equations. An<br />
alternative to the FEM is the BEM. At BEM only the<br />
surface <strong>of</strong> a model needs to be discretised. This leads to a<br />
much smaller system <strong>of</strong> equations represented in a dense<br />
matrix. The calculation time for the matrix gets<br />
acceptable if we use matrix compression. Therefore the<br />
fast multipole method (FMM) and the adaptive cross<br />
approximation (ACA) can be used [7], allowing to<br />
calculate hysteresis effects for magnetic fields [8].<br />
I. INTRODUCTION<br />
Finding a solution for complex three dimensional<br />
coupled field problems more efficiently is the goal <strong>of</strong> this<br />
approach. Therefore, established methods for numerical<br />
solutions like finite element methods (FEM) and<br />
boundary element methods (BEM) are combined with the<br />
idea <strong>of</strong> s<strong>of</strong>tware agents.<br />
S<strong>of</strong>tware agents are a way to develop flexible and<br />
efficient s<strong>of</strong>tware based on the concept <strong>of</strong> agent oriented<br />
s<strong>of</strong>tware development [1]. Therefore, the system is<br />
divided into autonomous and self-organized agents.<br />
These agents are independent <strong>of</strong> each other and capable<br />
to make decisions within there possibilities. Therefore<br />
they are able to interact with each other via messages and<br />
data exchange. Based on this the agents negotiate to reach<br />
their individual goals. The communication between the<br />
agents also allows a dynamic handling <strong>of</strong> multiple<br />
situations per agent as well as for the global system to<br />
grant dynamic and well fitting agent behaviour. Within<br />
the context <strong>of</strong> agent based systems a systematic<br />
distribution <strong>of</strong> the functions, necessary for solving a<br />
problem, to different agents grant a limited coupling<br />
between different agents and results into an even more<br />
flexible and manageable system. This flexibility and<br />
dynamic allow the approach described in section II to<br />
perfectly handle systems with multiple boundary<br />
conditions and to solve weak coupled systems. The<br />
approach <strong>of</strong> s<strong>of</strong>tware agents is currently well established<br />
in automation technology and used for example for selfmanagement<br />
in automation systems [2], modelling smart<br />
grids [3], prioritization <strong>of</strong> test cases [4] or optimising<br />
electromagnetic field problems [5].<br />
Because <strong>of</strong> the big influence <strong>of</strong> available computer<br />
resources for solving numerical problems nowadays<br />
workstations including multiple multicore CPUs and a<br />
relatively large RAM could be used. To handle these<br />
resources modern programming languages are available<br />
and grant a quite good usage <strong>of</strong> all <strong>of</strong> these resources.<br />
Based on the increase <strong>of</strong> calculation power the problems<br />
getting larger and coupled effects are considered as well.<br />
Nowadays large simulation problems are mostly solved<br />
on huge computer clusters with identical computers. The<br />
usage <strong>of</strong> temporary available resources grant, due to<br />
modern operation systems a large performance alternative<br />
[6]. Products like the Micros<strong>of</strong>t High Performance<br />
Computing Server or the Enterprise Linux Cluster from<br />
Redhat or Suse <strong>of</strong>fer a simple way to spread tasks to<br />
II. COUPLING SOFTWARE AGENTS AND SOLVER<br />
Considering a large coupled simulation, the creation <strong>of</strong><br />
equations including all effects is mostly not reasonable.<br />
Splitting the problem into smaller parts that can be solved<br />
iteratively can reduce the total expense <strong>of</strong> the large nonlinear<br />
problem [9], especially when small changes in the<br />
partial problems can be ignored and do not lead to further<br />
iterations <strong>of</strong> the calculation. Therefore a so called<br />
coordination agent is created. This agent splits the<br />
coupled problem into partial problems. Examples for<br />
different classes <strong>of</strong> partial problems are different single<br />
physic problems as well as geometry or material based<br />
partial problems. All functions <strong>of</strong> the coordination agent<br />
are described in section II.B. Then, the partial problems<br />
are assigned to different calculation agents. Fig. 1<br />
describes the cooperation <strong>of</strong> s<strong>of</strong>tware agents.<br />
Fig. 1: Concept <strong>of</strong> cooperation agents
Each calculation agent solves its problem with an<br />
optimized approach for its partial problem. Due to that a<br />
combination <strong>of</strong> multiple methods like FEM or BEM for<br />
different partial problems are possible. This allows a<br />
combination <strong>of</strong> the advantages <strong>of</strong> FEM and BEM for<br />
multiple different calculation resources. For the<br />
calculation agents there is no need to be within one<br />
system. Different resources can be used if the calculation<br />
agents are distributed to multiple computers or even the<br />
cloud as displayed in Fig. 2. The calculation agents are<br />
described in detail in section II.C. The collection <strong>of</strong> these<br />
s<strong>of</strong>tware agents is able to solve coupled problems. The<br />
necessity <strong>of</strong> this new approach handling multiple physics<br />
and large systems can be seen in [10]. The simulation was<br />
only possible with height effort to reach convergence.<br />
<br />
Fig. 2: Distributed Agents<br />
A. Steps to a successful simulation<br />
Setting the approach into its context and describing the<br />
process <strong>of</strong> creating and solving a complex coupled<br />
problem with this approach is topic <strong>of</strong> this subsection.<br />
The process is visualized in Fig. 3.<br />
Build a finer<br />
mesh<br />
Modelling the system with a FEM-s<strong>of</strong>tware<br />
Including mesh and boundary conditions<br />
Export mesh and boundary conditions<br />
Divide mesh into smaller parts<br />
Mapping the boundary conditions<br />
Mesh-management within the agent-system<br />
Calculation Agent 1<br />
Solve partial problem<br />
Parallel and independent<br />
Exchange <strong>of</strong> boundary<br />
conditions<br />
Exchange <strong>of</strong> status<br />
Exchange <strong>of</strong> results<br />
no<br />
status,<br />
boundary<br />
conditions<br />
convergence<br />
yes<br />
Combine solutions<br />
Calculation<br />
Agent<br />
2<br />
...<br />
conceivable<br />
Calculation<br />
Agent<br />
n<br />
Fig. 3: The process for a simulation<br />
- Initially a model estimating the actual problem<br />
needs to be created. This approach does not set any<br />
special requirements to the model itself. So the model can<br />
be created with commonly used CAD-s<strong>of</strong>tware tools. The<br />
same holds for the creation <strong>of</strong> the mesh <strong>of</strong> the model, so<br />
common meshing-tools can be used. For complex<br />
coupled problems it is important to consider all effects<br />
within one single mesh because the calculation is<br />
influenced by the geometry <strong>of</strong> all physics as well as their<br />
coupling.<br />
- To allow any solver to create a suitable solution, the<br />
boundary conditions for the given mesh including all<br />
coupled physics has to be set. The boundary conditions as<br />
well as the previously created mesh have to be exported<br />
in a way it can be handed over to the coordination agent.<br />
- 90 - 15th IGTE Symposium 2012<br />
- The mesh and the boundary conditions are now<br />
handed to a coordination agent. The coordination agent is<br />
responsible for finding a solution for the given problem.<br />
Therefore it splits the problem into smaller parts. The<br />
quantity <strong>of</strong> smaller parts depends on the number <strong>of</strong><br />
available calculation agents or is defined manually. In<br />
case <strong>of</strong> a resource based splitting the assignment <strong>of</strong> the<br />
partial problems to the different calculation agents is<br />
intuitive. In case <strong>of</strong> a manual quantity <strong>of</strong> partial problems<br />
each partial problem is solved iteratively according to<br />
availability <strong>of</strong> resources. The partial problems are now<br />
distributed to the available calculation agents.<br />
- The calculation agents now start solving their<br />
allocated partial problem. Because <strong>of</strong> each calculation<br />
agent running on its own hardware the agent is able to use<br />
all resources <strong>of</strong> this hardware to solve its partial problem<br />
fast and highly parallel. If a new calculation agent,<br />
including new hardware resources, appears in the system<br />
the coordination agents has to determine if the resource<br />
should used and which agent splits its partial problem. In<br />
case <strong>of</strong> a drop out <strong>of</strong> a calculation agent the coordination<br />
agent needs to attach the partial problem to another<br />
calculation agent. This behaviour allows a dynamic<br />
adaption <strong>of</strong> the system. To do so, information based on<br />
status <strong>of</strong> different calculation agents has to be distributed.<br />
- To allow the system <strong>of</strong> agents to solve a coupled<br />
problem, the calculation agents has to exchange results<br />
between each other as soon as they are available. If a<br />
calculation agent is able to understand and interpret the<br />
results <strong>of</strong> another calculation agent, new boundary<br />
conditions can be derived and integrated into the own<br />
calculation. This process continues until all calculation<br />
agents finished. If some partial problems do not reach<br />
convergence the calculation can be interrupted and<br />
reinitialised with a new set <strong>of</strong> partial problems without<br />
recalculating successfully solved parts.<br />
- Finally the coordination agent combines all results<br />
depending on the way they were split before and returns<br />
the result to the user.<br />
An example describing the advantages <strong>of</strong> this approach<br />
is shown in section II.E.<br />
B. The Coordination Agent<br />
The coordination agent is an independent program with a<br />
small set <strong>of</strong> functions. It’s visible to the in- and outside<br />
and represents the interface between the users and the<br />
calculation agents. The graphical user interface (GUI)<br />
provides the interface to the user. The GUI is controls all<br />
functions described below. The internal interface is<br />
realised via a message system handling different types <strong>of</strong><br />
messages received from other agents. In addition to that<br />
the coordination agent <strong>of</strong>fers process variables like the<br />
convergence criteria and the overall progress, so each<br />
problem needs to be assigned to at least one coordination<br />
agent. The different functions <strong>of</strong> the coordination agent<br />
are summarized in Fig. 4 and described in detail in the<br />
following.<br />
- Via its GUI the coordination agent <strong>of</strong>fers the<br />
interface to load a problem. A second problem can only<br />
be loaded, if the first is solved and the results are either<br />
collected by the user or the actual calculations are<br />
interrupted and possible results are dropped. The GUI is<br />
additionally used to display calculated results.
- Solving a model only gets possible if calculation<br />
agents are available. These agents need to be able to solve<br />
all different classes <strong>of</strong> the actual problem. Therefore the<br />
coordination agent collects and manages information<br />
about all available calculation agents and their<br />
possibilities to solve problems. In this context economical<br />
aspects can also be considered within the process <strong>of</strong><br />
solving by the possibility to weight different agents. In<br />
case <strong>of</strong> multiple coordination agents working in the same<br />
surrounding it’s necessary to care about the status <strong>of</strong><br />
agents to avoid multiple tasks for the same agent.<br />
- To instruct a calculation agent to solve a partial<br />
problem, the partial problem must be created. In case <strong>of</strong> a<br />
simple problem and a single calculation agent able to<br />
solve the problem, the partial problem can be the problem<br />
and can directly be handed over to the calculation agent.<br />
In all other cases the problem must be split.<br />
An obvious splitting for weak coupled systems is based<br />
on the different types <strong>of</strong> physics. Further opportunities<br />
for splitting results out <strong>of</strong> the method <strong>of</strong> BEM-FEM<br />
coupling (combining the positive effects on both methods<br />
by calculating non-linear equations with FEM while the<br />
linear once are calculated with BEM). An approach<br />
splitting the different physics and considering BEM/FEM<br />
coupling is realised for electromagnetic field problems in<br />
[11]. There it was shown that iterative coupling <strong>of</strong> BEM<br />
and FEM results in an increase <strong>of</strong> convergence compared<br />
to a strong coupling for the different physics. So this<br />
segmentation based on the different type <strong>of</strong> equations and<br />
physics is used.<br />
Another way to decompose different domains is based<br />
on regions solvable with the same numerical method. The<br />
regions are usually segmented by borders <strong>of</strong> the different<br />
materials. This is especially useful for distributed<br />
calculation and for different discretisation size within one<br />
model. The idea as well as a domain decomposition based<br />
on the number <strong>of</strong> available resources is realised for FEM<br />
in FETI [12] and for BEM in BETI [13].<br />
Another idea is based on overlapping regions only<br />
considering Dirichlet boundary conditions [14]. This is <strong>of</strong><br />
special interest, if we take a look at the amount <strong>of</strong> data<br />
exchanged between different agents.<br />
All mentioned methods for domain decomposition<br />
have in common, that the decomposition has to be done<br />
before the actual solving is done. A flexible or dynamic<br />
adaption to results or partial solutions gets possible with<br />
this agent based approach. This rapidly increases the<br />
speed <strong>of</strong> convergence for a complex simulation. So this<br />
approach gets more flexible, more dynamic and more<br />
adapted to available resources compared to existing<br />
domain decomposition algorithms.<br />
- In a next step the partial problems need to be<br />
distributed to the different calculation agents. Therefore<br />
the coordination agent reserves required calculation<br />
agents. Further it shares all necessary information<br />
including the actual partial problem and initialises the<br />
solving process.<br />
If a calculation agent finishes, a notification is received<br />
by the coordination agent. The coordination agent then<br />
updates its progress variables and checks, if all other<br />
agents working on the same problem have finished. In<br />
- 91 - 15th IGTE Symposium 2012<br />
this case the overall solution is available. If other agents<br />
are still working, only a partial solution can be <strong>of</strong>fered. In<br />
case <strong>of</strong> no solution can be found a creation <strong>of</strong> a finer<br />
mesh and an initialisation <strong>of</strong> the splitting process are done<br />
in hope to find solvable partial problems. The user finally<br />
gets informed about these circumstances.<br />
Fig. 4: The Coordination agent<br />
C. The Calculation Agent<br />
Calculation agents are independent programs. At their<br />
start up all necessary parameters are set independent from<br />
a usage by a coordination agent. The functions <strong>of</strong> the<br />
calculation agent are summarized in Fig. 5 and described<br />
in detail in the following.<br />
- Before a calculation agent can <strong>of</strong>fer its service to a<br />
coordination agent, it has to complete its description.<br />
Therefore a unique name must be set for each calculation<br />
agent while it’s initialised. Also the status the agent is<br />
currently in and the problem classes the calculation agent<br />
is capable <strong>of</strong> solving needs to be specified as well. The<br />
problem classes the calculation agent is able to solve<br />
depend on the specific solver the calculation agent is<br />
connected to. To reach a flexible system and to allow<br />
calculation agents to connect to different solvers, a solver<br />
interface is created capable <strong>of</strong> handling all data and<br />
information connected to the solver. The solver interface<br />
is described in detail in section II.D. A important<br />
information is the commissioner <strong>of</strong> the actual tasks the<br />
calculation agent is working for. Therefore the name <strong>of</strong><br />
the coordination agent is stored within each agent to send<br />
notifications to the commissioner if this gets necessary.<br />
- To manage the solver interface and to satisfy its<br />
needs is the major task for a running calculation agent.<br />
Therefore the agent provides all information requested by<br />
a solver and pass them onto the solver interface.<br />
Examples are the initial boundary conditions that are<br />
received from the coordination agent and the tolerances<br />
for the solver.<br />
The calculation agent also has to make sure, when ever<br />
another calculation agent reports an available result, the<br />
calculation agent has to check whether the result does<br />
influence the own calculation or not. In case <strong>of</strong> an<br />
influence, a re-initialisation <strong>of</strong> the solver process is<br />
necessary. This includes a stop <strong>of</strong> the actual solver<br />
process, an update <strong>of</strong> the boundary conditions after<br />
calculation agent has received the result from the other<br />
calculation agent and a start <strong>of</strong> the new initialized solver.<br />
In case <strong>of</strong> a successful calculated partial solution the<br />
calculation agent notify all calculation agents connected<br />
to the problem about this result and distribute the result<br />
about the new boundary conditions if they are requested.<br />
Also the coordination agent needs to be informed about
the successful calculation and the availability <strong>of</strong> the<br />
results. In case <strong>of</strong> a failure or a not converging solving<br />
algorithm chosen by the calculation agent, the<br />
coordination agent also has to be informed. The same<br />
holds for a drop <strong>of</strong> available solver resources.<br />
- Whenever a calculation agent is started a GUI<br />
provided by each calculation agent is displayed. This GUI<br />
allows setting the calculation agent parameters as well as<br />
connecting it to a solver interface includes setting<br />
necessary parameters therefor. Examples for these<br />
parameters are the host the solver is running on, the port<br />
this host allows to establish a connection and the problem<br />
classes that could be solved with the given solver. All<br />
functions the GUI provides are needed whenever the<br />
coordination agent instructs a calculation agent to solve a<br />
problem. Therefore the GUI provides functions like<br />
loading a model, starting the calculation, extracting the<br />
results and a possibility to cancel the actual solving<br />
process. These functions can also being used without a<br />
connection to a coordination agent. In this case the<br />
calculation agent solves simple problems on its own.<br />
- To track the actual process <strong>of</strong> solving a partial<br />
problem and to understand the behaviour <strong>of</strong> a calculation<br />
agent each calculation agent <strong>of</strong>fers a separate function <strong>of</strong><br />
writing a log file and displaying it within the GUI.<br />
To grant the availability for the coordination agent during<br />
the complete process <strong>of</strong> solving and to allow the<br />
calculation agent to react flexible to information from<br />
other agents at least two threads are created within the<br />
calculation agent. The first thread represents the<br />
functionality <strong>of</strong> the calculation agent. Further threads are<br />
used by the solver and its interface to solve the partial<br />
problem. Only the realisation with multiple threads<br />
allows the calculation agent to handle requests form the<br />
coordination agent as well as checking for changes in the<br />
boundary setting and interrupt in case <strong>of</strong> necessity while<br />
the solver is calculating a solution for the problem. A<br />
quick and efficient message exchange allows sending and<br />
receiving as well as processing messages with very little<br />
delays is another important part to grant the flexibility <strong>of</strong><br />
the system. Consequences for the solver <strong>of</strong> received<br />
messages are handled by the solver interface.<br />
Fig. 5: The Calculation Agent<br />
D. The Solver Interface<br />
The solver interface is part <strong>of</strong> the calculation agent. It is<br />
the bridge between the calculation agent and a solver. It<br />
allows the calculation agent to solve at least on problem<br />
class. The functions <strong>of</strong> the solver interface are described<br />
in the following and summarize Fig. 6.<br />
- To establish a connection between the calculation<br />
agent and different solvers the solver interface can be<br />
understood as a collection <strong>of</strong> libraries controlling a<br />
variety <strong>of</strong> solvers. While starting the calculation agent the<br />
specific type <strong>of</strong> solver must be selected and parameters<br />
- 92 - 15th IGTE Symposium 2012<br />
for the reachability <strong>of</strong> the solver must be set. Examples<br />
are the host name and the network port the solver can be<br />
reached or the local running solver application. In case <strong>of</strong><br />
multiple solver interfaces managed by a single calculation<br />
agent it gets possible to create calculation agents with the<br />
possibility to solve multiclass problems.<br />
- If a connection to a solver is established the<br />
necessary parameters need to be set. Therefore the partial<br />
problem received by the calculation agent must be<br />
translated to a form the solver does understand and<br />
passed to the solver.<br />
- In the next step the solver interface starts the solver.<br />
- While the solver is calculating the major task <strong>of</strong> the<br />
solver interface is to control the solver and manage its<br />
output. This includes monitoring all information created<br />
by the solver as well as interrupting the solver for<br />
checking possible changes due to results <strong>of</strong> other agents.<br />
Additional information like the convergence behaviour <strong>of</strong><br />
the actual partial problem, the availability <strong>of</strong> the solver,<br />
its resources and a guess for the remaining calculation<br />
time are collected by the solver interface and passed to<br />
the calculation agent. The analyses <strong>of</strong> the information<br />
allow a quick reaction from the calculation agent and also<br />
the coordination agent to any changes <strong>of</strong> the system. An<br />
example is a temporary unavailable solver. Due to that<br />
act the calculation agent gets temporary unable to solve<br />
problems and has to disconnect itself from the<br />
coordination agent and the problem. Then the<br />
coordination agent has to find an alternative for the<br />
calculation agent to successfully solve the problem.<br />
Another example concerns the possibility <strong>of</strong> convergence.<br />
If the calculation agent recognizes a convergence is<br />
unlikely, the calculation agent has to reconsider the<br />
chosen form <strong>of</strong> the solver or in the worst case a message<br />
has to be sent to the coordination agent to replace the<br />
actual splitting by a different on.<br />
- After a successful calculation the solver interface<br />
notifies the calculation agent to inform other agents about<br />
the available result. All information connected to the<br />
message exchange between different agents is handled by<br />
the calculation agent. The solver interface only takes care<br />
about the information directly connected to the solver.<br />
If the result is requested, the solver interface extracts the<br />
result and translates it into a form that can be shared with<br />
other agents. In case the result or the solver is no longer<br />
needed the solver interface detach the connection to the<br />
solver, release reserved resources and initialise the<br />
deregistering process <strong>of</strong> the calculation agent.<br />
Fig. 6: The solver interface<br />
E. Processing Details<br />
The way a problem is solved do significantly depend<br />
on the timing <strong>of</strong> the agents finishing their calculations<br />
and informing others agents about their results. So in<br />
coupled systems not every effect has the same meaning at<br />
each moment within the process <strong>of</strong> solving. The time to
calculate a partial result for two physics depends for an<br />
identic mesh on the linearity <strong>of</strong> the materials for the<br />
different physics as well as on the resources each agent is<br />
able to use. Therefore the dependent partial results are<br />
<strong>of</strong>fered at a different time and the timing issue to the<br />
global system needs to be taken special care <strong>of</strong>.<br />
As an example the temperature at a circuit board after a<br />
certain time should be calculated like it’s shown in Fig. 7.<br />
The system consists <strong>of</strong> one coordination agent and two<br />
calculation agents. The first calculation agent calculates<br />
the electric field and as a side effect, the resistive losses.<br />
The second agent takes care <strong>of</strong> the calculation <strong>of</strong> the heat<br />
conduction from the transistor. In the described approach<br />
the recalculation <strong>of</strong> the overall temperature simulation<br />
will automatically be initialised if the result <strong>of</strong> the electric<br />
simulation including the resistive losses is present and do<br />
significantly change the result <strong>of</strong> the temperature<br />
simulation. Because <strong>of</strong> the parallel calculation <strong>of</strong> the<br />
circuit board it’s only necessary to recalculate some<br />
values <strong>of</strong> the matrix. Fig. 8 and Fig. 9 show two different<br />
scenarios for handling the different calculation times.<br />
Fig. 7: Coupled Problem<br />
In Fig. 8 a calculation procedure is assumed where the<br />
calculation agent responsible for solving the electric field<br />
problem has finished its calculation first. In that case the<br />
calculation agent responsible for the temperature has to<br />
check whether the heat radiated from the electric current<br />
does significantly change the own result or can be<br />
ignored. In the example the heat has to be considered.<br />
Therefore the temperature calculation agent has to update<br />
its calculation based on the result <strong>of</strong> the electric<br />
calculation agent. Therefore it adapts its boundary<br />
conditions to the result and recalculates again. If the<br />
temperature calculation agent also finishes and in case <strong>of</strong><br />
no more calculation agents working on the problem the<br />
- 93 - 15th IGTE Symposium 2012<br />
coordination agent requests all calculated partial results<br />
and combines them to a single result that is finally<br />
<strong>of</strong>fered to the user.<br />
Fig. 8: Solution case I<br />
In Fig. 9 the opposite case is considered. The<br />
calculation agent responsible for solving the temperature<br />
problem finishes first. In this simplified case the<br />
temperature does not have any influence on the electric<br />
conductivity so the electric calculation agent responses a<br />
“not acknowledge” (NACK) to the given information<br />
about a result. This NACK means that there is no<br />
influence expected from the partial result <strong>of</strong> the<br />
temperature calculation agent to the electric calculation<br />
agent. Then the electric calculation agent passes on and<br />
finishes its calculation regularly. If the temperaturecalculation<br />
agent obtains the result <strong>of</strong> the electriccalculation<br />
agent it checks its result and recalculates it as<br />
in the previous case. The following continues equally.<br />
Fig. 9: Solution case II
III. IMPLEMENTATION ENVIRONMENT<br />
A. The Agent System<br />
The actual implementation is using the Java Agent<br />
Development Framework (JADE) as a middle-ware to<br />
implement the mentioned agents. JADE is a Java agent<br />
based development framework. It is distributed by<br />
Telecom Italia and currently available under Lesser<br />
General Public License Version 2 (LGPLv2) with the<br />
latest version 4.2.0 and a release on 26 th June 2012. The<br />
implementation in Java, grants a system and operating<br />
independent environment for usage. The minimal system<br />
requirement is a running Java 1.4 runtime-environment<br />
available for mostly every system and even smartphones.<br />
The agent communication is based on a protocol<br />
containing seven layers that ensure the correct<br />
transmission and reception <strong>of</strong> message from different<br />
types. The complete system is thereby based on a<br />
standard <strong>of</strong>fered by the “Foundation for Intelligent<br />
Physical Agents” (FIPA) that was inherited by IEEE in<br />
2005<br />
B. The Solver<br />
Each calculation agent can connect itself to two different<br />
solvers. As a FEM-s<strong>of</strong>tware, COMSOL Multiphysics<br />
[15] <strong>of</strong>fers a bench <strong>of</strong> modern algorithms able to solve<br />
coupled problems. It also includes the creation <strong>of</strong> a mesh<br />
with multiple mesh types. In this context it is quite useful<br />
that all elements available in the s<strong>of</strong>tware can be reached<br />
via the Java-API COMSOL Multiphysics <strong>of</strong>fers. It is also<br />
possible to run the s<strong>of</strong>tware as a server and connect to the<br />
server via an <strong>of</strong>fered jar library for Java-programs. The<br />
library also passes results that can be visualised within a<br />
GUI. The prototype <strong>of</strong> the GUI for the calculation agent<br />
is based on the example for the usage <strong>of</strong> the COMSOL<br />
API. Within the API the GUI and the solver are already<br />
realised in parallel threats and notifications are send when<br />
a task starts or finishes. This helps initialising further<br />
events. Necessary functions to control the solver and its<br />
behaviour are also implemented. As a BEM-s<strong>of</strong>tware the<br />
calculation agents are prepared to connect to FAMU [8].<br />
IV. CONCLUSION<br />
This approach will efficiently solve highly complex<br />
three dimensional coupled field problems based on the<br />
idea <strong>of</strong> s<strong>of</strong>tware agents spread onto multiple distributed<br />
computers including the cloud. The distributed computers<br />
run a so called calculation agent, able to solve smaller<br />
problems. The total expenditure for finding a solution is<br />
reduced by the creation <strong>of</strong> multiple smaller equation<br />
systems. The complex coupled problem is split by a<br />
variation <strong>of</strong> already established domain decomposition<br />
methods into these smaller problems. The domain<br />
decomposition used, is based on different physics and<br />
different material properties as well as geometrical<br />
aspects. Every partial problem is then assigned to a<br />
calculation agent and handled for its own. Finding a<br />
solution <strong>of</strong> a coupled problem only gets possible by the<br />
communication and negotiation between the different<br />
agents. The communication also allows to dynamically<br />
adapt the system to new surrounding and to find<br />
convergence in coupled systems<br />
- 94 - 15th IGTE Symposium 2012<br />
An additional approach to reduce the effort for finding<br />
a solution is reached by the independent decision agents<br />
are allowed to take. This concerns especially the way the<br />
calculation agents solving the given partial problem. This<br />
includes the decision for a numerical method like FEM or<br />
BEM and the reaction on changed boundary conditions.<br />
Meaning, slightly modified boundary conditions are<br />
skipped for the partial result if no change in the result is<br />
expected instead <strong>of</strong> initialisation a new calculation cycle.<br />
The domain decomposition and the coordination <strong>of</strong> the<br />
interworking <strong>of</strong> different calculation agents are handled<br />
by a so called coordination agent. In this paper the<br />
realisation <strong>of</strong> a calculation agent as well as the realisation<br />
<strong>of</strong> a coordination agent with their different functions and<br />
their interworking is described.<br />
REFERENCES<br />
[1] N. Jennings, "Agent-Oriented S<strong>of</strong>tware Engineering," in Multi-<br />
Agent System Engineering, Berlin, Springer, 1999, pp. 1-7.<br />
[2] H. Mubarak and P. Göhner, "An agent-oriented approach for selfmanagement<br />
<strong>of</strong> industrial automation systems," 8th International<br />
Conference on Industrial Informatics, pp. 721-726, 2010.<br />
[3] M. Pipattanasomporn, H. Feroze and S. Rahman, "Multi-agent<br />
systems in a distributed smart grid: Design and implementation,"<br />
Power Systems Conference and Exposition, pp. 1-8, 2009.<br />
[4] C. Malz, N. Jazdi and P. Göhner, "Prioritization <strong>of</strong> Test Cases<br />
Using S<strong>of</strong>tware Agents and Fuzzy Logic," 5th Conference on<br />
S<strong>of</strong>tware Testing, Verification and Validation, pp. 483-486, 2012.<br />
[5] D. G. Lymperopoulos, N. L. Tsitsas and D. I. Kaklamani, "A<br />
Distributed Intelligent Agent Platform for Genetic Optimization in<br />
CEM: Applications in a Quasi-Point Matching Method,"<br />
Transactions on Antennas and Propagation, vol. 55, no. 3, pp.<br />
619-628, 2007.<br />
[6] A. Buchau, S. M. Tsafak, W. Hafla and W. M. Rucker,<br />
"Parallelization <strong>of</strong> a Fast Multipole Boundary Element Method<br />
with Cluster OpenMP," Transactions on Magnetics, vol. 44, no. 6,<br />
pp. 1338-1341, 2008.<br />
[7] A. Buchau, W. M. Rucker, O. Rain, V. Rischmuller, S. Kurz and<br />
S. Rjasanow, "Comparison between different approaches for fast<br />
and efficient 3-D BEM computations," Transactions on Magnetics,<br />
vol. 39, no. 3, pp. 1107- 1110, 2003.<br />
[8] A. Buchau, W. Hafla, F. Groh and W. M. Rucker, "Fast multipole<br />
method based solution <strong>of</strong> electrostatic and magnetostatic field<br />
problems," Computing and Visualization in Science, vol. 8, no. 3,<br />
pp. 137-144, 2005.<br />
[9] V. Rischmuller, S. Kurz and W. M. Rucker, "Parallelization <strong>of</strong><br />
coupled differential and integral methods using domain<br />
decomposition," Transactions on Magnetics, vol. 38, no. 2, pp.<br />
981-984, 2002.<br />
[10] P. Alotto, M. Guarnieri and F. Moro, "A Fully Coupled Three-<br />
Dimensional Dynamic Model <strong>of</strong> Polymeric Membranes for Fuel<br />
Cells," Transactions on Magnetics, vol. 46, no. 8, pp. 3257-3260,<br />
2010.<br />
[11] J. Albert, R. Banucu, W. Hafla and W. M. Rucker, "Simulation<br />
Based Development <strong>of</strong> a Valve Actuator for Alternative Drives<br />
Using BEM-FEM Code," Transactions on Magnetics, vol. 45, no.<br />
3, pp. 1744-1777, 2009.<br />
[12] C. Farhat and F.-X. Roux, "A method <strong>of</strong> finite element tearing and<br />
interconnecting and its parallel solution algorithm," International<br />
Journal for Numerical Methods in Engineering, vol. 32, no. 6, pp.<br />
1205-1227, 1991.<br />
[13] U. Langer and O. Steinbach, "Boundary Element Tearing and<br />
Interconnecting Methods," Computing, vol. 71, no. 3, pp. 205-228,<br />
2003.<br />
[14] D. Lavers, I. Boglaev and V. Sirotkin, "Numerical solution <strong>of</strong><br />
transient 2-D eddy current problem by domain decomposition<br />
algorithms," Transactions on Magnetics, vol. 32, no. 3, pp. 1413-<br />
1416, 1996.<br />
[15] COMSOL AB, Tegnérgatan 23, SE-111 40 Stockholm.
- 95 - 15th IGTE Symposium 2012
- 96 - 15th IGTE Symposium 2012
- 97 - 15th IGTE Symposium 2012
- 98 - 15th IGTE Symposium 2012
- 99 - 15th IGTE Symposium 2012
- 100 - 15th IGTE Symposium 2012
- 101 - 15th IGTE Symposium 2012<br />
Human exposure to the magnetic field<br />
produced by MFDC spot welding systems<br />
D. Bavastro ∗ , A. Canova ∗ , L. Giaccone ∗ , M. Manca ∗ , M. Simioli †<br />
∗ Politecnico di Torino - Dipartimento Energia, C.so Duca degli abruzzi 24, 10129 Torino, Italy<br />
† KGR S.p.a, via Nicolao Cena 65, 10032 Brandizzo, Italy<br />
E-mail: luca.giaccone@polito.it<br />
Abstract—In this paper the magnetic field emission <strong>of</strong> Medium Frequency Direct Current (MFDC) spot<br />
welding system is analyzed with reference to the exposure <strong>of</strong> working population. In the first part <strong>of</strong> the<br />
paper experimental measurements have been carried out in order to get the magnetic field emission <strong>of</strong><br />
a selected MFDC system. The measurement is performed in time domain acquiring the waveform <strong>of</strong> the<br />
magnetic field. In the second part <strong>of</strong> the paper the methodologies suggested by the ICNIRP guidelines have<br />
been adopted for the analysis <strong>of</strong> the waveforms: the equivalent frequency method, the multiple frequency method<br />
and the weighted multiple frequency method. An exhaustive comparison <strong>of</strong> the possible methodology suggested<br />
by the guidelines is given and contextualized in the regulatory framework. The emission <strong>of</strong> MFDC spot<br />
welding system has been completely characterized. By means <strong>of</strong> the spectral analysis it is found that the<br />
overcoming <strong>of</strong> the limit is mainly due to the 2000 Hz and 4000 Hz components. This result is useful for<br />
manufacturers because, in order to minimize the overall emission, it is possible to think about a mitigation<br />
system that encloses only the related internal components.<br />
Index Terms—pulsed magnetic fields, quasi-static magnetic fields, spot welding, MFDC<br />
I. INTRODUCTION<br />
Protection <strong>of</strong> the working population against<br />
the possible effects <strong>of</strong> extremely low frequency<br />
(ELF) electromagnetic fields is a concern <strong>of</strong> the<br />
European Community, which has published 2004<br />
Directive 2004/40/EC [1]. The Directive refers to<br />
the risk to the health and safety <strong>of</strong> workers due<br />
to known short-term adverse effects in the human<br />
body caused by the circulation <strong>of</strong> induced currents,<br />
by energy absorption, and contact currents. One <strong>of</strong><br />
the most important points stated in the Directive is<br />
the rationale <strong>of</strong> exposure at low frequency which<br />
is defined in accordance with ICNIRP 1998 guidelines<br />
[2]. These Guidelines report that in the ELF<br />
frequency range, the risk to the health and safety <strong>of</strong><br />
workers is due to known short-term adverse effects<br />
caused by the circulation <strong>of</strong> induced currents in the<br />
human body.<br />
INCIRP guidelines and directive 2004/40/EC<br />
provide a definition <strong>of</strong> reference or action values<br />
(i.e., values which can be directly measured like<br />
magnetic flux density) and exposure limit values<br />
(i.e., limit which are based directly on established<br />
health effects and biological considerations like<br />
current density).<br />
In this paper the exposure to the magnetic field<br />
produced by Medium Frequency Direct Current<br />
(MFCD) spot welding devices is analyzed. For this<br />
application the magnetic field waveform is pulsed<br />
and non-sinusoidal. While continuous wave mode<br />
<strong>of</strong> exposure is strictly defined in ICNIRP guidelines,<br />
the evaluation for pulsed or non-sinusoidal<br />
magnetic field waveforms is still an open question<br />
[3], [4], [5], [6], [7], [8], [9]. In the ICNIRP guidelines<br />
(year 1998), the problem <strong>of</strong> non-sinusoidal<br />
waveforms was tackled by means <strong>of</strong> superposition<br />
<strong>of</strong> harmonic values. This approach, even if possible,<br />
has been highly criticized afterwards because<br />
<strong>of</strong> an excessive conservative estimates <strong>of</strong> exposure<br />
levels. Due to the increasing importance <strong>of</strong> nonsinusoidal<br />
sources <strong>of</strong> magnetic fields, in 2003<br />
ICNIRP has published a new guideline for pulsed<br />
and complex non-sinusoidal waveforms [10]. This<br />
document is strongly based on the result obtained<br />
from K. Jokela [11]. It addresses the exposure evaluation<br />
in non-sinusoidal conditions by means <strong>of</strong><br />
proper weighting factors to be applied to different<br />
harmonic components <strong>of</strong> the waveform spectrum.<br />
In 2010 the ICNIRP published a new set <strong>of</strong><br />
guidelines [13]. There are two main differences<br />
between this document and the older one: 1) the<br />
dosimetric quantity used for ELF electromagnetic<br />
field is E (V/m) instead <strong>of</strong> J (A/m2 ). 2) The limits<br />
imposed on action values have been increased<br />
as can be observed in Fig. 1.<br />
From the analysis <strong>of</strong> the current literature, several<br />
papers analyze the same problem by computing<br />
the spectrum <strong>of</strong> the measured welding current.
Fig. 1. Reference levels for occupational exposure to time<br />
varying magnetic field. Comparison between 1998 and 2010<br />
values.<br />
Afterward, the welder is usually modeled as a<br />
coil and, simulations are performed in frequency<br />
domain for each spectral component <strong>of</strong> the current<br />
in order to derive the current density inside a<br />
human model or a simplified and standardized<br />
model [8], [9], [12]. Even if these kind <strong>of</strong> simulations<br />
are not an easy task, the procedure is <strong>of</strong>ten<br />
preferred because it is easier to measure the current<br />
in time domain rather than the magnetic field,<br />
excpecially for quasi-rectangular waveform [14].<br />
The drawback <strong>of</strong> this procedure is that assuming<br />
a spectrum for the current, the generated magnetic<br />
field is characterized by the same spectrum in all<br />
the surrounding point due to the neglection <strong>of</strong> the<br />
nonlinear electrical devices inside the body <strong>of</strong> the<br />
welder.<br />
In this paper the different methodologies provided<br />
by the ICNIRP to analyze pulsed and nonsinusoidal<br />
magnetic field will be applied to the<br />
MFCD spot welding devices. The actual waveform<br />
<strong>of</strong> the magnetic field have been measured taking<br />
care to the possible measurement problem [14].<br />
Finally, in order to compute the exposure level<br />
the limit provided by the ICNIRP 1998 has been<br />
employed due to the fact that currently the Italian<br />
regulation framework refers to those guidelines.<br />
II. MFDC SYSTEMS<br />
In Fig. 2 the conversion chain <strong>of</strong> the MFDC<br />
system is represented. The supply power, taken<br />
from the three-phase system at 50 Hz, is driven<br />
by means <strong>of</strong> a rectifier to an IGBT switch. The<br />
waveform at the input/output <strong>of</strong> the transformer is<br />
characterized by a frequency <strong>of</strong> 1000 Hz and, after<br />
a full-wave rectification (f = 2000 Hz), is applied<br />
to the welder terminals. The welder terminal can<br />
be considered as a R-L load. The switching <strong>of</strong><br />
the IGBT bridge is controlled so that the welding<br />
current reaches a desired (constant) value. The<br />
- 102 - 15th IGTE Symposium 2012<br />
welding process can be performed by means <strong>of</strong><br />
a single impulse or with more the one impulse.<br />
In Fig. 3 the waveform <strong>of</strong> the weld current is<br />
shown. As it can be observed, the actual current<br />
is not perfectly rectangular because the conversion<br />
system is not able to nullify completely a ripple at<br />
2000 Hz (and higher harmonics) that is superposed<br />
to the weld current. The main weld parameter are:<br />
the current peak (Ip) that is usually in the order<br />
<strong>of</strong> some kA, the weld time that is the duration <strong>of</strong><br />
the single pulse and the hold time that is a period<br />
when the current is zero after the welding, but the<br />
electrodes are still applied to the sheet to chill the<br />
weld.<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
Fig. 2. MFDC spot welding device under analysis. The<br />
reference system for the measurement is placed in the center<br />
<strong>of</strong> the welding coil.<br />
current<br />
peak<br />
weld time hold time<br />
Fig. 3. Weld current and its main parameters. Weld time =<br />
140 ms, hold time = 40 ms.<br />
In the spot welding sector the MFDC welder are<br />
<strong>of</strong>ten preferred to the AC ones because <strong>of</strong> some<br />
benefit: a shorter weld times is required due to the<br />
the DC output current, hence significant energy<br />
saving is obtained. Moreover, MFDC systems are<br />
very stable in working condition far from the rating<br />
power (useful range: 20-95%). Conversely, AC<br />
systems are unstable and inefficient when used<br />
outside the 70-90% <strong>of</strong> the rating power.<br />
III. EXPERIMENTAL MEASUREMENT<br />
In this paper the magnetic field emission <strong>of</strong><br />
a MFDC welder produced by KGR S.p.A. is<br />
analyzed. In Fig. 4 the layout with dimensions
is shown. The MWG model is a manual welder,<br />
therefore the operator is quite close to the device<br />
during the welding operation. In Fig. 5 a classical<br />
working configuration is reported in frontal and<br />
lateral view.<br />
Several observation points have been defined<br />
in order to evaluate the human exposure in the<br />
working configuration represented in Fig. 5. It<br />
has to be stressed that the considered working<br />
configuration is also the most critical one because<br />
the operator is faced to the welder coil.<br />
The measurements points are summarized in Table<br />
I. The coordinates are related to the reference<br />
system in Fig. 6.<br />
Fig. 4. Weld current and its main parameters. Weld time =<br />
140 ms, hold time = 40 ms.<br />
(a) (b)<br />
Fig. 5. working configuration: front view (a) and side view<br />
(b)<br />
TABLE I<br />
FIELD POINTS<br />
field point x (m) y (m) z (m)<br />
P1 0 0 0.28<br />
P2 0 0 0.5<br />
P3 0.5 0 0.28<br />
P4 0.5 0 0.5<br />
P5 0.8 0 0.28<br />
P6 0.8 0 0.5<br />
Al the measurement will be referred to the<br />
current represented in Fig. 3: current peak (Ip)<br />
equal to 12 kA, wled time equal to 140 ms and<br />
hold time equal to 40 ms.<br />
For each measurement point the waveform <strong>of</strong><br />
the magnetic field has been measured along the<br />
three axis (x, y and z). Finally the rms values<br />
- 103 - 15th IGTE Symposium 2012<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
Fig. 6. Reference system for measurement points definition<br />
has been computed. The total field waveforms for<br />
each observation points are shown in Fig. 7. By<br />
comparing those waveforms it is possible to derive<br />
some considerations: 1) as expected, the maximum<br />
value is observed for the field points P1 and P2<br />
that are faced to the welder coil. 2) P1, P3 and P5<br />
present a higher peak values with respect to P2,<br />
P5 and P6 because they are closer to the welder.<br />
3) by defining two groups <strong>of</strong> points with the same<br />
distance from the welder [P1, P3, P5] and [P2,<br />
P4, P6] it is possible to note that the peak value<br />
decreases by moving far from the welder coil. 4)<br />
the decreasing low for the peak value is not true<br />
for the medium frequency ripple superposed to the<br />
waveforms. It seems that the observation points<br />
P3 and P4 are characterized by the higher ripple.<br />
This consideration will be better investigated in the<br />
following by analyzing the spectrum components<br />
<strong>of</strong> all the waveforms. At this stage, a qualitative<br />
explanation can be given considering that P3 and<br />
P4 are located in front <strong>of</strong> the full wave rectifier<br />
connected to the secondary winding <strong>of</strong> the medium<br />
frequency transformer. Hence, P3 and P4 are the<br />
points more influenced by the conversion system.<br />
IV. HUMAN EXPOSURE EVALUATION<br />
The magnetic field produced by MFDC spot<br />
welding system is a pulsed magnetic field (see<br />
Fig. 7). With reference to the ICNIRP guidelines<br />
[2], [10], [11], it can be analyzed with three different<br />
approaches: the equivalent frequency method,<br />
the multiple frequency method and the weighted<br />
multiple frequency method.<br />
A. Equivalent frequency method<br />
The equivalent frequency method is introduced<br />
with the note 4 <strong>of</strong> Table 4 and 6 in the ICNIRP<br />
guidelines [2] . Afterward, it is better detailed in<br />
the guidelines focused on pulsed magnetic fields<br />
[10]. The method simply takes into account the<br />
pulse duration (tp) and maximum value <strong>of</strong> the<br />
waveform during impulse. Finally it refers to the<br />
equivalent and continuous sinusoidal field with a<br />
frequency feq =1/tp in order to test compliance<br />
with the reference levels.
(a)<br />
(b)<br />
(c)<br />
Fig. 7. P1 and P2 (a) P3 and P4 (c) P5 and P6<br />
Fig. 8. Equivalent frequency method: application to the<br />
measurement point P1<br />
If a single pulse <strong>of</strong> the magnetic field measured<br />
in the observation point P1 is analyzed by means<br />
<strong>of</strong> the equivalent frequency method the result <strong>of</strong><br />
Fig. 8 is obtained. The single pulse is characterized<br />
- 104 - 15th IGTE Symposium 2012<br />
by tp = 140 ms that leads to a continuous sinusoidal<br />
field with frequency equal to feq =3.5 Hz.<br />
The amplitude <strong>of</strong> the sinusoidal field is imposed<br />
to the maximum valued observed during the single<br />
pulse, i.e. 6298.3 μT.<br />
It is now simple to identify the reference level<br />
for a 3.5 Hz sinusoidal field from Fig. 1, that is<br />
16327 μT. It must be stressed that reference level<br />
is a rms value, therefore it must be compared with<br />
6298.3/ √ 2 = 4453.5 μT. Finally, by means <strong>of</strong> the<br />
equivalent frequency method, the welder produces<br />
a magnetic field that is 3.5 times lower than the<br />
applicable reference level in the observation point<br />
with the higher emission.<br />
B. Multiple frequency method<br />
The multiple frequency method is suggested for<br />
non-coherent waveform, i.e. waveform that can not<br />
be measured with repeatability property because <strong>of</strong><br />
their intrinsic variation in time.<br />
The procedure is summarized in the following<br />
steps:<br />
• selection <strong>of</strong> the signal to be analyzed<br />
• perform the Fourier Transform <strong>of</strong> the signal<br />
• computing the global indicator defined as:<br />
where:<br />
Is =<br />
65 kHz<br />
Bj<br />
BL,j<br />
j=1 Hz<br />
+<br />
10MHz j>65 kHz<br />
Bj<br />
b<br />
(1)<br />
• Bj is the magnetic flux density at frequency<br />
j;<br />
• BL,j is the magnetic flux density reference<br />
level;<br />
• b is 30.7 μT (rms) for occupational exposure<br />
The exposure is compliant with the limit if Is < 1.<br />
In order to test the compliance with this method<br />
a single impulse in each observation point has<br />
been selected in order to perform the Fourier<br />
Transform. Finally, in order to better point out the<br />
rationale <strong>of</strong> this methodology, the Is representation<br />
is provided:<br />
• graphically: in the x-axis is represented<br />
the frequency and in the y-axis the ration<br />
Bj/BL,j. With this it is possible to understand<br />
what are the frequency components that<br />
exceed the relative limit.<br />
• numerically: computation <strong>of</strong> Is with (1)<br />
For the sake <strong>of</strong> brevity the graphical result are<br />
shown just for the measurement points P1 and P3.<br />
In Fig. 9 it is possible to see the spectrum <strong>of</strong> the P1<br />
and P3 waveforms. It is possible to observe that the<br />
harmonic content is mainly located in the range 0-<br />
100 Hz with significant values <strong>of</strong> DC component.<br />
The spectrum is also characterized from a 2000 Hz
(a)P1:x=0m,y=0m,z=0.28 m<br />
(b)P3:x=0.5m,y=0m,z=0.28 m<br />
Fig. 9. Spectral analysis <strong>of</strong> the measured waveform<br />
(a)P1:x=0m,y=0m,z=0.28 m<br />
(b)P3:x=0.5m,y=0m,z=0.28 m<br />
Fig. 10. Computation <strong>of</strong> the ICNIRP limit by means <strong>of</strong> the<br />
multiple frequency method<br />
- 105 - 15th IGTE Symposium 2012<br />
component and its multiple. As explained before<br />
those components are generated from the final<br />
full wave rectifier. In fact, the observation points<br />
P3 that is faced to the rectifier, present in its<br />
spectrum the higher harmonic content for this set<br />
<strong>of</strong> frequencies.<br />
From the analysis <strong>of</strong> Fig. 10 it comes out that<br />
none <strong>of</strong> the observation points can be considered<br />
complaint if the multiple frequency method is<br />
adopted because the Is values exceed significantly<br />
the unitary reference level (see also Table II for the<br />
other measurement points). Remembering that the<br />
y-axis <strong>of</strong> Fig. 10 represents the summation terms<br />
<strong>of</strong> (1) it is possible to observe that the overcoming<br />
<strong>of</strong> the limit is mainly due to the 2000 Hz and 4000<br />
Hz components. For point P3 those harmonics are<br />
about 5 times higher than the relative reference<br />
level. It is worth noting that reference levels become<br />
stricter for higher frequencies. Moreover,<br />
above 820 Hz eq. (1) imposes a constant limit<br />
equal to 30.7 μT even if in Fig. 1 the trend is<br />
still decreasing above 65 kHz.<br />
Finally, it must be stressed that this procedure<br />
is based on the assumption that the spectral components<br />
add in phase, i.e., all maxima coincide at<br />
the same time and results in a sharp peak. This is<br />
a realistic assumption when the number <strong>of</strong> spectral<br />
components is limited and their phases are not<br />
coherent, i.e., they vary randomly. For fixed coherent<br />
phases the assumption may be unnecessarily<br />
conservative [2], [11], [13].<br />
C. Weighted multiple frequency method<br />
The weighted multiple frequency method is proposed<br />
for coherent waveform, i.e. waveform that<br />
can be measured with repeatability property. For<br />
these signals the multiple frequency method always<br />
leads to a conservative exposure assessments.<br />
This modification provides an alternative method<br />
based on weighted peak that more closely recognizes<br />
the nature <strong>of</strong> biological interactions.<br />
The procedure is summarized in the following<br />
steps:<br />
• selection <strong>of</strong> the signal to be analyzed<br />
• perform the Fourier Transform <strong>of</strong> the signal<br />
• computing the global indicator defined as:<br />
<br />
<br />
<br />
<br />
<br />
<br />
Iw = max <br />
(WF) j<br />
Bj cos (2πfjt + θj + ϕj) <br />
<br />
j<br />
<br />
(2)<br />
where:<br />
• Bj is the amplitude <strong>of</strong> the j-th frequency<br />
component;
• WFj is the weighting function where the<br />
magnitude is equal with the inverse <strong>of</strong> the<br />
peak reference level at j-th frequency<br />
• θj is the phase <strong>of</strong> the i-th component <strong>of</strong> B<br />
• ϕj is the phases <strong>of</strong> the weighting function<br />
<strong>of</strong> the i-th component. It should satisfy the<br />
conditions:<br />
– ϕ(f) =π/2 if ffc<br />
• fc = 820 Hz for occupational exposure<br />
The exposure is compliant with the limit if Iw < 1.<br />
Conversely from the multiple frequency method<br />
here the spectrum <strong>of</strong> the waveform is weighted by<br />
a complex function. It is possible to observe that<br />
the ICNIRP definition <strong>of</strong> the weighting function<br />
for magnetic flux density waveform is a high-pass<br />
filter with cut-<strong>of</strong>f frequency equal to fc = 820 Hz<br />
[10], [11].<br />
In order to better point out the rationale <strong>of</strong> this<br />
methodology, the Iw representation is provided:<br />
• graphically: it is provided the<br />
graph <strong>of</strong> the function <strong>of</strong> time:<br />
<br />
j (WF) j Bj cos (2πfjt + θj + ϕj)<br />
• numerically: computation <strong>of</strong> Iw with (2)<br />
From the analysis <strong>of</strong> the results presented in<br />
Fig. 11 it is possible to observe that even applying<br />
the weighting methods none <strong>of</strong> the observation<br />
points can be considered compliant with<br />
the unitary reference level (see also Table II for<br />
the other measurement points). In spite <strong>of</strong> this<br />
result, the weighting procedure allows a significant<br />
reduction for some observation points (see P1<br />
and P3 values). This is in accordance to the fact<br />
that for pulsed fields associated to spot welding<br />
machine the multiple frequency method is a too<br />
conservative assessment procedure.<br />
V. CONCLUSIONS<br />
In this paper the magnetic field generated from<br />
MFDC spot welding systems have been analyzed.<br />
All the possible methodology suggested in the<br />
international guidelines provided by the ICNIRP<br />
have been employed: the equivalent frequency<br />
method, the multiple frequency method and the<br />
weighted multiple frequency method.<br />
The main conclusion is that the three methodologies<br />
lead to contrasting result. By means <strong>of</strong><br />
equivalent frequency method the emission is compliant<br />
with the reference level even in the maximum<br />
emission point (3.5 times lower than the<br />
limit). On the other hand, the other two approaches<br />
provide opposite results, i.e., none <strong>of</strong> the surrounding<br />
points is compliant with the limit.<br />
In Table II the results for weighted and nonweighted<br />
multiple frequency method are summarized.<br />
For most <strong>of</strong> the observation points, a high<br />
- 106 - 15th IGTE Symposium 2012<br />
(a)P1:x=0m,y=0m,z=0.28 m<br />
(b)P3:x=0.5m,y=0m,z=0.28 m<br />
Fig. 11. Computation <strong>of</strong> the ICNIRP limit by means <strong>of</strong> the<br />
wighted multiple frequency method<br />
difference in the result is observed. In our opinion<br />
for the spot welding process the multiple frequency<br />
method is too conservative because the waveform<br />
is clearly coherent due to the supply control system.<br />
However, result for weighted multiple frequency<br />
method are still quite far from the unity<br />
limit.<br />
The definition <strong>of</strong> exposure limits for pulsed<br />
magnetic fields is still an open problem. Therefore<br />
no Standard and guidelines define precisely<br />
what is the constraint and the procedure to be<br />
observed for spot welding machine as well as other<br />
technologies characterized by similar emission <strong>of</strong><br />
pulsed magnetic field (e.g MRI devices). One <strong>of</strong><br />
the consequence is that the Directive 2004/40/EC<br />
has been modified extending the deadline for the<br />
application <strong>of</strong> the reference levels [15], [16] that<br />
now is fixed for October 31th 2013. Concerning<br />
the EU directives, it must be stressed that reference<br />
levels are always related to acute effects and not to<br />
possible long term effect. In the literature, as well<br />
as in the real life applications, there is no evidence<br />
<strong>of</strong> acute effects. For possible long term effects, the<br />
epidemiological studies recognizes that it is hard<br />
to estimate the exposure because may arise other<br />
type <strong>of</strong> exposure in the same workplace which are<br />
correlated to the same EMF exposure and which<br />
may affect the health <strong>of</strong> workers. An example<br />
concerns exposure to welding fumes which may
increase lung cancer risks among welders [17],<br />
[18].<br />
TABLE II<br />
COMPARISON BETWEEN WEIGHTED AND NON-WEIGHTED<br />
MULTIPLE FREQUENCY METHOD<br />
field point Is Iw<br />
P1 99.68 16.77<br />
P2 13.60 4.23<br />
P3 125.28 36.41<br />
P4 18.23 7.51<br />
P5 23.64 7.80<br />
P6 6.38 4.61<br />
From the technical point <strong>of</strong> view it is quite impossible<br />
to apply a mitigation system to the welder<br />
coil for (obvious) operating reason. The analysis <strong>of</strong><br />
Table II together with Fig. 9 allows to understand<br />
that the overcoming <strong>of</strong> the limit is mainly due to<br />
the 2000 Hz and 4000 Hz components. For point<br />
P3 those harmonics are about 5 times higher than<br />
the relative reference level. Therefore, the future<br />
development <strong>of</strong> this work is to design a shielding<br />
case for the transformer and the full wave rectifier.<br />
Obviously the mitigation system will not reduce<br />
the maximum value <strong>of</strong> the magnetic field (that is<br />
generated from the welder coil). The aim is just to<br />
reduce as much as possible the medium frequency<br />
components that seem to play a significant role in<br />
equations (1) and (2).<br />
REFERENCES<br />
[1] Directive <strong>of</strong> the european parliament and <strong>of</strong> the council<br />
<strong>of</strong> 29 april 2004 on the minimum health and safety<br />
requirements regarding the exposure <strong>of</strong> workers to the<br />
risks arising from physical agents (electromagnetic fields)<br />
european parliament and council.<br />
[2] ICNIRP. Guidelines for limiting exposure to time varying<br />
electric, magnetic and electromagnetic fields (up to 300<br />
GHz). Health Phys, 74(4):494–522, 1998.<br />
[3] H. Heinrich. Assessment <strong>of</strong> non-sinusoidal, pulsed or<br />
intermittent exposure to low frequency electric and magnetic<br />
fields. Health Phys, 96(6):541–546, 2007.<br />
[4] R. Scorretti, N. Burais, A. Fabregue, and O. Nicolas, and<br />
L. Nicolas. Computation <strong>of</strong> the induced current density<br />
into the human body due to relative LF magnetic field<br />
generated by realistic devices. IEEE Transactions on<br />
Magnetics, 40(2):643–646, 2004.<br />
[5] D. Desideri and A. Maschio. Magnetic field emissions<br />
up to 400 kHz from a welding equipment. In Proc.<br />
Int. Symp. Electromagnetic Compatibility, pages 151–<br />
156, Barcelona, 2006.<br />
[6] D. Desideri, A. Maschio, and P. Mattavelli. Human<br />
exposure topulsed current waveforms below 100 kHz.<br />
In 391-396, editor, Proc. Int. Symp. Electromagnetic<br />
Compatibility, Hamburg, Sep. 8–12, 2008.<br />
[7] G. Kang and O. Gandhi. Comparison <strong>of</strong> various safety<br />
guidelines for electronic article surveillance devices with<br />
pulsed magnetic fields. IEEE Transactions on Biomedical<br />
Engineering, 50(1), 2003.<br />
[8] A. Canova, F. Freschi, and M. Repetto. Evaluation <strong>of</strong><br />
workers expo- sure to magnetic fields. The European<br />
Physical Journal Applied Physics, 52(2), 2010.<br />
- 107 - 15th IGTE Symposium 2012<br />
[9] A. Canova, F. Freschi, L. Giaccone, and M. Repetto. Exposure<br />
<strong>of</strong> working population to pulsed magnetic fields.<br />
IEEE Transaction on Magnetics, 46(8):2819–2822, 2010.<br />
[10] ICNIRP. Guidance on determining compliance <strong>of</strong> exposure<br />
to pulsed and complex non-sinusoidal waveform<br />
below 100 kHz with icnirp guidelines. Health Phys,<br />
84(3):383–387, 2003.<br />
[11] K. Jokela. Restricting exposure to pulsed and broadband<br />
magnetic fields. Health Phys, 79(4):373–388, 2000.<br />
[12] F. Dughiero, M. Forzan, and E. Sieni. A numerical<br />
evaluation on electromagnetic fields exposure on real<br />
human body models until 100 khz. COMPEL, 29:1552–<br />
1561, 2010.<br />
[13] ICNIRP. Guidelines for limiting exposure to time-varying<br />
electric and magnetic fields (1 Hz to 100 kHz). Health<br />
Phys, 99(6):818–836, 2010.<br />
[14] G. Crotti and D. Giordano. Problems in the detection<br />
<strong>of</strong> quasi-rectangular magnetic flux density waveforms. In<br />
18th Symposium IMEKO TC4, Natal (Brasil), September<br />
2001.<br />
[15] Directive <strong>of</strong> the european parliament and <strong>of</strong> the council <strong>of</strong><br />
23 april 2008 amending directive 2004/40/ec on minimum<br />
health and safety requirements regarding the exposure<br />
<strong>of</strong> workers to the risks arising from physical agents<br />
(electromagnetic fields) (18th individual directive within<br />
the meaning <strong>of</strong> article 16(1) <strong>of</strong> directive 89/391/eec).<br />
[16] Directive <strong>of</strong> the european parliament and <strong>of</strong> the council <strong>of</strong><br />
19 april 2012 amending directive 2004/40/ec on minimum<br />
health and safety requirements regarding the exposure<br />
<strong>of</strong> workers to the risks arising from physical agents<br />
(electromagnetic fields) (18th individual directive within<br />
the meaning <strong>of</strong> article 16(1) <strong>of</strong> directive 89/391/eec).<br />
[17] R.M. Sterns. Cancer incidence among welders: possible<br />
effects <strong>of</strong> exposure to extremely low frequency<br />
electromagnetic radiation (elf) and to welding fumes.<br />
Environmental Health Perspectives, 76:221–229, 1987.<br />
[18] Review <strong>of</strong> the scientific evidence for limiting exposure to<br />
electromagnetic fields (0-300 ghz). Technical Report Vol.<br />
15 N.3, National Radiological Protection Board, 2004.
- 108 - 15th IGTE Symposium 2012<br />
A Circuital Approach for Eddy Currents Fast<br />
Evaluation in Beam-like Structures<br />
A. Formisano<br />
Dipartimento di Ingegneria Industriale e dell’Informazione<br />
Seconda Università di Napoli, via Roma 29, I-81031 Aversa (CE), Italy<br />
E-mail: Alessandro.Formisano@unina2.it<br />
Abstract — The electromagnetic analysis <strong>of</strong> mechanic or civil structures composed by an interconnection <strong>of</strong> beam-like<br />
elements, mechanically interconnected to create a structural mesh, can be formulated in terms <strong>of</strong> an equivalent lumped<br />
elements electric network. This is the case <strong>of</strong> the eddy currents evaluation in a truss bridge or a bridge crane exposed to a<br />
time varying electromagnetic field or in the reinforcement <strong>of</strong> buildings concrete. In such cases, if compatible with the<br />
accuracy needs, the network approach can be very effective thanks to the strong reduction <strong>of</strong> the model complexity. The<br />
paper proposes such a kind <strong>of</strong> formulation, based on concept <strong>of</strong> partial inductance. Advantage is taken from automated treebuilding<br />
algorithms for electric networks, and on minimum order formulations based on loop currents to further reduce<br />
computational complexity.<br />
Index Terms— Eddy Currents, Electric Circuit Theory, Filamentary Structures<br />
I. INTRODUCTION<br />
The use <strong>of</strong> metallic materials in mechanical and civil<br />
engineering is a common practice, due to the extremely<br />
favorable behavior <strong>of</strong> such materials with respect to<br />
stresses and mechanical solicitations. On the other hand,<br />
when exposed to time varying electromagnetic fields,<br />
metallic structures react by generating a field due to<br />
induced currents in their volume. The effect <strong>of</strong> such fields<br />
may reveal critical in some particular applications, such<br />
as when forces on the structures must be taken under<br />
control, or when aging phenomena are facilitated by<br />
electric currents in the metal, or when the electromagnetic<br />
field map must be strictly controlled in critical regions<br />
not far from the structures (e.g. to reduce interference on<br />
electronic devices or to avoid impact on physical<br />
phenomena requiring controlled field maps, such as<br />
Nuclear Magnetic Resonance, or finally to limit human<br />
exposure to electromagnetic energy).<br />
In these cases, the possible interactions with<br />
surrounding electromagnetic field sources must be<br />
considered in the design phase. Unfortunately, fully 3D<br />
numerical analysis would usually be required, since no<br />
particular symmetry or simplification can be expected to<br />
reduce model complexity. On the other hand, such a<br />
model would require a quite large computational effort,<br />
although just an estimate <strong>of</strong> the electromagnetic effects<br />
due to the mechanical structure are <strong>of</strong>ten enough in the<br />
design step.<br />
In the particular cases when the mechanical structures<br />
can be modeled using interconnects <strong>of</strong> beam-like<br />
elements (e.g. truss bridges, bridge cranes, iron rebar in<br />
reinforced concrete, etc.), the electromagnetic analysis, in<br />
the magneto-quasi-static limit and assuming linear<br />
behavior <strong>of</strong> all the materials, can be formulated in terms<br />
<strong>of</strong> an equivalent electric network, composed by lumped<br />
elements. The current in each branch <strong>of</strong> the network is<br />
related to the current density in the corresponding beamlike<br />
element <strong>of</strong> the structure thanks to a filamentary<br />
current element approximation <strong>of</strong> the actual beam. Each<br />
current element, or current stick, is defined by stick tips,<br />
and a (scalar) stick currents.<br />
The interconnection <strong>of</strong> sticks is defined through a<br />
suitable incidence matrix, defined from the actual 3D<br />
topology. The lumped network is created by associating<br />
to each single beam <strong>of</strong> the original structure a lumped<br />
parameter model, falling in the typical circuit classes. It<br />
follows that a resistive parameter has to be used to model<br />
the Ohmic behavior <strong>of</strong> the metallic element. In addition, a<br />
set <strong>of</strong> inductances should be used to represent the<br />
induction phenomena among sticks and their capability to<br />
accumulate magnetic energy. Finally, an electromotive<br />
force (typical <strong>of</strong> the voltage sources in the circuits) can be<br />
used to represent the induction phenomena from assigned<br />
external currents.<br />
In principle, also capacitive parameters should be<br />
considered to take into account the capability to<br />
accumulate electric energy, but in the range <strong>of</strong><br />
frequencies here considered the impact <strong>of</strong> capacitive<br />
phenomena can be neglected.<br />
The mathematical tools able to face such a class <strong>of</strong><br />
systems are the well known Kirchh<strong>of</strong>f laws, replacing the<br />
most general Maxwell ones, significantly reducing the<br />
complexity <strong>of</strong> the model.<br />
Although the lumped network approach to treat similar<br />
structures is quite diffused in the electromagnetic analysis<br />
<strong>of</strong> mechanical structures [1-4], the network analysis is<br />
usually performed using standard computer codes, not<br />
necessarily guaranteeing the minimum computational<br />
effort. In this paper, an automated fundamental loop<br />
method is proposed to achieve a minimum complexity<br />
resolution <strong>of</strong> the network.<br />
Once currents in each branch are known, simple closed<br />
formulas can be used to estimate the field produced by<br />
each stick, forces on the structure elements, and Ohmic<br />
losses due to induction phenomena. This approach is<br />
particularly useful when a quick, yet not accurate<br />
estimation <strong>of</strong> the impact <strong>of</strong> metallic structures, is
equired.<br />
II. MATHEMATICAL MODELING<br />
Let's consider a structure composed <strong>of</strong> Nb metallic<br />
beams, connected in a general way at their tips in Nn<br />
nodes.<br />
The equivalent lumped network will be composed <strong>of</strong><br />
Nb branches, with the same topology as the mechanical<br />
structure. According to the geometrical characteristics <strong>of</strong><br />
the beam and, in addition, to the electromagnetic<br />
characteristics <strong>of</strong> the materials, each branches is endowed<br />
with a suitable set <strong>of</strong> circuit elements, including a<br />
resistance, an inductance, some mutual inductances, and a<br />
suited number <strong>of</strong> voltage sources.<br />
A very simple example is reported in Fig. 1(a), while<br />
in Fig. 1(b) the equivalent electric network is sketched.<br />
Since the assembly is immersed in a time varying field,<br />
we will assume that each stick, arbitrarily oriented,<br />
carries a current ik, k=1, 2...Nb, that represents the<br />
unknown to be determined. The currents are induced by<br />
an external field, but their value depends also on the<br />
structure itself, through material properties and<br />
geometrical relationships.<br />
voltage<br />
source<br />
+ -<br />
(a)<br />
branch<br />
resistance<br />
branch<br />
current ik<br />
(b)<br />
self inductance<br />
and mutual with<br />
all other branches<br />
Figure 1: (a) An example <strong>of</strong> mechanical interconnect <strong>of</strong> beam like<br />
metallic elements (the structure <strong>of</strong> a truss bridge); (b) its representation<br />
as an electric network.<br />
Each stick can be characterized by a resistance Rk.<br />
depending from its resistivity k, length Lk and cross<br />
section Sk. If assuming that the penetration depth at the<br />
highest frequency involved is smaller than the transverse<br />
dimension <strong>of</strong> the beam modeled by stick, and that both<br />
- 109 - 15th IGTE Symposium 2012<br />
resistivity and cross section are uniform along the beam,<br />
the resistance associated the k-th stick can be computed<br />
as [5]:<br />
Rk=k*Lk/Sk<br />
Of course, if any <strong>of</strong> the above exposed hypotheses<br />
falls, the more general expression using the line integral<br />
along the beam axis <strong>of</strong> k(l)/Sk(l) can be used.<br />
The resistances are then assembled into a diagonal<br />
resistance matrix R.<br />
In addition, if assuming a linear magnetic behavior, the<br />
sticks assembly is characterized also by an inductance<br />
matrix Mb, whose elements describe the mutual<br />
inductance between sticks or, on the diagonal, their self<br />
inductance. Under the same assumptions used for (1)<br />
about skin depth, the self inductance Mkk <strong>of</strong> the k-th stick<br />
can be computed using [6]:<br />
4 2Lk <br />
Mkk 210 Lkln<br />
1 r<br />
<br />
Lk<br />
where r is the geometric mean distance and is the<br />
arithmetic mean distance on the corresponding k-th beam<br />
cross section. (2) provides self inductance in Henry is Lk<br />
is in meters.<br />
The mutual inductance Mjk between the j-th and k-th<br />
stick can be computed using formulas from [6]; as a<br />
possible alternative, assuming a limited dimension <strong>of</strong> the<br />
cross section, the mutual inductance can be evaluated also<br />
by line integrating (numerically) the vector potential<br />
Ak(x) <strong>of</strong> stick k on the axis <strong>of</strong> stick j:<br />
ˆ<br />
M A x tˆdl<br />
jk k j dl j<br />
j<br />
where j is the centerline along the j-th beam, xj is a<br />
generic point along j, and ˆt is the centerline tangent unit<br />
vector. The following expression has been used in this<br />
study for Ak [5]:<br />
j <br />
where a, b and c are defined in Fig. 2, â is the unit<br />
vector along the k-th stick, and suitable countermeasures<br />
have been taken to avoid singularities when sticks are<br />
aligned [7].<br />
(1)<br />
(2)<br />
(3)<br />
I<br />
k 0 ˆ<br />
cba A x a ln<br />
4<br />
<br />
cba <br />
(4)<br />
<br />
c<br />
Ak b<br />
xj<br />
Figure 2: Basic elements form computation <strong>of</strong> vector potential using<br />
(3).<br />
Note that eq. (3) can be easily generalized to massive<br />
a
conductors, if the thin beam approximation may reveal<br />
too crude for the analysis, while this is not the case for<br />
closed form expressions found in [6].<br />
The structure is supposed to be immersed in the timevarying<br />
magnetic field produced by another set <strong>of</strong> Ne<br />
external currents ie, linked with the sticks by means <strong>of</strong> a<br />
mutual inductance matrix Me.<br />
Elements <strong>of</strong> Me can be easily evaluated using<br />
expressions based on (3). For the particular shape <strong>of</strong> field<br />
source, suitable analytical (possibly approximate)<br />
expression are available; e.g., the mutual inductance <strong>of</strong> a<br />
stick and a power line can be easily computed using<br />
formulas from [6]. Of course, the more general procedure<br />
based on suitable decomposition in elementary sticks and<br />
a numerical evaluation <strong>of</strong> (3) can be used.<br />
If assuming that external sources are given (because<br />
not influenced by eddy currents induced in the structure<br />
or some other factor) in each branch, the induced emf can<br />
be circuitally described as a voltage source. The set <strong>of</strong> the<br />
voltages is given by<br />
e= Me die/dt + d Me /dt ie<br />
where the last term vanishes in case <strong>of</strong> time invariance <strong>of</strong><br />
the matrix Me.<br />
Within these hypotheses, the system can be regarded as<br />
a R-L circuit, where sticks play the role <strong>of</strong> branches and<br />
Nn nodes represent the stick tips.<br />
The network topology is described by the incidence<br />
matrix (Nn rows and Nb column) providing, for each<br />
branch, the couple <strong>of</strong> starting-ending nodes. The<br />
incidence matrix can be easily recovered from CAD<br />
schemes for the mechanical assembly, where available, or<br />
by survey <strong>of</strong> the drawings.<br />
It is well known that the rank <strong>of</strong> incidence matrix is<br />
lower than the number <strong>of</strong> nodes and its value, for a<br />
connected network is Nn–1; consequently, <strong>of</strong>ten the<br />
“reduced” incidence matrix A is used, as it will be done in<br />
the following. The graph theory guarantees that<br />
independent columns in an incidence matrix do not form<br />
loops. It follows that a basis <strong>of</strong> the columns set defines a<br />
set <strong>of</strong> branches able to connect all the nodes <strong>of</strong> the<br />
network, i.e. a tree <strong>of</strong> the network<br />
A fast and effective way to search for independent<br />
columns is to determine the echelon form <strong>of</strong> A [8] and<br />
select the branch corresponding to the leading<br />
coefficients Of course, in general several trees can be<br />
defined for an assigned network; the choice <strong>of</strong> the<br />
extracted tree among all the possible ones can be<br />
controlled by ordering the columns <strong>of</strong> the incidence<br />
matrix in such a way that column corresponding to<br />
favourite branches are the leftmost ones.<br />
In order to simplify a number <strong>of</strong> automatic treatment <strong>of</strong><br />
the network topology, it is recommended to rearrange the<br />
branch numbering <strong>of</strong> the matrix in such a way to include<br />
in the first NT positions (Nn-1 in case <strong>of</strong> connected<br />
networks), the columns <strong>of</strong> the tree branches. Then, the<br />
incidence matrix A can be partitioned as:<br />
(5)<br />
A =(AT; AC) (6)<br />
- 110 - 15th IGTE Symposium 2012<br />
where, AT is the NTxNT non singular matrix corresponding<br />
to tree branches, and AC the NTxNb matrix corresponding<br />
to co-tree branches.<br />
Several effective methods can be used to face with the<br />
analysis <strong>of</strong> this network.<br />
One <strong>of</strong> the most effective and popular is the nodal<br />
technique whose unknowns are the nodes potential set vn.<br />
Here, for simplicity just the formulation in case <strong>of</strong> linear,<br />
memory free, voltage controlled components is<br />
highlighted:<br />
Yn vn = Jn<br />
where Yn and Jn are the nodal matrix and the nodal drive<br />
equivalent current vector, respectively. Both can be easily<br />
evaluated by the branch parameters. In particular<br />
Yn = A Yb A T , where Yb is the NbxNb matrix with the<br />
conductances (self or mutual) <strong>of</strong> the branches. The<br />
method can be extended to circuits with linear current<br />
controlled components or linear dynamical components<br />
and, in addition, also in presence <strong>of</strong> non linear<br />
components. Of course, in any case, the existence and the<br />
uniqueness <strong>of</strong> solution has to be assessed.<br />
Here the classical dual formulation is proposed, whose<br />
unknowns are the principal loop currents IL [9]. The<br />
model, for general dynamic systems, can be stated in time<br />
domain; here, taking advantage from linearity, the more<br />
compact formulation in the Laplace space is used:<br />
M<br />
s s s <br />
L M<br />
(7)<br />
Z I E (8)<br />
where IL and EM are the arrays <strong>of</strong> the principal loop<br />
currents and driven voltages, respectively, and s is the<br />
complex frequency.<br />
In (8) for simplicity, the hypotheses <strong>of</strong> linear, voltage<br />
controlled components has been assumed.<br />
T<br />
The loop impedance matrix BZ<br />
M b B<br />
Z can be<br />
deduced from the branch impedance matrix Zb = (Rb+sMb)<br />
and from the topological NLxNb matrix B <strong>of</strong> the principal<br />
loops related to the tree [8], where NL = Nb-NT is the<br />
maximum number <strong>of</strong> independent loops. Similar<br />
expressions hold for loop voltage sources EM, driven by<br />
external currents.<br />
It should be noticed that the partitioned form <strong>of</strong> B<br />
B = B ; 1 includes an identity matrix for the co-tree<br />
<br />
T L<br />
columns. In addition, the first partition B can be easily<br />
T<br />
deduced by the reduced incidence matrix:<br />
T -1<br />
T T L<br />
B =- A A<br />
(9)<br />
Once loop currents are known, currents in each branch<br />
are easily computed as I = B IL.<br />
From branch currents, estimates <strong>of</strong> the other electrical<br />
quantities can be easily recovered using closed form<br />
expression for stick currents. This is the case, for<br />
example, <strong>of</strong> the flux density produced in any points <strong>of</strong>
space [5], or the total Ohmic power, or, finally, the net<br />
force acting on the structure.<br />
III. NUMERICAL EXAMPLES<br />
In this section, firstly a simple example is discussed to<br />
illustrate the various steps <strong>of</strong> the proposed method; then a<br />
more complex case is presented to show the effectiveness<br />
<strong>of</strong> the approach.<br />
1. As a first example, the field produced by a reinforced<br />
concrete beam near a power line is considered. The<br />
beam is 2.4 m long, with a transverse dimension <strong>of</strong> 30<br />
cm, a reinforcement diameter <strong>of</strong> 2 cm, a resistivity <strong>of</strong><br />
5x10 -5 m and is 5 m away from the line. The line<br />
carries 100 A <strong>of</strong> current at 50 Hz, and is assumed 20<br />
m long.<br />
The networks has 16 nodes; the 15 tree branches are<br />
branches 1-15 in Fig. 3, and the non-trivial part <strong>of</strong> the<br />
fundamental loop matrix BT is reported in table I.<br />
The currents in each branch can easily be computed<br />
using a linear system with 13 equations, and then<br />
post-processed to obtain estimates <strong>of</strong> the desired<br />
quantities.<br />
The highest currents are in the “longitudinal”<br />
branches #17 and #19 (1.4 mA), and #21 and #23 (1.5<br />
mA).<br />
A 2D FEM model, neglecting connecting elements in<br />
the mesh, and correcting conductivity to take into<br />
account the finite length <strong>of</strong> actual geometry, provides<br />
a current <strong>of</strong> 1.5 mA in the four “longitudinal” beams.<br />
The "disturbance" magnetic field produced by the<br />
beam is 1.72 nT at a point 10 cm away from the<br />
power line. Note that in the 2D FEM model, the<br />
reinforcement contribution was hidden by the<br />
numerical errors.<br />
2. The second example compares computational<br />
complexity in the case <strong>of</strong> a fully 3D geometry either<br />
using a commercial FEM package (COMSOL<br />
Multiphysics Ver. 4.2a, [10]) or using the proposed<br />
approach.<br />
The aim is to estimate total Ohmic losses in the<br />
metallic structure <strong>of</strong> the truss beam depicted in Fig. 1.<br />
The bridge is 16 m long, 4 m large and 4 m high.<br />
Each beam is a square with a 0.4 m side.<br />
The bridge is made <strong>of</strong> non magnetic structural steel,<br />
with a conductivity <strong>of</strong> 4x10 6 S/m.<br />
The excitation field is provided by a circular coil<br />
radius 5 m, hanging 10 m above the bridge. Of course<br />
such excitation is not realistic, but has been adopted<br />
for its ease <strong>of</strong> modelling with FEM package.<br />
The FEM model is solved with a mesh composed <strong>of</strong><br />
34,000 2 nd order tetrahedral elements, giving<br />
300,000 unknowns, and requires 165 s to be solved<br />
on a i7-based PC, with 4GB RAM.<br />
The total Ohmic losses are 4 mW using FEM model<br />
and 3.5 mW using the proposed method. A map <strong>of</strong><br />
losses density is reported in Fig.4.<br />
- 111 - 15th IGTE Symposium 2012<br />
0.3 m<br />
Figure 3: Graph <strong>of</strong> the equivalent network for case 1: capital letters<br />
indicate nodes, number indicate branches, dashed lines are co-tree<br />
branches<br />
Loops<br />
C<br />
3<br />
2<br />
B<br />
1<br />
16<br />
A<br />
19<br />
D<br />
18<br />
2.4 m<br />
G<br />
7<br />
6<br />
4<br />
F<br />
5<br />
17<br />
23<br />
H<br />
22<br />
20<br />
E<br />
TABLE I<br />
NON TRIVIAL PART OF THE FUNDAMENTAL LOOP MATRIX<br />
Branches<br />
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15<br />
1 -1 -1 -1 0 0 0 0 0 0 0 0 0 0 0 0<br />
2 -1 -1 -1 -1 1 1 1 0 0 0 0 0 0 0 0<br />
3 0 -1 -1 -1 0 1 1 0 0 0 0 0 0 0 0<br />
4 0 0 -1 -1 0 0 1 0 0 0 0 0 0 0 0<br />
5 0 0 0 0 -1 -1 -1 0 0 0 0 0 0 0 0<br />
6 0 0 0 0 -1 -1 -1 -1 1 1 1 0 0 0 0<br />
7 0 0 0 0 0 -1 -1 -1 0 1 1 0 0 0 0<br />
8 0 0 0 0 0 0 -1 -1 0 0 1 0 0 0 0<br />
9 0 0 0 0 0 0 0 0 -1 -1 -1 0 0 0 0<br />
10 0 0 0 0 0 0 0 0 -1 -1 -1 -1 1 1 1<br />
11 0 0 0 0 0 0 0 0 0 -1 -1 -1 0 1 1<br />
12 0 0 0 0 0 0 0 0 0 0 -1 -1 0 0 1<br />
13 0 0 0 0 0 0 0 0 0 0 0 0 -1 -1 -1<br />
Figure 4: Ohmic Losses density plot for FEM solution <strong>of</strong> example 2.<br />
IV. CONCLUSIONS<br />
A prompt method to assess the effect <strong>of</strong> mechanical<br />
structures composed by interconnected conducting<br />
beams, exposed to low frequency magnetic fields, has<br />
been presented. The method is based on equivalent<br />
electric network analysis, and adopts a minimum order<br />
formulation to solve the network.<br />
ACKNOWLEDGEMENTS<br />
11<br />
L 26<br />
O<br />
15<br />
14<br />
P<br />
12<br />
N 28<br />
13 M<br />
The author wishes to thank pr<strong>of</strong>. R. Martone for<br />
fruitful discussions and for his support.<br />
This work has been partly supported by Seconda<br />
Università di Napoli, Italy, under PRIST grant<br />
“Generazione distribuita di energia da fonti tradizionali e<br />
8<br />
21<br />
K<br />
10<br />
J<br />
9<br />
27<br />
24<br />
I<br />
25
innovabili: aspetti ingegneristici e giuridici-economiciambientali”<br />
REFERENCES<br />
[1] A. Ruehli, "Equivalent Circuit Models for Three-Dimensional<br />
Multiconductor Systems", IEEE Trans. on Microwave Th. and<br />
Tech., vol. MTT-22, pp. 216-221, 1974.<br />
[2] W. Pinello, A. Ruehli, “Time Domain Solutions for Coupled<br />
Problems using PEEC Models with Waveform Relaxation”,<br />
<strong>Proceedings</strong> <strong>of</strong> Antennas and Propagation Society International<br />
Symposium AP-S. Digest, pp. 2118-2121, 1996.<br />
[3] A.Y. Wu, K.S. Sun, “Formulation and implementation <strong>of</strong> the<br />
current filament method for the analysis <strong>of</strong> current diffusion and<br />
heating in railguns and homopolar generators”, IEEE Trans. on.<br />
Mag., vol. 25, pp. 610-615, 1989.<br />
[4] B. Azzerboni, E. Cardelli, M. Raugi, ”Network mesh model for<br />
flux compression generators analysis, IEEE Trans. on Magn., vol.<br />
27, pp. 3951-3954, 1991.<br />
[5] H. A. Haus, J. R. Melcher, Electromagnetic Fields and Energy,<br />
Englewood Cliffs, NJ: Prentice Hall, 1989.<br />
[6] F. Grover, Inductance Calculation, New York: Van Nostrand,<br />
1946.<br />
[7] J. D. Hanson, S. P. Hirshman, “Compact expressions for the Biot–<br />
Savart fields <strong>of</strong> a filamentary segment”, Physics <strong>of</strong> Plasmas, vol.<br />
9, pp. 4410-4412, 2002.<br />
[8] L. Chua, I. Lin, Computer-Aided Analysis <strong>of</strong> Electronic Circuits,<br />
3rd ed., vol. 2. Oxford: Clarendon Press, 1982.<br />
[9] J. Nilsson, S. Riedel. Electric Circuits, Englewood Cliffs, NJ:<br />
Prentice Hall, 2010.<br />
[10] www.Comsol.com, last visited on Sept., 4 th 2012.<br />
- 112 - 15th IGTE Symposium 2012
- 113 - 15th IGTE Symposium 2012<br />
Effectiveness <strong>of</strong> the Preconditioned<br />
MRTR Method Supported by Eisenstat’s Technique<br />
in Real Symmetric Sparse Matrices<br />
*Yoshifumi Okamoto, *Tomonori Tsuburaya, † Koji Fujiwara, and *Shuji Sato<br />
*Department <strong>of</strong> Electrical and Electronic Systems Engineering, Utsunomiya <strong>University</strong><br />
7-1-2 Yoto, Utsunomiya, Tochigi 321-8585, Japan<br />
† Department <strong>of</strong> Electrical Engineering, Doshisha <strong>University</strong><br />
1-3 Tataramiyakodani, Kyotanabe, Kyoto 610-0321, Japan<br />
E-mail: okamotoy@cc.utsunomiya-u.ac.jp<br />
Abstract—The Incomplete Cholesky Conjugate Gradient (ICCG) method is widely used to solve the indefinite algebraic<br />
equations obtained from the edge-based finite element method. However, when a linear solver based on the minimum<br />
residual without residual oscillations is used, there is a possibility <strong>of</strong> the elapsed time being shortened. This paper shows the<br />
effectiveness <strong>of</strong> the preconditioned Minimized Residual method based on the Three-term Recurrence formula <strong>of</strong> the CG-type<br />
(MRTR) method with Eisenstat’s technique by making a comparison with ICCG method for real symmetric sparse matrices.<br />
Index Terms— Eisenstat’s technique, ICCG method, MRTR method, split preconditioning.<br />
A x b,<br />
(1)<br />
I. INTRODUCTION<br />
where A is a large sparse n-by-n matrix, x is a solution n-<br />
The ICCG method [1] is widely used as a solver for a vector, and b is a n-vector. Now, suppose that the<br />
real symmetric indefinite linear equation derived from the diagonal scaling has already been applied to (1).<br />
edge-based finite element method. While the behavior <strong>of</strong> The recurrence formula for x in MRTR method is<br />
a residual in CG iterations is oscillatory, a monotonic designed using the expression<br />
decrease in the residual is mathematically ensured in<br />
MRTR method [2], which is an algorithm identical to that<br />
<strong>of</strong> Orthomin(2) [3] and which is based on the minimum<br />
residual. Therefore, there is a possibility that MRTR can<br />
solve linear equations faster than the CG method.<br />
IC factorization is widely recognized as a powerful<br />
preconditioner for a symmetric linear system. However, it<br />
may not necessarily be powerful in various magnetic<br />
field problems. Some <strong>of</strong> the split preconditioners such as<br />
symmetric Gauss-Seidel (SGS) and diagonal IC<br />
factorization (DIC) might be capable <strong>of</strong> improving the<br />
convergence characteristics <strong>of</strong> linear solvers.<br />
SGS and DIC preconditioners, in which the <strong>of</strong>fdiagonal<br />
components in the original linear system are<br />
directly used in the preconditioned matrix, can utilize the<br />
Eisenstat’s technique [4], in which the matrix-vector<br />
product can be replaced by forward and backward<br />
substitutions. Therefore, there is a possibility that SGS<br />
and DIC preconditioners supported by Eisenstat’s<br />
technique can reduce the elapsed time by reducing the<br />
computational cost <strong>of</strong> an iterative process.<br />
This paper shows the effectiveness <strong>of</strong> preconditioned<br />
MRTR method supported by Eisenstat’s technique for the<br />
xk 1 x0<br />
zk<br />
1,<br />
zk<br />
1<br />
K k ( A;<br />
r0<br />
) ,<br />
(2)<br />
where xk+1 is the solution vector in step (k + 1) in the<br />
iterative process and Kk(A;r0) is the Krylov subspace<br />
spanned by A and the initial residual vector r0. The<br />
residual vector rk+1 is comprehended in Kk+1(A;r0) as<br />
follows:<br />
rk 1<br />
b Axk 1<br />
r0<br />
Azk<br />
1<br />
K k 1(<br />
A;<br />
r0<br />
) . (3)<br />
Furthermore, the approximate solution (x0 + z) in step (k<br />
+ 1) satisfies the minimization condition as follows:<br />
min || b A(<br />
x0<br />
z)<br />
|| 2 min || r0<br />
Az<br />
|| 2 , (4)<br />
zS<br />
k<br />
zS<br />
k<br />
where Sk is a subspace comprehended by Kk(A;r0). Using<br />
a three-term recurrence formula involving Lanczos<br />
polynomials, the algorithm <strong>of</strong> MRTR method is given as<br />
follows:<br />
Algorithm 1 (MRTR method). Let x0 be an initial guess,<br />
and put r0 = b – Ax0. Set y0 = – r0 and 0 = (y0, y0).<br />
For k = 0, 1, 2, …, repeat the following steps until the<br />
condition ||rk||2/||b||2 < MR holds:<br />
(<br />
Ark<br />
, rk<br />
) / ( Ark<br />
, Ark<br />
)<br />
( k 0)<br />
<br />
k ( Ark<br />
, r )<br />
k<br />
k<br />
,<br />
( k 1)<br />
<br />
<br />
k ( Ark<br />
, Ark<br />
) ( yk<br />
, Ark<br />
)( Ark<br />
, yk<br />
)<br />
several linear systems derived from the edge-based finite<br />
element method in the magnetic field analysis.<br />
Comparisons have been made with other well-known<br />
preconditioned CG methods.<br />
0<br />
<br />
( yk<br />
, Ark<br />
)( Ark<br />
, r )<br />
k<br />
k<br />
<br />
<br />
k ( Ark<br />
, Ark<br />
) ( yk<br />
, Ark<br />
)( Ark<br />
, yk<br />
)<br />
k 1<br />
k ( Ark , rk<br />
),<br />
( k 0)<br />
,<br />
( k 1)<br />
II. PRECONDITIONED MRTR METHOD<br />
A. MRTR method<br />
The symmetric sparse linear system can be defined as<br />
follows:<br />
k 1<br />
pk rk<br />
k<br />
pk<br />
1,<br />
k<br />
xk 1<br />
xk<br />
k pk<br />
,<br />
yk 1<br />
<br />
k yk<br />
k Ark<br />
,
k 1<br />
rk<br />
yk<br />
1,<br />
where (u, v) denotes the inner product <strong>of</strong> vectors u and v.<br />
The above algorithm is mathematically equivalent to the<br />
conjugate residual (CR) method [5].<br />
TABLE I shows the computational cost <strong>of</strong> MRTR<br />
method, along with a comparison with the CG method.<br />
Au, (u, v), (u + v), and u denote a matrix-vector<br />
product, the inner product, the sum <strong>of</strong> vectors, and a<br />
scalar-vector product, respectively. The computational<br />
cost <strong>of</strong> MRTR method is nearly identical to that <strong>of</strong> the<br />
CG method owing to the same number <strong>of</strong> computations<br />
for the matrix-vector product.<br />
TABLE I<br />
COMPUTATIONAL COST OF LINEAR SOLVERS<br />
linear solver Au (u, v) u + v u<br />
CG 1 2 3 3<br />
MRTR 1 4 4 4<br />
B. Preconditioning<br />
MRTR method can be combined with split<br />
preconditioning techniques as long as the preconditioner<br />
M, which can be written in the form M = CC T (with C : a<br />
lower triangular matrix), is used. The preconditioned<br />
T<br />
matrix Aˆ<br />
1<br />
<br />
C AC retains the symmetry <strong>of</strong> A. Here,<br />
we utilize M and C derived using shifted IC factorization<br />
[6], DIC [1], and SGS preconditioning [7].<br />
IC preconditioner<br />
IC factorization is performed as follows:<br />
ˆ ˆ ˆ T<br />
, ˆ ˆ 1/<br />
2<br />
M LDL<br />
C LD<br />
,<br />
(5)<br />
i1<br />
<br />
2<br />
<br />
<br />
aii<br />
<br />
li<br />
k d k k ( i j),<br />
k 1<br />
lij j1<br />
(6)<br />
aij<br />
<br />
<br />
li<br />
kl<br />
j kd<br />
k k ( i j),<br />
k 1<br />
dii 1 / lii<br />
,<br />
(7)<br />
where lij and dii are components <strong>of</strong> Lˆ and Dˆ and is<br />
the shifted parameter. is determined by performing the<br />
following steps: 1. Set = 1.05. 2. Perform IC<br />
factorization. 3. If all diagonal components become<br />
positive, shifted IC factorization is stopped. Otherwise,<br />
return to step 1, add 0.05 to , and iterate steps 1-3.<br />
If (5) is used for forward and backward substitutions,<br />
there is a possibility <strong>of</strong> cache miss in backward<br />
substitution. Hence, M is modified as follows:<br />
ˆ ˆ ˆ 1<br />
( ) ( ˆ ˆ T<br />
M LD<br />
D LD)<br />
.<br />
(8)<br />
Therefore, the process <strong>of</strong> forward and backward<br />
substitutions to compute the unknown vector u can be<br />
described as follows:<br />
ˆ ˆ ˆ 1 ( ) ( ˆ ˆ T<br />
L D D LD)<br />
u v,<br />
(9)<br />
where v is a known vector and the diagonal components<br />
<strong>of</strong> LDˆ ˆ become 1.0. Forward and backward substitutions<br />
is performed by following a two-step procedure<br />
consisting <strong>of</strong><br />
ˆ ˆ<br />
ˆ 1<br />
( ) ,<br />
( ˆ ˆ T<br />
LD<br />
y v y D LD)<br />
u,<br />
(10)<br />
- 114 - 15th IGTE Symposium 2012<br />
ˆ ˆ T<br />
( LD)<br />
u Dˆ<br />
y.<br />
(11)<br />
Consequently, the computational cost can be reduced by<br />
using the forward substitution (10) and backward<br />
T<br />
substitution (11) instead <strong>of</strong> the expression M Lˆ<br />
Dˆ<br />
Lˆ<br />
.<br />
DIC preconditioner<br />
The large sparse matrix A can be split into three terms<br />
as follows:<br />
T<br />
A L I L ,<br />
(12)<br />
where L is the strictly lower triangular part <strong>of</strong> A and I is a<br />
unit matrix. The diagonal matrix Dˆ obtained using<br />
shifted IC factorization (see (6) and (7)) is utilized. Thus,<br />
M can be defined as follows:<br />
ˆ ˆ 1<br />
ˆ T<br />
( ) ( ) , ( ˆ ) ˆ 1/<br />
2<br />
M L D D L D C L D D . (13)<br />
Similar to the case <strong>of</strong> the IC preconditioner, the<br />
procedure for forward and backward substitution should<br />
be attentively schemed. The forward and backward<br />
substitution for the DIC preconditioner is designed to<br />
make the diagonal component 1.0:<br />
ˆ 1 ( ) ˆ ( ˆ 1<br />
T<br />
L D I D LD<br />
I)<br />
u v,<br />
(14)<br />
ˆ 1<br />
( ) , ˆ ( ˆ 1<br />
T<br />
LD I y v y D LD<br />
I ) u,<br />
(15)<br />
( LDˆ<br />
1<br />
T<br />
I ) u Dˆ<br />
1<br />
y.<br />
(16)<br />
SGS preconditioner<br />
Using (12), M for the SGS preconditioner can be<br />
defined as follows:<br />
T<br />
M ( L I ) ( L I ) , C L I.<br />
(17)<br />
Then, the algorithm <strong>of</strong> the preconditioned CG method is<br />
as follows:<br />
Algorithm 2 (Preconditioned CG method). Let x0 be M –1<br />
b, and put r0 = b – Ax0. Set p0 = M –1 r0 and u0 = p0.<br />
For k = 0, 1, 2, …, repeat the following steps until the<br />
condition ||rk||2/||b||2 < CG holds:<br />
Apk<br />
,<br />
( rk , uk<br />
) / ( pk<br />
, ),<br />
x x <br />
p ,<br />
k <br />
k 1<br />
k k k<br />
rk rk<br />
<br />
k,<br />
1<br />
uk<br />
M r<br />
,<br />
1<br />
1 k 1<br />
k ( rk 1,<br />
uk<br />
1)<br />
/ ( rk<br />
, uk<br />
),<br />
pk 1<br />
uk<br />
1<br />
k pk<br />
.<br />
The algorithm <strong>of</strong> preconditioned MRTR method [8] is as<br />
follows:<br />
Algorithm 3 (Preconditioned MRTR method). Let x0 be<br />
M –1 b, and put r0 = b – Ax0. Set u0 = M –1 r0, y0 = – r0, and<br />
z0 = M –1 y0.<br />
For k = 0, 1, 2, …, repeat the following steps until the<br />
condition ||rk||2/||b||2 < MR holds:<br />
1<br />
AM rk<br />
Auk<br />
,<br />
w<br />
1<br />
1<br />
1<br />
M AM rk<br />
M<br />
,<br />
(<br />
w,<br />
rk<br />
) / ( ,<br />
w)<br />
<br />
<br />
k ( w,<br />
r )<br />
k<br />
k<br />
<br />
<br />
k ( ,<br />
w)<br />
( yk<br />
, w)(<br />
w,<br />
y<br />
k<br />
)<br />
( k 0)<br />
,<br />
( k 1)
0<br />
<br />
( yk<br />
, w)(<br />
w,<br />
r )<br />
k<br />
k<br />
<br />
<br />
k ( ,<br />
w)<br />
( yk<br />
, w)(<br />
w,<br />
y<br />
( w,<br />
r ),<br />
k 1<br />
k k<br />
<br />
p <br />
k 1<br />
k uk<br />
k pk<br />
1<br />
k<br />
x x p<br />
k 1<br />
k k k<br />
yk 1<br />
<br />
k yk<br />
k,<br />
rk<br />
1<br />
rk<br />
yk<br />
1,<br />
zk 1<br />
<br />
k zk<br />
k w,<br />
u u z<br />
k 1<br />
k k 1.<br />
,<br />
,<br />
k<br />
)<br />
( k 0)<br />
,<br />
( k 1)<br />
C. Eisenstat’s technique<br />
Here, Eisenstat’s approach, in which the preconditioned<br />
matrix and vectors are mainly utilized, is applied to the<br />
preconditioned linear solvers in order to reduce the<br />
computational cost for the matrix-vector product.<br />
First, we apply Eisenstat’s technique to the DIC<br />
preconditioner using the expression ˆ 1<br />
( ) ˆ 1/<br />
2<br />
C LD<br />
I D ,<br />
and the preconditioned matrix-vector product Apˆ ˆ<br />
k can be<br />
transformed into<br />
Aˆ<br />
pˆ<br />
k<br />
ˆ 1/<br />
2 ˆ 1<br />
1<br />
T<br />
D ( LD<br />
I)<br />
( L I L )<br />
ˆ 1<br />
T<br />
ˆ 1/<br />
2<br />
( LD<br />
I)<br />
D pˆ<br />
k<br />
ˆ 1/<br />
2<br />
( ˆ 1<br />
1<br />
) {( ˆ 1<br />
D LD<br />
I LD<br />
I)<br />
Dˆ<br />
( 2Dˆ<br />
I)<br />
ˆ ( ˆ 1<br />
T<br />
) }( ˆ 1<br />
T<br />
) ˆ 1/<br />
2<br />
D LD<br />
I LD<br />
I D pˆ<br />
k<br />
ˆ 1/<br />
2<br />
( ˆ 1<br />
T<br />
) ˆ 1/<br />
2 ˆ ˆ 1/<br />
2<br />
( ˆ 1<br />
1<br />
D LD<br />
I D pk<br />
D LD<br />
I)<br />
{ ˆ 1/<br />
2 ˆ ( 2 ˆ )( ˆ 1<br />
T<br />
) ˆ 1/<br />
2<br />
D p<br />
ˆ<br />
k D I LD<br />
I D pk<br />
}<br />
ˆ 1/<br />
2 T<br />
1<br />
ˆ { ˆ 1/<br />
2 ˆ ( 2 ˆ T<br />
D C p<br />
) ˆ<br />
k C D pk<br />
D I C pk<br />
} .<br />
(18)<br />
It is shown that Apˆ ˆ<br />
k can be replaced by one backward<br />
substitution<br />
T<br />
C pˆ k and one forward substitution<br />
1<br />
ˆ 1/<br />
2<br />
{ ˆ ( 2 ˆ T<br />
C D p ) ˆ<br />
k D I C pk<br />
} . On the other hand, the<br />
formulation <strong>of</strong> Apˆ ˆ<br />
k with the SGS preconditioner is as<br />
follows:<br />
Aˆ<br />
pˆ<br />
k<br />
1<br />
T<br />
T<br />
( L I)<br />
( L I L ) ( L I)<br />
pˆ<br />
k<br />
1<br />
T<br />
T<br />
( L I)<br />
{( L I)<br />
I ( L I)<br />
} ( L I)<br />
pˆ<br />
k<br />
(19)<br />
T<br />
1<br />
T<br />
( L<br />
I)<br />
pˆ<br />
( ) { ˆ ( ) ˆ<br />
k L I pk<br />
L I pk<br />
}<br />
T<br />
C pˆ<br />
1<br />
C ( pˆ<br />
T<br />
C pˆ<br />
) .<br />
k<br />
k<br />
The applicable scope <strong>of</strong> Eisenstat’s technique is restricted<br />
to the preconditioned matrix, in which the lower<br />
triangular part <strong>of</strong> the original equation is used as it is. The<br />
preconditioned CG method supported by Eisenstat’s<br />
technique is as follows:<br />
Algorithm 4 (Preconditioned CG method supported by<br />
Eisenstat’s technique). Set<br />
<br />
( LDˆ<br />
C <br />
L I<br />
1<br />
I)<br />
Dˆ<br />
1/<br />
2<br />
k<br />
( DIC)<br />
.<br />
( SGS)<br />
ˆ0 0<br />
Let x0 be M –1 1<br />
b, and put r0 = b – Ax0. Set r C r ,<br />
pˆ rˆ<br />
.<br />
0<br />
0<br />
- 115 - 15th IGTE Symposium 2012<br />
For k = 0, 1, 2, …, repeat the following steps until the<br />
condition ||rk||2/||b||2 < CG holds:<br />
T<br />
u C pˆ<br />
k ,<br />
ˆ 1/<br />
2 1<br />
{ ˆ 1<br />
ˆ ˆ<br />
<br />
<br />
D u C D<br />
A pk<br />
1<br />
u C ( pˆ<br />
k u)<br />
( ˆ , ˆ ) / ( ˆ , ˆ ˆ<br />
k rk<br />
rk<br />
pk<br />
Apk<br />
),<br />
xk 1<br />
xk<br />
<br />
ku,<br />
r rˆ<br />
<br />
Ap ˆ ˆ ,<br />
ˆk 1<br />
k k k<br />
r ˆ k 1 C rk<br />
1,<br />
( ˆ , ˆ ) / ( ˆ , ˆ<br />
k rk 1<br />
rk<br />
1<br />
rk<br />
rk<br />
pˆ ˆ ˆ<br />
k 1<br />
rk<br />
1<br />
k pk<br />
.<br />
/ 2<br />
),<br />
pˆ<br />
( 2Dˆ<br />
I)<br />
u}<br />
k<br />
( DIC)<br />
,<br />
( SGS)<br />
Preconditioned MRTR method supported by Eisenstat’s<br />
technique is as follows:<br />
Algorithm 5 (Preconditioned MRTR method supported<br />
by Eisenstat’s technique). Set<br />
<br />
( LDˆ<br />
C <br />
L I<br />
1<br />
I)<br />
Dˆ<br />
1/<br />
2<br />
( DIC)<br />
.<br />
( SGS)<br />
Let x0 be M –1 1<br />
b and put r0 = b – Ax0. Set rˆ<br />
0 C r0<br />
,<br />
yˆ ˆ 0 r0<br />
.<br />
For k = 0, 1, 2, …, repeat the following steps until the<br />
condition ||rk||2/||b||2 < MR holds:<br />
T<br />
u C rˆ<br />
k ,<br />
ˆ 1/<br />
2 1 ˆ 1/<br />
2<br />
{ ˆ ( 2 ˆ<br />
ˆ<br />
) }<br />
ˆ<br />
<br />
<br />
D u C D rk<br />
D I u<br />
Ark<br />
1<br />
u C ( rˆ<br />
k u)<br />
(<br />
Aˆ<br />
rˆ<br />
ˆ ˆ ˆ ˆ ˆ<br />
k , rk<br />
) / ( Ark<br />
, Ark<br />
)<br />
<br />
<br />
( ˆ<br />
k ˆ , ˆ<br />
k Ark<br />
rk<br />
)<br />
<br />
( ˆ ˆ , ˆ ˆ ) ( ˆ , ˆ ˆ )( ˆ<br />
<br />
A A A Aˆ<br />
, ˆ<br />
k rk<br />
rk<br />
yk<br />
rk<br />
rk<br />
yk<br />
)<br />
0<br />
<br />
<br />
ˆ ˆ<br />
( yˆ<br />
ˆ ˆ ˆ<br />
k , Ark<br />
)( Ark<br />
, r )<br />
k<br />
k<br />
( ˆ ˆ , ˆ ˆ ) ( ˆ , ˆ ˆ )( ˆ<br />
<br />
A A A Aˆ<br />
, ˆ<br />
k rk<br />
rk<br />
yk<br />
rk<br />
rk<br />
y<br />
( ˆ ˆ , ˆ<br />
k 1<br />
k Ark rk<br />
),<br />
k 1<br />
pk u k<br />
pk<br />
1,<br />
<br />
x x p<br />
k 1<br />
k k k<br />
y <br />
yˆ<br />
Ar ˆ ˆ ,<br />
ˆ k 1<br />
k k k k<br />
rˆ<br />
ˆ ˆ<br />
k 1<br />
rk<br />
yk<br />
1,<br />
r ˆ k 1 C rk<br />
1.<br />
k<br />
,<br />
k<br />
)<br />
( DIC)<br />
,<br />
( SGS)<br />
( k 0)<br />
,<br />
( k 1)<br />
( k 0)<br />
( k 1)<br />
In the linear solver using Eisenstat’s technique, the lower<br />
triangular matrix-vector product C r ˆk 1<br />
is computed to<br />
evaluate the residual rk+1.<br />
D. Computational cost <strong>of</strong> preconditioned linear solvers<br />
TABLE II shows the computational cost <strong>of</strong><br />
preconditioned linear solvers. Au, Lu, L -1 u, and L -T u<br />
denote the matrix-vector product, the lower triangular<br />
matrix-vector product, forward substitution, and<br />
backward substitution, respectively. The abbreviations<br />
EDIC and ESGS represent the DIC and SGS<br />
,
preconditioners using Eisenstat’s technique, respectively.<br />
The computational cost <strong>of</strong> the preconditioned linear<br />
solvers using Eisenstat’s technique (EDIC and ESGS) is<br />
lower than that <strong>of</strong> the other preconditioned solvers by<br />
10 %. The reason why the computational cost does not<br />
reduce significantly when Eisenstat’s technique is used is<br />
the additional computation <strong>of</strong> the lower triangular matrixvector<br />
product Cr ˆk 1<br />
, whose cost is approximately equal<br />
to that <strong>of</strong> forward or backward substitution.<br />
TABLE II<br />
COMPUTATIONAL COST OF PRECONDITIONED LINEAR SOLVERS<br />
linear<br />
solver precond. Au Lu L-1u L -T u<br />
app. costs per one ite.<br />
(Au + Lu + L -1 u + L -T u)<br />
IC 1 0 1 1 1.0<br />
DIC 1 0 1 1 1.0<br />
CG EDIC 0 1 1 1 0.9<br />
SGS 1 0 1 1 1.0<br />
ESGS 0 1 1 1 0.9<br />
MRTR<br />
IC 1 0 1 1 1.0<br />
DIC 1 0 1 1 1.0<br />
EDIC 0 1 1 1 0.9<br />
SGS 1 0 1 1 1.0<br />
ESGS 0 1 1 1 0.9<br />
approximate computational costs:<br />
Au = 0.4, Lu = 0.3, L -1 u = 0.3, L -T u = 0.3<br />
III. ANALYSIS MODEL<br />
Figure 1 shows finite element meshes <strong>of</strong> model<br />
problems used for performing a magnetic field analysis.<br />
The unknown numbers in all meshes are determined by<br />
the absolute edge number based on the nodal number.<br />
Figure 1 (a) shows a box shield model [9] in which the<br />
magnetic shielding part is composed <strong>of</strong> four-layer finite<br />
elements in the thickness direction; the shielding<br />
thickness is 1 mm. Magnetostatic and eddy current<br />
analyses are carried out by considering the magnetic<br />
nonlinearity <strong>of</strong> SS400.<br />
Figures 1 (b) and (c) show the permanent-magnet-type<br />
MRI model [10]. For this model, magnetostatic field<br />
analysis is performed by considering the magnetic<br />
nonlinearity <strong>of</strong> the pole piece, yoke, and props. The 2ndorder<br />
hexahedral elements are <strong>of</strong> the Serendipity type.<br />
Finally, Figure 1 (d) shows the IPM motor (D-model)<br />
proposed by the IEEJ committee. A strongly coupled<br />
analysis is performed between the magnetic field and<br />
AC-driven three-phase circuit. The stator and overhung<br />
rotor are considered to be magnetically nonlinear, and the<br />
conductivity <strong>of</strong> the magnet is set to be 6.944 × 10 5 S/m.<br />
The number <strong>of</strong> revolutions per minute is set to 1500, and<br />
the pitch <strong>of</strong> the mechanical angle is 1°. The total number<br />
<strong>of</strong> time steps is set to 360.<br />
- 116 - 15th IGTE Symposium 2012<br />
TABLE III lists the analyzed conditions. The Newton-<br />
Raphson (NR) method, along with the line search<br />
technique based on functional minimization (0, 1.0) [11],<br />
is used as the nonlinear analysis method. GEAR’s<br />
implicit scheme [12], [13] <strong>of</strong> 2nd order is used for the<br />
discretization <strong>of</strong> the time domain based on the A-<br />
formulation.<br />
IV. NUMERICAL RESULTS<br />
A. Verification <strong>of</strong> computational accuracy<br />
The computational accuracy <strong>of</strong> preconditioned MRTR<br />
method is verified for the box shield model. Figure 2<br />
shows the analysis results. The magnetic flux density Bz<br />
in the z-direction on z-axis is shown in Figure 2 (a). The<br />
characteristic <strong>of</strong> the ICCG method coincide with that <strong>of</strong><br />
ESGS-MRTR. The relative error for two characteristics is<br />
less than 10 -4 % at points A and B. Similarly, the relative<br />
error for eddy current loss PJe as shown in Figure 2 (b) is<br />
less than 10 -4 % at these points. It is to be noted that other<br />
preconditioned linear solvers have similar characteristics.<br />
20<br />
240<br />
y<br />
75<br />
20d<br />
y<br />
(unit:mm) magnet: Br = 1.2 T<br />
z<br />
y z<br />
yoke: SS400<br />
z<br />
16<br />
100<br />
(unit: mm)<br />
coil:2 kAT<br />
magnetic shielding<br />
(SS400)<br />
x<br />
(a) Box shield model<br />
prop:<br />
SS400<br />
x gradientcoil<br />
polepiece:<br />
SS400<br />
13<br />
(unit:mm) magnet: Br = 1.2 T<br />
z<br />
240 20<br />
y<br />
75<br />
yoke: SS400<br />
13<br />
prop: SS400<br />
x<br />
gradientcoil<br />
polepiece<br />
: SS400<br />
(b) MRI (1st order tetra.) (c) MRI (2nd order hexa.)<br />
(unit : mm)<br />
45<br />
32.5<br />
TABLE III<br />
ANALYZED CONDITIONS<br />
x<br />
v<br />
w<br />
u<br />
v<br />
w<br />
u<br />
rotor core<br />
(50A350)<br />
stator core<br />
(50A350)<br />
30<br />
shaft<br />
(S45C)<br />
(d) IPM motor<br />
Figure 1: Finite element meshes.<br />
analysis model formul. discret. no. <strong>of</strong> nodes no. <strong>of</strong> elements DoF nonlinear circuit field<br />
box shield<br />
A<br />
A <br />
1st-hexa 72,900 67,980<br />
magnet<br />
enlarged view<br />
CG, MR || B<br />
|| 2<br />
197,472 static 10-3 10-2 <br />
206,427 time domain 10-3 10-2 <br />
MRI<br />
A 1st-tetra 279,090 49,813 323,965 static 10-3 10-3 A 2nd-hexa 93,879 87,120 1,014,600 static 10-3 10-3 IPM motor 1st-hexa 381,197 352,980 1,030,156 time domain 10-3 10-2 <br />
<br />
A
B z [T]<br />
ESGS-MRTR<br />
0.025<br />
0.020<br />
0.015<br />
0.010<br />
0.005<br />
ICCG<br />
point A<br />
point B<br />
0 0.05 0.10 0.15 0.20<br />
z [m]<br />
2.0<br />
1.5<br />
1.0<br />
P Je [W] ESGS-MRTR<br />
0.5<br />
0 0.01 0.02 0.03 0.04<br />
t [s]<br />
point B<br />
point A<br />
ICCG<br />
(a) (b)<br />
Figure 2: Some characteristics <strong>of</strong> the box shield model:<br />
(a) the Bz distribution in the z-axis direction and (b) the<br />
distribution <strong>of</strong> eddy current loss.<br />
log 10(||r (k) || 2 / ||b|| 2)<br />
log 10(||r (k) || 2 / ||b|| 2)<br />
1<br />
0 DIC-CG<br />
-1<br />
-2<br />
EDIC-CG<br />
DIC-MRTR<br />
EDIC-MRTR<br />
SGS-CG<br />
-3<br />
ICCG<br />
ESGS-CG<br />
-4<br />
IC-MRTR<br />
-5<br />
SGS-MRTR<br />
-6 ESGS-MRTR<br />
-7<br />
0 200 400 600<br />
iteration number k<br />
800 1000<br />
1<br />
0<br />
-1<br />
-2<br />
-3<br />
(a)<br />
DIC-CG<br />
EDIC-CG<br />
DIC-MRTR<br />
EDIC-MRTR<br />
SGS-CG<br />
ESGS-CG<br />
-4 IC-MRTR<br />
-5<br />
SGS-MRTR<br />
ESGS-MRTR<br />
-6<br />
ICCG<br />
-7<br />
0 150 300 450 600<br />
iteration number k<br />
750 900<br />
(b)<br />
Figure 3: Convergence characteristics <strong>of</strong> preconditioned<br />
linear solvers for the box shield model. (a) Magnetostatic<br />
field analysis and (b) eddy current analysis.<br />
B. Convergence characteristics and elapsed time<br />
Figure 3 shows the convergence characteristics <strong>of</strong> the<br />
box shield model, obtained by the magnetostatic and<br />
eddy current analyses. The characteristics are normalized<br />
by the initial norm <strong>of</strong> the residual in the 1st NR iteration.<br />
In MRTR method, the monotonic decrease in the residual<br />
has been mathematically proved; nevertheless, there are<br />
some noise spikes in the characteristics in the case <strong>of</strong><br />
preconditioned MRTR method. The generation <strong>of</strong> noise<br />
is likely to be caused by changes in the NR iteration.<br />
Noise generation is also observed for the preconditioned<br />
CG method. The characteristics <strong>of</strong> preconditioned MRTR<br />
method are superior to those <strong>of</strong> the preconditioned CG<br />
method because the monotonic decrease in the residual is<br />
mathematically guaranteed in the former method. The<br />
characteristics <strong>of</strong> the ESGS and EDIC preconditioners<br />
- 117 - 15th IGTE Symposium 2012<br />
linear<br />
solver<br />
CG<br />
MRTR<br />
linear<br />
solver<br />
CG<br />
MRTR<br />
TABLE IV<br />
ANALYSIS RESULTS FOR BOX SHIELD MODEL<br />
(a) MAGNETOSTATIC FIELD ANALYSIS<br />
precond. total linear ite. NR ite. time for precond. [s] elapsed time [s]<br />
IC 544 (1.00) 5 1.07 8.2 (1.00)<br />
DIC 979 (1.80) 5 1.06 13.8 (1.68)<br />
EDIC 979 (1.80) 5 1.07 13.0 (1.59)<br />
SGS 653 (1.20) 5 0.03 8.3 (1.01)<br />
ESGS 653 (1.20) 5 0.03 7.9 (0.96)<br />
IC 448 (0.82) 5 1.01 7.4 (0.90)<br />
DIC 812 (1.49) 5 1.09 12.4 (1.51)<br />
EDIC 812 (1.49) 5 1.06 11.5 (1.40)<br />
SGS 552 (1.01) 5 0.03 7.7 (0.94)<br />
ESGS 552 (1.01) 5 0.03 7.0 (0.85)<br />
(b) EDDY CURRENT ANALYSIS IN TIME DOMAIN (1ST TIME STEP)<br />
precond. total linear ite. NR ite. time for precond. [s] elapsed time [s]<br />
IC 503 (1.00) 4 1.07 8.6 (1.00)<br />
DIC 844 (1.68) 4 1.05 13.6 (1.58)<br />
EDIC 844 (1.68) 4 1.06 12.8 (1.49)<br />
SGS 595 (1.18) 4 0.03 8.7 (1.01)<br />
ESGS 595 (1.18) 4 0.03 8.2 (0.95)<br />
IC 393 (0.78) 4 1.00 7.4 (0.86)<br />
DIC 694 (1.38) 4 1.07 12.1 (1.41)<br />
EDIC 694 (1.38) 4 1.03 11.2 (1.30)<br />
SGS 470 (0.93) 4 0.03 7.4 (0.86)<br />
ESGS 470 (0.93) 4 0.03 6.8 (0.79)<br />
are consistent with those <strong>of</strong> SGS and DIC, respectively,<br />
and the characteristics <strong>of</strong> the DIC preconditioner are<br />
inferior to those <strong>of</strong> other preconditioned solvers. The IC-<br />
MRTR characteristics are the best among all<br />
preconditioned solvers. However, the elapsed time <strong>of</strong><br />
ESGS-MRTR is the shortest among all solvers, as can be<br />
seen in TABLE IV. The reason for this is the reduction in<br />
the computational cost when Eisenstat’s technique is<br />
used. All results are obtained by using a PC (CPU: Intel<br />
Core i7 2600K/4.2 GHz; memory: 16 GB). Following all<br />
problems are solved with the same hardware.<br />
Figure 4 shows the convergence characteristics <strong>of</strong> MRI<br />
models. The convergence characteristics <strong>of</strong><br />
preconditioned MRTR are superior to those <strong>of</strong> the<br />
preconditioned CG method. The DIC preconditioner is<br />
not very effective in improving the convergence<br />
characteristics. While the IC preconditioner is successful<br />
in the case <strong>of</strong> a tetrahedron, the SGS preconditioner is the<br />
most effective for a 2nd-order hexahedron. The<br />
effectiveness <strong>of</strong> the preconditioner depends on the target<br />
problem. TABLE V shows the analysis results for the<br />
MRI model. The elapsed time <strong>of</strong> ESGS-MRTR is the<br />
shortest among all preconditioned solvers.<br />
TABLE VI shows the analysis results for an IPM<br />
motor. The number <strong>of</strong> NR iterations is different for all<br />
solvers owing to the slight discrepancy in the converged<br />
solution in every time step. The elapsed time <strong>of</strong> ESGS-<br />
MRTR is the shortest among all linear solvers.<br />
V. CONCLUSION<br />
This paper shows the suitability <strong>of</strong> preconditioned<br />
MRTR method for solving an algebraic equation derived<br />
from the edge-based finite element method in a magnetic<br />
field. There is a possibility <strong>of</strong> reducing the elapsed time<br />
in the case <strong>of</strong> MRTR method by using the symmetric
Gauss-Seidel preconditioner supported by Eisenstat’s<br />
technique.<br />
log 10(||r (k) || 2 / ||b|| 2)<br />
log 10(||r (k) || 2 / ||b|| 2)<br />
1<br />
0<br />
-1<br />
DIC-CG<br />
EDIC-CG DIC-MRTR<br />
EDIC-MRTR<br />
-2<br />
ICCG<br />
-3<br />
SGS-CG<br />
-4<br />
-5<br />
IC-MRTR<br />
ESGS-CG<br />
-6 SGS-MRTR<br />
-7<br />
-8<br />
ESGS-MRTR<br />
-9<br />
0 200 400 600 800<br />
iteration number k<br />
1000<br />
1<br />
0<br />
-1<br />
-2<br />
-3<br />
(a)<br />
ICCG<br />
SGS-CG<br />
ESGS-CG<br />
DIC-MRTR<br />
EDIC-MRTR<br />
DIC-CG<br />
-4<br />
EDIC-CG<br />
-5<br />
IC-MRTR<br />
-6 SGS-MRTR<br />
-7 ESGS-MRTR<br />
-8<br />
0 900 1800 2700 3600<br />
iteration number k<br />
4500<br />
(b)<br />
Figure 4: Convergence characteristics <strong>of</strong> preconditioned<br />
linear solvers for the MRI model. (a) Tetrahedron and (b)<br />
2nd-order hexahedron.<br />
linear<br />
solver<br />
CG<br />
MRTR<br />
linear<br />
solver<br />
CG<br />
MRTR<br />
TABLE V<br />
ANALYSIS RESULTS FOR THE MRI MODEL<br />
(a) 1ST ORDER TETRAHEDRON<br />
precond. total linear ite. NR ite. time for precond. [s] elapsed time [s]<br />
IC 572 (1.00) 7 0.71 11.5 (1.00)<br />
DIC 958 (1.67) 7 0.73 18.6 (1.62)<br />
EDIC 958 (1.67) 7 0.72 17.6 (1.53)<br />
SGS 649 (1.14) 7 0.03 11.9 (1.03)<br />
ESGS 649 (1.14) 7 0.03 11.3 (0.98)<br />
IC 481 (0.84) 7 0.73 10.8 (0.94)<br />
DIC 777 (1.36) 7 0.72 16.7 (1.45)<br />
EDIC 777 (1.36) 7 0.74 15.3 (1.33)<br />
SGS 542 (0.95) 7 0.03 11.0 (0.96)<br />
ESGS 542 (0.95) 7 0.03 10.1 (0.88)<br />
(b) 2ND ORDER HEXAHEDRON<br />
precond. total linear ite. NR ite. time for precond. [s] elapsed time [s]<br />
IC 3,795 (1.00) 8 25.6 570.7 (1.00)<br />
DIC 4,305 (1.13) 8 25.2 643.5 (1.13)<br />
EDIC 4,219 (1.11) 8 25.3 585.4 (1.03)<br />
SGS 2,590 (0.68) 8 0.50 370.1 (0.65)<br />
ESGS 2,590 (0.68) 8 0.50 342.8 (0.60)<br />
IC 2,785 (0.73) 8 23.9 443.3 (0.78)<br />
DIC 3,357 (0.88) 8 25.7 530.9 (0.93)<br />
EDIC 3,357 (0.88) 8 23.3 470.3 (0.82)<br />
SGS 2,112 (0.56) 8 0.50 316.1 (0.55)<br />
ESGS 2,112 (0.56) 8 0.50 287.6 (0.50)<br />
- 118 - 15th IGTE Symposium 2012<br />
linear<br />
solver<br />
CG<br />
MRTR<br />
TABLE VI<br />
ANALYSIS RESULTS FOR THE IPM MOTOR<br />
precond. total linear ite. total NR ite. time for precond. [h] elapsed time [h]<br />
IC 1,666,584 (1.00) 4,070 (0.80) 4.84 39.1 (1.00)<br />
DIC 1,799,885 (1.08) 5,065 (1.00) 4.81 43.6 (1.12)<br />
EDIC 1,779,282 (1.07) 4,999 (0.99) 4.82 40.9 (1.05)<br />
SGS 1,400,867 (0.84) 4,996 (0.99) 0.03 29.7 (0.76)<br />
ESGS 1,406,236 (0.84) 4,993 (0.99) 0.03 28.6 (0.73)<br />
IC 931,375 (0.56) 4,450 (0.88) 5.24 26.6 (0.68)<br />
DIC 1,088,578 (0.65) 4,716 (0.93) 4.95 29.7 (0.76)<br />
EDIC 1,067,788 (0.64) 5,242 (1.03) 4.96 27.6 (0.71)<br />
SGS 866,637 (0.52) 5,178 (1.02) 0.04 19.6 (0.50)<br />
ESGS 864,545 (0.52) 5,162 (1.02) 0.04 18.2 (0.47)<br />
ACKNOWLEDGMENT<br />
The authors would like to thank Dr. K. Abe and Dr. Y.<br />
Takahashi for their advice and helpful comments. This<br />
work was supported by a Japan Society for the Promotion<br />
<strong>of</strong> Science (JSPS) Grant-in-Aid for Young Scientists (B)<br />
(Grant Number: 23760252).<br />
REFERENCES<br />
[1] J. A. Meijerink and H. A. van der Vorst, “An iterative solution<br />
method for linear systems <strong>of</strong> which the coefficient matrix is a<br />
symmetric M-matrix,” Mathematics <strong>of</strong> Computation, Vol. 31, No.<br />
137, pp. 148-162, Jan. 1977.<br />
[2] K. Abe, S.-L. Zhang, and T. Mitsui, “MRTR method: an iterative<br />
method based on the three-term recurrence formula <strong>of</strong> CG-type for<br />
nonsymmetric matrix,” The Japan Society for Industrial and<br />
Applied Mathematics, Vol. 7, No. 1, pp. 37-50, Mar. 1997. (in<br />
Japanese)<br />
[3] K. Abe and S.-L. Zhang, “A variant algorithm <strong>of</strong> the Orthomin(m)<br />
method for solving linear systems,” Appl. Math. Comput., Vol. 206,<br />
No. 1, pp. 42-49, Dec. 2008.<br />
[4] S. C. Eisenstat, “Efficient implementation <strong>of</strong> a class <strong>of</strong><br />
preconditioned conjugate gradient methods,” SIAM J. Sci. Stat.<br />
Comput., Vol. 2, No. 1, pp. 1-4, Mar. 1981.<br />
[5] S. C. Eisenstat, H. C. Elman, and M. H. Schultz, “Variational<br />
iterative methods for nonsymmetric systems <strong>of</strong> linear equations,”<br />
SIAM J. Numer. Anal., Vol. 20, No. 2, pp. 345-357, Apr. 1983.<br />
[6] K. Fujiwara, T. Nakata, and H. Fusayasu, “Acceleration <strong>of</strong><br />
convergence characteristic <strong>of</strong> the ICCG method,” IEEE Trans.<br />
Magn., Vol. 29, No. 2, pp. 1958-1961, Mar. 1993.<br />
[7] O. Axelsson, “A generalized SSOR method,” BIT Numerical<br />
Mathematics, Vol. 12, No. 4, pp. 443-467, Jul. 1972.<br />
[8] A. Shiode, S. Fujino, and K. Abe, “Preconditioning for symmetric<br />
positive definite matrices <strong>of</strong> MRTR method,” Trans. JSCES, No.<br />
20060007, pp. 231-237, Feb. 2006. (in Japanese)<br />
[9] A. Kameari, “Improvement <strong>of</strong> ICCG convergence for thin<br />
elements in magnetic field analyses using the finite-element<br />
method,” IEEE Trans. Magn., Vol. 44, No. 6, pp. 1178-1181, Jun.<br />
2008.<br />
[10] K. Miyata, K. Ohashi, A. Muraoka, and N. Takahashi, “3-D<br />
magnetic field analysis <strong>of</strong> permanent-magnet type <strong>of</strong> MRI taking<br />
account <strong>of</strong> minor loop,” IEEE Trans. Magn., Vol. 42, No. 4, pp.<br />
1451-1454, Apr. 2006.<br />
[11] Y. Okamoto, K. Fujiwara, and R. Himeno, “Exact minimization <strong>of</strong><br />
energy functional for NR method with line-search technique,” IEEE<br />
Trans. Magn., Vol. 45, No. 3, pp. 1288-1291, Mar. 2009.<br />
[12] C. W. GEAR, Numerical Initial Value Problems in Ordinary<br />
Differential Equations. Englewood Cliffs, NJ: Prentice-Hall, Inc.,<br />
1971.<br />
[13] Y. Okamoto, K. Fujiwara, and Y. Ishihara, “Effectiveness <strong>of</strong><br />
higher order time integration in time-domain finite-element<br />
analysis,” IEEE Trans. Magn., Vol. 46, No. 8, pp. 3321-3324, Aug.<br />
2010.
- 119 - 15th IGTE Symposium 2012<br />
High Frequency Mixing Rule Based Effective<br />
Medium Theory <strong>of</strong> Metamaterials<br />
Zsolt Szabó<br />
Department <strong>of</strong> Broadband Infocommunications and Electromagnetic Theory,<br />
Budapest <strong>University</strong> <strong>of</strong> <strong>Technology</strong> and Economics, Egry József 18, 1111 Budapest, Hungary,<br />
E-mail: szabo@evt.bme.hu<br />
Abstract— The electromagnetic response <strong>of</strong> metamaterials is governed by the collective behavior <strong>of</strong> engineered electric and<br />
magnetic dipoles. Therefore metamaterials may be replaced by hypothetical composites <strong>of</strong> spherical particles embedded in a<br />
host material. The effective electric permittivity and magnetic permeability <strong>of</strong> such systems can be computed with high<br />
frequency extension <strong>of</strong> the Maxwell-Garnett mixing rule. The validity <strong>of</strong> this assumption is discussed and as a benchmark the<br />
effective electromagnetic material parameters <strong>of</strong> a deep subwavelength spherical composite are calculated in three different<br />
ways: with the Maxwell-Garnett mixing rule, high frequency mixing rule and directly extracted from transmission reflection<br />
data. The developed theory is applied to find the parameters <strong>of</strong> a composite with similar magnetic response as a metamaterial<br />
built <strong>of</strong> split ring resonator.<br />
Index Terms—metamaterials, effective medium theory, Maxwell-Garnett mixing, Mie theory.<br />
I. INTRODUCTION<br />
Recently metamaterials are in focus <strong>of</strong> very intensive<br />
research and due to their unique properties are promising<br />
over the full electromagnetic spectrum [1-3]. The research<br />
<strong>of</strong> metamaterials has started with the goal <strong>of</strong> producing<br />
materials with negative refractive index i.e. simultaneous<br />
negative electric permittivity and magnetic permeability<br />
for imaging applications below the diffraction limit [4].<br />
However, the ultimate goal <strong>of</strong> the metamaterial research<br />
is to fabricate materials with arbitrarily configurable<br />
electric and magnetic properties. With an advance in<br />
micro- and nano-manufacturing techniques there are<br />
possibilities to produce subwavelength structures that can<br />
support symmetric and anti-symmetric modes. The<br />
associated current flow produces electric and magnetic<br />
dipole moments. A metamaterial with a customized<br />
optical response can be built as a superposition <strong>of</strong> such<br />
nano-elements. A very common design <strong>of</strong> an artificial<br />
material with tailored negative permittivity is the wire<br />
medium [5]. The most common designs to produce<br />
artificial magnetism are the variations <strong>of</strong> the split ring<br />
resonators [6] or pairs <strong>of</strong> nanorods [7]. The superposition<br />
<strong>of</strong> subwavelength structures with negative electric<br />
permittivity and magnetic permeability can lead to<br />
negative refractive index [8] even at optical frequencies<br />
[9]. However the losses and the finite size <strong>of</strong> the unit cell<br />
results in a cut<strong>of</strong>f frequency, limiting the applicability <strong>of</strong><br />
metamaterials. In addition toward optical frequencies it is<br />
increasingly challenging to fabricate the meta-structures,<br />
especially the negative magnetic response.<br />
The design <strong>of</strong> devices with metamaterials <strong>of</strong>ten<br />
requires the application <strong>of</strong> the effective medium theory.<br />
However robust effective metamaterial parameter<br />
extraction and homogenization are unsolved theoretical<br />
challenges <strong>of</strong> the metamaterial research. In spite <strong>of</strong><br />
considerable progress, researchers are still debating the<br />
fundamental issues and question the validity <strong>of</strong> the<br />
effective medium concept, which is considered by many<br />
as the Achilles-heel <strong>of</strong> this research field.<br />
In this paper it is argued that metamaterials can be<br />
homogenized when their electromagnetic response is<br />
governed by the excitation <strong>of</strong> electric and magnetic<br />
dipoles. The electromagnetic response <strong>of</strong> spherical<br />
particles can be replaced with static dipoles when the<br />
sphere is very small compared to the optical wavelength<br />
<strong>of</strong> the incident electromagnetic wave and by radiating<br />
dipoles, when the size is larger. The analytical formulas<br />
<strong>of</strong> the Mie theory explain precisely the scattering<br />
mechanism. Metamaterials may be equivalent to a<br />
properly chosen hypothetical composite <strong>of</strong> spherical<br />
particles embedded in a host material. Therefore well<br />
developed effective medium theories <strong>of</strong> composite<br />
materials can be applied to metamaterials. The validity <strong>of</strong><br />
this assumption is discussed and as a benchmark the<br />
effective electromagnetic material parameters <strong>of</strong> a deep<br />
subwavelength composite <strong>of</strong> spherical particles are<br />
calculated in three different ways: with the Maxwell-<br />
Garnett mixing rule, high frequency mixing rule and<br />
extracted directly from transmission reflection data. The<br />
developed theory is applied to find the parameters <strong>of</strong> a<br />
composite with similar magnetic response as a<br />
metamaterial built up <strong>of</strong> split ring resonator.<br />
II. EFFECTIVE MEDIUM THEORIES OF METAMATERIALS<br />
Several effective medium theories <strong>of</strong> metamaterials<br />
have been developed. In Fig. 1 two models <strong>of</strong><br />
metamaterial homogenization are presented. The effective<br />
metamaterial parameters can be extracted by replacing the<br />
electromagnetic response <strong>of</strong> the metamaterials with the<br />
electromagnetic response <strong>of</strong> a homogeneous isotropic slab<br />
is it is shown in Fig. 1.b. The model <strong>of</strong> Fig. 1.c replaces<br />
the metamaterial with the hypothetical composite <strong>of</strong><br />
spherical particles embedded in a host material. In both<br />
cases the electromagnetic properties can be determined in<br />
such a way that the metamaterial slab and the slab with<br />
the homogenized material parameters have the same<br />
reflection S 11 and transmission S 21 parameters.<br />
When the metamaterial is replaced with homogeneous<br />
slab, from the Fresnel relations the effective metamaterial<br />
parameters can be expressed. However the extracted wave<br />
impedance is exact only in the quasi static limit [10] and
the unique extraction <strong>of</strong> the refractive index is<br />
cumbersome due to the branching problem <strong>of</strong> the<br />
refractive index; that is the calculation <strong>of</strong> the refractive<br />
index involves the evaluation <strong>of</strong> a complex logarithm that<br />
is a multi-valued function. To remove this ambiguity, the<br />
Kramers–Kronig relation can be applied to estimate the<br />
refractive index from the extinction coefficient [11]. The<br />
physically realistic exact values <strong>of</strong> the refractive index are<br />
determined by selecting those branches <strong>of</strong> the logarithmic<br />
function which are closest to those predicted by the<br />
Kramers–Kronig relation. Finally from the wave<br />
impedance and from the refractive index the electric<br />
permittivity and the magnetic permeability can be<br />
calculated.<br />
(a)<br />
- 120 - 15th IGTE Symposium 2012<br />
x = ε μ ωr<br />
c , where ω is the angular frequency <strong>of</strong><br />
h h<br />
r r 0<br />
the incident radiation and c 0 is the speed <strong>of</strong> light in<br />
vacuum, provides the guideline for the validity <strong>of</strong> the<br />
Maxwell Garnett mixing rule, with the necessary<br />
condition x 1.<br />
However the limits <strong>of</strong> the Mixing-<br />
Garnett mixing rule can be extended. The Mie theory<br />
explains precisely the scattering mechanism <strong>of</strong> standalone<br />
spherical particles <strong>of</strong> any size and <strong>of</strong>fer analytic solution<br />
in form <strong>of</strong> infinite series [14]. When the magnetic<br />
permeability <strong>of</strong> the host material and <strong>of</strong> the spherical<br />
particle is equal, the Mie coefficients are<br />
mΨn( mx) Ψ′ n( x) −Ψn( x) Ψ′<br />
n(<br />
mx)<br />
an<br />
=<br />
,<br />
mΨ mx ξ′ x −ξ x Ψ′<br />
mx<br />
b<br />
n( ) n( ) n( ) n(<br />
)<br />
( mx) ′ ( x) m ( x) ′ ( mx)<br />
( mx) ξ′ ( x) mξ ( x) ′ ( mx)<br />
Ψ Ψ − Ψ Ψ<br />
=<br />
, (4)<br />
n n n n<br />
n<br />
Ψn n − n Ψn<br />
where m =<br />
i i<br />
ε r μr h h<br />
ε r μr<br />
is the contrast <strong>of</strong> the<br />
refractive index and n Ψ and ξ n are the Riccati-Bessel<br />
functions. The radiating electric and magnetic dipole<br />
polarizabilities correspond to the first terms <strong>of</strong> the<br />
expansion and can be expressed with the Mie scattering<br />
coefficients as<br />
3<br />
3<br />
3r<br />
3r<br />
α e = i a1,<br />
α 3 m = i b1.<br />
3<br />
2x<br />
2x<br />
(5)<br />
(b) (c)<br />
Figure 1: Homogenization models <strong>of</strong> metamaterials<br />
Substituting (5) in the Clausius-Mossotti relation leads to<br />
the expressions <strong>of</strong> the effective electric permittivity and<br />
with a similar argument to the expression <strong>of</strong> the effective<br />
magnetic permeability [15, 16]<br />
The Maxwell-Garnett mixing rule [10, 12, 13] can<br />
provide the effective electric permittivity <strong>of</strong> dilute, two<br />
component mixtures and it is derived with the assumption<br />
that the spherical inclusions can be replaced by static<br />
electric dipoles with polarizability<br />
3<br />
eff h x + 3iζa1(<br />
mx)<br />
εr = εr<br />
3<br />
x − 3 iζa1( mx<br />
2 )<br />
3<br />
eff h x + 3iζb1(<br />
mx)<br />
μr = μr<br />
3<br />
x − 3 iζb1( mx<br />
2 )<br />
,<br />
. (6)<br />
i h<br />
εr − εr<br />
3<br />
αe<br />
= r , i h<br />
εr + 2εr<br />
(1)<br />
where 1<br />
h<br />
where ε r is the electric permittivity <strong>of</strong> the host material,<br />
i<br />
ε r is the electric permittivity and r is the radius <strong>of</strong> the<br />
spherical inclusions. The connection between the<br />
eff<br />
polarizability and the effective electric permittivity ε r is<br />
given by the Clausius-Mossotti relation [13]<br />
eff h<br />
εr − εr ζ<br />
= α<br />
eff h 3 e , (2)<br />
εr + 2εr<br />
r<br />
where ζ is the filling factor <strong>of</strong> the spherical inclusion.<br />
When (1) is substituted in the Clausius-Mossotti relation<br />
it results in the Maxwell-Garnett mixing formula<br />
eff h i h<br />
εr −εr εr −εr<br />
= ζ . (3)<br />
eff h i h<br />
εr + 2εr εr + 2εr<br />
In this relation, the size <strong>of</strong> the spherical inclusions is not<br />
appearing in a direct way; the filling factor ζ is the only<br />
geometry factor in the Maxwell-Garnett formula. The<br />
static dipole approximation is valid only for spheres,<br />
which are very small compared to the optical wavelength<br />
<strong>of</strong> the incident electromagnetic wave. The size parameter<br />
a and b 1 are the first terms <strong>of</strong> the Mie scattering<br />
coefficients and i = − 1 is the imaginary unit. The<br />
evaluation <strong>of</strong> a 1 and b1 is trivial, because in (5) for<br />
n = 1 , the Riccati-Bessel functions and the derivatives<br />
can be expressed with simple expression <strong>of</strong> trigonometric<br />
functions as<br />
sin ρ<br />
Ψ 1 ( ρ) = − cos ρ ,<br />
ρ<br />
1 cosρ<br />
Ψ ′ 1 ( ρ) = sin ρ1−<br />
2 +<br />
,<br />
ρ ρ<br />
cos ρ <br />
ξ1( ρ) =Ψ1( ρ) − i + sin ρ<br />
ρ ,<br />
1 <br />
ξ′ 1( ρ) =Ψ ′ 1( ρ) + i Ψ 1( ρ) + cos ρ 2 <br />
ρ .<br />
When the size <strong>of</strong> the spherical inclusions is not small<br />
enough to be replaced with static dipoles, but it is small<br />
enough to disregard all higher order modes <strong>of</strong> (4) then the<br />
high frequency mixing formulas (6) are applicable. Note<br />
that the resonance based magnetic metamaterials are<br />
working under similar conditions [1]. Metamaterials has
finite unit cell sizes, and especially magnetic<br />
metamaterials has unit cells, which are not deep<br />
subwavelength. The strength <strong>of</strong> the resonance decreases<br />
with the size <strong>of</strong> the unit cell and the resonance is not<br />
strong enough to produce negative permeability for<br />
structures with deep sub-wavelength elements.<br />
Metamaterials with larger unit cell can support higher<br />
order modes at frequencies, which are just slightly<br />
different than the frequency region where the double<br />
negative behavior occurs. Special care must be taken<br />
when metamaterial parameters are extracted directly from<br />
transmission reflection data, and it is not sufficient to<br />
enforce the continuity <strong>of</strong> the refractive index, because we<br />
may extract erroneous effective metamaterial parameters<br />
for frequency regions where they do not even exist. The<br />
high frequency mixing, which is based on the Mie theory<br />
provides estimate for the limits <strong>of</strong> the homogenization.<br />
III. EFFECTIVE MATERIAL PARAMETERS OF COMPOSITE<br />
WITH SPHERICAL METALLIC INCLUSIONS<br />
In this section the effective parameters <strong>of</strong> the<br />
composite material with the unit cell illustrated in Fig. 2.a<br />
are calculated. This composite serves as benchmark to<br />
compare the effective material parameters calculated with<br />
the Maxwell Garnett mixing rule, the high frequency<br />
mixing rule and directly extracted from transmission<br />
reflection data. The geometry and the composition are<br />
selected such that the size parameter <strong>of</strong> the spheres at<br />
optical frequencies satisfies the condition x 1.<br />
The<br />
length <strong>of</strong> the cubic unit cell is 15 nm and the radius <strong>of</strong> the<br />
sphere is 3 nm. The spherical inclusions are made <strong>of</strong> Ag<br />
and are embedded in SiO2 host and the calculations take<br />
into account the frequency dispersion <strong>of</strong> the materials<br />
parameters. Fig. 2.b presents the electric permittivity <strong>of</strong><br />
the Ag inclusions, and Fig. 2.c plots the electric<br />
permittivity <strong>of</strong> the SiO2 host [17, 18]. The composite is<br />
considered infinitely large in the x and y directions (see<br />
Fig. 2.a) and only-one-unit-cell thick in the z direction.<br />
The Maxwell-Garnett type mixing rules do not require<br />
cubic unit cells; the requirement is that the inclusions are<br />
separated. For periodically arranged spherical inclusions,<br />
when the filling factor is high, the Maxwell-Garnett<br />
mixing rule has to be modified [12]. On the other hand<br />
disorder and inaccuracy <strong>of</strong> shapes destroys the collective<br />
effects and extends the limits <strong>of</strong> the theory.<br />
The aim <strong>of</strong> the calculation is to determine the effective<br />
parameters <strong>of</strong> this composite in the frequency range from<br />
0.4 to 1 PHz. The calculations <strong>of</strong> the transmission<br />
reflection data (S-parameters) <strong>of</strong> this paper are performed<br />
with the frequency-domain solver <strong>of</strong> the commercial<br />
s<strong>of</strong>tware CST Microwave Studio [19]. Due to periodicity,<br />
one unit cell with perfect electric conducting and perfect<br />
magnetic conducting boundary conditions in the x and the<br />
y directions is sufficient to calculate the S-parameters<br />
below the frequencies where diffraction occurs. In the z<br />
direction additional air regions are added to the<br />
computational space by positioning waveguide ports at<br />
one-unit-cell distance from the surface <strong>of</strong> the composite.<br />
The fundamental mode <strong>of</strong> the waveguide ports is excited<br />
to launch a plane wave, which is propagating along the z<br />
direction; at the same time the waveguide ports act as<br />
- 121 - 15th IGTE Symposium 2012<br />
absorbing boundary condition and permits the automatic<br />
calculation <strong>of</strong> the S-parameters [19]. The online algorithm<br />
[20] is applied to extract the electromagnetic parameters<br />
from the S parameters. To get a good estimate for the<br />
Kramers–Kronig integral, the simulations cover the 0.25–<br />
1.25 PHz frequency interval. When this frequency<br />
interval is even larger, the accuracy <strong>of</strong> the Kramers–<br />
Kronig approximation does not change noticeably in the<br />
frequency range <strong>of</strong> interest.<br />
(a)<br />
(b)<br />
(c)<br />
Figure 2: The geometry <strong>of</strong> the composite material is<br />
shown in (a), the electric permittivity <strong>of</strong> the spherical<br />
inclusions made <strong>of</strong> Ag is presented in (b) and the electric<br />
permittivity <strong>of</strong> the SiO2 host materials is plotted in (c).<br />
Fig. 3.a and 3.b presents the magnitude and phase <strong>of</strong><br />
2 2<br />
the S-parameters. The absorption A = 1− S11<br />
− S21<br />
in<br />
function <strong>of</strong> frequency is plotted as well, showing a<br />
resonant peek at f 1 = 0.7192 PHz.<br />
The effective electric permittivity <strong>of</strong> the composite<br />
material, which is calculated in three different ways, with<br />
the high frequency mixing rule, with the Maxwell-Garnett<br />
mixing and extracted from the S-parameters, are<br />
presented in Fig. 4. The magnetic permeability is obtained<br />
from the high frequency mixing rule and it is extracted<br />
from the S-parameters as well, and it has values close to<br />
one over the frequency range <strong>of</strong> interest. Comparing the<br />
real and imaginary parts <strong>of</strong> the electric permittivity<br />
obtained with the three different methods, a very good<br />
agreement can be observed. The peak in the imaginary<br />
part <strong>of</strong> the electric permittivity corresponds to the<br />
absorption peek <strong>of</strong> Fig. 3.a, which reveals that it is<br />
electric resonance. The electric permittivity has Lorentz<br />
shape and can be successfully fitted with a single<br />
oscillator model
( − )<br />
2<br />
εrs εr∞ω0 εr ( ω) = εr∞+<br />
. (7)<br />
2 2<br />
ω0+ iδω−ω<br />
where the static electric permittivity ε rs = 2.379 , the<br />
electric permittivity at very high frequencies ε r∞<br />
= 2.23 ,<br />
the resonant frequency ω0= 2π ⋅ 0.7228 rad/fs and<br />
damping constant δ = 0.33 1/fs.<br />
(a)<br />
(b)<br />
Figure 3: The S-parameters <strong>of</strong> the one-unit-cell thick<br />
composite material. In (a), the magnitude, and in (b) the<br />
phase <strong>of</strong> the S-parameters is plotted. Note the absorption<br />
peek at f 1 = 0.7192 PHz.<br />
Figure 4: The effective electric permittivity <strong>of</strong> the<br />
composite material calculated with the high frequency<br />
mixing rule, Maxwell-Garnett mixing and extracted from<br />
the S-parameters.<br />
The electric permittivity at the frequency <strong>of</strong> the<br />
absorption peek is ε = 2.5319 + 2.0465i<br />
and the<br />
r<br />
- 122 - 15th IGTE Symposium 2012<br />
corresponding optical wavelength is<br />
λ opt = c0 ( n f1)<br />
= 245.04 nm, which is much larger than<br />
any characteristic dimension <strong>of</strong> the composite (the size <strong>of</strong><br />
the unit cell is 15 nm), showing that the resonant behavior<br />
is related to the composition rather than structuring.<br />
IV. EQUIVALENT COMPOSITES OF METAMATERIALS<br />
DESIGNED WITH THE HIGH FREQUENCY MIXING RULE<br />
In this section the equivalent composite <strong>of</strong> a magnetic<br />
metamaterial is determined. The geometry <strong>of</strong> the<br />
metamaterial is the well studied split ring resonator [1, 2,<br />
3, 8] as it is shown in Fig. 5. The dimensions and the<br />
material parameters are the same as in [8]. The size <strong>of</strong> the<br />
cubic unit cell is 5 mm, the split ring resonators are made<br />
<strong>of</strong> copper, the outer length <strong>of</strong> the exterior split ring<br />
resonators is 3 mm, the width <strong>of</strong> both split rings is<br />
0.25 mm, the thickness is 0.02 mm, the size <strong>of</strong> the gaps<br />
and the distance between the split ring resonators is<br />
0.5 mm. The substrate is made <strong>of</strong> dielectric with<br />
ε r = 3.84 and the thickness <strong>of</strong> the substrate is 0.25 mm.<br />
The metamaterial is periodic in the direction<br />
perpendicular to the propagation <strong>of</strong> the electromagnetic<br />
wave (z direction), the electric field is polarized in y<br />
direction, which means that the magnetic field is<br />
perpendicular to the plane <strong>of</strong> the split ring resonators. The<br />
metamaterial <strong>of</strong> [8] was designed to experimentally<br />
demonstrate the negative refraction. The role <strong>of</strong> the splitring<br />
resonators is to provide the negative magnetic<br />
response, while additional copper wires placed on the<br />
back side <strong>of</strong> the substrate are responsible for producing<br />
the negative electric permittivity, leading to a negative<br />
refractive index at a frequency <strong>of</strong> 10 GHz. In our<br />
numerical simulations the metamaterial is only-one-unitcell<br />
thick.<br />
Figure 5: The unit cell <strong>of</strong> the magnetic metamaterial slab<br />
is composed <strong>of</strong> metallic split-ring resonators.<br />
The reflection, transmission and absorption spectrum<br />
<strong>of</strong> the double negative metamaterial [8] is presented in<br />
Fig. 8.a, while in Fig. 8.b the electromagnetic response <strong>of</strong><br />
the split ring resonators is shown. The simulations reveal<br />
that the position <strong>of</strong> the resonant peek at 10 GHz is not<br />
changed by removing the wires; nevertheless the shape <strong>of</strong><br />
the transmission and reflection curves is greatly affected.
The effective parameters <strong>of</strong> the magnetic metamaterial<br />
built <strong>of</strong> split ring resonators are extracted from the Sparameters<br />
with [20] and are shown in Fig. 7.a. The<br />
magnetic permeability has Lorentz shape with negative<br />
values and it is similar to the magnetic permeability <strong>of</strong><br />
[8]. The effective electric permittivity has a shape <strong>of</strong> antiresonance,<br />
which may be an artifact caused by the<br />
replacement <strong>of</strong> the anisotropic metamaterial structure with<br />
the homogenized model <strong>of</strong> isotropic slab.<br />
(a)<br />
(b)<br />
Figure 6: In (a) the reflection, transmission and<br />
absorption spectrum <strong>of</strong> the double negative metamaterial<br />
is presented, while in (b) the electromagnetic response <strong>of</strong><br />
the split ring resonators is shown.<br />
Spectral fitting is carried out to find the parameters <strong>of</strong><br />
the composite, which is magnetically equivalent to the<br />
metamaterial, built <strong>of</strong> split ring resonators. The<br />
parameters <strong>of</strong> the high frequency mixing rule, the radius<br />
and the electric permittivity <strong>of</strong> the spherical inclusions,<br />
the filling factor and the electric permittivity <strong>of</strong> the host<br />
material are determined by minimizing the mean square<br />
error,<br />
2 2<br />
N HF TR HF TR<br />
Re( μri ) Re( μri ) Im ( μri) Im ( μ <br />
− − ri ) <br />
+ <br />
TR TR <br />
i= 1 Re( μri ) Im(<br />
μri<br />
) <br />
<br />
Ω=<br />
<br />
2N<br />
where N is the number <strong>of</strong> data points in the spectra, Re()<br />
and Im() return the real and imaginary parts <strong>of</strong> the<br />
magnetic permeability<br />
μ extracted from the S-<br />
TR<br />
r<br />
parameters or HF<br />
μ r calculated with the high frequency<br />
- 123 - 15th IGTE Symposium 2012<br />
mixing rule. The minimization is performed with the<br />
differential evolution algorithm [21]. The minimization<br />
provides r = 2.31 mm for the radius <strong>of</strong> the spherical<br />
inclusions, the electric permittivity <strong>of</strong> the inclusions is<br />
i<br />
ε r = 37.67 , the filling factor is ζ = 0.13 and the electric<br />
h<br />
permittivity <strong>of</strong> the host material is ε r = 1.<br />
Note that the<br />
results <strong>of</strong> this optimization are implementable; several<br />
materials exist at microwave frequencies with even higher<br />
electric permittivity and are available as powder or<br />
suspension [7].<br />
(a)<br />
(b)<br />
Figure 7: In (a) the effective magnetic permeability and<br />
electric permittivity <strong>of</strong> the metamaterial made <strong>of</strong> split-ring<br />
resonators extracted from S-parameters is shown. In (b)<br />
the material parameters <strong>of</strong> the equivalent composite are<br />
presented.<br />
The real and imaginary parts <strong>of</strong> the magnetic<br />
permeability and the electric permittivity <strong>of</strong> the equivalent<br />
composite are plotted in Fig. 7. b. As it can be seen there<br />
is a good agreement between the effective magnetic<br />
permeability <strong>of</strong> the metamaterial and the permeability <strong>of</strong><br />
the composite. Comparing the real part <strong>of</strong> the electric<br />
permittivities it can be observed that they are comparable<br />
TR<br />
at low frequencies, for example at 5 GHz ε = 1.57 and<br />
ε = 1.43 , even though there is no optimization goal<br />
HF<br />
r<br />
formulated for permittivity in the mean square error <strong>of</strong> the<br />
minimization procedure. On the other hand there is no<br />
anti-resonant behavior in the electric permittivity <strong>of</strong> the<br />
composite in the frequency region <strong>of</strong> the magnetic<br />
resonance. The magnetic resonance <strong>of</strong> the composite is<br />
followed by electric resonance, which appears at the<br />
r
upper end <strong>of</strong> the investigated frequency region. To move<br />
the electric resonance outside <strong>of</strong> this frequency region,<br />
the bounds <strong>of</strong> the optimization parameters were changed<br />
and several minimization runs were performed. As a<br />
result it can be observed that the model does not provide<br />
enough freedom to maintain the strength and the position<br />
<strong>of</strong> the magnetic resonance and at the same time to change<br />
the position <strong>of</strong> the electric resonance to higher<br />
frequencies. The extension <strong>of</strong> the high frequency model to<br />
ellipsoidal particles may solve this issue.<br />
In Fig. 8 the magnitudes <strong>of</strong> the S-parameters for the<br />
metamaterial built <strong>of</strong> split ring resonator and those for the<br />
equivalent composite are presented. The difference<br />
between the curves is due to the difference in electric<br />
permittivities. The correspondence may be improved by<br />
considering frequency dependent material parameters.<br />
Figure 8: Comparison between the magnitudes <strong>of</strong> the<br />
S-parameters <strong>of</strong> the metamaterial built <strong>of</strong> split ring<br />
resonator and the S-parameters <strong>of</strong> the equivalent<br />
composite.<br />
V. CONCLUSIONS<br />
High frequency mixing rule, which is based on the<br />
Clausius-Mossotti relation and the first terms <strong>of</strong> the Mie<br />
expansion corresponding to radiating dipoles has been<br />
applied to characterize composites and metamaterials. To<br />
validate the model the effective electric permittivity and<br />
magnetic permeability <strong>of</strong> a deep subwavelength<br />
composite were calculated and it was shown that similar<br />
results are produced by the high frequency mixing rule,<br />
the Maxwell-Garnett mixing rule or extracted directly<br />
from the S-parameters.<br />
The developed high frequency model can open<br />
alternative ways to engineer required electromagnetic<br />
properties. It was shown that equivalent composite, which<br />
has similar effective magnetic permeability, can be<br />
assigned to the magnetic metamaterial built <strong>of</strong> split ring<br />
resonators.<br />
VI. ACKNOWLEDGEMENT<br />
This work has been supported by the János Bolyai<br />
Research Fellowship <strong>of</strong> the Hungarian Academy <strong>of</strong><br />
Sciences and OTKA 105996.<br />
- 124 - 15th IGTE Symposium 2012<br />
[1]<br />
REFERENCES<br />
L. Solymár and E. Shamonina, Waves in Metamaterials, Oxford,<br />
<strong>University</strong> Press, 2009.<br />
[2] Marqués R., Martín F., Sorolla M., Metamaterials with<br />
NegativeParameters. John Willey and Sons, 2008.<br />
[3] N. Engheta, R. W. Ziolkowski, Metamaterials Physics and<br />
Engineering Applications, John Willey and Sons, 2006.<br />
[4] J. B. Pendry, Negative Refraction Makes a Perfect Lens, Physical<br />
Review Letters, vol. 85, no. 18, pp. 3966–3969, 2000.<br />
[5] J.B. Pendry, A.J. Holden, D.J. Robbins, and W.J. Stewart,<br />
Magnetism from Conductors and Enhanced Non-Linear<br />
[6]<br />
Phenomena, IEEE Transactions on Microwave Theory and<br />
Techniques, vol. 47, 2075, 1999.<br />
D. R. Smith, W. Padilla, D. Vier, S. Nemat-Nasser and S. Schultz,<br />
Composite Medium with Simultaneously Negative Permeability<br />
and Permittivity, Phys. Rev. Lett., vol. 84, p. 4184, 2000.<br />
[7] V. M. Shalaev, Optical negative-index metamaterials, Nature<br />
Photonics, vol. 1, pp. 41-48, 2006.<br />
[8] R. A. Shelby, D. R. Smith, S. Schultzm, Experimental<br />
Verification <strong>of</strong> a Negative Index <strong>of</strong> Refraction Science, vol 292,<br />
pp. 77-79, 2001.<br />
[9] G. Dolling, M. Wegener, C. Soukoulis and M. S. Linden,<br />
Negative-index metamaterial at 780 nm wavelength, Opt. Let.,<br />
vol. 32, no. 1, pp. 53-55, 2007.<br />
[10] A. F. de Baas (editor), Nanostructured Metamaterials, European<br />
Comission, 2010.<br />
[11] Zs. Szabó, G.-H. Park, R. Hedge, and E.-P. Li, “A unique<br />
extraction <strong>of</strong> metamaterial parameters based on Kramers-Kronig<br />
relationship,” IEEE Trans. Microwave Theory Tech., vol. 58, no.<br />
10, pp. 2646-2653, 2010.<br />
[12] A. Sihvola, Electromagnetic Mixing Formulas and Applications,<br />
The Institution <strong>of</strong> Electrical Engineers, London, United Kingdom,<br />
1999.<br />
[13] D. E. Aspnes, Local-field effects and effective-medium theory: A<br />
microscopic perspective, Am. J. Phys., vol. 50, no. 8, pp. 704-709,<br />
1982.<br />
[14] C. F. Bohren, D. R. Huffman, Absorption and Scattering <strong>of</strong> Light<br />
by Small Particles, Wiley-VCH, 2004.<br />
[15] R. Ruppin, Evaluation <strong>of</strong> extended Maxwell-Garnett theories,<br />
Optics Communications, vol 182, pp. 273–279, 2000.<br />
[16] C. A. Grimes, D. M. Grimes, Permeability and permittivity<br />
spectra <strong>of</strong> granular materials, Phys. Rev. B, vol. 43, pp. 10780–<br />
10788, 1991.<br />
[17] E.D. Palik and G.K. Ghosh, Editors, Handbook <strong>of</strong> Optical<br />
Constants <strong>of</strong> Solids, Academic Press, New York, 1997.<br />
[18] [Online] http://www.sspectra.com/sopra.html<br />
[19] [Online] www.cst.com<br />
[20] [Online] http://effmetamatparam.sourceforge.net/<br />
[21] K. V. Price, R. M. Storn, J. A. Lampinen, Differential Evolution,<br />
A Practical Approach to Global Optimization, Springer, 2005.
- 125 - 15th IGTE Symposium 2012<br />
Enhancement <strong>of</strong> Maximum Starting Torque and<br />
Efficiency in Permanent Magnet Synchronous Motors<br />
Jawad Faiz, Vahid Ghorbanian and Bashir Mahdi Ebrahimi<br />
Center <strong>of</strong> Excellence on Applied Electromagnetic Systems, School <strong>of</strong> Electrical and Computer<br />
Engineering, College <strong>of</strong> Engineering, <strong>University</strong> <strong>of</strong> Tehran, Tehran 1439957131, Iran<br />
(e-mail: jfaiz@ut.ac.ir)<br />
Abstract— This paper presents a new algorithm for enhancement <strong>of</strong> maximum starting torque and steady-state efficiency in<br />
permanent magnet (PM) motors. This algorithm includes two strategies which are used to raise starting torque and decrease<br />
losses in PM motors. Therefore, transient and steady-state operations <strong>of</strong> the PM motor are improved. It is essential to model<br />
the core in the efficiency estimation <strong>of</strong> losses <strong>of</strong> the PM motor. Appropriate control coefficients based on the introduced<br />
algorithm are set for two aforementioned goals. Simulation results are presented the competency <strong>of</strong> the proposed algorithm.<br />
Index Terms— PM motor, control strategy, starting torque, losses minimization, efficiency.<br />
I. INTRODUCTION<br />
Application <strong>of</strong> permanent magnet (PM) motors is<br />
increasing due to the advanced technology in PM<br />
manufacturing, and its high power density, improved<br />
power factor and high efficiency. Some applications such<br />
as electric and hybrid vehicles with huge start-stop<br />
actions need high starting torque to quickly accelerate the<br />
vehicles. Enhancement <strong>of</strong> the starting torque involves the<br />
increase <strong>of</strong> the stator windings currents leading to<br />
windings temperature rise. Therefore, the stator current<br />
must be limited. Since the performance improvement <strong>of</strong><br />
these motors has considerable effects upon the electrical<br />
power consumption over long time applications, the<br />
instantaneous optimal control and appropriate operating<br />
point over different loads and speeds should be<br />
considered.<br />
Nowadays, a wide range <strong>of</strong> the motor speeds and<br />
torques are achieved by application <strong>of</strong> vector control<br />
methods in the motors [1]-[4]. These control methods<br />
provide stability and precise required speed and accurate<br />
response because <strong>of</strong> the feedback in the motor which can<br />
control the flux and torque independently [5]. The<br />
controllable quantities are id and iq currents. By<br />
controlling current id, the motor flux and consequently<br />
speed is adjusted and by controlling current iq, the steadystate<br />
output torque is regulated [6]. In [7], [8], a method<br />
has been introduced to control the torque <strong>of</strong> the PM<br />
machine by limiting the windings currents. This machine<br />
has been used as a generator <strong>of</strong> a wind turbine. When the<br />
wind speed is higher than that <strong>of</strong> the rated speed, torque<br />
rises and the windings insulation may fail. In some<br />
methods [7], there is no need to have a mechanical sensor.<br />
Since magnetic saturation has considerable impact on the<br />
motor behavior at high currents; the saturation effects are<br />
approximately modeled in this paper. Meanwhile, strategy<br />
<strong>of</strong> maximum torque control is normally applied to the<br />
motor in the non-starting case [3].<br />
Different strategies have been so far applied to improve<br />
the efficiency <strong>of</strong> the PM motor. In [9], efficiency <strong>of</strong> the<br />
motor has been improved through introducing a new teeth<br />
and slot structure. In [7], [12], a method based on the dq<br />
model <strong>of</strong> the motor has been proposed in which the core<br />
losses have been taken into account by a resistance in the<br />
equivalent circuit <strong>of</strong> the motor. This technique is called<br />
the “loss model control”, in which copper and core losses<br />
are evaluated as an analytical function <strong>of</strong> equivalent<br />
circuit parameters and id and iq currents <strong>of</strong> the motor. By<br />
obtaining the optimal currents, the motor losses can be<br />
minimized and efficiency maximized. In interior PM<br />
(IPM) motors Lq and Ld are unequal and it is difficult to<br />
obtain the optimal operating point at different speeds and<br />
torques analytically. To overcome this problem, normally<br />
id and iq are expressed as functions <strong>of</strong> the motor speed and<br />
its coefficients are stored in a lookup table. The objection<br />
<strong>of</strong> this method is that by increasing the motor speed and<br />
torque ranges, the tables will be larger and it must be<br />
updated by change <strong>of</strong> the motor parameters. In [1], [5],<br />
[10], motor efficiency has been improved by id =0<br />
method. In the PM motor model, motor reluctance torque<br />
appears as coefficient <strong>of</strong> id and by putting id =0, the<br />
reluctance torque as an opposed torque is eliminated and<br />
consequently its output power increases for a fixed speed.<br />
Applying id=0 to IPM motors needs a high power<br />
inverter. Therefore, this method is normally applied to<br />
surface-mounted PM (SPM) motor [10]. The unity power<br />
factor method can develop less maximum torque<br />
compared to other methods [10]. So, it is not suitable in<br />
the high torque applications. Flux-linkage control method<br />
presents a better performance in IPM.<br />
On contrary to the loss model control which depends<br />
on the motor model accuracy, methods presented in [2],<br />
[3], [13] is called search control method which is<br />
independent <strong>of</strong> the motor and drive parameters. In this<br />
method, attempt has been made to reduce the input power<br />
and this is normally done through the control <strong>of</strong> voltage<br />
or dc link current <strong>of</strong> the inverter. Application <strong>of</strong> this<br />
control method may produce undesirable oscillations in<br />
the torque and speed <strong>of</strong> the motor which leads to<br />
instability [14]. In this method the use <strong>of</strong> a frequency<br />
stabilizer is necessary.<br />
Previous papers have not taken into account both high<br />
starting torque and steady-state efficiency improvement in<br />
PM motor. This paper investigates the control strategy <strong>of</strong><br />
the maximum torque and efficiency enhancement in the<br />
steady-state operation <strong>of</strong> the motor. By application <strong>of</strong> this<br />
method, torque raises up to 4 pu and current up to 2 pu.
Distinction <strong>of</strong> this paper and [10] is the design <strong>of</strong><br />
intelligent system for applying the limitation on id and iq,<br />
So, at any instant sensitivity <strong>of</strong> torque against each current<br />
components is measured and a component that has less<br />
effect in <strong>of</strong> the torque development is limited. To improve<br />
the steady-state efficiency <strong>of</strong> the motor, loss model<br />
control algorithm <strong>of</strong> [5] is used. In section II the motor<br />
model is introduced. In section III control strategy is<br />
described. Section IV and V present the simulation<br />
method and results respectively. Finally section VI<br />
concludes the paper.<br />
II. MODEL OF MOTOR<br />
The proposed control strategies in this paper are based<br />
on the analytical equations <strong>of</strong> the motor model. Since<br />
enhancement <strong>of</strong> the maximum starting torque and steadystate<br />
efficiency <strong>of</strong> the motor are carried out through the<br />
control <strong>of</strong> id and iq current vectors, the Park’s model<br />
converts three-phase abc equations <strong>of</strong> the motor into twophase<br />
dq equations in which id and iq currents <strong>of</strong> the<br />
motor are available. Figure 1 shows the IPM motor<br />
model where Ra is the stator resistance, Ld is the d-axis<br />
inductance and Lq is the q-axis inductance.<br />
Figure 1: Two-axes model <strong>of</strong> IPM motor<br />
These two inductances are not equal in the IPM motors<br />
and they develop a considerable reluctance torque. Rc is<br />
the iron losses equivalent resistance. The iron losses<br />
consist <strong>of</strong> the hysteresis and eddy current losses, and are<br />
modeled by equivalent resistance Rc, which depends on<br />
the temperature and frequency. To simplify the<br />
computations, this resistance is evaluated at the rated<br />
conditions. In [5], leakage and magnetizing inductances<br />
are separated and inserted in different branches. Since Rc<br />
is very larger than that <strong>of</strong> the other impedances, the<br />
current <strong>of</strong> the losses branch is small and the current <strong>of</strong> the<br />
left hand side and right hand side have no considerable<br />
difference. Therefore, sum <strong>of</strong> leakage and magnetizing<br />
inductances are used in the model.<br />
The governing equations <strong>of</strong> the motor model are as<br />
follows:<br />
diod<br />
vd<br />
Raid<br />
<br />
Lqioq<br />
Ld<br />
(1)<br />
dt<br />
- 126 - 15th IGTE Symposium 2012<br />
dioq<br />
vq<br />
Raiq<br />
Ldiod<br />
a Lq<br />
dt<br />
(2)<br />
icd id<br />
iod<br />
(3)<br />
icq iq<br />
ioq<br />
(4)<br />
diod<br />
( Lqioq<br />
Ld<br />
)<br />
i<br />
dt<br />
cd <br />
Rc<br />
(5)<br />
dioq<br />
( (<br />
Ldiod<br />
<br />
a ) Lq<br />
)<br />
i<br />
dt<br />
cq <br />
RC<br />
The developed electromagnetic torque <strong>of</strong> the motor is:<br />
(6)<br />
3P<br />
Te ( )[ aLq<br />
( Ld<br />
Lq<br />
) iodioq<br />
]<br />
2<br />
The dynamic equation <strong>of</strong> the motor is as follows:<br />
(7)<br />
dr<br />
Te<br />
Tm<br />
c sign(<br />
r ) Fr<br />
J<br />
dt<br />
(8)<br />
III. CONTROL STRATEGY<br />
A. Enhancement <strong>of</strong> Maximum Torque<br />
Eqn. (7) indicates that the motor torque depends on id<br />
and iq components <strong>of</strong> currents. At the starting, stator<br />
current does not so much depend on the load, and<br />
normally is 2 to 2.5 times the rated value. The starting<br />
torque <strong>of</strong> the motor can be up to 2.5 times the rated<br />
torque and generally there is no need to reduce it.<br />
However, in some applications such as ABS brake <strong>of</strong> cars<br />
the motor must have very short declaration time (about<br />
fractional <strong>of</strong> ms), and the starting torque, about 2.5 times<br />
the rated torque, cannot response quickly in the no control<br />
mode. Therefore, it is necessary to apply the high starting<br />
torque in the case <strong>of</strong> no control case over longer time in<br />
order to provide an appropriate declaration time. In the<br />
previous studies some control methods have been<br />
proposed to increase the starting torque; however a<br />
limited starting current has not been considered. Here a<br />
novel technique is introduced that optimally limits the<br />
stator current under vector control and also enhanced the<br />
maximum starting torque. By applying this method, the<br />
starting torque rises up to 4 times the rated torque. The<br />
basis <strong>of</strong> this method is the use <strong>of</strong> id and iq components <strong>of</strong><br />
current. Since the equivalent resistance <strong>of</strong> the iron losses<br />
is almost infinite:<br />
i i<br />
q<br />
oq<br />
id iod<br />
(10)<br />
Suppose the stator current is<br />
starting period, iq is as follows:<br />
is constant during the<br />
2 2 2 2 2 2<br />
(11)<br />
i i i i i i<br />
q<br />
d<br />
s<br />
q<br />
s<br />
d<br />
Combining (7) and (11) leads to:<br />
3P<br />
2 2<br />
(12)<br />
Te ( ) is<br />
id<br />
[ a ( Ld<br />
Lq<br />
) id<br />
]<br />
2<br />
where is can be taken as 2 to 2.5 times the rated current.<br />
In practice, the back-emf increases by acceleration <strong>of</strong> the<br />
motor and this decreases the stator current. However, to<br />
simplify the equations it was taken to be constant. The<br />
(9)
optimal id is obtained by putting the derivative <strong>of</strong> the<br />
maximum torque versus id equal to zero:<br />
2<br />
dTe<br />
a<br />
a 2 is<br />
0 id<br />
<br />
( ) <br />
2<br />
2<br />
did<br />
4(<br />
Ld<br />
Lq<br />
) 4(<br />
Ld<br />
Lq<br />
) 2 (13)<br />
For positive id, the reluctance torque increases and the<br />
total torque <strong>of</strong> the motor reduces. So, only the negative<br />
sign is acceptable.<br />
2<br />
a<br />
a 2 is<br />
id <br />
( ) <br />
2<br />
2<br />
4(<br />
Ld<br />
Lq<br />
) 4(<br />
Ld<br />
Lq<br />
) 2<br />
(14)<br />
By applying id from (14) to the reference point id,<br />
torque rises. But the stator current becomes larger than<br />
the permissible current by applying the obtained<br />
components <strong>of</strong> the current. Therefore, the torque<br />
sensitivity versus the current components is calculated<br />
and the current that has less influence the torque<br />
development is limited:<br />
T Te<br />
3P<br />
e<br />
(15)<br />
Si | i cte ( )( Ld<br />
Lq<br />
) iq<br />
d<br />
q <br />
id<br />
2<br />
T Te<br />
3P<br />
e Si | i cte ( )( a ( Ld<br />
Lq<br />
) iq<br />
)<br />
q<br />
d <br />
(16)<br />
iq<br />
2<br />
By applying this limit, the stator current does not rise<br />
further than 2 times the rated current.<br />
B. Improvement <strong>of</strong> Steady-state Efficiency<br />
The major factor in the efficiency reduction <strong>of</strong> the<br />
motor is the increase <strong>of</strong> the copper and iron losses. In the<br />
vector controlled motor supplied by an inverter, high<br />
order harmonics generates additional losses. In spite <strong>of</strong><br />
this, the major part <strong>of</strong> the losses allocated to the<br />
fundamental harmonic. Copper losses directly and iron<br />
losses indirectly is proportional with the motor current.<br />
The copper and iron losses arising from the fundamental<br />
harmonic is optimized by vector control <strong>of</strong> the stator<br />
currents. The high order harmonic losses are<br />
uncontrollable. The basis <strong>of</strong> the losses control is the<br />
estimation <strong>of</strong> the motor losses using the presented model<br />
and its optimization versus the motor currents. Since<br />
efficiency is defined in steady-state, the time derivatives<br />
<strong>of</strong> the dynamic equations are set equal to zero and losses<br />
are calculated as follows:<br />
Lqioq<br />
2 <br />
(<br />
iod<br />
) <br />
3R<br />
2 2 3R<br />
Rc<br />
<br />
Wcu<br />
( iod<br />
, ioq<br />
, ) ( )( id<br />
iq<br />
) ( ) <br />
<br />
2<br />
2 ioq<br />
(<br />
a Ldiod<br />
) 2<br />
(<br />
)<br />
<br />
<br />
<br />
Rc<br />
<br />
(17)<br />
2<br />
2<br />
3R<br />
<br />
( ) <br />
c 2 2 3<br />
Lqioq<br />
W fe(<br />
iod<br />
, ioq,<br />
) ( )( icd<br />
icq)<br />
( ) <br />
<br />
2<br />
2R<br />
2 (18)<br />
c ( a Ldiod<br />
) <br />
where Wcu and Wfe are the copper losses and iron losses<br />
respectively. The motor losses are function <strong>of</strong> iod, ioq and<br />
. In these equations, the influence <strong>of</strong> temperature rise<br />
and higher harmonics on the resistances and magnetic<br />
saturation upon the inductances have been ignored and<br />
taken to be constant. However, the losses increase<br />
nonlinearly due to the high harmonics and considerable<br />
rise <strong>of</strong> the magnetizing current because <strong>of</strong> the saturation.<br />
The magnetic saturation occurs normally at starting, and<br />
at the steady-state mode the motor operates at the knee <strong>of</strong><br />
the magnetization characteristic; therefore, neglecting the<br />
- 127 - 15th IGTE Symposium 2012<br />
saturation is acceptable. By combining (7), (17), (18), the<br />
following equation is obtained:<br />
Wc W fe(<br />
iod<br />
, Te<br />
, ) Wcu<br />
( iod<br />
, Te<br />
, )<br />
Wc<br />
( iod<br />
, Te<br />
, )<br />
(19)<br />
As indicated in (19), the total losses <strong>of</strong> the motor depend<br />
on function <strong>of</strong> the operating point and iod current. At the<br />
operating point with fixed speed and torque, the optimal<br />
iod is obtained for losses reduction using the analytical<br />
derivative <strong>of</strong> (19).However, in the IPM motor, Ld and Lq<br />
are not identical, therefore the equations are complicated<br />
and use <strong>of</strong> analytical derivative is difficult. Sometimes,<br />
the currents <strong>of</strong> the motor are expressed as a polynomial<br />
versus each other where its polynomial coefficients<br />
depending on the speed <strong>of</strong> the motor. The coefficients <strong>of</strong><br />
the polynomials versus the motor operating point are<br />
stored in a look-up table. However, this method is quick<br />
but interpolation over different operating points leads to a<br />
highly approximated method. Meanwhile, the tables over<br />
wide range <strong>of</strong> speed and torque become large, and these<br />
tables must be updated by change <strong>of</strong> the motor type.<br />
Flow-chart reported in [5] has optimized the motor losses<br />
without using analytical derivative and also lookup table.<br />
The presented algorithm is an iterative one and it<br />
normally converges to an appropriate solution after 14<br />
iterations. In the present paper, the section related to the<br />
response <strong>of</strong> the final conditional expression reported in<br />
[5] is modified and therefore the optimal response point is<br />
achieved with lower number <strong>of</strong> iterations at any operating<br />
point. This algorithm increases the developed<br />
electromagnetic torque <strong>of</strong> the motor at a constant <strong>of</strong><br />
operating point and consequently efficiency <strong>of</strong> the motor<br />
improves. Since iq is the torque component <strong>of</strong> the current,<br />
its value depends largely on the load <strong>of</strong> the motor and its<br />
large change will lose the stable operating point.<br />
Therefore, the reluctance torque value is controllable<br />
by change <strong>of</strong> id. In the traditional methods such as id=0,<br />
the reluctance torque is almost zero and demagnetization<br />
effect <strong>of</strong> the stator current diminishes. The idea used in<br />
the new method is to make negative id which makes the<br />
reluctance torque positive and improves the efficiency<br />
Figure 2: Loss minimization algorithm
Figure 3: Motor and control system<br />
<strong>of</strong> the motor at fixed speed. idmax is generally taken to be a<br />
small positive value and idmin a large negative value. d<br />
defines the step variations <strong>of</strong> id and x the mean value <strong>of</strong> id<br />
in every step. To achieve an appropriate response the step<br />
number depends on the value <strong>of</strong> id which is fixed at an<br />
optimal value. The simulation results show the<br />
improvement <strong>of</strong> the motor efficiency by applying the<br />
presented control method compared to id =0 method.<br />
IV. SIMULATION METHOD<br />
The above-mentioned control strategies are applied to a<br />
PM motor under vector control. The output <strong>of</strong> these<br />
control methods provides the reference values <strong>of</strong> the drive<br />
current. Since the motor supply under vector control is<br />
PWM type, the high order odd harmonics are injected to<br />
the motor. Amplitude <strong>of</strong> these harmonics varies with<br />
changing the operating point. Figure. 3 shows the<br />
complete system <strong>of</strong> the motor and drive. The LMA block<br />
is for efficiency improvement strategy in steady-state and<br />
T/A block is for the maximum starting torque<br />
enhancement. Limitation block limits the motor currents.<br />
The reference current values are transformed into the twophase<br />
reference voltages by Vqd_ref. Then the motor<br />
reference voltages are formed by transforming the twophase<br />
to three-phase voltages and applying to the PWM<br />
block. The maximum starting torque algorithm during<br />
transient and efficiency improvement algorithm during<br />
the steady-state periods are applied to the motor. In the<br />
motor model, a low-pass filter is used for eliminating<br />
high-order harmonics from the control process.<br />
Specifications <strong>of</strong> the simulated motor have been<br />
summarized in Table I.<br />
V. SIMULATION RESULTS<br />
The results have been obtained by simulation <strong>of</strong> the<br />
motor under control using Simulink. First, the results <strong>of</strong><br />
applying T/A algorithm during the transient mode to the<br />
motor is considered. By using motor parameters and<br />
- 128 - 15th IGTE Symposium 2012<br />
TABLE I<br />
NAMEPLATES AND PARAMETERS OF IPM<br />
MOTOR<br />
Number <strong>of</strong> poles 6<br />
Rated Torque (Nm) 1.8<br />
Rated rms current (A) 3.6<br />
Rated speed (rpm) 4000<br />
Stator winding resistance Ra ( 2.21<br />
Core loss equivalent resistance Rc ( 840<br />
Direct axis inductance (mH) 9.77<br />
Quadrature axis inductance (mH) 14.94<br />
Permanent magnet flux a (Wb) 0.0844<br />
Mechanical losses (Nm) 0.04<br />
considering constant maximum current <strong>of</strong> the motor, id<br />
component <strong>of</strong> the stator current is calculated by T/A and<br />
applied to the input reference <strong>of</strong> the drive. Since a high<br />
torque is necessary at the starting, at the first instant the<br />
current id increases in the negative direction considerably<br />
(Figure. 4a). Negative current id leads to the positive<br />
reluctance torque and increases the total torque <strong>of</strong> the<br />
motor. By applying T/A, value <strong>of</strong> iq also increases and the<br />
motor current rises over permissible limit.<br />
Therefore, currents are limited. Figure. 4b exhibits the<br />
variations <strong>of</strong> the normalized electromagnetic torque <strong>of</strong> the<br />
motor (based on the rated values) in which the torque<br />
raises up to 3.8 pu due to T/A applications. This torque is<br />
very larger than the case in which the motor is able to<br />
develop with no control strategy. In order to study the<br />
performance <strong>of</strong> the stator current limiter system, threephase<br />
currents <strong>of</strong> the motor are shown in figure. 5 which<br />
indicates that the phase current <strong>of</strong> the motor raises up to<br />
1.8 pu. This current rises up to 2.5 pu when current<br />
limiter is not used. The reason for a constant torque<br />
during the transient mode is that the tolerable peak<br />
current by the stator winding is assumed constant. If this<br />
current as a function <strong>of</strong> the motor emf is applied to the<br />
model, the motor torque will decrease by time. After<br />
completion <strong>of</strong> the transient period, the control algorithm
iabc(pu)<br />
Speed(rpm)<br />
id(A)<br />
torque(pu)<br />
-5<br />
0 0.01 0.02 0.03 0.04 0.05<br />
time(s)<br />
2<br />
1<br />
0<br />
-1<br />
0<br />
-1<br />
-2<br />
-3<br />
-4<br />
4<br />
3.5<br />
3<br />
2.5<br />
2<br />
1.5<br />
1<br />
0.5<br />
0 0.01 0.02 0.03 0.04 0.05<br />
time(s)<br />
-2<br />
0 0.01 0.02 0.03 0.04 0.05<br />
time(s)<br />
5000<br />
4000<br />
3000<br />
2000<br />
1000<br />
(a)<br />
(b)<br />
Figure 4: (a) id and (b) motor torque at starting period<br />
is converted into LMA. Change <strong>of</strong> the control strategy is<br />
also visible in the three-phase currents <strong>of</strong> the motor which<br />
creates some irregularities in the currents. In addition to<br />
limiting the current, reducing overshoot<br />
Figure 5: Three-phase currents <strong>of</strong> controlled motor<br />
with T/A control<br />
Without T/A control<br />
0<br />
0 0.01 0.02 0.03 0.04<br />
time(s)<br />
Figure 6: Time variations <strong>of</strong> motor speed<br />
and shortening settling time <strong>of</strong> the motor speed are other<br />
advantages <strong>of</strong> applying T/A. Figure. 6 compares the<br />
motor speed variations without T/A application and<br />
- 129 - 15th IGTE Symposium 2012<br />
controlling cases. Settling time <strong>of</strong> the motor from 0 to the<br />
rated speed decreases from 0.025 in the no control case to<br />
0.01 in the application <strong>of</strong> T/A and without overshoot. In<br />
fact, by applying the maximum torque control, the motor<br />
becomes more stable. It is noted that the torque jump due<br />
to switching from T/A to LMA is not present in the speed<br />
signal. The reasons are the high inertia and long<br />
mechanical time constant <strong>of</strong> the motor compared to its<br />
electrical time constant. The LMA attempts to find the<br />
optimal id for the efficiency improvement. Normally, id is<br />
chosen negative values by applying this algorithm. The<br />
influence <strong>of</strong> the negative id is the enhancement <strong>of</strong> the<br />
torque and decrease <strong>of</strong> losses in the motor. Simulation <strong>of</strong><br />
PM motor under LMA control over a wide range <strong>of</strong> the<br />
speed and torque has been carried out and the effect <strong>of</strong><br />
this algorithm on the motor variables and system<br />
efficiency has been investigated. Meanwhile, the outputs<br />
<strong>of</strong> LMA with id=0 are compared and advantage <strong>of</strong> this<br />
method over conventional methods is given.<br />
Figure. 7 shows the variations <strong>of</strong> the copper and iron<br />
losses <strong>of</strong> the motor versus speed and torque. By raising<br />
the speed at the rated load, the iron losses increase and in<br />
this case the losses reduction algorithm shows its<br />
dominant effects. Also at fixed speed and high loads,<br />
reduction <strong>of</strong> the total iron and copper losses is<br />
considerable. Meanwhile, by increasing the negative<br />
value <strong>of</strong> id, demagnetization effect <strong>of</strong> PM decreases and<br />
for a fixed output power, supply voltage reduces. This<br />
means the efficiency improvement. Difference between<br />
the motor losses in two cases id=0 and LMA causes the<br />
motor efficiency change.<br />
Figure. 8 shows the efficiency versus speed and torque<br />
<strong>of</strong> the motor. Efficiency <strong>of</strong> the motor has been compared<br />
in two id=0 and LMA cases. According to figure. 8a,<br />
efficiency <strong>of</strong> the motor under LMA control over different<br />
speeds, shows a relative increase by id=0 method. By<br />
increasing the speed <strong>of</strong> the motor, the rate <strong>of</strong> efficiency<br />
improvement also rises. It means that whatever the motor<br />
approaches more to the rated operating point, its<br />
efficiency improves. Figure. 8b shows the efficiency<br />
versus load torque at the rated speed, and it emphasizes<br />
the efficiency improvement <strong>of</strong> the LMA in the motor<br />
compared to that <strong>of</strong> the conventional methods. The<br />
impact <strong>of</strong> this method is higher for higher loads. As<br />
shown in figure. 8a, there is no much difference between<br />
LMA and id=0 at low load levels. So, efficiency will not<br />
be considerably changed. This is not true over the low<br />
speeds. Generally, electrical machines operate in the knee<br />
<strong>of</strong> the magnetization characteristic where they have peak<br />
power density; therefore they have the maximum<br />
efficiency at the rated operating point. By applying LMA<br />
at the rated operating point, a 3% rise <strong>of</strong> the efficiency<br />
occurs. In the references, LMA algorithm has been<br />
applied to the PM motor experimentally. The difference<br />
between the simulation and experimental results is due to<br />
the approximations included in the simulation. The most<br />
important factor is ignoring the magnetic saturation.
Total loss(W)<br />
82<br />
80<br />
78<br />
76<br />
74<br />
72<br />
70<br />
LMA<br />
id=0<br />
68<br />
500 1000 1500 2000 2500 3000 3500 4000<br />
Speed(rpm)<br />
Total losses(W)<br />
120<br />
100<br />
80<br />
60<br />
40<br />
20<br />
LMA<br />
id=0<br />
0<br />
0 0.5 1 1.5 2<br />
torque(Nm)<br />
Comparision <strong>of</strong> the motor efficiencies at rated load (1.8 N.m)<br />
- 130 - versus the angular 15th speed in the case IGTE <strong>of</strong> LMA and id=0 controls. Symposium 2012<br />
Efficiencies [%]<br />
Efficiencies [%]<br />
100<br />
90<br />
80<br />
70<br />
60<br />
50<br />
40<br />
80<br />
60<br />
40<br />
20<br />
0<br />
0 1000 2000 3000 4000<br />
Angular speed [rpm]<br />
(a) (a)<br />
VI. CONCLUSION<br />
Two algorithms for enhancing the maximum starting<br />
torque and steady-state efficiency <strong>of</strong> a PM motor were<br />
investigated. In both methods, stator current components<br />
have been used, transient and steady-state performance <strong>of</strong><br />
a PMSM have been improved by closed-loop control<br />
strategy and application <strong>of</strong> the two algorithms. These<br />
algorithms are independent <strong>of</strong> the PM motor type. The<br />
simulation results shown that by applying two algorithms,<br />
performance <strong>of</strong> the motor is considerably improved<br />
compared to that <strong>of</strong> the conventional control methods. By<br />
applying the stator current limiting method during the<br />
starting period, the stator winding insulation is prevented<br />
against the damage due to high current. The LMA method<br />
improves the steady-state efficiency <strong>of</strong> the motor up to<br />
3% at the rated load and T/A method causes the increase<br />
<strong>of</strong> the starting torque up to 4 times <strong>of</strong> the rated torque.<br />
Therefore, by applying the proposed control methods in<br />
addition to providing a high starting torques without the<br />
risk <strong>of</strong> the short circuit <strong>of</strong> the windings; the extra losses<br />
arising from the imprecise control <strong>of</strong> electrical motors can<br />
be prevented.<br />
AKNOWLEGEMENT<br />
We sincerely thank the Iran’s National Elites<br />
Foundation (INEF) for financial support <strong>of</strong> the project.<br />
VII. REFRENCES<br />
[1] S.Morimoto, Y.Tong, Y.Takeda, and T. Hirasa, “Loss minimization<br />
control <strong>of</strong> permanent magnet synchronous motor drives,”IEEE<br />
Transactions on Industrial Electronics, vol. 41, no. 5, pp. 511-517,<br />
Oct 1994.<br />
[2] C.Mademlis, L.Xypteras, and N.Margaris, “Loss minimization in<br />
surface permanent magnet synchronous motor drives",IEEE<br />
Transactions on Industrial Electronics,vol. 47, no. 1, pp. 115-122,<br />
Feb 2000.<br />
Comparision <strong>of</strong> the motor efficiencies at rated speed (4000)<br />
versus the load torque in the case <strong>of</strong> LMA and <strong>of</strong> id=0 controls<br />
id=0<br />
LMA<br />
id=0<br />
LMA<br />
30<br />
0 0.5 1 1.5 2<br />
Torque(N.m)<br />
(b) (b)<br />
Figure 7: Losses <strong>of</strong> motor versus (a) speed and (b) torque<br />
Figure 8: Efficiency <strong>of</strong> the motor versus (a)speed (b)<br />
torque<br />
[3] Sadegh Vaez,M.A.Rahman, "Adaptive Loss Minimization Control <strong>of</strong><br />
Inverter Fed IPM Motor Drives".IEEE Power Electronics Specialists<br />
Conference, pp. 861-868, vo. 2, 1997.<br />
[4] T.M.Jahns, G.B.Kliman, T.W.Neumann, “Interior permnanent<br />
magnet synchronous motor for adjustable speed drives,” IEEE<br />
Transactions Industry Applications, vol. 22, no. 4, pp. 738-747,<br />
July/August 1986<br />
[5] C.Cavallaro, A.O.Tommaso, R.Miceli, and A.Raciti, “Efficiency<br />
enhansment <strong>of</strong> permanent magnet synchronous motor drives by<br />
online loss minimization approaches,”IEEE Transactions on<br />
Industry Applications, vol. 52, no. 4, pp. 1153-1160, August 2005.<br />
[6] J.S.Yim, S.K.Sul, B.H.Bae, N.R.Patel, and S.Hiti, “Modified current<br />
control schemes for high-performance permanent-magnet ac drives<br />
with low sampling to operating frequency ratio,” IEEE Transactions<br />
Industry Applications, vol. 45, no. 2, pp. 763-771, March/April<br />
2009.<br />
[7] S.Morimoto, H.Nakayama, and M.Sanada, “Sensorless output<br />
maximization control for variable-speed wind generation system<br />
using IPMSG,”IEEE Transactions on Industry Applications, vol.<br />
41, no. 1, pp. 60-67, Jan/Feb 2005.<br />
[8] T.Nakamura, S.Morimoto, m.sanada, and Y.Takada, “Optimum<br />
control <strong>of</strong> IPMSG for wind generation system,” IEEE Power<br />
Conversion Conference, Osaka, pp. 1435-1440, 2002.<br />
[9] C.Chris, G.R.Slemon, and R.Bonert, “Minimization <strong>of</strong> iron loss<br />
<strong>of</strong> permanent magnet synchronous machines,”IEEE Transactions<br />
on Energy Conversion, vol. 20, no. 1, pp. 121- 127, March 2005.<br />
[10] S.Morimoto, Y.Takeda, and T.Hirasa, “Current phase control<br />
methods for permanent magnet synchronous motors,”IEEE<br />
Transactions on Power Electronics, vol. 5, no. 2, pp. 133, April<br />
1990.<br />
[11] S.Morimoto, Y.Takeda, T.Hirasa, and K.Taniguchi, “Expansion <strong>of</strong><br />
operating limits for permanent magnet motor by current vector<br />
control considering inverter capacity,”IEEE Transactions on<br />
Industry Applications, vol. 26, no. 5, pp. 866-871, Sep/Oct 1990.<br />
[12] K.Yamazaki, “Torque and efficiency calculation <strong>of</strong> an interior<br />
permanent magent motor considering harmonic iron losses <strong>of</strong> both<br />
the stator and rotor,” IEEE Transactions Magnetics,vol. 39, no. 3,<br />
pp. 1460-1463, May 2003.<br />
[13] R.S.Colby, and D.W.Novotny, “An efficiency-optimizing<br />
permanent magnet synchronous motor drive,”IEEE Transaction on<br />
Industry Applications, vol. 24, no. 3, pp. 462-469,May/June 1988.<br />
[14] A.Kusko, and D.Galler, “Control means for minimization <strong>of</strong> losses,<br />
in AC and DC motor drives,”IEEE Transactions on Industry<br />
Applications, vol. 19, no. 4, pp. 561-570 ,July/August 1983.
- 131 - 15th IGTE Symposium 2012<br />
Core Losses Estimation Techniques in Electrical<br />
Machines with Different Supplies – A Review<br />
Jawad Faiz, A.M. Takbash and B. M. Ebrahimi<br />
Center <strong>of</strong> Excellence on Applied Electromagnetic Systems, School <strong>of</strong> Electrical and Computer Engineering, College <strong>of</strong><br />
Engineering, <strong>University</strong> <strong>of</strong> Tehran, Tehran, Iran, Email: jfaiz@ut.ac.ir<br />
Abstract—In this paper, different methods for core losses estimation in ferromagnetic materials with non-sinusoidal supply<br />
are studied. At this end, the origin <strong>of</strong> the core losses in the aforementioned materials is addressed. Since magnetization<br />
excitation is the most effective factor upon the core losses, different core losses estimation with six general types <strong>of</strong> excitations<br />
are considered and features <strong>of</strong> these methods, their advantages and disadvantages are investigated.<br />
Index Terms—Core Loss, Finite Element Method, Hysteresis Loop, Steinmetz Equation.<br />
I. INTRODUCTION<br />
The major role <strong>of</strong> ferromagnetic materials in electrical<br />
machines leads to wide research towards a better<br />
realization <strong>of</strong> these materials and their characteristics.<br />
One <strong>of</strong> the most important features in these materials is<br />
their losses. Core losses are generated due to magnetic<br />
flux residual and eddy current. From physics point <strong>of</strong><br />
view, these two factors have an identical origin; they are<br />
movement <strong>of</strong> magnetic domains walls as well as internal<br />
movement <strong>of</strong> the magnetic domains. The magnetic<br />
residual flux is a well-known phenomenon. When an<br />
external magnetic field is applied to a ferromagnetic<br />
material, magnetic dipoles try to align with this external<br />
field. Even after removing the external magnetic field,<br />
some magnetic domains preserve their alignments and<br />
such case the material is magnetized. By changing the<br />
magnetic field a small amount <strong>of</strong> energy is stored in the<br />
material due to existing residual flux. The level <strong>of</strong> this<br />
stored energy depends on the material type. If a conductor<br />
is imposed on a varying magnetic field or moved<br />
appropriately in the magnetic field, eddy current is<br />
induced in the conductor. Any variation in magnetic field<br />
that causes the movement <strong>of</strong> the magnetic domains walls<br />
is a factor inducing eddy current. Eddy current generates<br />
heat and electromagnetic forces. For dc excitation, there<br />
are residual and eddy current losses as well. The reason<br />
for the residual losses is the internal movement <strong>of</strong> the<br />
magnetic domains which itself generates microscopic<br />
currents [1]. Based on these two factors, the core losses<br />
have been classified into two classes [2]. Some categorize<br />
the core losses into three classes [3], in which the third<br />
class is stray losses due to the external factors such as<br />
external magnetic fields. Two factors influence the core<br />
losses. The first factor depends on the magnetic material<br />
alloy, for instance rising Si content in SiFe magnetic alloy<br />
reduces the eddy current losses. The second factor is the<br />
external factor. Magnetic properties <strong>of</strong> magnetic materials<br />
are affected by cutting, pressing and welding processes.<br />
For instance, cutting or welding process <strong>of</strong> magnetic<br />
sheets can increase the residual losses <strong>of</strong> the sheets.<br />
Pressing the sheets causes the increase <strong>of</strong> the eddy<br />
currents. One <strong>of</strong> methods for reduction <strong>of</strong> the eddy<br />
current is using thin sheets. However, this leads to higher<br />
residual losses [4], [5]. Another external factor affecting<br />
the core losses is the magnetizing excitation. In the case<br />
<strong>of</strong> sinusoidal excitation, the well-known classic Steinmetz<br />
equations can be used for core losses estimation:<br />
P k f B k f B<br />
(1)<br />
x 2 2<br />
ir h s m e s m.<br />
where Pir is the core losses, fs is the supply frequency, Bm<br />
is the magnetic flux density magnitude and kh, ke, xare the<br />
Steinmetz factors. Such equations leads to error in the<br />
case <strong>of</strong> non-sinusoidal excitation and new equations must<br />
be introduced. Application <strong>of</strong> different drives with<br />
various switching patterns and also internal faults in<br />
electrical machines leads to non-sinusoidal excitation.<br />
Wide application <strong>of</strong> inverter-fed electrical machines and<br />
different faults such as rotor broken bars and eccentricity<br />
are important research topics in recent years. Therefore,<br />
core losses estimation in the magnetic core over such noncommon<br />
conditions is important. The trend <strong>of</strong> core losses<br />
estimation can be classified as shown in Figure 1. In this<br />
classification six general methods has been introduced<br />
which will be discussed in this paper.<br />
II. FINITE-ELEMENT-BASED METHODS<br />
Finite element methods (FEM) are time consuming and<br />
high precision techniques that take into account the<br />
geometry and physics <strong>of</strong> the machine. There are three<br />
steps in modeling electrical machines. They include<br />
geometrical modeling <strong>of</strong> motor considering physical<br />
characteristics, modeling motor supply considering<br />
electrical characteristics and finally modeling the motor<br />
load taking into account mechanical features. FEMs<br />
provide magnetic field distributions within induction<br />
motor based on its geometrical and magnetic parameters.<br />
Other quantities such as air gap flux density can be also<br />
estimated using the magnetic field distribution. In FEM,<br />
the coupling between electrical and magnetic fields and<br />
motor rotation can be taken into account. Losses in<br />
different parts <strong>of</strong> the machine can be estimated having the<br />
magnetic field distribution in various sections <strong>of</strong> the<br />
motor. A new method has been introduced in [6] for core<br />
losses estimation in a no-load motor with direct-fed and<br />
PWM-fed motor using 2D time stepping FEM in which<br />
the simulation results have been compared to the
Steinmetz<br />
Eq.<br />
Hysteresis<br />
Model<br />
FEM<br />
- 132 - 15th IGTE Symposium 2012<br />
Iron Loss<br />
Calculation<br />
Methods<br />
Equivalent<br />
Circuit<br />
Physical Eq.<br />
Figure 1: Different methods for core losses estimation in non-sinusoidal excitation<br />
experimental results. To include stray losses and<br />
rotational residual losses, a modification factor has been<br />
considered in the losses calculation process. This<br />
modification factor depends on the peak magnetic flux<br />
density and its distortion. In [7], a new model for<br />
laminated core <strong>of</strong> induction motor has been presented<br />
based on 3D FEM. This modeling method is based on the<br />
reduced magnetic potential equations and used to estimate<br />
the core losses with non-sinusoidal excitation caused by<br />
PWM application. The results are more precise than that<br />
<strong>of</strong> the 2D FEM. In [8], the authors emphasize the need for<br />
precise data in core losses estimation from magnetic<br />
fields, therefore 2D FE model is applied; impact <strong>of</strong> nonsinusoidal<br />
supply and its harmonics on the rotor magnetic<br />
field has been investigated and the well-known integral<br />
equations have been then used to estimate the core losses.<br />
Core losses estimation in rotating electrical machines is<br />
more complicated than that <strong>of</strong> the static machine because<br />
<strong>of</strong> more complicated structure and rotating magnetic<br />
fields [9]. So, FEM modeling has been modified in order<br />
to include the stray core losses due to magnetic field<br />
rotating vector and its harmonics. At this end, two types<br />
<strong>of</strong> PM motors with two different structures have been<br />
modeled using a new method and the simulation results<br />
have been compared to the experimental results. In [10],<br />
impact <strong>of</strong> the stator slot shapes upon the core losses has<br />
been discussed and three different structures <strong>of</strong> induction<br />
motor for minimizing the core losses have been<br />
investigated. In this case, core losses are estimated using<br />
magnetic flux density and field intensity and integration<br />
<strong>of</strong> their product. Core losses distribution over core crosssection<br />
and impact <strong>of</strong> different stator slot shapes has been<br />
considered using FEM. In [11], a model based on eddy<br />
current analysis in the magnetic sheets is presented which<br />
is capable to estimate the stray losses for computation <strong>of</strong><br />
high frequency core losses and residual losses in core<br />
sheets <strong>of</strong> electrical machines. Advantages <strong>of</strong> this method<br />
is taking into account the magnetic field distribution<br />
along sheets thickness using one-dimensional non-linear<br />
FEM over non-linear 2D elements and also impact <strong>of</strong><br />
frequency and magnetic flux density on the quantities<br />
related to the core losses. In [12], a model has been<br />
introduced to study the impact <strong>of</strong> PWM supply on the<br />
induction motor core losses. Triple losses <strong>of</strong> the core have<br />
been presented by a combined model using FEM. The<br />
results have been compared to the traditional modeling<br />
and experimental results. This combined model consists<br />
<strong>of</strong> two static and dynamic models in which the impact <strong>of</strong><br />
Control<br />
Strategy<br />
the residual minor loops has been also included. In<br />
addition, the losses distribution over the motor and<br />
separation <strong>of</strong> different components <strong>of</strong> the core losses has<br />
been pointed out. In [13], induction motor performance<br />
has been analyzed using 2D FEM. This is one <strong>of</strong> the few<br />
works in which the impact <strong>of</strong> internal fault <strong>of</strong> induction<br />
motor such as rotor broken bars and eccentricity and also<br />
application <strong>of</strong> the PWM drive upon core and Ohmic<br />
losses have been considered and shown that the rotor<br />
broken bar causes the increase <strong>of</strong> the losses around the<br />
damaged bar; in addition PWM supply also increases the<br />
core losses density. In [14], Permanent magnet (PM)<br />
motor under three static, dynamic and mixed<br />
eccentricities faults have been analyzed using FEM.<br />
Finally, impact <strong>of</strong> these faults on core and Ohmic losses<br />
has been investigated. Figure 2 shows the impact <strong>of</strong><br />
different eccentricities on the eddy current and residual<br />
losses.<br />
III. HYSTERESIS LOOP MODEL BASED METHODS<br />
Precise mathematical model <strong>of</strong> hysteresis loop in<br />
magnetic material could be useful in accurate estimation<br />
<strong>of</strong> the core losses. First, classical hysteresis models such<br />
as [15] have been used to calculate the losses. Recently,<br />
improved hysteresis model such as loss surface model<br />
(LSM) and energy-based hysteresis vector-model have<br />
been employed to estimate the losses which are briefly<br />
described below. The LSM is a numerical and dynamic<br />
model for core losses evaluation which has been applied<br />
to thick magnetic laminations in [16]. This method is<br />
based on the definition <strong>of</strong> the magnetic field as a surface<br />
function <strong>of</strong> magnetic flux density and its rate as follows:<br />
dB dB<br />
S H( B, ) Hstat ( B) Hdyn ( B,<br />
). (2)<br />
dt dt<br />
In fact, this method is combination <strong>of</strong> a static and<br />
dynamic model and is capable to model the static and<br />
dynamic behaviors <strong>of</strong> the hysteresis loop. The static<br />
behavior is modeled using different hysteresis curves and<br />
dynamic behavior using six parameters which depend on<br />
the magnetic flux density and time variation <strong>of</strong> its<br />
derivative. Vector magnetic hysteresis model has been<br />
used in [17] to estimate the core losses. In this method,<br />
magnetic field intensity has been evaluated using the<br />
vertical components <strong>of</strong> the magnetic flux density and then<br />
core losses have been calculated by integration <strong>of</strong> the<br />
product <strong>of</strong> the magnetic flux density and field intensity.
- 133 - 15th IGTE Symposium 2012<br />
Figure 2: (a) Hysteresis losses and (b) eddy current losses for different degrees and types <strong>of</strong> eccentricity [14]<br />
Application <strong>of</strong> two Preisach and Jill-Atherton models<br />
have been compared in three different magnetic materials<br />
in order to obtain an optimal method for precise modeling<br />
<strong>of</strong> the magnetic cores using FEM with reasonable<br />
computation time [18]. In mathematical models the full<br />
hysteresis loop and a series <strong>of</strong> the magnetic parameters <strong>of</strong><br />
the proposed material must be available which<br />
complicated its application [19]<br />
IV. STEINMETZ EQUATIONS-BASED METHODS<br />
Purpose <strong>of</strong> the improved Steinmetz equations is to<br />
estimate core losses for non-sinusoidal magnetic flux<br />
density analytically. This modification is done using<br />
different methods. Improved Steinmetz equations have<br />
been employed to estimate the core losses in switched<br />
reluctance motor (SRM) [20], [21]. First SRM behavior is<br />
analyzed using FEM and then improved core losses<br />
equations for SRM including the impact <strong>of</strong> the minor<br />
hysteresis loops beside <strong>of</strong> current harmonics effect are<br />
considered.<br />
ab. B 1 dB<br />
max<br />
2<br />
Pc kcfChfBmax C ( ) .<br />
2 e avg (3)<br />
2<br />
dt<br />
where Kcf is the modification factor that takes into<br />
account the impact <strong>of</strong> the minor hysteresis loops within<br />
the major loops. Another idea for improving the<br />
Steinmetz classical equations is obtaining an equivalent<br />
frequency for proposed non-sinusoidal signal which have<br />
been used for magnetic sheets [22] and transformer [23]:<br />
2<br />
2<br />
T dB<br />
feq <br />
( ) dt.<br />
2 2<br />
( B 0<br />
max Bmin ) (4)<br />
dt<br />
where Bmax and Bmin are the maximum and minimum <strong>of</strong><br />
the magnetic flux density. In the other words, remagnetizing<br />
frequency is substituted by an equivalent<br />
frequency versus magnetic flux density variations. In<br />
addition the impacts <strong>of</strong> dc upon the core losses have been<br />
considered in [22] and the modified Steinmetz equations<br />
(MSE) have been introduced. Another method <strong>of</strong><br />
modifying Steinmetz equations is introducing the<br />
coefficients in order to take into account the nonsinusoidal<br />
magnetic excitation waveform. In this case, the<br />
distorted magnetic flux density waveform is used to<br />
determine these coefficients. For instance, in [24] the<br />
Steinmetz equations have been changed as such that the<br />
hysteresis losses versus the mean value <strong>of</strong> the rectified<br />
waveform <strong>of</strong> the magnetic flux density and eddy current<br />
losses versus the rms value <strong>of</strong> this waveform have been<br />
expressed. Consequently, these coefficients are estimated<br />
when the magnetic flux density waveforms in two directfed<br />
and PWM-fed are known and the core losses in<br />
abnormal operation are expressed versus the core losses<br />
in the normal operation. In [25], the traditional Steinmetz<br />
equations and their modification have been used to<br />
estimate the magnetic sheets losses under PWM-fed; such<br />
that the impact <strong>of</strong> frequency and magnetic flux density<br />
variations upon the Steinmetz equation have been<br />
included in the coefficients for different materials and<br />
also impact <strong>of</strong> the magnetic flux density waveform<br />
variations due to drive on the core losses. In order to<br />
consider the frequency and magnetic flux density on the<br />
Steinmetz coefficients, these coefficients are considered<br />
as 3 rd order equations versus magnetic flux density.<br />
V. PHYSICS-DEFINED LOSSES BASED METHODS<br />
A long time ago, there was a procedure for core losses<br />
estimation based on the physics definitions <strong>of</strong> various<br />
core losses. For instance in [26], a modeling method was<br />
introduced for magnetic domains in material and then<br />
eddy current, its cause and losses were discussed.<br />
Advantage <strong>of</strong> this model is its capability to use over wide<br />
range <strong>of</strong> the flux density up to the saturation level and<br />
wide frequency band. In [27], forming the eddy current in<br />
the magnetic sheets and external magnetic fields effect<br />
has been considered. Meanwhile, impacts <strong>of</strong> the internal<br />
magnetic fields (adjacent magnetic domains walls) on the<br />
eddy current have been included. In [28], core losses as<br />
non-linear function <strong>of</strong> frequency are expressed as three<br />
types <strong>of</strong> hysteresis, classic and stray losses. The classic<br />
and stray losses are as follows:<br />
( class)<br />
2 2 2 2<br />
P d<br />
I max<br />
fm/6.<br />
(5)<br />
( exc) 2L<br />
( class)<br />
P (1.63 ) P .<br />
(6)<br />
d<br />
where is the conductivity, d is the magnetic sheet<br />
thickness and L is the magnetic domain dimensions. Core<br />
losses have been categorized into two types: 1: Core<br />
losses constant against frequency (hysteresis losses), and<br />
2: core losses depending on frequency (eddy current<br />
losses and abnormal stray losses) [29]. Each category has<br />
been expressed by equations versus frequency and<br />
magnetic flux density. This method is not a precise<br />
method, the main reason is the classification <strong>of</strong> the losses<br />
versus dependency and independency on the<br />
frequency. In addition, this method has appropriate results
Figure 4: various losses in the mains-fed and inverter-fed<br />
induction motor [31]<br />
over particular amplitude <strong>of</strong> the magnetic flux density due<br />
to the simplification <strong>of</strong> the method. In [30], [31], core<br />
losses have been divided into hysteresis losses, eddy<br />
current losses, and stray eddy current losses. Classic eddy<br />
current losses and stray eddy current losses are as follows:<br />
2<br />
d 1 T dB<br />
Wc( ) dt.<br />
f 12m<br />
T (7)<br />
0 dt<br />
v<br />
1 1 T dB<br />
W GV<br />
S dt.<br />
(8)<br />
c<br />
0<br />
fm 0<br />
v T dt<br />
where mv is the magnetic material density. In the stray<br />
eddy currents losses S is the magnetic sheet cross-section,<br />
G is the dimensions factor and V0 is a parameter that<br />
determines the local fields distribution. Then an<br />
equivalent frequency is defined to estimate the losses<br />
based on physics definition taking into account the<br />
harmonics within the core magnetic flux density. Figure 4<br />
shows various losses in the mains-fed and inverter-fed<br />
induction motor. Also in [32], integrally defined <strong>of</strong> triple<br />
subdivision <strong>of</strong> core losses has been used to estimate the<br />
core losses. In this case, losses equations have been<br />
presented versus mmf for different emf (sinusoidal,<br />
triangular and rectangular) waveforms considering<br />
relationship between emf and magnetic flux density and<br />
piecewise-linearized modeling <strong>of</strong> emf waveform. The<br />
relevant equations due to different losses have been given<br />
for each waveform. In addition, impact <strong>of</strong> different<br />
parameters such as duty cycle upon various core losses<br />
has been investigated. In [33], the influence <strong>of</strong> minor<br />
hysteresis loops within the magnetic sheet hysteresis loop<br />
upon core losses estimation has been investigated and<br />
their complicated time variations versus peak magnetic<br />
flux density and magnetic polarization vector for<br />
estimation <strong>of</strong> the triple core losses have been presented.<br />
Figure 5 shows that how these minor hysteresis loops are<br />
generated. Variations in the magnetic polarization<br />
envelop causes minor hysteresis loops within the major<br />
hysteresis loop. Dividing the core losses into three<br />
different losses and integral definitions versus magnetic<br />
flux density, its derivative lead to complicated<br />
computations. An important point in these methods is<br />
their dependency on the coefficients which depend on the<br />
physical and chemical characteristics <strong>of</strong> the proposed<br />
material and its molecular structure; however, they are not<br />
1.5<br />
- 134 - 15th IGTE Symposium 2012<br />
Figure 5: Generating minor hysteresis loops within major<br />
hysteresis loop [33].<br />
Figure 6: dq equivalent circuit considering core losses [36].<br />
available and their computations need some tests,<br />
therefore application <strong>of</strong> these methods is difficult. In<br />
[34], eddy current losses have been evaluated by<br />
application <strong>of</strong> different double-magnetic-excitation on the<br />
magnetic sheets using Maxwell equations. These<br />
computations have been carried out on a steel sheet and<br />
can be extended to the whole electrical machine. In<br />
addition, computations results have large difference with<br />
the test results and no justification has been given. Also<br />
application <strong>of</strong> the complicated Maxwell equations is an<br />
important problem particularly in the selection <strong>of</strong> the<br />
numerical solution method for solution <strong>of</strong> the equations.<br />
In [35], superposition method has been applied to<br />
estimate the eddy current losses in the PWM-fed motor<br />
and transformer. Since different losses do not vary versus<br />
magnetic flux density, application <strong>of</strong> the superposition is<br />
not correct.<br />
VI. EQUIVALENT CIRCUIT-BASED METHODS<br />
Equivalent circuit <strong>of</strong> induction motor has been<br />
frequently used to analyze the motor behavior and find its<br />
core losses. As shown in Figure 6, at this end a simplified<br />
dq model <strong>of</strong> ac motors can be used which does not lead to<br />
accurate results [36]. In addition, in dq model harmonics<br />
are ignored and this decreases the precision <strong>of</strong> the<br />
method. In [37], an equivalent circuit with a<br />
harmonic supply has been introduced to estimate core<br />
losses. In this case, dq model has been used and<br />
parameters <strong>of</strong> the equivalent circuit are calculated using<br />
FEM. In [38], impact <strong>of</strong> unbalanced supplied induction<br />
motor on the motor efficiency has been presented. The<br />
proposed equivalent circuit <strong>of</strong> induction motor is not<br />
accurate, so it has been modified by layered magnetic<br />
core. To do so, a resistance with leakage inductances has<br />
been connected in parallel. The major point in the
Method<br />
Steinmetz Eqn.<br />
MSE<br />
Hysteresis model<br />
Physical Eqn.<br />
FEM<br />
Equivalent Circuit<br />
Control Strategy<br />
application <strong>of</strong> the equivalent circuits for core losses<br />
estimation is its strong dependency on the parameters<br />
which may vary due to the operating conditions.<br />
VII. CONTROL STRATEGY-BASED METHODS<br />
Since efficiency <strong>of</strong> electrical machine is an important<br />
factor beside its life, one <strong>of</strong> the major quantities aiming to<br />
reduce is the core losses <strong>of</strong> drive-fed motor. In [39], [40],<br />
strategies for squirrel-cage induction motor and PM<br />
synchronous motor control have been introduced to<br />
decrease the losses. In these strategies, all equations for<br />
minimizing the losses depend on the motor parameters<br />
and determination <strong>of</strong> these parameters is themselves<br />
critical. In [41], impact <strong>of</strong> the core losses <strong>of</strong> induction<br />
motor on the stator-flux oriented control has been studied<br />
and a control strategy taking into account the induction<br />
motor losses has been introduced. In this method, core<br />
losses have been modeled by a resistance in parallel with<br />
the magnetizing inductance. In [42], a vector controlbased<br />
strategy for PMSM has been presented to maximize<br />
the motor efficiency. At this end, a model has been<br />
suggested for losses. In this method, d-component <strong>of</strong> the<br />
stator current is determined to maximize the IPM motor<br />
efficiency. In this strategy, losses are divided into Ohmic,<br />
core, mechanical and harmonic losses and they are then<br />
calculated.<br />
This test-based method can be extended to different<br />
types <strong>of</strong> PM and reluctance motors. However, they<br />
depend strongly on the motor parameters while these<br />
parameters are not in turn constant and vary by changing<br />
the operating point <strong>of</strong> the motor. Table I summarizes the<br />
core losses estimation for different supplies. In this table,<br />
some factors such as complexity <strong>of</strong> the methods, need for<br />
magnetic material parameters, response to the nonsinusoidal<br />
excitation waveforms and precision <strong>of</strong> the<br />
methods have been compared. As seen in Table 1, some<br />
accurate methods are complicated and need huge data<br />
from the magnetic material. Some methods are simple<br />
with low accurate responses. So, a proper method must be<br />
selected based-on the application.<br />
VIII. CONCLUSION<br />
Different methods were proposed for core losses<br />
estimation in magnetic materials for different excitations.<br />
These methods were classified and studied. Some factors<br />
such as complexity <strong>of</strong> methods for magnetic material<br />
parameters estimation, response to non-sinusoidal<br />
- 135 - 15th IGTE Symposium 2012<br />
TABLE I<br />
COMPARISON OF DIFFERENT CORE LOSSES ESTIMATION METHODS<br />
Complex waveform Complexity Material knowledge<br />
-<br />
Low<br />
Low<br />
+<br />
Low<br />
Low<br />
+<br />
High<br />
High<br />
+<br />
High<br />
High<br />
+<br />
High<br />
Medium<br />
-<br />
Medium<br />
Low<br />
+<br />
Medium<br />
Low<br />
Accuracy<br />
Low<br />
Medium<br />
Good<br />
Good<br />
Good<br />
Low<br />
Medium<br />
excitation waveforms and precision for each method were<br />
investigated and summarized. The methods such as FEM,<br />
hysteresis model, physical equation lead to accurate<br />
results but they are complicated methods and need a huge<br />
data <strong>of</strong> the magnetic material. MSE method is simple and<br />
no need huge data <strong>of</strong> the magnetic material with average<br />
accuracy. Control strategies are not used in direct core<br />
losses estimation. Equivalent electrical circuits <strong>of</strong> motor<br />
parameters also depend on the operating point <strong>of</strong> the<br />
motor and this is considered a difficulty <strong>of</strong> these methods.<br />
ACKNOWLEDGMENT<br />
The authors would like to thank Iran’s National Elites<br />
Foundation (INEF) for financial support <strong>of</strong> the project.<br />
[1] C. D. Graham,<br />
REFERENCES<br />
"Physical origin <strong>of</strong> losses in conducting<br />
[2]<br />
ferromagnetic materials," Journal <strong>of</strong> Applied Physics, vol. 53, no.<br />
11, pp. 8276-8280, Nov. 1982.<br />
M. S. Lancarotte, Jr. Aderbalde, and A. Penteado, "Estimation <strong>of</strong><br />
core losses under sinusoidal or non-sinusoidal induction by<br />
analysis <strong>of</strong> magnetization rate,” IEEE Transactions on Energy<br />
Conversion, vol.16, no. 2, pp. 174-179, June 2001.<br />
[3] M. Amar, and F. Protat, "A simple method for the estimation <strong>of</strong><br />
power losses in silicon iron sheets under alternating pulse voltage<br />
excitation," IEEE Transactions on Magnetics, vol. 30, no. 2, pp.<br />
942-944, March 1994.<br />
[4] W. Arshad, T. Ryckebusch, F. Magnussen, H. Lendenmann, B.<br />
Eriksson, J. Soulard, and B. Malmros, "Incorporating lamination<br />
processing and component manufacturing in electrical machine<br />
design tools," 42nd IAS Annual Meeting <strong>of</strong> Industry Applications<br />
Society, pp. 94-102, 2007.<br />
[5] H. Skarrie, "Design <strong>of</strong> powder core inductors," Licentiate Thesis,<br />
Lund <strong>University</strong>, Sweden, 2001.<br />
[6] Z. Gmyrek, A. Boglietti, and A. Cavagnino, "Estimation <strong>of</strong> iron<br />
losses in induction motors: calculation method, results, and<br />
analysis," IEEE Transactions on Industrial Electronics, vol. 57,<br />
no. 1, pp. 161-171, January 2010.<br />
[7] K. B. Tatis, A. G. Kladas, and J. A.Tegopoulos, "Harmonic iron<br />
loss determination in laminated iron cores by using a particular 3-<br />
D finite-element model," IEEE Transactions on Magnetics, vol.<br />
40, no. 2, pp. 860- 863, March 2004.<br />
[8] A. M. Knight, and Y. Zhan, "Identification <strong>of</strong> flux density<br />
harmonics and resulting iron losses in induction machines with<br />
nonsinusoidal supplies," IEEE Transactions on Magnetics, vol.<br />
44, no. 6, pp.1562-1565, June 2008.<br />
[9] L. Ma, M. Sanada, S. Morimoto, and Y. Takeda, "Iron loss<br />
prediction considering the rotational field and flux density<br />
harmonics in IPMSM and SynRM," IEE <strong>Proceedings</strong> - Electric<br />
Power Applications, vol. 150, no. 6, pp. 747- 751, November<br />
2003.<br />
[10] M. Enokizono, H. Shimoji, and T. Horibe, "Effect <strong>of</strong> stator<br />
construction <strong>of</strong> three-phase induction motors on core loss," IEEE<br />
Transactions on Magnetics, vol. 39, no. 3, pp. 1484- 1487, May<br />
2003.<br />
[11] K. Yamazaki, and N. Fukushima, "Experimental validation <strong>of</strong><br />
iron loss model for rotating machines based on direct eddy current
analysis <strong>of</strong> electrical steel sheets," IEEE International Conference<br />
on Electric Machines and Drives. IEMDC '09, pp. 851-857, 3-6<br />
May 2009.<br />
[12] E. Dlala, and A. Arkkio, "A General model for investigating the<br />
effects <strong>of</strong> the frequency converter on the magnetic iron losses <strong>of</strong> a<br />
squirrel-cage induction motor," IEEE Transactions on Magnetics<br />
, vol. 45, no. 9, pp. 3303-3315, Sept. 2009.<br />
[13] J. F. Bangura, and N. A. Demerdash, "Effects <strong>of</strong> broken bars/endring<br />
connectors and air gap eccentricities on Ohmic and core<br />
losses <strong>of</strong> induction motors in ASDs using a coupled finite<br />
element-state space method," IEEE Transactions on Energy<br />
Conversion, vol. 15, no. 1, pp. 40-47, Mar. 2000.<br />
[14] B. M. Ebrahimi, and J. Faiz, "Diagnosis and performance<br />
analysis <strong>of</strong> three phase permanent magnet synchronous motors<br />
with static, dynamic and mixed eccentricity," Electric Power<br />
Applications, IET, vol. 4, no. 1, pp. 53-66, January 2010.<br />
[15] D. Jiles, and D. Atherton, "Theory <strong>of</strong> ferro-magnetic hysteresis,"<br />
Journal <strong>of</strong> Magnetism and Magnetic Materials, vol. 61, no. 1-2,<br />
pp. 48-60, Sep.1986.<br />
[16] T. Chevalier, A. Kedous Lebouc, B. Cornut, and C. Cester, "A<br />
new dynamic hysteresis model for electrical steel sheet," Physica<br />
B: Condensed Matter, vol. 275, no. 1-3, pp. 197-201, 2000.<br />
[17] M. Enokizono, and T. Horibe, "Loss evaluation <strong>of</strong> induction<br />
motor by using E&S2 model magnetic hysteresis," IEEE<br />
Transactions on Magnetics, vol. 38, no. 5, pp. 2379- 2381.<br />
September 2002.<br />
[18] A. Benabou, S. Clenet, and F. Piriou, "Comparison <strong>of</strong> Preisach<br />
and Jiles–Atherton models to take into account hysteresis<br />
phenomenon for finite element analysis," Journal <strong>of</strong> Magnetism<br />
and Magnetic Materials, no. 261, pp. 139–160, 2003.<br />
[19] A. Krings, and J. Soulard, "Over view and comparison <strong>of</strong> iron<br />
loss models for electrical machines," Ecological Vehicle and<br />
Renewable Energy (EVER), Monaco, 25-28 March 2010.<br />
[20] B. Ganji, J. Faiz, K. Kasper, C. E. Carstensen, and R. W.<br />
DeDoncker, "Core loss model based on finite-element method for<br />
switched reluctance motors," IEE <strong>Proceedings</strong> - Electric Power<br />
Applications, vol. 4, no. 7, pp. 569–577, 2010.<br />
[21] J. Faiz, B. Ganji, B. R. W. De Doncker, and J. O. Fiedler,<br />
"Electromagnetic modeling <strong>of</strong> switched reluctance motor using<br />
finite element method," 32nd Annual Conference on IEEE<br />
Industrial Electronics, IECON, pp. 1557-1562, 6-10 Nov. 2006.<br />
[22] J. Reinert, A. Brockmeyer, and R. W. De Doncker, "Calculation<br />
<strong>of</strong> losses in ferro- and ferromagnetic materials based on the<br />
modified Steinmetz equation," IEEE Transactions on Industry<br />
Applications, vol. 37, no. 4, pp. 1055-1061, Jul./Aug. 2001.<br />
[23] M. Albach, T. Durbaum, and A. Brockmeyer, "Calculating core<br />
losses in transformers for arbitrary magnetizing currents: a<br />
comparison <strong>of</strong> different approaches," 27th Annual IEEE Power<br />
Electronics Specialists Conference, PESC '97, vol. 2, pp. 1463-<br />
1468, 23-27 Jun 1996.<br />
[24] A. Boglietti, A. Cavagnino, M. Lazzari, and M. Pastorelli,<br />
"Predicting iron losses in s<strong>of</strong>t magnetic materials with arbitrary<br />
voltage supply: an engineering approach," IEEE Transactions on<br />
Magnetics, vol. 39, no. 2, pp. 981- 989, Mar. 2003.<br />
[25] D. Ionel, M. Popescu, C. Cossar, M. I. McGilp, A. Boglietti,<br />
and A. Cavagnino, "A general model <strong>of</strong> the laminated steel losses<br />
in electric motors with PWM voltage supply," IEEE Industry<br />
Applications Society Annual Meeting. IAS '08., pp. 1-7, 5-9 Oct.<br />
2008.<br />
[26] R. H. Pry, and C. P. Bean, "Calculation <strong>of</strong> the energy loss in<br />
magnetic sheet materials using a domain model," Journal <strong>of</strong><br />
Applied Physics, vol. 29, no. 3, pp. 532-533, Mar. 1958.<br />
- 136 - 15th IGTE Symposium 2012<br />
[27] G. Bertotti, "Physical interpretation <strong>of</strong> eddy current losses in<br />
ferromagnetic materials. I. Theoretical considerations," Journal <strong>of</strong><br />
Applied Physics, vol. 57, no. 6, pp. 2110-2117, March 1985.<br />
[28] G. Bertotti, "General properties <strong>of</strong> power losses in s<strong>of</strong>t<br />
ferromagnetic materials," IEEE Transactions on Magnetics, vol.<br />
24, no. 1, pp. 621-630, Jan. 1988.<br />
[29] R. Newbury, "Prediction <strong>of</strong> loss in silicon steel from distorted<br />
waveforms," IEEE Transactions on Magnetics, vol. 14, no. 4, pp.<br />
263- 268, Jul. 1978.<br />
[30] M. Amar and R. Kaczmarek, "A general formula for prediction<br />
<strong>of</strong> iron losses under non-sinusoidal voltage waveform," IEEE<br />
Transactions on Magnetics, vol. 31, no. 5, pp. 2504-2509, Sep.<br />
1995.<br />
[31] T. C. Green, C. A. Hernandez-Aramburo, and A. C. Smith,<br />
"Losses in grid and inverter supplied induction machine drives,"<br />
IEE <strong>Proceedings</strong> - Electric Power Applications, vol. 150, no. 6,<br />
pp. 712- 724, 7 Nov. 2003,<br />
[32] W. A. Roshen, "A practical, accurate and very general core loss<br />
model for no sinusoidal waveforms," IEEE Transactions on<br />
Power Electronics, vol. 22, no. 1, pp. 30-40, Jan. 2007.<br />
[33] E. Barbisio, F. Fiorillo, and C. Ragusa, "Predicting loss in<br />
magnetic steels under arbitrary induction waveform with minor<br />
hysteresis loops," IEEE Transactions on Magnetics, vol. 40, no.<br />
4, pp. 1810- 1819, July 2004.<br />
[34] J. Sagarduy, A. J. Moses, and F. J. Anayi, "Eddy current losses in<br />
electrical steels subjected to matrix and classical PWM excitation<br />
waveforms," IEEE Transactions on Magnetics, vol. 42, no. 10,<br />
pp. 2818-2820, Oct. 2006.<br />
[35] Ruifang Liu, C. C. Mi, and D. W. Gao, "Modeling <strong>of</strong> eddycurrent<br />
loss <strong>of</strong> electrical machines and transformers operated by<br />
pulse width-modulated inverters," IEEE Transactions on<br />
Magnetics, vol. 44, no. 8, pp. 2021-2028, Aug. 2008.<br />
[36] M. Popescu, D. G. Dorrell, and D. M. Ionel, "A study <strong>of</strong> the<br />
engineering calculations for iron losses in 3-phase AC motor<br />
models," IEEE 33rd Annual Conference <strong>of</strong> the Industrial<br />
Electronics Society. IECON 2007., pp. 169-174, 5-8 Nov. 2007.<br />
[37] K. Yamazaki, "Torque and efficiency calculation <strong>of</strong> an interior<br />
permanent magnet motor considering harmonic iron losses <strong>of</strong> both<br />
the stator and rotor," IEEE Transactions on Magnetics, vol. 39,<br />
no. 3, pp. 1460- 1463, May 2003.<br />
[38] A. Vamvakari, A. Kandianis, A. Kladas, S. Manias, and J.<br />
Tegopoulos, "Analysis <strong>of</strong> supply voltage distortion effects on<br />
induction motor operation," IEEE Transactions on Energy<br />
Conversion, vol. 16, no. 3, pp. 209-213, Sep. 2001.<br />
[39] S. Lim, and K. Nam, "Loss-minimizing control scheme for<br />
induction motors," IEE <strong>Proceedings</strong> -Electric Power<br />
Applications, vol. 151, no. 4, pp. 385- 397, 7 July 2004.<br />
[40] J. Lee, K. Nam, S. Choi, and S. Kwon, "Loss-minimizing control<br />
<strong>of</strong> PMSM with the use <strong>of</strong> polynomial approximations," IEEE<br />
Transactions on Power Electronics, vol. 24, no. 4, pp. 1071-<br />
1082, April 2009.<br />
[41] S. D. Wee, M. H. Shin, and D. S. Hyun, "Stator-flux-oriented<br />
control <strong>of</strong> induction motor considering iron loss," IEEE<br />
Transactions on Industrial Electronics, vol. 48, no. 3, pp. 602-<br />
608, Jun. 2001.<br />
[42] C. Mademlis, I. Kioskeridis, and N. Margaris, "Optimal<br />
efficiency control strategy for interior permanent-magnet<br />
synchronous motor drives," IEEE Transactions on Energy<br />
Conversion, vol. 19, no. 4, pp. 715- 723, Dec. 2004.
- 137 - 15th IGTE Symposium 2012<br />
Fast Computation <strong>of</strong> Inductances and Capacitances <strong>of</strong><br />
High Voltage Power Transformer Windings<br />
*Župan, Tomislav, *Štih, Željko and *Trkulja, Bojan<br />
*Faculty <strong>of</strong> Electrical Engineering and Computing, <strong>University</strong> <strong>of</strong> Zagreb, Unska 3, 10000 Zagreb, Croatia<br />
E-mail: tomislav.zupan@fer.hr<br />
Abstract— Inductances and capacitances in analysis <strong>of</strong> fast transients in high voltage transformers are usually calculated on<br />
the basis <strong>of</strong> simple analytical approximations for applications in lumped circuit models. This paper presents the application<br />
<strong>of</strong> the boundary element method to calculation <strong>of</strong> capacitances and inductances <strong>of</strong> transformer windings with coils <strong>of</strong><br />
rectangular cross section. Two-dimensional axially symmetric mathematical model <strong>of</strong> electric and magnetic field in<br />
transformer, which is based on integral equations, is introduced. The accuracy and applicability <strong>of</strong> the proposed approach is<br />
illustrated by an example.<br />
Index Terms—boundary element method, capacitance calculation, inductance calculation, rectangular cross section coils.<br />
I. INTRODUCTION<br />
Power transformers are one <strong>of</strong> the most important<br />
segments <strong>of</strong> electric power systems. Due to their long<br />
term running requirements they are faced with numerous<br />
overvoltage strikes during their lifetime. Therefore it is <strong>of</strong><br />
great importance to know how the power transformer<br />
windings will react on such excitations.<br />
Since most <strong>of</strong> the overvoltage strikes are characterized<br />
by high frequency impulse signals (due to lightning<br />
discharges, short circuits, etc.), the capacitances between<br />
windings, which are negligible during normal operational<br />
frequency <strong>of</strong> power grid, become significant. Thus it is<br />
meaningful to have, alongside the strict calculation <strong>of</strong><br />
inductances, a method for precise calculation <strong>of</strong><br />
capacitances <strong>of</strong> high voltage power transformer windings.<br />
Distribution <strong>of</strong> voltage along windings <strong>of</strong> transformers<br />
due to fast excitations (switching, lightning, and testing)<br />
is one <strong>of</strong> the most important inputs to insulation design.<br />
Lumped circuit model <strong>of</strong> winding is most frequently<br />
applied in calculation <strong>of</strong> such distribution [1]-[3]. Parts <strong>of</strong><br />
windings are represented by equivalent capacitances,<br />
inductances and resistances. Depending on the way <strong>of</strong> the<br />
connection <strong>of</strong> the conductors, the equivalent diagram is<br />
made for each <strong>of</strong> the turns or, for the sake <strong>of</strong> faster<br />
calculation, turns are grouped.<br />
There are numerous papers dealing with the presented<br />
problem by using the numerical approach based on the<br />
finite element method (FEM) [1], [4]. In this paper we<br />
introduce the application <strong>of</strong> the boundary element method<br />
(BEM) to calculation <strong>of</strong> capacitances and inductances <strong>of</strong><br />
windings <strong>of</strong> transformers with coils <strong>of</strong> rectangular cross<br />
section. Main advantage <strong>of</strong> this approach is ease <strong>of</strong> use,<br />
because the discretization is done only at material<br />
interfaces and boundaries, thus effectively reducing the<br />
order <strong>of</strong> the mathematical model by one dimension.<br />
II. COMPUTATION OF CAPACITANCES<br />
Typical arrangement inside the high voltage power<br />
transformer can be seen in Fig. 1 (simplified). Windings<br />
usually consist <strong>of</strong> low voltage, high voltage and<br />
regulation voltage parts. Conductors are typically <strong>of</strong><br />
rectangular cross section insulated by paper insulation<br />
and immersed in oil, grouped in discs with radial and<br />
axial canals for heat transfer purposes.<br />
z<br />
r<br />
LV<br />
HV RV<br />
core<br />
Fig. 1. Windings in high voltage power transformers<br />
Usual approach to calculation <strong>of</strong> capacitances is based<br />
on simple analytical parallel-plate approximations [2]:<br />
2 2<br />
Rout Rin<br />
CParallel<br />
0<br />
,<br />
(1)<br />
drc<br />
<br />
<br />
<br />
o p<br />
where o and p are relative permittivities <strong>of</strong> oil and<br />
paper insulation, respectively, drc height <strong>of</strong> the radial<br />
canal between conductors in z axis direction, width <strong>of</strong><br />
insulation between conductors, in R and Rout conductor's<br />
inner and outer radius, respectively,<br />
cylindrical approximations [5]:<br />
or analytical<br />
20<br />
phc CSerial<br />
<br />
,<br />
Rav<br />
<br />
ln 2 <br />
R <br />
av <br />
2 <br />
(2)<br />
oil<br />
paper<br />
insulation<br />
conductor
where c h is conductor's height, and Rav average radius<br />
between two conductors.<br />
These analytical methods result in significant errors and<br />
consequently in unsatisfactory accuracy <strong>of</strong> computation<br />
<strong>of</strong> voltage distribution along the winding.<br />
Electromagnetic field in power transformers is<br />
transverse and the capacitances may be computed on the<br />
basis <strong>of</strong> electrostatic analysis. The electric field in a space<br />
composed <strong>of</strong> conducting regions at known potentials and<br />
regions filled with different dielectrics can be solved by a<br />
pair <strong>of</strong> coupled integral equations [6]. The potential (A)<br />
at any point A on the surface <strong>of</strong> conductor is related to the<br />
charge density (B) on total surface <strong>of</strong> all boundaries<br />
(conductor–dielectric and dielectric–dielectric) by the<br />
equation:<br />
A B G B, A dS 0,<br />
(3)<br />
<br />
<br />
S<br />
where:<br />
1<br />
GB, A<br />
,<br />
(4)<br />
4 d AB<br />
and dAB is the distance between points A and B. The<br />
charge density (D) at any point D on the dielectric–<br />
dielectric boundary is related to the charge density (B)<br />
on total surface <strong>of</strong> all boundaries by the equation:<br />
o <br />
<br />
i<br />
D2 BDNB, DndS 0.<br />
<br />
(5)<br />
o i S<br />
Here, n is the unit vector normal to the surface at the<br />
point D, o is the permittivity <strong>of</strong> the region in the<br />
direction <strong>of</strong> the normal unit vector on the surface at the<br />
point D and i is the permittivity <strong>of</strong> the region in the<br />
opposite direction.<br />
Geometry <strong>of</strong> the windings can be approximated with<br />
axially symmetry, and the problem becomes twodimensional.<br />
Therefore, we may integrate surface<br />
integrals in (3) and (5) with respect to circumferential<br />
direction and reduce them to line integrals. Kernels <strong>of</strong><br />
integral equations are:<br />
1 r<br />
Prr ( , ) kKk 2<br />
r<br />
<br />
DN( r, r) DR( r, r) ar DZ( r, r) az<br />
<br />
<br />
<br />
1<br />
DR( r, r)<br />
<br />
4 r<br />
3<br />
r k<br />
<br />
2 r1k KkEk rr <br />
EkKk 2<br />
<br />
k 2r<br />
<br />
(6)<br />
1<br />
DZ( r, r) <br />
4r 3<br />
r zz k<br />
E k r 2 2r 1<br />
k<br />
2<br />
k <br />
4rr<br />
.<br />
2 2<br />
rr zz <br />
Here, k is the modulus <strong>of</strong> the elliptic integrals <strong>of</strong> the<br />
first and the second kind K(k) and E(k). The vector<br />
<br />
r ra r za<br />
z defines the position <strong>of</strong> the source point<br />
<br />
(B), and the vector r rar zaz<br />
defines the position <strong>of</strong><br />
the point <strong>of</strong> interest (A, D).<br />
- 138 - 15th IGTE Symposium 2012<br />
<br />
Terms Prr ( , ) , DR( r, r)<br />
and DZ( r, r)<br />
represent the<br />
kernels for electric potential and radial and axial<br />
components <strong>of</strong> electrical induction vector <strong>of</strong> uniformly<br />
charged ring with negligible cross section, respectively<br />
[7]. Because the conductors inside power transformers<br />
have rectangular cross section, their surface can be<br />
divided, using BEM approach, into either thin cylinders<br />
or thin discs. As can be seen on Fig.2, both <strong>of</strong> the cases,<br />
for the sake <strong>of</strong> generality, can be represented with<br />
truncated cone.<br />
truncated<br />
cone<br />
disc<br />
cylinder<br />
Fig. 2. General representation <strong>of</strong> the conductor segment<br />
division<br />
<br />
The final expressions for Prr ( , b, re)<br />
, after integrating<br />
(6) over l, are:<br />
1 2<br />
l r() t K( k )<br />
Prr ( , b, re) <br />
dt<br />
q<br />
0<br />
2 2<br />
e b e b<br />
<br />
l z z r r<br />
rt () ( re rb) trb zt () ( ze zb) tzb, where r(t) and z(t) represent parametric notation <strong>of</strong> the<br />
general point on segment l, l is the length <strong>of</strong> the segment<br />
and q and k are:<br />
2<br />
<br />
2 2<br />
q r rt () zzt () 2<br />
rrt ()<br />
4 rr( t)<br />
k .<br />
q<br />
<br />
<br />
From the equation E ,0, <br />
r z<br />
<br />
<br />
obtain the kernels <strong>of</strong> electrical induction vector:<br />
1<br />
l<br />
DR( r, rb, re) r( t)<br />
<br />
<br />
0<br />
2 2<br />
2() rtKk ()(1 k) Ek ()2() rt krrt () <br />
<br />
<br />
2 2 3 2<br />
k (1 k ) q<br />
<br />
1<br />
l rt () zzt () Ek<br />
( )<br />
DZ( r, rb, re) <br />
dt.<br />
2 3 2<br />
(1 k ) q<br />
0<br />
z<br />
z<br />
ze<br />
zb<br />
The system <strong>of</strong> integral equations (3) and (5) is solved<br />
by BEM. Boundaries and interfaces between two<br />
dielectrics are divided into finite segments and the<br />
l<br />
rb re r<br />
dt<br />
r<br />
(7)<br />
(8)<br />
we<br />
(9)
unknown distribution <strong>of</strong> surface charge density on the ith<br />
segment is approximated as linear combination <strong>of</strong><br />
predefined basis functions:<br />
N<br />
<br />
( r) <br />
t ( r).<br />
(10)<br />
i in in<br />
i1<br />
The simplest approach is to use basis functions which<br />
are constant (N=1) on the finite segment. The application<br />
<strong>of</strong> (10) to (3) and (5) results in:<br />
N<br />
<br />
( r) P( r, r, r) dC ; r C<br />
<br />
0 i p k i 0<br />
i1 Ci<br />
N<br />
o i<br />
<br />
i( r) 2 i<br />
DN( r, rp, rk) dCi; o <br />
i i1 Ci<br />
<br />
r Ci.<br />
(11)<br />
Here, C0 is boundary at known potential 0 and Ci is<br />
interface between two dielectrics. A linear equation<br />
system for unknown coefficients i is derived by<br />
enforcing an exact solution at midpoints <strong>of</strong> each finite<br />
segment (point-matching [8]). The integrals in (11) are<br />
Ci,<br />
the integrals become singular. In such a case their vicinity<br />
is treated separately and this contribution is calculated<br />
analytically (logarithmic singularities).<br />
After the determination <strong>of</strong> the surface charge<br />
distribution<br />
calculated by:<br />
on conductors, the capacitances are<br />
Qij<br />
Cij ; i j.<br />
<br />
(12)<br />
i j<br />
Here, i and j are potentials <strong>of</strong> i-th and j-th conductor,<br />
respectively, and Qij is total charge on the j-th conductor<br />
influenced by the charge on the i-th conductor:<br />
j<br />
<br />
<br />
Q dS S<br />
ij j j kj kj<br />
S<br />
k 1<br />
j<br />
N<br />
,<br />
- 139 - 15th IGTE Symposium 2012<br />
(13)<br />
where kj is surface charge density on k-th segment <strong>of</strong> jth<br />
conductor, Nj is the number <strong>of</strong> finite segments on j-th<br />
conductor and Skj is the surface <strong>of</strong> the k-th segment <strong>of</strong> jth<br />
conductor.<br />
We calculate the capacitances by setting the potential <strong>of</strong><br />
the i-th conductor to 1V and the potential <strong>of</strong> all other<br />
conductors to zero. Then, we obtain the total charge on<br />
conductors and use (12) to calculate the capacitances.<br />
III. CAPACITANCE CALCULATION -EXAMPLE<br />
The following procedure has been tested on two<br />
examples, first one showing the calculation <strong>of</strong> turn-toturn<br />
capacitances <strong>of</strong> high voltage winding in a power<br />
transformer and the second one showing a more<br />
"macroscopic" approach, where the conductors in one<br />
row <strong>of</strong> high voltage winding are grouped into disc and<br />
then the disc-by-disc capacitances are observed.<br />
The turn-to-turn capacitances were calculated in three<br />
ways:<br />
BEM approach. Total number <strong>of</strong> unknown<br />
coefficients <strong>of</strong> surface charge distribution was<br />
896.<br />
FEM approach using Ans<strong>of</strong>t Maxwell ®<br />
package.<br />
Total number <strong>of</strong> elements was 36180, and total<br />
number <strong>of</strong> nodes was 2103, which results in 0.1%<br />
error in calculation <strong>of</strong> energy.<br />
Analytical approach based on cylindrical<br />
approximation shown in (2) for calculation <strong>of</strong><br />
serial capacitance between radially neighboring<br />
conductors or parallel-plate approximation shown<br />
in (1) for calculation <strong>of</strong> parallel capacitance<br />
between axially neighboring conductors.<br />
Following proposed BEM approach, graphical<br />
depiction <strong>of</strong> one example where the middle conductor's<br />
potential is set to 1V, illustrating the distribution <strong>of</strong> the<br />
surface charge density on the observed conductor and the<br />
influenced surface charge densities on neighboring<br />
conductors can be seen on Fig. 3.<br />
Fig. 3. BEM solution for calculation <strong>of</strong> turn-to-turn<br />
capacitances<br />
The same example was solved using Ans<strong>of</strong>t Maxwell ®<br />
package, as can be seen on Fig. 4.<br />
Fig. 4. FEM solution for calculation <strong>of</strong> turn-to-turn<br />
capacitances (Ans<strong>of</strong>t Maxwell ® )<br />
Comparison <strong>of</strong> the results for turn-to-turn capacitance<br />
calculation <strong>of</strong> various approaches is given in Table 1. CiS<br />
and CiP represent the serial and parallel capacitance<br />
between two innermost conductors, CS and CP between
two middlemost conductors, and CoS and CoP between<br />
two outermost conductors.<br />
TABLE I<br />
TURN-TO-TURN CAPACITANCE RESULTS<br />
turn-to- CiS CiP CS CP CoS CoP<br />
turn [nF] [nF] [nF] [nF] [nF] [nF]<br />
® Maxwell 1.70 0.09 1.81 0.03 1.93 0.10<br />
Analytical 1.56 0.04 1.66 0.04 1.76 0.05<br />
BEM 1.68 0.09 1.77 0.03 1.89 0.10<br />
The disc-to-disc example results solved by BEM can be<br />
seen in Fig. 5 and the same example solved using Ans<strong>of</strong>t<br />
Maxwell ® package can be seen in Fig. 6.<br />
Fig. 5. BEM solution for calculation <strong>of</strong> disc-to-disc<br />
capacitances<br />
Fig. 6. FEM solution for calculation <strong>of</strong> disc-to-disc<br />
capacitances (Ans<strong>of</strong>t Maxwell ® )<br />
Comparison <strong>of</strong> the results for disc-to-disc capacitance<br />
calculation is given in Table 2.<br />
TABLE II<br />
DISC-TO-DISC CAPACITANCE RESULTS<br />
disc-to- Cbottom Cmid1 Cmid2 Ctop<br />
disc [nF] [nF] [nF] [nF]<br />
®<br />
Maxwell 2.83 2.88 2.90 2.83<br />
Analytical 2.36 2.36 2.36 2.36<br />
BEM 2.81 2.73 2.79 2.75<br />
The cumulative results show significant errors in<br />
analytical approach and justify the necessity <strong>of</strong><br />
application <strong>of</strong> numerical approaches. Even in the case <strong>of</strong><br />
very coarse discretization <strong>of</strong> the BEM approach, the<br />
- 140 - 15th IGTE Symposium 2012<br />
results differ by only a few percents from the results<br />
obtained by FEM.<br />
IV. COMPUTATION OF INDUCTANCES<br />
Using the analogy introduced in capacitance<br />
computation, inductances can be computed on the basis<br />
<strong>of</strong> magnetostatic analysis. The magnetic field in space<br />
composed <strong>of</strong> conducting regions with known currents and<br />
regions filled with different magnetic materials can be<br />
solved by a pair <strong>of</strong> coupled integral equations. Due to the<br />
linearity <strong>of</strong> the computation, by imposing the constant<br />
magnetic permeability, the magnetic vector potential and<br />
magnetic field strength can be written as:<br />
<br />
Ar ( ) AM( r) AS( r)<br />
(14)<br />
Hr ( ) HM( r) HS(<br />
r),<br />
where A <br />
is total magnetic vector potential, AM <br />
is<br />
magnetic vector potential caused by surface<br />
magnetization current density KM <br />
and S A is magnetic<br />
vector potential caused by imposed current density S J .<br />
The same subscripts and definitions are valid for<br />
magnetic field strength.<br />
<br />
Magnetic vector potential AA ( ) at any point A on the<br />
surface that restricts the model is related to the surface<br />
<br />
magnetization current density K( B)<br />
on total surface <strong>of</strong><br />
all boundaries by the equation:<br />
<br />
<br />
AA ( ) KBGBAdS ( ) ( , ) A(<br />
A),<br />
(15)<br />
<br />
S<br />
where GBA ( , ) is written in equation (4). The surface<br />
<br />
magnetization current density K( D)<br />
at any point D on<br />
the boundary <strong>of</strong> two different magnetic materials is<br />
related to the surface magnetization current density<br />
<br />
K( B)<br />
on total surface <strong>of</strong> all the boundaries by the<br />
equation:<br />
o <br />
i<br />
K( D) 2 K( B) HT( B, D) d S<br />
o i<br />
S<br />
(16)<br />
o <br />
i <br />
2 HS( D) n.<br />
<br />
o i<br />
Here, n is the unit vector normal to the surface at the<br />
point D, o is the permeability <strong>of</strong> the region in the<br />
direction <strong>of</strong> the normal unit vector on the surface at the<br />
point D and i is the permeability <strong>of</strong> the region in the<br />
opposite direction.<br />
Assuming the same simplification as in the capacitance<br />
calculation, geometry <strong>of</strong> the windings can be<br />
approximated with axially symmetry, and the problem<br />
becomes two-dimensional so we may integrate surface<br />
integrals in (15) and (16) with respect to circumferential<br />
direction and reduce them to line integrals. Kernels <strong>of</strong><br />
integral equations are:<br />
2<br />
1 r 1 k <br />
<br />
Grr ( , ) 1 Kk<br />
( ) Ek<br />
( ) <br />
rk <br />
2 <br />
<br />
<br />
(17)<br />
<br />
HT ( r, r) HR( r, r) a HZ( r, r) a n,<br />
<br />
S<br />
r z
where Grr ( , ) is the kernel for magnetic vector potential<br />
<br />
<br />
and HR( r, r)<br />
and HZ( r, r)<br />
are kernels for radial and<br />
axial components <strong>of</strong> magnetic field strength:<br />
k zz HR( r, r) K(<br />
k)<br />
<br />
4<br />
rr<br />
r<br />
2 2<br />
2<br />
r r zz <br />
<br />
Ek ( ) <br />
2 2<br />
rr zz <br />
<br />
k<br />
HZ( r, r) K<br />
( k)<br />
<br />
(18)<br />
4<br />
rr<br />
2 2<br />
2<br />
r r zz <br />
<br />
Ek ( ) <br />
2 2<br />
rr zz <br />
<br />
2 4rr<br />
k <br />
.<br />
2 2<br />
rr zz Here, k is the modulus <strong>of</strong> the elliptic integrals <strong>of</strong> the<br />
first and the second kind K(k) and E(k). The vector<br />
<br />
r ra r za<br />
z defines the position <strong>of</strong> the source point<br />
<br />
(B), and the vector r rar zaz<br />
defines the position <strong>of</strong><br />
the point <strong>of</strong> interest (A, D).<br />
The system <strong>of</strong> integral equations (15) and (16) is solved<br />
by BEM using the same technique mentioned in<br />
capacitance calculation section above, dividing the<br />
interfaces between different magnetic materials into finite<br />
segments and approximating the unknown distribution <strong>of</strong><br />
surface magnetization current density with linear<br />
combination <strong>of</strong> predefined basis functions:<br />
N<br />
<br />
Ki( r) Kint in(<br />
r).<br />
(19)<br />
i1<br />
Again, using the simplest adequate approach, the basis<br />
functions are constant on the finite segment (N=1). The<br />
application <strong>of</strong> (19) to (15) and (16) results in:<br />
N<br />
<br />
Ar ( ) K Gr ( , r) dC A( r); rC <br />
K r<br />
<br />
0 i i S<br />
0<br />
i1 Ci<br />
<br />
<br />
<br />
<br />
<br />
HrS ( r) arHzS ( r) azn; r Ci<br />
N<br />
i( ) o i<br />
<br />
<br />
Ki HT( r, r ) dCi<br />
2 o <br />
i i1 Ci<br />
- 141 - 15th IGTE Symposium 2012<br />
(20)<br />
Here, C0 is boundary at known magnetic vector<br />
potential and Ci is interface between two magnetic<br />
materials. Using the point-matching technique, a linear<br />
equation system for unknown coefficients Ki is derived.<br />
The example <strong>of</strong> the distribution <strong>of</strong> surface magnetization<br />
current density on the core <strong>of</strong> the transformer is shown in<br />
Fig. 7.<br />
As can be seen through inspecting equations (15) and<br />
(16), it is still necessary to determine the magnetic vector<br />
<br />
potential contribution <strong>of</strong> imposed current density AS( r)<br />
and their radial and axial components <strong>of</strong> magnetic field<br />
strength HrS ( r) and HzS ( r) .<br />
The calculation for magnetic vector potential and<br />
magnetic field strength <strong>of</strong> circular conductor <strong>of</strong><br />
rectangular cross section have been done in [9] and are<br />
presented here for the completeness <strong>of</strong> proposed method.<br />
Fig. 7. Distribution <strong>of</strong> surface magnetization current density on<br />
transformer core boundaries<br />
Using Fig. 8. for clarity, the equations are:<br />
T1( R1, R2, r, zZ1) T1( R1, R2, r, Z2 z)<br />
<br />
;<br />
z Z1<br />
<br />
TA(<br />
R1, R2, r) T1( R1, R2, r, zZ1) AS<br />
<br />
T1(<br />
R1, R2, r, Z2 z); Z1 z Z2<br />
<br />
<br />
T1( R1, R2, r, Z2 z) T1( R1, R2, r, zZ1) <br />
;<br />
z Z2<br />
(21)<br />
T2( R1, R2, r, zZ1) T2( R1, R2, r, Z2 z)<br />
<br />
;<br />
z Z1<br />
<br />
TB(<br />
R1, R2, r) T2( R1, R2, r, zZ1) H zS <br />
T2(<br />
R1, R2, r, Z2 z); Z1 z Z2<br />
<br />
<br />
T2( R1, R2, r, Z2 z) T2( R1, R2, r, zZ1) <br />
;<br />
z Z2<br />
(22)<br />
H T ( R , R , r, Z z) T ( R , R , r, zZ ).<br />
rS<br />
3 1 2 2 3 1 2 1<br />
R1 R2<br />
r<br />
Fig. 8. Circular conductor with rectangular cross section<br />
The subfunctions TA, TB, T1, T2 and T3 are:<br />
<br />
0Ira<br />
R2 r T1( R1, R2, r, a)<br />
ln<br />
<br />
4 <br />
<br />
<br />
R1r 2 2 <br />
R2 r a<br />
<br />
2 2 <br />
<br />
R1r a<br />
<br />
<br />
3 <br />
2<br />
0IrR2 sin d 2 X( R , r, a, ) aX( R , r, a,<br />
)<br />
<br />
0<br />
z<br />
Z2<br />
Z1<br />
<br />
2 2<br />
I
3 <br />
2<br />
0 1<br />
sin<br />
IrR d <br />
2 <br />
<br />
X( R , r, a, ) a X( R , r, a,<br />
)<br />
<br />
<br />
0<br />
1 1<br />
<br />
0Ia<br />
co s X( R2, r, a, ) 2<br />
<br />
0<br />
X( R1, r, a, ) d<br />
<br />
0Ir rsin arctan<br />
2 <br />
a<br />
0<br />
2<br />
<br />
R2 <br />
<br />
X( R2, r, a, ) 2<br />
R <br />
1<br />
<br />
X( R1, r, a,<br />
)<br />
<br />
<br />
2 <br />
0Ir<br />
a sincossin <br />
<br />
<br />
<br />
cossind <br />
<br />
4 R rcos <br />
X( R , r,<br />
a,<br />
)<br />
0<br />
2 2<br />
2<br />
2 <br />
0Ir<br />
a<br />
1<br />
R <br />
1 d X( R2, r, a, ) 4 <br />
R <br />
1<br />
<br />
X( R<br />
0 1,<br />
r, a,<br />
)<br />
<br />
sin cos sin<br />
,<br />
cos ( , , , ) d<br />
<br />
<br />
<br />
R r X R r a <br />
(23)<br />
1 1<br />
I Ia<br />
T2( R1, R2, r, a) R2 R1 <br />
2 2<br />
<br />
<br />
<br />
<br />
<br />
2<br />
ln<br />
R r <br />
<br />
<br />
R1r 2 2<br />
R2 r a<br />
<br />
2 2 <br />
R1r a<br />
<br />
<br />
Iar<br />
2<br />
<br />
sin <br />
<br />
1 R<br />
0<br />
2 rco s X( R2, r, a, ) <br />
R2<br />
<br />
<br />
X( R2, r, a,<br />
)<br />
<br />
<br />
d<br />
<br />
<br />
sin R1<br />
<br />
1<br />
d<br />
<br />
<br />
<br />
<br />
R<br />
0<br />
1rco s X( R1, r, a, ) X( R1, r, a,<br />
)<br />
<br />
<br />
<br />
2<br />
Ir rsin R2<br />
arctan<br />
2<br />
<br />
a<br />
<br />
<br />
<br />
X(<br />
R<br />
0<br />
2,<br />
r, a,<br />
)<br />
2<br />
R <br />
1<br />
sin d, X( R , r, a,<br />
)<br />
<br />
(24)<br />
1<br />
2 2 2<br />
X( R, r, a, ) R r a 2Rrcos (25)<br />
2 2<br />
Ir R2 r R2 r a<br />
T3( R1, R2, r, a)<br />
ln<br />
<br />
4 2 2 <br />
R1 r R1 r a <br />
<br />
<br />
<br />
<br />
2<br />
I Ir<br />
co s X( R2, r, a, ) X( R1, r, a, ) d<br />
2 <br />
4<br />
0<br />
<br />
sincossin R2<br />
<br />
1 d<br />
R<br />
0<br />
2 rco s X( R2, r, a, ) X( R2, r, a,<br />
)<br />
<br />
<br />
<br />
<br />
<br />
0<br />
<br />
<br />
<br />
<br />
sincos sin<br />
R1<br />
<br />
1d R1rcos X( R1, r, a,<br />
)<br />
X( R1, r, a,<br />
)<br />
<br />
<br />
<br />
<br />
I R2 R1 ; r R1<br />
<br />
T ( R , R , r) I<br />
R r ; R r R<br />
B<br />
1 2 2 1 2<br />
<br />
<br />
0; r R2.<br />
(26)<br />
- 142 - 15th IGTE Symposium 2012<br />
<br />
<br />
0Ir<br />
R2 R1; r R<br />
<br />
1<br />
2<br />
<br />
3 3<br />
0Ir<br />
r R <br />
1<br />
TA( R1, R2, r) R2 r ; R<br />
2 1 r R2<br />
2 <br />
<br />
3r<br />
<br />
<br />
3 3<br />
0I<br />
R2 R <br />
1<br />
<br />
; r R2<br />
2 <br />
<br />
3r<br />
<br />
Finally, the inductance calculation can be done using<br />
the equation for the stored magnetic energy:<br />
1 2 1 <br />
LI <br />
2 2<br />
JS AdV V<br />
(27)<br />
1 <br />
L J ( ) ( ) .<br />
2 S AM r AS r dV<br />
I <br />
V<br />
As can be seen from equation (27), the inductance <strong>of</strong> ith<br />
conductor can be separated into two parts:<br />
Li LiM LiS,<br />
(28)<br />
where LiM is the contribution <strong>of</strong> magnetizing currents<br />
and LiS is the contribution <strong>of</strong> imposed currents<br />
(inductance <strong>of</strong> conductor in free space).<br />
The self and mutual inductance calculations <strong>of</strong> circular<br />
conductors with rectangular cross section have been done<br />
in a couple <strong>of</strong> papers [10]-[13]. Technically, the<br />
equations for L and M are the same with the difference in<br />
the limits <strong>of</strong> integration. The calculation <strong>of</strong> the selfinductance<br />
can be observed as the special case <strong>of</strong> the<br />
mutual-inductance equation.<br />
With the assumption <strong>of</strong> uniform distribution <strong>of</strong> current<br />
on conductor's cross section, the total energy stored in the<br />
magnetic field <strong>of</strong> the conductor is:<br />
2 2<br />
<br />
Z2 Z4 R2 R4<br />
0JJ<br />
1 2<br />
W cos<br />
r<br />
2<br />
<br />
<br />
0zZ1 ZZ3 rR1 RR3 RdRdrdZdzd<br />
<br />
r R 2Rrcos zZ 2<br />
.<br />
(29)<br />
Using the expressions:<br />
1<br />
I<br />
W MI1I2; J (30)<br />
2<br />
S<br />
the equation for the mutual inductance <strong>of</strong> circular<br />
conductor with rectangular cross section is:<br />
0<br />
M <br />
Q,<br />
(31)<br />
( R2 R1)( Z2 Z1)( R4 R3)( Z4 Z3)<br />
where Q represents the above written quintuple integral<br />
in (29).<br />
Using the equivalences:<br />
Z3 Z1; Z4 Z2; R3 R1; R4 R2; I1 I2,<br />
(32)<br />
after analytically solving the four integrals for r, R, z and<br />
Z, according to [13], the final expression for the selfinductance<br />
<strong>of</strong> circular conductor <strong>of</strong> rectangular cross<br />
section in free space is:<br />
2 <br />
0N<br />
L (<br />
2 2<br />
2, 2, , ) ( 1, 1,<br />
, )<br />
2 1 <br />
Q R R H Q R R H <br />
R R H 0<br />
QR ( , R, H, ) QR<br />
( , R, H, ) d<br />
1 2 2 1
4<br />
<br />
2 <br />
h cos QrRh (, , , )<br />
<br />
<br />
30sin <br />
2 hcos<br />
bh arctan<br />
sin<br />
2 2 2<br />
2 2<br />
h cosrRsin3h r R cos<br />
<br />
2<br />
hsin bh 20<br />
<br />
4 2<br />
Rhsincos<br />
hrRcos <br />
arctan <br />
2 2<br />
Rsin bh <br />
<br />
4 2<br />
rhsincos<br />
hRrcos <br />
arctan <br />
2 2<br />
rsin bh <br />
<br />
2<br />
bh <br />
cos<br />
2 4 4 2 2<br />
3cos<br />
R r Rrcosr R <br />
15<br />
2<br />
2 2 <br />
2r R <br />
<br />
2<br />
bh <br />
<br />
bhcos ln <br />
<br />
<br />
2<br />
bh h<br />
<br />
2<br />
bh h<br />
<br />
2<br />
2 2 2 4 4<br />
r R 2cos R r<br />
<br />
8<br />
<br />
5 2 2<br />
R sin cos ln rRcos <br />
5<br />
b<br />
5 2 2<br />
r sin cos ln Rrcos b<br />
5<br />
3 2 2 2 2<br />
R cos 5h 3R sin <br />
ln rRcos <br />
15<br />
(33)<br />
2<br />
bh <br />
3 2 2 2 2<br />
r cos 5h 3r sin <br />
ln Rrcos <br />
15<br />
<br />
2<br />
bh ,<br />
<br />
<br />
2 2<br />
where N is the number <strong>of</strong> turns, b r R 2rRcos, and H Z2 Z1.<br />
The integral over in equation (33)<br />
cannot be written in closed form so it has to be solved<br />
numerically, solving the singularities in 0 and<br />
by using the l'Hôpital's rule.<br />
The above presented method for determining the<br />
inductance matrix <strong>of</strong> the power transformer windings has<br />
been tested for the various conductor positions and<br />
different magnetic permeabilities <strong>of</strong> the transformer core.<br />
Comparison showed that the difference between the<br />
pr<strong>of</strong>essional FEM tools (Ans<strong>of</strong>t Maxwell ®<br />
s<strong>of</strong>tware<br />
package) is way beyond 1%, which proves the accuracy<br />
and usefulness <strong>of</strong> the presented method.<br />
V. CONCLUSION<br />
In this paper we present the method for fast and precise<br />
computation <strong>of</strong> capacitances and inductances <strong>of</strong> high<br />
power transformer windings with coils <strong>of</strong> rectangular<br />
cross section based on the boundary element method.<br />
Geometry <strong>of</strong> the windings is axially symmetric, and the<br />
model may be reduced to two-dimensional axially<br />
symmetric problem. Capacitances are computed from<br />
static electric field solution. Surface charge distribution is<br />
determined by BEM solution <strong>of</strong> a pair <strong>of</strong> coupled integral<br />
equations for static electric fields. Inductances are<br />
computed from static magnetic field solution. Surface<br />
magnetization current distribution is determined by BEM<br />
- 143 - 15th IGTE Symposium 2012<br />
solution <strong>of</strong> a pair <strong>of</strong> coupled integral equations for static<br />
magnetic fields.<br />
Boundaries and interfaces are divided into line and arc<br />
finite segments, and the unknown distribution <strong>of</strong> surface<br />
charge or current density is approximated by piecewise<br />
constant functions. System <strong>of</strong> equations for unknown<br />
coefficients <strong>of</strong> distribution is obtained by “pointmatching”.<br />
The testing shows that even very coarse discretization<br />
results in satisfactory accuracy <strong>of</strong> the computation and<br />
therefore proves the applicability <strong>of</strong> the presented<br />
method.<br />
REFERENCES<br />
[1] E. Bjerkan and H. K. Høidalen, "High frequency FEM-based<br />
power transformer modeling: investigation <strong>of</strong> internal stresses due<br />
to network-initiated overvoltages", International Conference on<br />
Power Systems Transients (IPST'05), Montreal, Canada, June<br />
2005.<br />
[2] Y. Shibuya, T. Matsumoto and T. Teranishi, "Modelling and<br />
analysis <strong>of</strong> transformer winding at high frequencies", International<br />
Conference on Power Systems Transients (IPST'05), Montreal,<br />
Canada, June 2005.<br />
[3] K. Pedersen, M. E. Lunow, J. Holboell and M. Henriksen,<br />
"Detailed high frequency models <strong>of</strong> various winding types in<br />
power transformers", International Conference on Power Systems<br />
Transients (IPST'05), Montreal, Canada, June 2005.<br />
[4] G. Liang, H. Sun, X. Zhang, X. Cui, “Modeling <strong>of</strong> Transformer<br />
Windings Under Very Fast Transient Overvoltages” IEEE<br />
Transactions on Electromagnetic Compatibility, Vol. 48, No 4,<br />
November 2006.<br />
[5] M. Popov, L. van der Sluis, R. P. P. Smeets and J. L. Roldan,<br />
"Analysis <strong>of</strong> very fast transients in layer-type transformer<br />
windings", IEEE Transactions on Power Delivery, Vol. 22, No. 1,<br />
pp. 238-247, January 2007.<br />
[6] Ž. Štih, “High Voltage Insulating System Design by Application<br />
<strong>of</strong> Electrode and Insulator Contour Optimization”, IEEE<br />
Transactions on Electrical Insulation, Vol. EI-21, No.4, August<br />
1986.<br />
[7] P. Zhu, "Field distribution <strong>of</strong> a uniformly charged circular arc",<br />
Journal <strong>of</strong> Electrostatics, Vol. 63, pp. 1035-1047, March 2005.<br />
[8] Z. Haznadar, Ž. Štih, "Electromagnetic Fields, Waves and<br />
Numerical Methods", IOS Press, Amsterdam 2000.<br />
[9] J. T. Conway, "Trigonometric integrals for the magnetic field <strong>of</strong><br />
the coil <strong>of</strong> rectangular cross section", IEEE Transactions on<br />
Magnetics, Vol. 42, No. 5, pp. 1538-1548, May 2006.<br />
[10] S. I. Babic and C. Akyel, "New analytic-numerical solutions for<br />
the mutual inductance <strong>of</strong> two coaxial circular coils with<br />
rectangular cross section in air", IEEE Transactions on Magnetics,<br />
Vol. 42, No. 6, pp. 1661-1669, June 2006.<br />
[11] J. T. Conway, "Inductance calculations for circular coils <strong>of</strong><br />
rectangular cross section and parallel axes using Bessel and Struve<br />
functions", IEEE Transactions on Magnetics, Vol. 46, No. 1, pp.<br />
75-81, January 2010.<br />
[12] D. Yu, K. S. Han, "Self-Inductance <strong>of</strong> Air-Core Circular Coils<br />
with Rectangular Cross Section", IEEE Transactions on<br />
Magnetics, Vol. 23, No. 6, pp. 3916-3921, November 1987.<br />
[13] I. Doležel, "Self-inductance <strong>of</strong> an air cylindrical coil", Acta<br />
, Vol. 34, No. 4, pp. 443-473, 1989.
- 144 - 15th IGTE Symposium 2012<br />
Numerical and Experimental Investigations <strong>of</strong><br />
the Structural Characteristics <strong>of</strong> Stator Core<br />
Stacks<br />
Mathias Mair ∗ , Bernhard Weilharter †‡ , Siegfried Rainer § , Katrin Ellermann ∗ and Oszkár Bíró †§<br />
∗ Institute for Mechanics, <strong>University</strong> <strong>of</strong> <strong>Technology</strong> <strong>Graz</strong>, Austria<br />
† Christian Doppler Laboratory for Multiphysical Simulation, Analysis and Design <strong>of</strong> Electrical<br />
Machines, Austria<br />
‡ Institute for Electric Drives and Machines, <strong>University</strong> <strong>of</strong> <strong>Technology</strong> <strong>Graz</strong>, Austria<br />
§ Institute for Fundamentals and Theory in Electrical Engineering, <strong>University</strong> <strong>of</strong> <strong>Technology</strong> <strong>Graz</strong>, Austria<br />
E-mail: mair@tugraz.at<br />
Abstract—The response characteristics <strong>of</strong> two stator core stacks are investigated by experimental modal analysis.<br />
Furthermore, the modal parameters like the eigenfrequencies and eigenvectors are calculated from a numerical modal<br />
analysis. Afterwards, their frequency response functions are computed and compared with the measured frequency response<br />
functions. In order to achieve a set <strong>of</strong> material parameters, the computed response characteristics are adjusted to match<br />
the measured response characteristics.<br />
Index Terms—experimental modal analysis, homogeneous material model, numerical modal analysis, stator core stack<br />
I. INTRODUCTION<br />
The development <strong>of</strong> electrical machines requires an accurate<br />
dynamical analysis in order to reduce side effects<br />
<strong>of</strong> vibration, e.g. noise and material damage. Since the<br />
electrical machine consists <strong>of</strong> many complex and heterogeneous<br />
parts, like the stator core stack, the mechanical<br />
modeling <strong>of</strong> an electrical machine is a complicated task.<br />
Especially for the noise computation <strong>of</strong> electrical machines,<br />
the structural behavior <strong>of</strong> the stator core stack is<br />
<strong>of</strong> interest. It is mainly influenced by forces caused by<br />
the magnetic field in the air gap acting on the stator teeth<br />
[1]–[4]. An analytical method proposed by [5] allows<br />
for the computation <strong>of</strong> the stator vibration with a twodimensional<br />
ring model. However, the disadvantage <strong>of</strong><br />
this method is that it is not possible to consider the<br />
response characteristics along the axial direction. Other<br />
approaches, considering the stator core stack as a thin<br />
cylinder, have been investigated in [6]–[11].<br />
With numerical methods, e.g. the finite element method<br />
(FEM), the three dimensional behavior <strong>of</strong> the stator can<br />
be determined [12], [13]. The main problem is to set up<br />
an appropriate material model for the numerical analysis,<br />
which considers the heterogeneous composition <strong>of</strong> the<br />
stator core stack consisting <strong>of</strong> laminated sheets coated<br />
with resin [14].<br />
Experimental investigations <strong>of</strong> models consisting <strong>of</strong><br />
laminated iron sheets and a comparison with numerical<br />
simulations using three-dimensional homogeneous models<br />
has been presented in [15]. Another investigation <strong>of</strong> a<br />
stator core stack has been done by [16] in order to acquire<br />
isotropic material parameters. With this approach, it is<br />
possible to calculate modes with pure radial displacement<br />
adequately. A similar investigation has been presented by<br />
[17] with the difference that the used FEM-model has<br />
been computed with a material model <strong>of</strong> transversally<br />
isotropic elasticity. However, the measurement points<br />
have not been uniformly distributed on the outer ring<br />
surface. As a consequence, the measurement result <strong>of</strong> the<br />
response characteristics <strong>of</strong> the stator core stack is limited.<br />
In this paper, the three dimensional structural vibration<br />
behavior <strong>of</strong> stator core stacks is investigated by using<br />
the finite element method in conjunction with a modal<br />
analysis. The influence <strong>of</strong> the lamination along the axial<br />
direction will be considered by using a homogeneous material<br />
model with transversally isotropic elasticity. For the<br />
identification <strong>of</strong> the corresponding material parameters,<br />
two finite element models <strong>of</strong> stator core stacks have been<br />
set up, one with teeth and one without teeth. A modal<br />
analysis is carried out to determine the eigenfrequencies<br />
and eigenforms (modes) <strong>of</strong> the finite element models.<br />
Thereafter the frequency response characteristics <strong>of</strong> the<br />
two structures are computed with a reduced order model<br />
by applying a modal reduction.<br />
Acceleration measurements for which the structures<br />
have been excited with an electrodynamic shaker in a<br />
frequency range <strong>of</strong> 0 − 6000 Hz have been performed<br />
at 60 points on the stator core stacks. An experimental<br />
modal analysis then provides the measured response<br />
characteristics and eigenfrequencies and eigenforms [18].<br />
Finally, the results <strong>of</strong> the numerical investigation are<br />
compared with the results from the experimental modal<br />
analysis. The material parameters are adjusted step by<br />
step until the response characteristics <strong>of</strong> the numerical<br />
model approximate the measured one sufficiently. This<br />
way a set <strong>of</strong> material parameters for homogeneous material<br />
models for the stator core stacks is obtained, which<br />
describes the structural behavior adequately.
II. EXPERIMENTAL MODAL ANALYSIS (EMA)<br />
An experimental modal analysis allows for the determination<br />
<strong>of</strong> the response characteristics <strong>of</strong> the stator core<br />
stacks excited by a shaker. For this, acceleration sensors<br />
are recording the vibration at distinct measurement points<br />
on the stator core stacks. Then, the characteristic response<br />
behavior can be derived, transforming the resulting time<br />
signals into the frequency domain.<br />
A. Test stand for experimental modal analysis<br />
In order to measure the vibration on the stator core<br />
stacks, a test stand as shown in Fig. 1 has been built.<br />
Ropes composed <strong>of</strong> an elastic material are connected<br />
to a portal frame and suspend the stator core stack,<br />
additionally the table is mounted on air bearings. This<br />
reduces the influence <strong>of</strong> the adjacent structure to a minimum.<br />
In order to excite the structure, an electromagnetic<br />
shaker is mounted on the table. The connection between<br />
structure and shaker is realized by a push rod and a<br />
force sensor affixed to the test structure with a twocomponent<br />
adhesive. This way, the measurement setup<br />
can be built up and disassembled easily without machine<br />
tools. For the measurement, the shaker is controlled by<br />
a measurement system which also records the signals <strong>of</strong><br />
the acceleration sensors.<br />
Fig. 1: Test stand<br />
B. Measurements procedure<br />
The measurement points are located on the outer<br />
surface <strong>of</strong> the structure at sixty defined positions, see<br />
Fig. 2. The excitation is applied to point no. 1 for all<br />
measurements. At this point, the structure is excited by<br />
the shaker with a so-called periodic chirp signal in a<br />
frequency range from 2 Hz to 6400 Hz. The applied signal<br />
is repeated ten times with the entire frequency range<br />
passed through in each sequence within a short time<br />
period <strong>of</strong> 2.5 s.<br />
In the course <strong>of</strong> the measurement procedure, the frequency<br />
response function (FRF) in reference to the excitation<br />
point is determined for each measurement point.<br />
Finally, the arithmetically averaged acceleration and force<br />
signals <strong>of</strong> these ten measurement runs are used to derive<br />
the FRFs corresponding each <strong>of</strong> the measurement points.<br />
- 145 - 15th IGTE Symposium 2012<br />
55<br />
43<br />
31<br />
19<br />
7<br />
58<br />
46<br />
34<br />
22<br />
10<br />
49<br />
37<br />
25<br />
13<br />
Fig. 2: Defined measurement points<br />
C. Identification <strong>of</strong> modal parameters<br />
After all FRFs are measured, the next step is to<br />
identify the modal parameters. This is done by the so<br />
called PolyMAX frequency-domain method [19], which<br />
is a curve fitting technique. In a first step a least-squares<br />
method fits the polynomials<br />
p<br />
<br />
p<br />
−1 V0(Ω) =<br />
(1)<br />
z<br />
i=0<br />
i βi z<br />
i=0<br />
i αi<br />
to the measured FRFs. Thereby, Ω denotes the excitation<br />
frequency and V0(Ω) is called the frequency response<br />
matrix. βi are the coefficient numerator matrices, αi are<br />
the coefficient denominator matrices, zi are the complex<br />
basis vectors in the discrete frequency domain and p is<br />
the order <strong>of</strong> the polynomials.<br />
With the known polynomial functions, the eigenvalues<br />
λi and the modal participation vectors li can be calculated.<br />
To determine the eigenvectors ri as well as the<br />
lower and upper residual matrices LR and UR, a further<br />
least square method, based on the pole residual model<br />
q<br />
<br />
ril<br />
V(Ω) =<br />
i=0<br />
T i<br />
λi − jΩ + rilH <br />
i<br />
+<br />
λi − jΩ<br />
LR<br />
+UR , (2)<br />
Ω2 is applied. The dashed symbols mark the conjugate<br />
complex variables. The determined frequency response<br />
matrix V(Ω)<br />
⎡<br />
⎤<br />
f11 f12 ··· f1m<br />
⎢<br />
.<br />
⎢<br />
.<br />
⎥<br />
f21 f22<br />
V(Ω) = ⎢<br />
. ⎥<br />
⎢ .<br />
⎣<br />
. ⎥<br />
(3)<br />
.<br />
.. ⎦<br />
fn1 ··· fnm<br />
is filled with the frequency response functions fkl between<br />
the excitation point l and the measuring point k.<br />
The size <strong>of</strong> V(Ω) is therewith defined by m excitation<br />
points times n measuring points.<br />
The identified modal parameters and frequency response<br />
functions fkl allow a convenient description <strong>of</strong><br />
the measured response characteristics for the later comparison<br />
with numerical results.<br />
D. Measurement results<br />
In order to determine material parameters for a specific<br />
numerical model, the mode-shapes determined with the<br />
1
EMA, corresponding to the estimated eigenvalues, must<br />
be identified. To distinguish the different mode-shapes,<br />
a numbering system is established. The numbering comprises<br />
three numbers and refers to a cylindrical coordinate<br />
system. The first digit describes the number <strong>of</strong> maxima<br />
<strong>of</strong> the mode in radial direction along the azimuthal<br />
coordinate axis, see Fig. 3. The second digit represents<br />
the number <strong>of</strong> zero crossings <strong>of</strong> the mode in radial<br />
direction along the z-axis. The third digit is a counter to<br />
differentiate modes with the same deformation properties<br />
regarding the first two digits.<br />
z<br />
2<br />
y<br />
Fig. 3: Example for a mode (3,2,0)<br />
1) Results <strong>of</strong> the stator core stack without teeth:<br />
The sum <strong>of</strong> all measured as well as the sum <strong>of</strong> all<br />
approximated frequency response functions fkl, the latter<br />
calculated by the identified modal parameters, are depicted<br />
in Fig. 4, with the different mode patterns indicated<br />
by the introduced numbering system.<br />
magnitude [ m N ]<br />
10 -5<br />
10 -6<br />
10 -7<br />
(2,1,0)<br />
(2,0,0)<br />
(2,2,3)<br />
(2,2,2)<br />
(2,2,1)<br />
(2,2,0)<br />
(3,2,2)<br />
(3,2,1)<br />
(3,2,0)<br />
(3,1,0)<br />
(3,0,0)<br />
(3,1,1)<br />
(3,2,3)<br />
500 1000 1500 2000 2500 3000 3500 4000 4500 5000 5500 6000<br />
(4,0,0)<br />
(4,0,1)<br />
(3,3,0)<br />
(3,4,0)<br />
(3,3,1)<br />
(3,4,1)<br />
(3,3,2)<br />
(3,4,2) (4,2,0)<br />
(3,3,3) (3,4,3) (4,2,1)<br />
(3,3,4)<br />
(4,2,2)<br />
(3,3,5)<br />
(4,4,0)<br />
(4,4,1)<br />
3<br />
Sum <strong>of</strong> all approximated FRF‘s, using modal parameters<br />
Sum <strong>of</strong> all measured FRF‘s<br />
frequency [Hz]<br />
x<br />
(5,0,0)<br />
(5,4,0)<br />
(5,4,1)<br />
(5,4,2)<br />
(5,4,3)<br />
Fig. 4: Sum <strong>of</strong> all frequency response functions <strong>of</strong> the<br />
stator core stack without teeth<br />
As Fig. 4 shows, with the increase <strong>of</strong> the excitation<br />
frequency the mean <strong>of</strong> the magnitude decreases. This<br />
corresponds to mass dominated dynamical behavior. [20,<br />
p.294]<br />
- 146 - 15th IGTE Symposium 2012<br />
The modes (2,0,0), (2,1,0), (3,0,0), (3,1,0), (4,0,0),<br />
(4,0,1) and (5,0,0) have the most distinctive magnitude in<br />
the investigated frequency domain. Some <strong>of</strong> these modes,<br />
(2,0,0), (3,0,0), (4,0,0) and (5,0,0) could be calculated by<br />
analytical two-dimensional methods, see [7], [5] or [10].<br />
Other modes like (2,2,0) or (3,1,1) with non-uniform<br />
radial displacements along the z-axis can only be treated<br />
by three-dimensional approaches.<br />
Table I summarizes the measurement results for the<br />
stator core stack without teeth and lists modes with their<br />
appropriate eigenfrequencies and damping ratios.<br />
TABLE I: Modes, eigenfrequency and damping ratio <strong>of</strong><br />
stator core stack without teeth<br />
mode num. eigenfrequency damping ratio<br />
1/(2, 0, 0) 769,22 Hz 0,411387 %<br />
2/(2, 1, 0) 795,03 Hz 0,959535 %<br />
3/(2, 2, 0) 1282,95 Hz 1,197700 %<br />
4/(2, 2, 1) 1353,39 Hz 1,086200 %<br />
5/(3, 2, 0) 1728,14 Hz 1,442430 %<br />
6/(3, 1, 0) 2092,07 Hz 0,218612 %<br />
7/(3, 0, 0) 2109,96 Hz 0,093046 %<br />
8/(3, 1, 1) 2192,46 Hz 1,122050 %<br />
9/(3, 2, 0) 2278,98 Hz 0,691529 %<br />
10/(3, 3, 0) 2509,73 Hz 0,897709 %<br />
11/(3, 3, 1) 2569,24 Hz 1,057550 %<br />
12/(3, 3, 2) 2631,76 Hz 1,108720 %<br />
13/(4, 0, 0) 3860,04 Hz 0,043218 %<br />
14/(4, 0, 1) 3871,37 Hz 0,050271 %<br />
15/(3, 4, 0) 3947,18 Hz 0,496143 %<br />
16/(3, 4, 1) 4037,01 Hz 1,110130 %<br />
17/(4, 2, 0) 4414,03 Hz 0,606060 %<br />
18/(4, 2, 1) 4482,12 Hz 0,590154 %<br />
19/(4, 4, 0) 4756,58 Hz 0,348687 %<br />
20/(4, 4, 1) 4773,74 Hz 0,360285 %<br />
21/(5, 4, 0) 4940,08 Hz 0,930774 %<br />
22/(5, 0, 0) 5927,25 Hz 0,127039 %<br />
2) Results <strong>of</strong> the stator core stack with teeth: Similarly<br />
the results given in Fig. 4, the sum <strong>of</strong> all measured and<br />
the sum <strong>of</strong> all approximated frequency response function<br />
fkl for the stator core stack with teeth, are plotted in Fig.<br />
5.<br />
magnitude [ m N ]<br />
10 -5<br />
10 -6<br />
10 -7<br />
1e-8<br />
(2,0,1)<br />
(2,0,0)<br />
(2,1,0)<br />
(2,1,1)<br />
(2,2,1)<br />
(2,2,0)<br />
(3,0,0)<br />
(3,1,0)<br />
(0,0,1)<br />
(0,0,2)<br />
(3,2,0)<br />
(4,0,0)<br />
(4,3,2)<br />
(4,3,1)<br />
(4,3,0)<br />
Sum <strong>of</strong> all approximated FRF‘s, using modal parameters<br />
Sum <strong>of</strong> all measured FRF‘s<br />
(4,0,1)<br />
500 1000 1500 2000 2500 3000 3500 4000 4500 5000 5500 6000<br />
frequency [Hz]<br />
(4,1,0)<br />
(4,3,7)<br />
(4,3,6)<br />
(4,3,3)<br />
(4,3,4)<br />
(4,3,5)<br />
(5,1,0)<br />
(5,0,0) (6,1,0)<br />
(6,0,0)<br />
(0,1,0)<br />
(5,2,0)<br />
(5,4,0)<br />
(5,2,1)<br />
(5,4,1)<br />
(5,4,2)<br />
(5,4,3)<br />
Fig. 5: Sum <strong>of</strong> all frequency response functions <strong>of</strong> the<br />
stator core stack with teeth
Comparing the FRFs depicted in Fig. 4 and Fig. 5,<br />
the number <strong>of</strong> distinct modes in Fig. 5 is greater. Due<br />
to the higher mass, the corresponding eigenfrequencies<br />
<strong>of</strong> the stator core stack with teeth are lower than those<br />
<strong>of</strong> the stator core stack without teeth. Furthermore, the<br />
frequency spacing <strong>of</strong> distinct modes, e.g. (3,0,0) and<br />
(3,1,0) increases.<br />
It is impossible to identify modes which have higher<br />
eigenfrequencies than 5400 Hz, because the used measurement<br />
grid is too coarse to detect the appropriate<br />
eigenforms. In this frequency range, one would expect<br />
to see modes comprising seven maxima or more with a<br />
radial displacement along the azimuthal axis. This is not<br />
possible to identify with only twelve measurement points<br />
in the azimuthal direction.<br />
Table II lists measurement results for the stator core<br />
stack with teeth.<br />
TABLE II: Modes, eigenfrequency and damping ratio <strong>of</strong><br />
stator core stack with teeth<br />
mode num. eigenfrequency damping ratio<br />
1/(2, 0, 0) 661.27 Hz 0.054903 %<br />
2/(2, 0, 1) 667.84 Hz 0.055824 %<br />
3/(2, 1, 0) 720.08 Hz 0.337501 %<br />
4/(2, 1, 1) 727.85 Hz 0.332884 %<br />
5/(2, 2, 0) 1365.85 Hz 1.005810 %<br />
6/(2, 2, 1) 1416.91 Hz 0.939553 %<br />
7/(3, 0, 0) 1767.43 Hz 0.047843 %<br />
8/(3, 1, 0) 1851.83 Hz 0.205125 %<br />
9/(3, 2, 0) 2313.17 Hz 0.640636 %<br />
10/(0, 0, 1) 2372.92 Hz 0.654953 %<br />
11/(0, 0, 2) 2376.10 Hz 0.608697 %<br />
12/(4, 3, 0) 2755.43 Hz 0.382039 %<br />
13/(4, 0, 0) 3107.52 Hz 0.097675 %<br />
14/(4, 0, 1) 3116.28 Hz 0.152128 %<br />
15/(4, 1, 0) 3190.75 Hz 0.180018 %<br />
16/(4, 3, 3) 3288.28 Hz 0.743811 %<br />
17/(0, 1, 0) 3955.81 Hz 0.306280 %<br />
18/(5, 2, 0) 4074.92 Hz 0.188232 %<br />
19/(5, 0, 0) 4423.63 Hz 0.046342 %<br />
20/(5, 1, 0) 4484.19 Hz 0.173786 %<br />
21/(5, 4, 0) 4655.18 Hz 0.661100 %<br />
22/(5, 4, 1) 4745.87 Hz 0.237239 %<br />
23/(6, 0, 0) 5314.37 Hz 0.031471 %<br />
24/(6, 1, 0) 5356.55 Hz 0.039125 %<br />
III. NUMERICAL MODAL ANALYSIS<br />
As a means for the numerical simulation <strong>of</strong> the dynamical<br />
behavior <strong>of</strong> the stator core stacks, the finite element<br />
method is used. Therefore, an adequate simulation model<br />
has to be set up.<br />
For the numerical model based on the FEM - model,<br />
some simplifications are made:<br />
• the contacts between the laminations are neglected<br />
• the grain orientation <strong>of</strong> the cold rolled silicon-ironalloy<br />
is neglected<br />
• a linear and homogeneous material model is assumed<br />
• the FEM model is assumed to be linear<br />
- 147 - 15th IGTE Symposium 2012<br />
Using an adequate FEM-model and performing a numerical<br />
modal analysis, the modal parameters (eigenfrequencies,<br />
eigenforms, and damping coefficients) which<br />
can be directly related to the modal parameters from the<br />
measurements are obtained.<br />
A. Material model<br />
By considering the above limitations, a material model<br />
with transversally isotropic elasticity is implemented.<br />
z<br />
Fig. 6: Coordinate system <strong>of</strong> the stator core stack<br />
The used transversally isotropic elasticity <strong>of</strong> the material<br />
model corresponds to the coordinate system, depicted<br />
in Fig. 6. The flexibility matrix S for a material model<br />
with a transversally isotropic elasticity is given by<br />
⎡<br />
⎢<br />
S = ⎢<br />
⎣<br />
1<br />
Ex<br />
νxy<br />
− Ex<br />
1<br />
Ex<br />
νzy<br />
− Ex<br />
νxz<br />
− Ez<br />
νyz<br />
− Ez<br />
1<br />
Ez<br />
0<br />
0<br />
0<br />
0<br />
0<br />
0<br />
0<br />
0<br />
0<br />
0<br />
0<br />
0<br />
0<br />
0<br />
0<br />
0<br />
0<br />
0<br />
2(1+νxy)<br />
Ex<br />
0<br />
0<br />
0<br />
1<br />
Gxz<br />
0<br />
0<br />
0<br />
1<br />
Gxz<br />
− νyx<br />
Ex<br />
− νzx<br />
Ex<br />
x<br />
y<br />
⎤<br />
⎥ , (4)<br />
⎥<br />
⎦<br />
where E is Young’s modulus, G is the shear modulus and<br />
ν is Poisson’s ratio. The material model and therefore<br />
the flexibility matrix S <strong>of</strong> the transversally isotropic<br />
elasticity has rotationally symmetrical properties. Due to<br />
the symmetric properties <strong>of</strong> S, the following conditions<br />
are valid:<br />
νxy = νyx<br />
νxz<br />
Ez<br />
= νzx<br />
Ex<br />
B. Modal description<br />
Based on the finite element method, the equations <strong>of</strong><br />
motion <strong>of</strong> a structural damped system can be formulated<br />
as the linear system <strong>of</strong> differential equations<br />
(5)<br />
(6)<br />
M ¨ û + D ˙ û + Kû = f . (7)<br />
Thereby, M is the mass matrix, K is the stiffness matrix,<br />
f is the excitation force and D is the proportional
damping matrix defined by the Rayleigh damping model<br />
[12, p.950ff]<br />
D = α M + β K . (8)<br />
as a linear combination <strong>of</strong> the mass matrix M and the<br />
stiffness matrix K with the scalar coefficients α and β.<br />
Solving the eigenvalue problem <strong>of</strong> the undamped<br />
system leads to the eigenvalues λi and eigenvectors<br />
ri. The modal matrix R consisting <strong>of</strong> mass-normalized<br />
eigenvectors defines the transformation<br />
û = Rz (9)<br />
<strong>of</strong> the displacement in the global state space û to the<br />
displacement in the modal state space z.<br />
Using (9) and multiplying (7) with the transformed<br />
modal matrix R from the left, the proportional damped<br />
equations <strong>of</strong> motion are transformed into a noninteracting<br />
system <strong>of</strong> the dimension q<br />
˜M ¨z + ˜ D ˙z + ˜ Kz = ˜ f . (10)<br />
Since, the eigenvectors are mass-normalized, the transformation<br />
<strong>of</strong> the mass matrix M into the modal state<br />
space leads to the identity matrix I<br />
˜M = R T MR= I . (11)<br />
The transformed stiffness matrix becomes<br />
˜K = R T KR= Λ = diag ω 2 i , (12)<br />
where ωi is the undamped angular eigenfrequency <strong>of</strong> the<br />
i−th mode. The damping matrix yields a diagonal matrix<br />
expressed as<br />
˜D = R T DR= diag [2ζiωi] , (13)<br />
Thereby ζi denotes the modal damping ratio <strong>of</strong> the i−th<br />
mode [20, p.63ff].<br />
This approach yields simultaneously a modal reduction<br />
<strong>of</strong> the equation system. This reduction has the advantage<br />
that, without it, the computing effort increases rapidly.<br />
The disadvantage is that the disregarded modes create<br />
an error in the response characteristics. However, the<br />
error resulting from the material model with transversally<br />
isotropic elasticity is expected to be much higher than this<br />
error and thus the latter is neglected.<br />
Assuming a harmonic excitation, the excitation force<br />
can be expressed as<br />
f = ˇ f e jΩt<br />
(14)<br />
in the frequency domain. Here, ˇ f is the amplitude<br />
<strong>of</strong> the excitation force and Ω describes the excitation<br />
frequency. This leads to a harmonic ansatz for the modal<br />
displacement:<br />
z = ˇz e jΩt<br />
(15)<br />
where ˇz denotes the amplitude <strong>of</strong> the displacement in the<br />
modal state space. Hence, (10) becomes<br />
−Ω 2 I ˇz + jΩ ˜ Dˇz + ˜ Kˇz = ˇ f . (16)<br />
- 148 - 15th IGTE Symposium 2012<br />
Finally, the backward transformation in the global state<br />
space leads to<br />
ǔ = R<br />
<br />
−Ω 2 I + jΩ ˜ D + ˜ −1 K<br />
R T ˇ f = ˜V ˇ f . (17)<br />
Therewith, the numerically estimated frequency response<br />
matrix ˜V is<br />
<br />
˜V (Ω) = R −Ω 2 I + jΩ ˜ D + ˜ −1 K R T<br />
(18)<br />
Using (11), (13) and (12), the entries <strong>of</strong> ˜V (Ω) can be<br />
expressed by the frequency response functions ˜ fkl<br />
˜fkl(Ω) =<br />
q<br />
i=1<br />
ir ∗ k ir ∗ l<br />
−Ω 2 + ω 2 i + jΩ 2ζi ωi<br />
, (19)<br />
which can be related directly to the corresponding frequency<br />
response functions fkl(Ω) estimated by the measurement.<br />
Thereby ir∗ k denotes the k-th entry <strong>of</strong> the i-th<br />
mass-normalized eigenvector ri.<br />
IV. INFLUENCE OF MATERIAL PARAMETERS ON<br />
TRANSMISSION BEHAVIOR<br />
In order to estimate the influence <strong>of</strong> each material<br />
parameter, simulations as explained in section III-B are<br />
carried out for the stator model without teeth. The<br />
material parameters, except for the density, are varied<br />
in a distinct range and their influence on the structural<br />
behavior is investigated.<br />
The density is determined by measurements. With a<br />
mass <strong>of</strong> 149.8kg and a volume from 0.019905 m 3 , a<br />
density <strong>of</strong> 7525 kg/m 3 results. This value is used for all<br />
calculations <strong>of</strong> this study.<br />
The material parameters <strong>of</strong> interest for the influence<br />
on the dynamical behavior are the Young’s moduli Ex<br />
and Ez, the shear module Gxz and the Poisson ratios<br />
νxy and νxz. The initial set <strong>of</strong> material parameters is<br />
shown in Table III. Based on this set, each parameter is<br />
TABLE III: Initial dataset <strong>of</strong> material parameters for a<br />
variation <strong>of</strong> each parameter<br />
material parameters values<br />
Ex<br />
190 · 109 N/m 2<br />
Ez<br />
25 · 109 N/m 2<br />
Gxz<br />
10 · 109 N/m 2<br />
νxy<br />
0.3<br />
νxz<br />
0.3<br />
ϱ 7525 kg/m 3<br />
varied to a lower and a higher value. The influence <strong>of</strong><br />
each parameter on the frequency response functions is<br />
shown below. The results <strong>of</strong> these investigations are the<br />
basis on which the adjustment <strong>of</strong> the simulated response<br />
characteristics on the measured response characteristics<br />
is done.
A. Variation <strong>of</strong> the Poisson ration ν<br />
In order to analyse the influence <strong>of</strong> the Poisson ratios<br />
on the response characteristics, the values are varied from<br />
0.2 to 0.4 . The Poisson ratios νxy and νxz are set equal<br />
for the calculations. Fig. 7 shows the sum <strong>of</strong> all frequency<br />
response functions for the varied Poisson ratios.<br />
magnitude [ m N ]<br />
-4<br />
10 v = 0.2<br />
v = 0.3<br />
v = 0.4<br />
10 -5<br />
10 -6<br />
10 -7<br />
500 1000 1500 2000 2500 3000<br />
frequency [Hz]<br />
Fig. 7: Sum <strong>of</strong> all frequency response functions <strong>of</strong> the<br />
stator core stack without teeth for varied Poisson ratio<br />
This comparison evidences that the influence <strong>of</strong> the<br />
Poisson ratios on the response characteristics is insignificantly<br />
small. Therefore, for νxy and νxz a value <strong>of</strong> 0.3<br />
is chosen.<br />
B. Variation <strong>of</strong> the elastic modulus Ex<br />
In Fig. 8, the sum <strong>of</strong> the calculated frequency response<br />
functions for the variation Ex is depicted. It can be ob-<br />
magnitude [ m N ]<br />
10 -4<br />
10 -5<br />
10 -6<br />
10 -7<br />
1<br />
2<br />
4<br />
5<br />
11 11a<br />
500 1000 1500<br />
frequency [Hz]<br />
2000 2500 3000<br />
7<br />
8<br />
9 2<br />
E x = 170· 10 N/m<br />
9 2<br />
E x = 190· 10 N/m<br />
9 2<br />
E x = 210· 10 N/m<br />
Fig. 8: Sum <strong>of</strong> all frequency response functions <strong>of</strong><br />
the stator core stack without teeth for varied Young’s<br />
modulus Ex<br />
served that some eigenfrequencies, for example for mode<br />
4, 5 or 11, are not influenced by the Young’s modulus Ex.<br />
Other modes, such as 7, 8, or 11a, are heavily affected by<br />
it. A small variation <strong>of</strong> the Young’s modulus Ex yields<br />
a large shift <strong>of</strong> distinct eigenfrequencies. If the value <strong>of</strong><br />
Ex decreases, the eigenfrequencies corresponding to the<br />
modes 7, 8, or 11a are declining and vice versa. Also<br />
- 149 - 15th IGTE Symposium 2012<br />
the eigenfrequencies <strong>of</strong> the corresponding modes 1 and<br />
2 depend on the Young’s modulus Ex but not as much<br />
as the previous ones.<br />
C. Variation <strong>of</strong> the elastic modulus Ez<br />
The variation <strong>of</strong> the material parameter Ez, depicted<br />
in Fig. 9, leads to a different dynamical behavior than<br />
the variation <strong>of</strong> Ex. The eigenfrequencies corresponding<br />
to the modes 1, 2 and 7 are not influenced by a variation<br />
magnitude [ m N ]<br />
10 -4<br />
10 -5<br />
10 -6<br />
10 -7<br />
2<br />
1<br />
3<br />
4<br />
500 1000 1500<br />
frequency [Hz]<br />
2000 2500 3000<br />
7<br />
9 2<br />
E z = 20· 10 N/m<br />
9 2<br />
E z = 25· 10 N/m<br />
9 2<br />
E = 30· 10 N/m<br />
Fig. 9: Sum <strong>of</strong> all frequency response functions <strong>of</strong><br />
the stator core stack without teeth for varied Young’s<br />
modulus Ez<br />
<strong>of</strong> the Young’s modulus Ez. When increasing the value<br />
<strong>of</strong> Ez, all other eigenfrequencies are shifted upwards<br />
in the considered frequency range and when decreasing<br />
Ez, these eigenfrequencies are shifted downwards. It is<br />
interesting to note, that the eigenfrequencies which are<br />
independent <strong>of</strong> the Young’s modulus Ez (mode 1, 2, 7),<br />
depend on the Young’s modulus Ex.<br />
D. Variation <strong>of</strong> the shear modulus Gxz<br />
Finally, theresults for the variation <strong>of</strong> the shear modulus<br />
Gxz is shown in Fig. 10. It can be seen that the<br />
magnitude [ m N ]<br />
10 -4<br />
10 -5<br />
10 -6<br />
10 -7<br />
1<br />
2<br />
500 1000 1500<br />
frequency [Hz]<br />
2000 2500 3000<br />
6<br />
9<br />
z<br />
10<br />
9 2<br />
G xz = 8· 10 N/m<br />
9 2<br />
G xz = 10· 10 N/m<br />
9 2<br />
G = 12· 10 N/m<br />
Fig. 10: Sum <strong>of</strong> all frequency response functions <strong>of</strong> the<br />
stator core stack without teeth by varied shear modulus<br />
Gxz<br />
5<br />
xz<br />
11<br />
12
eigenfrequencies corresponding to the modes 1 and 2 are<br />
independent <strong>of</strong> the variation <strong>of</strong> Gxz. Other modes, like<br />
5, 6, 9, 11 or 12, are heavily affected by the variation<br />
<strong>of</strong> the shear modulus. These eigenfrequencies are shifted<br />
downwards by decreasing and upwards by increasing the<br />
value <strong>of</strong> Gxz.<br />
Summing up, the influence <strong>of</strong> the material properties<br />
on the eigenfrequencies is strongly influenced by the<br />
corresponding eigenmode occurring at these frequencies.<br />
Depending on the characteristics (radial, azimuthal or axial<br />
bending) <strong>of</strong> the mode, Ex, Ez and Gxz have different<br />
influences. This background is important to know for the<br />
latter adjustment <strong>of</strong> the response characteristics.<br />
V. ADJUSTMENT OF MATERIAL PARAMETERS<br />
The dynamical behavior <strong>of</strong> the numerical model<br />
strongly depends on the chosen material parameters.<br />
An iterative process optimizes the eigenfrequencies and<br />
eigenvectors based on the known influence <strong>of</strong> the material<br />
parameters as discussed in section IV. Material<br />
parameters can be found by an adequate adjustment <strong>of</strong><br />
the measured and simulated response characteristics.<br />
A. Stator core stack without teeth<br />
First, a dataset <strong>of</strong> material parameters is chosen which<br />
describes the behavior <strong>of</strong> isotropic elasticity. Thereafter,<br />
the material parameters <strong>of</strong> the transversally isotropic<br />
elasticity are identified for the stator core stack without<br />
teeth.<br />
1) Comparison <strong>of</strong> measured and calculated frequency<br />
response functions by using the isotropic dataset: The<br />
results from the numerical simulation using isotropic<br />
material parameters (Table IV) are compared with the<br />
results from the experimental investigation, see Fig. 11.<br />
It can be seen that there is no analogy between the simu-<br />
TABLE IV: Initial dataset <strong>of</strong> material parameters <strong>of</strong><br />
isotropic elasticity<br />
material parameters values<br />
Ex<br />
210 · 109 N/m 2<br />
Ez<br />
210 · 109 N/m 2<br />
Gxz<br />
Ex<br />
2(1+ν)<br />
ν 0.3<br />
ϱ 7525 kg/m 3<br />
lated and measured response characteristics. Only the first<br />
eigenfrequency <strong>of</strong> the measured and simulated FRFs are<br />
similar. Furthermore, in the investigated frequency range<br />
less eigenfrequencies arise for the simulation model. This<br />
comparison shows that a material model with isotropic<br />
elasticity is clearly unsuitable.<br />
2) Adjustment <strong>of</strong> the measured and simulated frequency<br />
response function by using transversal isotropic<br />
elasticity: In a next step, a material model with transversally<br />
isotropic elasticity is used and the needed material<br />
parameters are adjusted. Table V lists these material<br />
parameters used as an initial dataset for the simulation.<br />
- 150 - 15th IGTE Symposium 2012<br />
magnitude [ m N ]<br />
10 -5<br />
10 -6<br />
10 -7<br />
10 -8<br />
Sum <strong>of</strong> simulated FRF‘s in radial direction<br />
Sum <strong>of</strong> measured FRF‘s in radial direction<br />
500 1000 1500 2000 2500 3000<br />
frequency [Hz]<br />
Fig. 11: Comparison <strong>of</strong> the sum <strong>of</strong> all measured and<br />
calculated FRFs with isotropic material model <strong>of</strong> the<br />
stator core stack without teeth<br />
Thereby, the Young’s modulus Ez and the shear modulus<br />
Gxz are chosen considerably lower with 40·10 9 N/m 2 and<br />
15 · 10 9 N/m 2 . This shifts the eigenfrequencies downward<br />
in the considered frequency range as explained in section<br />
IV.<br />
Looking at Fig. 12, it can be seen that in the regarded<br />
frequency range the measured and simulated eigenfrequencies<br />
<strong>of</strong> the modes (2,0,0) and (3,0,0) are close<br />
together.<br />
TABLE V: Dataset <strong>of</strong> material parameters <strong>of</strong> transversal<br />
isotropic elasticity<br />
magnitude [ m N ]<br />
10 -5<br />
10 -6<br />
10 -7<br />
material parameters values<br />
Ex<br />
210 · 109 N/m 2<br />
Ez<br />
40 · 109 N/m 2<br />
Gxz<br />
15 · 109 N/m 2<br />
νxy<br />
0.3<br />
νxz<br />
0.3<br />
ϱ 7525 kg/m 3<br />
mode (2,0,0)<br />
Sum <strong>of</strong> simulated FRF‘s in radial direction<br />
Sum <strong>of</strong> measured FRF‘s in radial direction<br />
mode (3,0,0)<br />
10<br />
500 1000 1500 2000 2500 3000<br />
-8<br />
frequency [Hz]<br />
Fig. 12: Comparison <strong>of</strong> the sum <strong>of</strong> all measured and calculated<br />
FRFs with transversally isotropic material model<br />
<strong>of</strong> the stator core stack without teeth<br />
Now the modes are adjusted by decreasing the elastic
modulus Ex. The value is optimized manually step-bystep<br />
until an adequate match is attained. For the Young’s<br />
modulus Ex, a value could be found which aligns the<br />
measured and simulated modes (2,0,0) and (3,0,0). The<br />
next step in the optimization is to adjust the mode<br />
(3,1,0). Therefore the shear modulus Gxz is reduced.<br />
Finally, with the Young’s modulus Ez, the other modes<br />
between (2,0,0) and (3,0,0) can be influenced. A stepwise<br />
reduction <strong>of</strong> this value yields an adequate correlation <strong>of</strong><br />
the other modes.<br />
The resulting material parameters <strong>of</strong> this optimization<br />
<strong>of</strong> the stator core stack without teeth are listed in Table<br />
VI.<br />
TABLE VI: Resulting dataset <strong>of</strong> material parameters<br />
for the stator core stack without teeth <strong>of</strong> transversally<br />
isotropic elasticity<br />
material parameters values<br />
Ex<br />
191, 8 · 109 N/m 2<br />
Ez<br />
24, 7 · 109 N/m 2<br />
Gxz<br />
11 · 109 N/m 2<br />
νxy<br />
0, 3<br />
νxz<br />
0, 3<br />
ϱ 7525 kg/m 3<br />
Fig. 13 depicts the response characteristics <strong>of</strong> the<br />
simulation results, using optimized material parameters<br />
and the measurement results for the stator core stack<br />
without teeth. A good approximation for the simulated<br />
magnitude [ m N ]<br />
10 -5<br />
10 -6<br />
10 -7<br />
mode (2,0,0)<br />
Mode mode (2,0,0) (2,1,0)<br />
Mode mode (2,2,0)<br />
mode (2,2,1)<br />
mode (2,2,3)<br />
mode (3,1,0)<br />
mode (3,2,1)<br />
Sum <strong>of</strong> simulated FRF‘s in radial direction<br />
Sum <strong>of</strong> measured FRF‘s in radial direction<br />
mode (3,0,0)<br />
mode (3,2,3)<br />
mode (3,3,0)<br />
mode (3,3,4)<br />
10<br />
500 1000 1500 2000 2500 3000<br />
-8<br />
frequency [Hz]<br />
Fig. 13: Comparison <strong>of</strong> the sum <strong>of</strong> all measured and<br />
calculated FRFs <strong>of</strong> the stator core stack without teeth,<br />
by using a material model with transversally isotropic<br />
elasticity and optimized parameters<br />
response characteristics can be observed. Hence, the<br />
identified material parameters for a linear and homogeneous<br />
material model can represent a similar dynamical<br />
behavior as the real structure <strong>of</strong> the stator core without<br />
teeth in a frequency range from 0Hzto 3000 Hz.<br />
The coincident eigenfrequencies and their corresponding<br />
modes are listed in Table VII.<br />
- 151 - 15th IGTE Symposium 2012<br />
TABLE VII: Coincident measured and simulated eigenfrequencies<br />
and modes <strong>of</strong> the stator core stack without<br />
teeth, resulting from adjustment<br />
modes measured eigenfreq. simulated eigenfreq.<br />
(2, 0, 0) 769.22 Hz 749.11 Hz<br />
(2, 1, 0) 795.03 Hz 757.16 Hz<br />
(2, 2, 0) 1280.95 Hz 1278.38 Hz<br />
(2, 2, 1) 1353.39 Hz 1362.88 Hz<br />
(2, 2, 3) 1606.08 Hz 1607.62 Hz<br />
(3, 2, 1) 1881.54 Hz 1869.87 Hz<br />
(3, 1, 0) 2092.07 Hz 2095.87 Hz<br />
(3, 0, 0) 2109.96 Hz 2097.67 Hz<br />
(3, 2, 3) 2278.98 Hz 2313.00 Hz<br />
(3, 3, 0) 2509.72 Hz 2512.20 Hz<br />
(3, 3, 4) 2826.32 Hz 2830.84 Hz<br />
B. Stator core stack with teeth<br />
As an initial dataset for the stator core stack with teeth,<br />
the resulting material parameters <strong>of</strong> the stator core stack<br />
without teeth have been chosen.<br />
For the investigation <strong>of</strong> the stator core stack with<br />
teeth the density has to be determined. With a weight <strong>of</strong><br />
196.4kgand a volume <strong>of</strong> 2.61335·10−2 m3 the density <strong>of</strong><br />
the stator core stack with teeth results in 7515.3 kg/m 3 .<br />
This is 0.14 % less than the density <strong>of</strong> the stator core<br />
stack without teeth. Hence, all simulations for the stator<br />
core stack with teeth use the newly found density.<br />
1) Comparison <strong>of</strong> measured and calculated frequency<br />
response functions by using the initial dataset: The identified<br />
material parameters are validated by a comparison<br />
<strong>of</strong> the measured and the calculated response characteristics<br />
<strong>of</strong> the stator core stack with teeth, see Fig. 14. The<br />
comparison shows that the match <strong>of</strong> the measured data<br />
with the simulation results using the material parameters<br />
in Table VI for the stator core stack without teeth is not<br />
satisfactory. Only the modes (2,0,0) and (3,0,0) have a<br />
smaller deviation than the other modes. Therefore, the<br />
material parameters <strong>of</strong> the stator core stack with teeth<br />
are determined again.<br />
magnitude [ m N ]<br />
10 -5<br />
mode (2,0,0)<br />
10 -6<br />
10 -7<br />
10 -8<br />
mode (2,1,0)<br />
mode (3,0,0) mode (3,1,0)<br />
mode (2,2,x)<br />
Sum <strong>of</strong> simulated FRF‘s in radial direction<br />
Sum <strong>of</strong> measured FRF‘s in radial direction<br />
mode (3,2,x)<br />
mode (3,3,x)<br />
500 1000 1500<br />
frequency [Hz]<br />
2000 2500 3000<br />
Fig. 14: Validation <strong>of</strong> the identified material parameters<br />
by comparing results <strong>of</strong> a the simulation and measurement<br />
<strong>of</strong> the stator core stack with teeth
2) Adjustment <strong>of</strong> measured and simulated frequency<br />
response function by using transversal isotropic elasticity:<br />
The procedure <strong>of</strong> the adjustment <strong>of</strong> the material<br />
parameters is the same as for the stator core stack without<br />
teeth. For the initial dataset, the material parameters<br />
identified in section V-A are used. The optimization<br />
yields a set <strong>of</strong> material parameters listed in Table VIII.<br />
TABLE VIII: Dataset <strong>of</strong> material parameters <strong>of</strong> transversally<br />
isotropic elasticity with optimized Ez<br />
material parameters values<br />
Ex<br />
199.8 · 109 N/m 2<br />
Ez<br />
20.1 · 109 N/m 2<br />
Gxz<br />
9.9 · 109 N/m 2<br />
νxy<br />
0.3<br />
νxz<br />
0.3<br />
ϱ 7515.3 kg/m 3<br />
Fig. 15 shows that eigenfrequencies exist in the simulation<br />
which do not occur in the measurement. Furthermore<br />
the linear and homogeneous model does not adjust<br />
the modes (2,1,0), (3,1,0) and (4,1,0) to the measured<br />
eigenfrequencies <strong>of</strong> the equivalent simulated modes. The<br />
measured distance in the frequency range <strong>of</strong> about 100 Hz<br />
between the modes (2,0,0) and (2,1,0) or (3,0,0) and<br />
(3,1,0) etc. could not be represented.<br />
magnitude [ m N ]<br />
10 -5<br />
10 -6<br />
10 -7<br />
10 -8<br />
mode (2,0,0)<br />
mode (2,1,0)<br />
mode (3,0,0)<br />
mode (2,3,0)<br />
mode (2,2,x)<br />
mode (2,2,0)<br />
mode (3,1,0)<br />
Sum <strong>of</strong> simulated FRF‘s in radial direction<br />
Sum <strong>of</strong> measured FRF‘s in radial direction<br />
mode (3,2,x)<br />
mode (3,3,0)<br />
mode (4,3,2)<br />
mode (4,3,1)<br />
mode (4,0,0)<br />
500 1000 1500 2000<br />
frequency [Hz]<br />
2500 3000<br />
Fig. 15: Comparison <strong>of</strong> measured and simulated frequency<br />
response functions <strong>of</strong> the stator core stack with<br />
teeth in a frequency domain from 500 Hz to 3300 Hz<br />
Table IX lists modes and their corresponding measured<br />
and simulated eigenfrequencies which could be approximately<br />
adjusted in a frequency range from 500 Hz to<br />
3300 Hz.<br />
VI. SUMMARY AND CONCLUSION<br />
For the investigation <strong>of</strong> electrical machines, the dynamical<br />
behavior <strong>of</strong> the stator core is <strong>of</strong> high interest. The<br />
mechanical characterization is difficult since the structure<br />
<strong>of</strong> the stator core is inhomogeneous. In this paper, an<br />
approach has been presented which yields a linear and<br />
homogeneous description <strong>of</strong> a stator core stack.<br />
For the investigation <strong>of</strong> the dynamical behavior, two<br />
stator core stacks, one without stator teeth and the other<br />
- 152 - 15th IGTE Symposium 2012<br />
TABLE IX: Coincident measured and simulated eigenfrequencies<br />
and modes <strong>of</strong> the stator core stack with teeth,<br />
resulting from adjustment<br />
modes measured eigenfreq. simulated eigenfreq.<br />
(2, 0, 0) 661.27 Hz 662.79 Hz<br />
(2, 1, 0) 720.08 Hz 611.87 Hz<br />
(2, 2, 0) 1365.85 Hz 1358.87 Hz<br />
(3, 0, 0) 1767.43 Hz 1777.50 Hz<br />
(3, 1, 0) 1854.83 Hz 1782.64 Hz<br />
(4, 3, 1) 2819.37 Hz 2955.25 Hz<br />
(4, 3, 2) 2962.95 Hz 2955.25 Hz<br />
(4, 0, 0) 3107.52 Hz 3134.49 Hz<br />
with stator teeth, have been chosen. The experimental<br />
modal analysis have been carried out on both stator core<br />
stacks. The measurement results have been used for the<br />
adjustment <strong>of</strong> the simulation data.<br />
The numerical modal analysis has been applied in<br />
conjunction with the finite element method. For that,<br />
adequate models had to be chosen. The inhomogeneous<br />
structure has been represented by a linear and homogeneous<br />
FEM model and the lamination <strong>of</strong> the stator cores<br />
has been considered by a transversally isotropic material<br />
model.<br />
A study <strong>of</strong> the influence <strong>of</strong> the transversally isotropic<br />
material parameters has been carried out. Thereby, each<br />
material parameter has been varied and the resulting<br />
FRFs have been compared. It could be identified,<br />
which material parameter influences which mode. This<br />
knowledge is the basis <strong>of</strong> the adjustment <strong>of</strong> the simulated<br />
response characteristics <strong>of</strong> the stator cores.<br />
Before adjusting the response characteristics, a comparison<br />
<strong>of</strong> the simulated results using a material model<br />
<strong>of</strong> isotropic elasticity with the measurement results has<br />
been carried out for the stator core stack without teeth.<br />
It could be shown that a material model with isotropic<br />
elasticity is not appropriate.<br />
The stepwise adjustment <strong>of</strong> the simulated FRFs to<br />
the measured have shown a good match <strong>of</strong> the response<br />
characteristic <strong>of</strong> the stator core stack without teeth up to<br />
a frequency <strong>of</strong> 3kHz. However, some measured modes<br />
could not be identified with the used linear and homogeneous<br />
numerical model. The approximated material<br />
parameters have then been validated by a comparison <strong>of</strong><br />
the measured and simulated dynamical behavior <strong>of</strong> the<br />
stator core stack with teeth. This validation shows that<br />
the response characteristic <strong>of</strong> the measured and simulated<br />
results did not coincide except for some simulated<br />
eigenfrequencies.<br />
In order to improve the numerical model <strong>of</strong> the stator<br />
core stack with teeth, a further optimization <strong>of</strong> the material<br />
parameters has been carried out. The match <strong>of</strong> the<br />
measured and simulated response characteristics is not<br />
as good as for the stator core stack without teeth. In the<br />
investigated frequency range <strong>of</strong> the measured data some<br />
eigenfrequencies and eigenmodes could not be identified<br />
in the simulation.<br />
However, a working method has been introduced,
which describes the three dimensional dynamical behaviour<br />
<strong>of</strong> a stator core stack by using a linear and<br />
homogeneous numerical model.<br />
REFERENCES<br />
[1] B. Weilharter, O. Biro, H. Lang, G. Ofner, and S. Rainer, “Validation<br />
<strong>of</strong> a comprehensive analytic noise computation method for<br />
induction machines,” Industrial Electronics, IEEE Transactions<br />
on, vol. 59, no. 5, pp. 2248 –2257, may 2012.<br />
[2] J. Li, X. Song, D. Choi, and Y. Cho, “Research on reduction<br />
<strong>of</strong> vibration and acoustic noise in switched reluctance motors,”<br />
in Advanced Electromechanical Motion Systems Electric Drives<br />
Joint Symposium, 2009. ELECTROMOTION 2009. 8th International<br />
Symposium on, july 2009, pp. 1 –6.<br />
[3] S. Rainer, O. Biro, B. Weilharter, and A. Stermecki, “Weak coupling<br />
between electromagnetic and structural models for electrical<br />
machines,” Magnetics, IEEE Transactions on, vol. 46, no. 8, pp.<br />
2807 –2810, aug. 2010.<br />
[4] B. Weilharter, O. Bi? andro? and, S. Rainer, and A. Stermecki,<br />
“Computation <strong>of</strong> rotating force waves in skewed induction machines<br />
using multi-slice models,” Magnetics, IEEE Transactions<br />
on, vol. 47, no. 5, pp. 1046 –1049, may 2011.<br />
[5] H. Jordan, Geräuscharme Elektromotoren. Essen: Verlag W.<br />
Girardet, 1951.<br />
[6] R. N. Arnold and G. B. Warburton, “Flexural vibrations <strong>of</strong> the<br />
walls <strong>of</strong> thin cylindrical shells having freely supported ends,” <strong>Proceedings</strong><br />
<strong>of</strong> the Royal Society <strong>of</strong> London. Series A, Mathematical<br />
and Physical Sciences, vol. 197, no. 1049, pp. 238–256, Jun. 1949.<br />
[7] H. Frohne, “Über die primären bestimmungsgrößen er<br />
lautstärke bei asynchronmotoren,” Ph.D. dissertation, Technische<br />
Hochschule Hannover, 1959.<br />
[8] E. C. Naumann and J. L. Sewall, “An experimental and analytical<br />
vibration study <strong>of</strong> thin cylindrical shells with and without longitudinal<br />
stiffeners,” NASA Langley Research Center, Tech. Rep.<br />
NASA-TN-D-4705, Sep 1968.<br />
[9] Y. Stavsky and R. Loewy, “On vibrations <strong>of</strong> heterogenous orthotropic<br />
cylindrical shells,” Journal <strong>of</strong> Sound and Vibration,<br />
vol.18,no.3,pp.i–i,1971.<br />
[10] P. L. Tímár, A. Fazecas, J. Kiss, A. Miklós, and S. J. Yang,<br />
Noise and Vibration <strong>of</strong> Electrical Machines. New York: Elsevier<br />
Science Publishing Company, Inc., 1989.<br />
[11] C. Wang and J. C. S. Lai, “Prediction <strong>of</strong> natural frequencies <strong>of</strong><br />
finite length circular cylindrical shells,” Applied Acoustics, vol. 59,<br />
no. 4, pp. 385 – 400, 2000.<br />
[12] K.-J. Bathe, Finite-Elemente-Methoden, 2nd ed. Berlin Heidelberg:<br />
Springer, 2002, aus dem Englichen übersetzt von Peter<br />
Zimmermann.<br />
[13] R. Gasch and K. Knothe, Strukturdynamik: Band 1: Diskrete<br />
Systeme. Springer, 1987.<br />
[14] W. Yixuan and W. Ying, “Research on dynamic characteristics<br />
<strong>of</strong> stator core <strong>of</strong> large turbo-generator,” in Power and Energy<br />
Engineering Conference (APPEEC), 2010 Asia-Pacific, march<br />
2010, pp. 1 –4.<br />
[15] H. Wang and K. Williams, “Effects <strong>of</strong> laminations on the vibrational<br />
behaviour <strong>of</strong> electrical machine stators,” Journal <strong>of</strong> Sound<br />
and Vibration, vol. 202, no. 5, pp. 703–715, May 1997.<br />
[16] M. van der Giet and K. Hameyer, “Identification <strong>of</strong> homogenized<br />
equivalent materials for the modal analysis <strong>of</strong> composite structures<br />
in electrical machines,” in the 9th International Conference on<br />
Vibrations in Rotating Machinery, VIRM 2008, Exceter, UK, 2008,<br />
pp. 437–448.<br />
[17] J. Roivainen, “Unit-wave response-based modeling <strong>of</strong> electromechanical<br />
noise and vibration <strong>of</strong> electromechanical machines,”<br />
Ph.D. dissertation, Helsinki <strong>University</strong> <strong>of</strong> <strong>Technology</strong>, Helsinki,<br />
2009.<br />
[18] S. Verma and A. Balan, “Experimental investigations on the<br />
stators <strong>of</strong> electrical machines in relation to vibration and noise<br />
problems,” Electric Power Applications, IEE <strong>Proceedings</strong> -, vol.<br />
145, no. 5, pp. 455 –461, sep. 1998.<br />
[19] B. Peeters, H. Van der Auweraer, P. Guillaume, and J. Leuridan,<br />
“The polymax frequency-domain method: a new standard for<br />
modal parameter estimation?” Shock and Vibration, vol. 11, no. 3,<br />
pp. 395–409, Jan. 2004.<br />
- 153 - 15th IGTE Symposium 2012<br />
[20] D. J. Ewins, Modal Testing: Theory, Practice and Applications,<br />
2nd ed. Letchworth, Hertfordshire, UK: Research Studies Press,<br />
2000.
- 154 - 15th IGTE Symposium 2012<br />
Proper Location <strong>of</strong> the Regulating Coil in Transformers<br />
from Short-Circuit Forces Point <strong>of</strong> View<br />
*, O. Sonmez, * B. Duzgun, * G. Komurgoz<br />
* Istanbul Technical <strong>University</strong> Electrical and Electronics Faculty, 80626 Istanbul, Turkey<br />
Abstract—A transformer has complicated network <strong>of</strong> internal forces acting on and stressing the conductors, support and<br />
insulation structures. These forces are fundamental to the interaction <strong>of</strong> current-carrying conductors within magnetic fields<br />
involving an alternating-current source. Location <strong>of</strong> the regulating coil in transformer determines electrodynamic forces<br />
effect on the operational behavior <strong>of</strong> the transformer. This paper presents design principles <strong>of</strong> the regulating coil in<br />
transformers and shows the electrodynamics forces and their deformation results by using finite element method.<br />
Index Terms—Electrodynamic Forces, Deformation Analysis, FEA, Regulating Coil.<br />
I. INTRODUCTION<br />
The transformer is a very critical and costly important<br />
component in power generation and transmission systems<br />
as regarding reliable and performance. The capacity <strong>of</strong><br />
transformers is increasing with the rapid development. As<br />
the voltage level is higher, the time needed to design a<br />
transformer is <strong>of</strong> great importance. One <strong>of</strong> the important<br />
problems in the design <strong>of</strong> transformers is radial and axial<br />
forces, being proportional to the square <strong>of</strong> the short<br />
circuit current. By the interaction <strong>of</strong> leakage field and<br />
the short circuit current, which makes the windings be<br />
published or pulled, the huge short circuit force is<br />
generated in the windings (large power). The leakage<br />
flux not only causes the additional losses and forces, but<br />
also creates heating to the internal components. Short<br />
circuit current is 8 to 10 times the rated current in larger<br />
transformers and 20 to 25 times in smaller units. Forces<br />
arising during short-circuit may be as high as ten<br />
thousand to million N. By the effect <strong>of</strong> so large forces<br />
and thermal expansion <strong>of</strong> wires, the insulation <strong>of</strong><br />
transformer windings can be distorted, even collapsed,<br />
short circuit error occurs or damage to the clamping<br />
structures. Furthermore, the location <strong>of</strong> the tapings has<br />
the predominant effect on the axial forces since it<br />
controls the residual ampere turn. Failure <strong>of</strong> transformers<br />
due to short circuits is major concern for power utilities<br />
and manufactures. These hazards can be avoided by<br />
proper design <strong>of</strong> windings structure against thermal and<br />
mechanical strains to prevent permanent deformations<br />
and movement <strong>of</strong> windings if forces can be calculated<br />
correctly.<br />
In the past, many technical papers have been published<br />
which give equations for calculation the electromagnetic<br />
forces acting on the windings in transformers [1-9].<br />
Electromagnetic force computations methods have been<br />
proposed in the literature mainly based on static and<br />
transient formulations [10]. Classical methods can be<br />
used to compute the short circuit forces in windings [11].<br />
In these methods, it is used simplified configurations<br />
with some assumptions. Furthermore these methods are<br />
simple, fast and easy, but not accurate and not suitable<br />
for predicting the performance <strong>of</strong> special types <strong>of</strong><br />
Sönmez Transformer Company, 41410 Kocaeli, Turkey<br />
e-mail: komurgoz@itu.edu.tr<br />
transformers, especially the axial length <strong>of</strong> windings is<br />
not equal [12]. It is, however, obvious that by using<br />
modern computerized methods, sophisticated methods, it<br />
is possible to calculate forces acting on the elements <strong>of</strong><br />
winding, the effect <strong>of</strong> any arrangements <strong>of</strong> parts and<br />
asymmetries. If magnetic field is calculated accurately, it<br />
is possible to define electromagnetic forces in the<br />
detailed transformer model by using numerical methods,<br />
Finite Element Methods (FEM), Finite Difference<br />
Methods (FDM) and Boundary elements (BEM) etc. In<br />
recent years, a significant development <strong>of</strong> FEM s<strong>of</strong>tware<br />
has enabled the force calculation to be accomplished<br />
easily in where the winding and tapping arrangement is<br />
complex.<br />
This paper concentrates on the use <strong>of</strong> FEM to models.<br />
This method provides a comprehensive view <strong>of</strong> the<br />
overall transformer mechanic and electromagnetic<br />
behavior under normal and disturbance conditions. The<br />
effect <strong>of</strong> tap winding configurations is also analyzed. The<br />
results obtained from FEM <strong>of</strong> transformers using<br />
MAXWELL® and ANSYS® are validated by the<br />
mathematical models.<br />
II. FORCES ACTING ON THE TRANSFORMER<br />
When the electromagnetic force becomes greater than the<br />
strength <strong>of</strong> the windings, the windings will fail. The types<br />
<strong>of</strong> failure,Electromagnetic forces, acting on transformer<br />
can be classified as “radial forces” which develop in the x<br />
direction and “axial forces” develop in the y direction.<br />
For the calculation <strong>of</strong> these forces, both analytical and<br />
numerical methods are presented such as residual<br />
ampere-turn method, Robin’s solution, Smythe’s<br />
solution, calculation using Fourier series, two<br />
dimensional method <strong>of</strong> images, FEM, image method with<br />
discrete conductors etc. [13].<br />
Axial forces creates slipping or breakdown <strong>of</strong> windings as<br />
a whole standing-up <strong>of</strong> part <strong>of</strong> windings, tilting and<br />
deformation <strong>of</strong> coils. Radial forces creats buckling<br />
phenomena <strong>of</strong> inner windings, excessive elongation <strong>of</strong><br />
outer windings.<br />
A. Axial Forces<br />
One <strong>of</strong> the elementary and simplest methods, residual
ampere-turn method, gives closer approximations and<br />
reliable results for the calculation <strong>of</strong> axial forces.<br />
Concentric windings are separated into two groups and<br />
each group has balanced ampere-turns. The radial<br />
ampere-turns produce radial flux which causes axial<br />
force in the windings as it seen in Figure 1. This<br />
assumption allows calculation <strong>of</strong> the axial forces.<br />
Figure 1: Axial and radial forces in concentric axially nonsymmetrical<br />
windings [13].<br />
The algebraic sum <strong>of</strong> the ampere-turns <strong>of</strong> low voltage<br />
and high voltage windings at any point and at end <strong>of</strong> the<br />
windings gives the radial ampere-turns at that point in the<br />
winding. A curve is plotted for every points called<br />
residual or unbalanced ampere-turn diagram which the<br />
method derives its name [12]. It is clear that windings<br />
without axial displacement and windings have the same<br />
length have no residual ampere-turns or forces between<br />
windings. However, there are some internal compressive<br />
forces and forces on the end coils, although there is no<br />
axial thrust between windings.<br />
Figure 2 gives the methodology for the determining<br />
distribution <strong>of</strong> radial ampere-turns. ‘a’ is the length<br />
tapped out at the end <strong>of</strong> the outer windings. Summation<br />
<strong>of</strong> I and II shown in Figure 2(b) are both balanced<br />
ampere-turn groups. If these groups are superimposed,<br />
they produce the given ampere-turn arrangement. The<br />
triangle as shown in Figure 2(c) presents the diagram <strong>of</strong><br />
the radial-ampere turns. This diagram plotted against<br />
distance along the winding. a(NImax) is the maximum<br />
value, where (NImax) represents the ampere-turns <strong>of</strong> either<br />
the low voltage or high voltage winding.<br />
Figure 2: Determination <strong>of</strong> residual ampere-turns [12].<br />
- 155 - 15th IGTE Symposium 2012<br />
Tapings location on the winding has a great effect on the<br />
axial forces since it controls the residual ampere-turn<br />
diagram.<br />
B. Radial Forces<br />
The radial forces develop due to interaction <strong>of</strong> coil<br />
currents with the axial component <strong>of</strong> its own magnetic<br />
flux. In a transformer with concentric windings, radial<br />
forces considered insignificant because, the radial<br />
strength <strong>of</strong> the winding is high. Most problems occur<br />
because <strong>of</strong> axial forces and axial movement results more<br />
damage to the winding and insulation than radial<br />
movements.<br />
The inner coil is subjected a pressure tends to collapse<br />
to the core. At the same time, the outer coil is under a<br />
pressure to extend the diameter <strong>of</strong> the coil which<br />
produces a stress as shown in Figure 1. Preferable choice<br />
in a transformer is circular coils, because they are the<br />
strongest shape to withstand the radial pressure<br />
mechanically [14].<br />
III. CALCULATION OF ELECTROMAGNETIC FORCES<br />
A. Short-Circuit Current<br />
Short-circuit currents on the windings have a<br />
significant effect on calculation <strong>of</strong> electromagnetic<br />
forces. Generally, the short-circuit current is calculated<br />
for different situations by considering [15];<br />
Tapping arrangement<br />
Fault position<br />
Short-circuit power combination (network and<br />
transformer)<br />
Short-circuit type (e.g. three phase symmetrical)<br />
To see the effects <strong>of</strong> the short-circuit current on power<br />
transformers, the simplest fault scenario, three phase<br />
short-circuit scenario is investigated. Symmetrical shortcircuit<br />
current can be calculated according IEC 60076-5<br />
as [16];<br />
I <br />
U<br />
Z Z <br />
<br />
3 t s<br />
9 And the amplitude<br />
is;<br />
Imax <strong>of</strong> the first peak <strong>of</strong> the current<br />
I Ik 2 10<br />
max<br />
<br />
The factor k is the initial <strong>of</strong>fset <strong>of</strong> the current and<br />
2 stands for the peak to r.m.s. value <strong>of</strong> sinusoidal wave.<br />
This k 2 factor depends on the X/R ratio and the<br />
values <strong>of</strong> k are shown in standards IEC 60076-5 [16].<br />
This current is based on the following expression for<br />
the peak factor;<br />
R/ X<br />
2 <br />
k 2 1<br />
<br />
e<br />
<br />
sin<br />
2 11
Y1 [kA]<br />
25.00<br />
12.50<br />
0.00<br />
-12.50<br />
Curve Inf o<br />
InputCurrent(Winding_LV_A)<br />
Setup1 : Transient<br />
InputCurrent(Winding_LV_B)<br />
Setup1 : Transient<br />
InputCurrent(Winding_LV_C)<br />
Setup1 : Transient<br />
Name X Y<br />
Phase C_sc 131.5000 21.6972<br />
Phase A_sc 118.5000 21.6972<br />
Phase B_sc 105.0000 21.7270<br />
Input Current LV Model2D_coils ANSOFT<br />
Phase B_sc<br />
Phase A_sc<br />
Phase C_sc<br />
-25.00<br />
75.00 87.50 100.00<br />
Time [ms]<br />
112.50 125.00 135.00<br />
Figure 3: Input currents <strong>of</strong> low voltage windings.<br />
The given short-circuit has two components as steady<br />
state and exponentially unidirectional component. In<br />
Figure 3, applied steady-state and short-circuit currents<br />
on the windings <strong>of</strong> the power transformer in Maxwell<br />
s<strong>of</strong>tware is shown. The exponentially unidirectional<br />
component is ignored to make calculations simpler.<br />
B. Electromagnetic Forces<br />
Transient analysis allows calculating electromagnetic<br />
forces for every time step by calculating the leakage flux<br />
and full field in winding region. Fully coupled dynamic<br />
physics solution is;<br />
A<br />
AJs V Hc vA t<br />
The differential equation and the boundary conditions<br />
<strong>of</strong> transient axial symmetric electromagnetic field can be<br />
expressed in the cylindrical coordinate as;<br />
- 156 - 15th IGTE Symposium 2012<br />
12 rA rA rA <br />
<br />
<br />
: v' Z Z <br />
v' r r <br />
Js <br />
<br />
'<br />
t<br />
13<br />
S1: rA rA0<br />
14 rA <br />
S2: v' Ht<br />
n<br />
15 For 2D analysis, the radial and axial components <strong>of</strong> the<br />
magnetic flux density can be expressed as;<br />
A<br />
Br<br />
<br />
z<br />
B<br />
0<br />
16 1 rA<br />
Bz<br />
<br />
r r<br />
17 When the magnetic flux density is decomposed into its<br />
radial and axial components;<br />
<br />
F J ˆ B rˆB zˆ d F rˆF zˆ<br />
18<br />
<br />
<br />
<br />
<br />
r z r z<br />
In brief, the force on the power transformer is<br />
expressed by the Lorentz force as<br />
<br />
dF idlB And the radial force <strong>of</strong> unit length<br />
F B I dl<br />
x y<br />
max<br />
The axial force <strong>of</strong> unit length<br />
F B I dl<br />
y x<br />
max<br />
19 20 21 IV. RESULTS &DISCUSSIONS<br />
A. Model<br />
Electrical machines require an accurate mathematical<br />
model for system simulation and performance evaluation.<br />
Detailed knowledge <strong>of</strong> the flux distribution <strong>of</strong> a<br />
transformer plays a very important role in a safe<br />
estimation <strong>of</strong> the forces <strong>of</strong> the transformer. Complex<br />
computer programs are required to obtain a reasonable<br />
representation <strong>of</strong> the field in different parts <strong>of</strong> the<br />
windings. Using the above models for determinate forces,<br />
a numerical application (FEM) has been implemented for<br />
a 25 MVA power transformer. 3-D model <strong>of</strong> the general<br />
structure is shown in Figure 4. To reduce computing time<br />
and avoid excessive use <strong>of</strong> ram, the insulating materials<br />
and supporting structure are neglected, besides analyses<br />
were done in 2-D structure.<br />
Figure 4: 3-D model <strong>of</strong> analyzed power transformer.<br />
The characteristics <strong>of</strong> the studied transformer are<br />
presented in Table I and geometry details <strong>of</strong> the analyzed<br />
transformer are shown in Figure 5.<br />
TABLE I<br />
TRANSFORMER DATA<br />
Rated Power 25 [MVA]<br />
Rated Frequency 50 [Hz]<br />
Rated Voltages 120 / 11 [kV]<br />
Rated Currents 120 / 1310 [A]<br />
Turns Ratio 1000 / 159<br />
Connection Yd11<br />
Tap setting ± 15 %<br />
Transformer short circuit voltage (%) 9<br />
Figure 5: Geometry details <strong>of</strong> analyzed transformer.
Figure 6: 2-D model <strong>of</strong> analyzed transformer tapped at upper side.<br />
B. Electromagnetic Results<br />
Transformers require an accurate mathematical model<br />
for system simulation and performance evaluation. In this<br />
study, magnetic analysis <strong>of</strong> the designed machines has<br />
been investigated using Maxwell 2D program and total<br />
deformations have been investigated using ANSYS®<br />
program (Figure 6). The simulations were completed<br />
using the following steps;<br />
1) Geometric model creation,<br />
2) The appointment <strong>of</strong> the materials that make up<br />
the structure <strong>of</strong> the machine,<br />
3) Boundary conditions and mesh process,<br />
4) The appointment <strong>of</strong> currents in windings,<br />
5) Analyze,<br />
6) Examination <strong>of</strong> the results.<br />
In Figure 7 and 8 leakage flux distributions are shown<br />
for +15% tapping position and -15% tapping position. As<br />
the leakage flux increases, electromagnetic forces are<br />
occurring rapidly.<br />
Figure 7: Leakage flux distribution at +15% tapping position <strong>of</strong> HV<br />
windings.<br />
Figure 8: Leakage flux distribution at -15% tapping position <strong>of</strong> HV<br />
windings.<br />
- 157 - 15th IGTE Symposium 2012<br />
The graphs <strong>of</strong> distribution <strong>of</strong> radial magnetic flux<br />
density along the transformer window are shown in<br />
Figure 9 and 10 for +15% tapping, -15% tapping at upper<br />
part <strong>of</strong> HV windings, respectively.<br />
Mag_B [tesla]<br />
1.50<br />
1.25<br />
1.00<br />
0.75<br />
0.50<br />
0.25<br />
Axial Flux Density Distribution Model2D_coils ANSOFT<br />
0.00<br />
0.00 0.25 0.50 0.75 1.00 1.25 1.50 1.75<br />
Distance [meter]<br />
Curve Inf o<br />
Mag_B<br />
Setup1 : Transient<br />
Time='115000000ns'<br />
Figure 9: Axial flux distribution at +15% tapping position <strong>of</strong> HV<br />
windings.<br />
Figure 9 shows axial flux distribution with respect to<br />
height <strong>of</strong> the winding for at +15% tapping position <strong>of</strong> HV<br />
windings. To determine the axial forces, it is necessary to<br />
find the radial flux produced by the radial ampere-turns.<br />
As seen from figure, axial flux density is approximately<br />
constant along the winding due to symmetrical windings<br />
(with fully balanced ampere-turns)<br />
Mag_B [tesla]<br />
3.50<br />
3.00<br />
2.50<br />
2.00<br />
1.50<br />
1.00<br />
0.50<br />
Axial Flux Density Distribution Model2D_coils ANSOFT<br />
0.00<br />
0.00 0.25 0.50 0.75 1.00 1.25 1.50 1.75<br />
Distance [meter]<br />
Figure 10: Axial flux distribution at -15% tapping position <strong>of</strong> HV<br />
windings.<br />
Curve Inf o<br />
Mag_B<br />
Setup1 : Transient<br />
Time='115000000ns'<br />
If there is an asymmetry in the winding heights due to the<br />
tap position or for some other reasons such as failure,<br />
flux distribution changes as shown in Figure 10. Flux<br />
density distribution makes maximum in one place along<br />
the height <strong>of</strong> the winding.<br />
The electromagnetic forces in the winding <strong>of</strong> the<br />
power transformer are calculated with the leakage flux<br />
and transient currents. The radial and axial forces <strong>of</strong> each<br />
conductor coil in the HV windings are given in Figure<br />
11-14. Figure 11 and 12 shows radial and axial forces at<br />
+15% tapping position <strong>of</strong> HV windings.<br />
Radial Forces<br />
x 105<br />
3.5<br />
3<br />
2.5<br />
2<br />
1.5<br />
1<br />
0.5<br />
0<br />
Radial Forces on the HV Coils for +15% tapping at 118.5 ms<br />
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15<br />
Coil Numbers<br />
Figure 11: Radial Forces at +15% tapping position <strong>of</strong> HV windings.
Axial Forces<br />
x 104<br />
8<br />
6<br />
4<br />
2<br />
0<br />
-2<br />
-4<br />
-6<br />
-8<br />
Axial Forces on the HV Coils for +15% tapping at 118.5 ms<br />
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15<br />
Coil Numbers<br />
Figure 12: Axial Forces at +15% tapping position <strong>of</strong> HV windings.<br />
Axial and radial forces in windings when the windings<br />
are axially non-symmetrical are calculated as given in<br />
Figure 13 and 14.<br />
Due to the symmetry <strong>of</strong> winding and regular<br />
distribution <strong>of</strong> flux, forces values are smaller than<br />
asymetrical winding arrangement. If there is an<br />
asymmetry in the winding heights due to the tap position<br />
(or for some other reasons), the ampere-turn unbalance<br />
increases and gives rise to forces, and result <strong>of</strong> this,<br />
tending to break the winding.<br />
Radial Forces<br />
x 105<br />
6<br />
5<br />
4<br />
3<br />
2<br />
1<br />
0<br />
Radial Forces on the HV Coils for -15% tapping at 118.5 ms<br />
1 2 3 4 5 6 7 8 9 10 11 12<br />
Coil Numbers<br />
Figure 13: Radial Forces at %-+15 tapping position <strong>of</strong> HV windings.<br />
Axial Forces<br />
x 105<br />
1<br />
0<br />
-1<br />
-2<br />
-3<br />
-4<br />
-5<br />
-6<br />
-7<br />
-8<br />
-9<br />
Axial Forces on the HV Coils for -15% tapping at 118.5 ms<br />
1 2 3 4 5 6 7 8 9 10 11 12<br />
Coil Numbers<br />
Figure 14: Axial Forces at -15% tapping position <strong>of</strong> HV windings.<br />
The total body force density and total deformation are<br />
determined by using ANSYS program and shown in<br />
Figure 15-18. Figure 15 and 17 shows the effect <strong>of</strong> forces<br />
on winding at +15% tapping position <strong>of</strong> HV windings.<br />
- 158 - 15th IGTE Symposium 2012<br />
Figure 15: Total body force density at -15% tapping position <strong>of</strong> HV<br />
windings (115 ms)<br />
Figure 16: Total body force density at -15% tapping position <strong>of</strong> HV<br />
windings (115 ms)<br />
Deformations in windings when the windings are<br />
axially non-symmetrical are obtained as given in Figure<br />
16 and18. Total deformation depends on the tap position<br />
and at +15% tapping position <strong>of</strong> HV windings, they are<br />
bigger than which at -15% tapping position <strong>of</strong> HV<br />
windings. The location <strong>of</strong> forces shifts to the upper side<br />
<strong>of</strong> the winding.<br />
Figure 17: Total deformations at +15% tapping position <strong>of</strong> HV windings<br />
(115 ms).<br />
V. CONCLUSION<br />
In this paper, leakage magnetic field and electrodynamic<br />
force <strong>of</strong> the 25 MVA power transformers were analyzed<br />
under short circuit conditon <strong>of</strong> the low voltage windings<br />
<strong>of</strong> the transformer by using ANSYS® and MAXWELL®
Figure 18 Total deformations at -15% tapping position <strong>of</strong> HV windings<br />
(115 ms).<br />
based on the FEM.Two different conditions when the<br />
power transformer is under mximum tap are analyzed.<br />
The location <strong>of</strong> the regulating coil is changed.<br />
Afterwards, deformation result is showed by using<br />
calculated force values. Undesirable stresses values can<br />
be prevented on the transformers by making appropriate<br />
coil arrangements. The insertation <strong>of</strong> tap sections in the<br />
windings, which produces asymetries between LV and<br />
HV windings, tends to cause an inrease <strong>of</strong> radial and<br />
axial forces annd then damages in transformers. The<br />
method <strong>of</strong> calculation <strong>of</strong>feres a reference to the design <strong>of</strong><br />
transformer.<br />
LIST OF PRINCIPLES SYMBOLS<br />
a fractional difference in winding heights<br />
A magnetic vector potential<br />
Br, B , Bz components <strong>of</strong> the flux density <strong>of</strong> (in Tesla)<br />
dl<br />
<br />
F<br />
unit length <strong>of</strong> wire<br />
force<br />
h axial height <strong>of</strong> the winding<br />
Hc coercive magnetic field strength <strong>of</strong> the PM<br />
Ht tangential component <strong>of</strong> magnetic intensity<br />
Imax maximum current<br />
Js current source density<br />
J - directional short-circuit current density<br />
ˆr , ˆ and ˆz unit vectors in cylindrical coordinate<br />
S1 parallel boundary condition<br />
S2 vertical boundary condition<br />
U rated voltage<br />
v velocity<br />
V electric scalar potential<br />
Zs short-circuit impedance <strong>of</strong> the system<br />
Zt short-circuit impedance <strong>of</strong> the transformer<br />
phase angle<br />
conductivity<br />
studied domain<br />
v ' reluctivity<br />
' conductance<br />
REFERENCES<br />
[1] K. Karsai, D. Kerenyi, and L. Kiss, "Large Power Transformers,"<br />
Elsevier Science, December 1987.<br />
[2] L. Jiao, "The calculation <strong>of</strong> ampere force on electric power<br />
transformer under the short circuit situation," Electrical Machines<br />
- 159 - 15th IGTE Symposium 2012<br />
And Systems, 2008. ICEMS 2008. International Conference On,<br />
17-20 October 2008.<br />
[3] V. Behjat, A. Vahedi, A. Setayeshmehr, and et all. "Identification<br />
<strong>of</strong> the most sensitive frequency response measurement technique<br />
for diagnosis <strong>of</strong> interturn faults in power transformers,"<br />
Measurement Science And <strong>Technology</strong>, volume 21, No. 7, 2010.<br />
[4] L. Jiao, B. Bai, and H. Li, "The calculation <strong>of</strong> ampere force on<br />
electric power transformer under the short circuit situation,"<br />
International conference on electrical machines and systems, pp.<br />
4423-4426, October 2008.<br />
[5] A. C. de Azevedo, I. Rezende, and A. C. Delaiba, "Investigation <strong>of</strong><br />
transformer electromagnetic forces caused by external faults using<br />
FEM," Transmission & distribution conference and exposition:<br />
Latin America, August 2006.<br />
[6] K. Kurita, T. Kuriyama, K. Hiraishi, and et all. "Mechanical<br />
Strength <strong>of</strong> Transformer Windings under Short-Circuit<br />
Conditions," IEEE Transactions on Power Apparatus and Systems,<br />
volume 88, No. 3, March 1969.<br />
[7] K. Hiraishi, Y. Hori and S. Shida, "Mechanical Strength Of<br />
Transformer Windings under Short-Circuit Conditions," IEEE<br />
Transactions on Power Apparatus and Systems, pages 2381-2390,<br />
October 1971.<br />
[8] R. E. Ayres, G. O. Usry, M R Patel, and et all. "Dynamic<br />
Measurement during Short Circuit Testing <strong>of</strong> Transformers, Part<br />
2- Test Results And Analysis," IEEE Transactions on Power<br />
Apparatus and Systems, volume PAS-94, pages 198-206,<br />
March/April 1975.<br />
[9] M. Heathcote and D.P. Franklin, "The J&P Transformer Book"<br />
Reed Educational and Pr<strong>of</strong>essional Publishing Ltd, England, 12th<br />
edition, 1998.<br />
[10] G.B. , Kumbhar, and S.V. Kulkarni, "Analysis <strong>of</strong> short-circuit<br />
performance <strong>of</strong> split-winding transformer using coupled fieldcircuit<br />
approach," Power Delivery, IEEE Transactions On, Issue:<br />
2, pages 936-943, April 2007.<br />
[11] M. Waters, "The Measurement and Calculation <strong>of</strong> Axial<br />
Electromagnetic Forces in Concentric Transformer Windings,"<br />
<strong>Proceedings</strong> <strong>of</strong> the IEE - Part II: Power Engineering, volume 101,<br />
pages 35-46, February 1954.<br />
[12] M. F. Beavers and C. M. Adams, "The Calculations and<br />
Measurement <strong>of</strong> Axial Electromagnetic Forces on Concentric<br />
Coils in Transformers," Power Apparatus and Systems, Part III.<br />
Transactions <strong>of</strong> the American Institute <strong>of</strong> Electrical Engineers,<br />
volume 78, pages 467-477, August 1959.<br />
[13] M. S. A. Minhas, "Dynamic Behaviour <strong>of</strong> Transformer Winding<br />
under Short-Circuits," Ph.D. Thesis, <strong>University</strong> <strong>of</strong> the<br />
Witwatersrand, Johannesburg, November 2007.<br />
[14] M. G. Say, "The Performance and Design <strong>of</strong> Alternating Current<br />
Machines," Sir Issak Pitman & Sons Ltd, London, 3rd edition,<br />
1958.<br />
[15] N. Mahomed, "Electromagnetic Forces in Transformers under<br />
Short-Circuit Conditions," Energize Online, pp. 36-40, March<br />
2011.<br />
[16] IEC Standard 60076-5: Power Transformers-Part 5: "Ability to<br />
withstand short circuit”, 2006.
- 160 - 15th IGTE Symposium 2012<br />
Robust Design <strong>of</strong> IPM motors using<br />
Co-Evolutionary Algorithms<br />
*Min Li, † André S. Ruela, † Frederico G. Guimarães, † Jaime A. Ramírez and *David A. Lowther<br />
*McGill <strong>University</strong>, 3480 <strong>University</strong>, H3A 2K6 Montreal, Canada<br />
† Federal <strong>University</strong> <strong>of</strong> Minas Gerais, Belo Horizonte, MG 31270-010, Brazil<br />
E-mail: *david.lowther@mcgill.ca, † jramirez@ufmg.br<br />
Abstract—A robust design formulation is developed considering the minimization <strong>of</strong> the torque ripples <strong>of</strong> an interior<br />
permanent magnet (IPM) machine in the presence <strong>of</strong> uncertainties in the values <strong>of</strong> the design variables. This optimization<br />
problem is first solved using the worst vertex prediction and a deterministic search. In addition, a competitive co-evolutionary<br />
algorithm is applied to the minimax optimization problem to find a robust solution, in which one population evolves values for<br />
the design variables and the other one evolves values in the uncertainty set. Through a worst case analysis, the result from the<br />
co-evolutionary algorithm is proven to have a more robust performance than that <strong>of</strong> the non-robust optimization if<br />
manufacturing tolerance is taken into account. The computation time <strong>of</strong> the co-evolutionary algorithm may be largely<br />
reduced through the use <strong>of</strong> parallel computing environments.<br />
Index Terms—Evolutionary algorithms, IPM motor, Minimax optimization, Robust design.<br />
I. INTRODUCTION<br />
In recent years, interior permanent magnet (IPM)<br />
motors have become popular for many applications that<br />
require variable speed and torque. As an alternative to the<br />
traditional induction motor, IPM motors have the<br />
advantages <strong>of</strong> higher efficiencies and lower noise. The<br />
design <strong>of</strong> an IPM motor is a complicated task that<br />
involves the consideration <strong>of</strong> many different aspects,<br />
such as the size and the weight <strong>of</strong> the machine, the<br />
desired output torque, the cost <strong>of</strong> the permanent magnet,<br />
etc. In this paper, we focus on the robust design <strong>of</strong> IPM<br />
machines for which the objective is the reduction <strong>of</strong><br />
vibrations and noise <strong>of</strong> the device caused by errors in<br />
manufacturing, in order to improve the quality and to<br />
extend the life <strong>of</strong> product.<br />
The idea <strong>of</strong> robust design was introduced to electrical<br />
machine design over two decades ago. Robustness is<br />
<strong>of</strong>ten defined in terms <strong>of</strong> the performance <strong>of</strong> the device<br />
being less sensitive to manufacturing errors and<br />
variations <strong>of</strong> the operation conditions. Dr. Taguchi, with<br />
his statistical based methods, is considered as one <strong>of</strong> the<br />
pioneers <strong>of</strong> engineering robust design as he developed<br />
the foundations <strong>of</strong> robust design to meet the challenges <strong>of</strong><br />
producing high-quality products. A Taguchi-based<br />
optimization method has been applied to the design <strong>of</strong><br />
brushless DC motors in [1], where the signal-to-noise<br />
ratio was used to estimate the robustness <strong>of</strong> the product.<br />
A robust shape optimization was applied by Yoon to the<br />
design <strong>of</strong> electromagnetic devices in [2], where the mean<br />
and the standard deviation <strong>of</strong> the performance were<br />
treated as multi-objectives for the design problem. This<br />
paper also employed a sensitivity based approach to<br />
compute the approximation <strong>of</strong> the standard deviation and<br />
took feasibility robustness into account. Another useful<br />
formulation <strong>of</strong> robust design is to apply the worst case<br />
analysis and to optimize the worst performance <strong>of</strong> the<br />
objective function in the presence <strong>of</strong> uncertainties [3] [4].<br />
The robustness measure is integrated into the<br />
optimization process by using a robust target function<br />
defined on the uncertainty set <strong>of</strong> the design variables; and<br />
the vertices <strong>of</strong> the uncertainty set were used to predict the<br />
worst value <strong>of</strong> the objective function. Several different<br />
robust design formulations were reviewed and discussed<br />
in [5] and the authors proposed that the standard<br />
deviation can be approximated using the difference<br />
between the worst performance and the nominal<br />
performance and the computation cost could be largely<br />
reduced. In the recent development <strong>of</strong> sensitivity based<br />
robust design optimization, the authors defined a gradient<br />
index (GI) using the sensitivity <strong>of</strong> the performance<br />
function with respect to some critical uncertainty<br />
variables [6]. This simple and efficient algorithm was<br />
illustrated with an example <strong>of</strong> MEMS devices where<br />
robustness is crucial for high yield rate but information<br />
on uncertainties is hard to obtain. This gradient index<br />
based robust design method was also tested with the<br />
TEAM workshop problem 22 in [7]. The worst case<br />
analysis and the robust target function have also been<br />
applied to topological design problems [8], where a<br />
robust topological gradient (TG) was used to evaluate the<br />
robustness for a certain topological design.<br />
In addition to deterministic optimization approaches,<br />
the worst case analysis based robust design problems (i.e.<br />
minimax optimization) can also be solved using genetic<br />
algorithms and evolutionary algorithms [9] [10]. In<br />
particular, co-evolutionary algorithms have been used to<br />
solve constrained optimization problems formulated as<br />
the minimax optimization problem [11, 12]. In this case,<br />
one population evolves solutions for the problem while<br />
the second one evolves the terms for the Lagrange<br />
penalty function. Co-evolutionary methods have been<br />
applied to robust design as well [13, 14]. In [13] a coevolutionary<br />
algorithm is used to design a robust<br />
nonlinear control under uncertainties. In [14], the authors<br />
have also reviewed a few different formulations <strong>of</strong><br />
competitive co-evolutionary genetic algorithms.<br />
In this paper, a robust design problem is defined for<br />
minimizing the torque ripples <strong>of</strong> an IPM motor while<br />
considering the uncertainties in the design variables. Two<br />
different approaches based on the worst case scenario are<br />
being considered. The first one employs a deterministic<br />
search and uses the computed sensitivity information to
predict the worst performance <strong>of</strong> the design. The second<br />
algorithm is based on a competitive co-evolutionary<br />
strategy. It introduces a competition between two<br />
populations, one evolves values for the design variables<br />
and the other one evolves values for the uncertainty<br />
variables. Details <strong>of</strong> the two approaches are presented in<br />
the second section <strong>of</strong> the paper and the results from the<br />
robust design are tested against a non-robust design in<br />
section III. In the last section, the performance and the<br />
limitations <strong>of</strong> the two robust design approaches are<br />
discussed.<br />
II. ROBUST TOPOLOGY OPTIMIZATION<br />
A. Robust Design formulation<br />
A practical way to treat the robust design problem is to<br />
use the worst case scenario. A robust objective function<br />
can be defined as:<br />
min max f ( ) , (1)<br />
x U<br />
( x)<br />
where f(x) is the nonrobust objective function and U(x) is<br />
an uncertainty set containing all the possible variations <strong>of</strong><br />
the design variable x,<br />
n<br />
U ( x)<br />
{ R : ( 1<br />
i<br />
) xi<br />
( 1<br />
i<br />
) xi}<br />
. (2)<br />
Then the worst performance <strong>of</strong> the objective function f<br />
can be approximated using the value <strong>of</strong> f evaluated at one<br />
<strong>of</strong> the vertices <strong>of</strong> U, i.e. the worst vertex.<br />
max<br />
U<br />
( x)<br />
f ( ) f ( x<br />
pred<br />
Nevertheless, in a constrained optimization problem, if<br />
a nominal optimal solution x* is located close to the<br />
boundary <strong>of</strong> the feasible region, which may happen in<br />
some cases, some perturbed solutions, due to the<br />
variations <strong>of</strong> the design variables, will no longer be<br />
feasible. In such a situation, the robust solution must be<br />
placed away from the boundary <strong>of</strong> the feasible region to<br />
make sure that the entire uncertainty set <strong>of</strong> x* stays in the<br />
feasible region. To ensure feasibility robustness, a robust<br />
constraint function is defined as:<br />
max<br />
U<br />
( x )<br />
)<br />
- 161 - 15th IGTE Symposium 2012<br />
(3)<br />
g ( ) 0 , (4)<br />
where gi(x) are the original constraints for the problem.<br />
Finally, a robust design formulation using a robust<br />
objective function and a robust constraint function is<br />
given as:<br />
min<br />
x<br />
max<br />
U<br />
( x )<br />
L<br />
i<br />
f ( )<br />
max g i ( ) 0 , (5)<br />
s.<br />
t.<br />
U<br />
( x )<br />
X x X<br />
U<br />
B. Topological gradient<br />
Applications <strong>of</strong> topological shape optimization to the<br />
design <strong>of</strong> electromagnetic devices are relatively new [15]<br />
[16]. Unlike the classical shape optimization for which<br />
only the size and boundary <strong>of</strong> the design object is<br />
allowed to vary, in a topological shape design process,<br />
the topology <strong>of</strong> the domain can change as well, for<br />
instance, by drilling an air hole in the domain or filling<br />
this hole with a different material than the rest <strong>of</strong> the<br />
domain.<br />
The topological gradient (TG) is defined as the<br />
derivative <strong>of</strong> an objective function with respect to an<br />
infinitely small hole Q as:<br />
obj<br />
( \ Q(<br />
x,<br />
r))<br />
obj<br />
TG ( x)<br />
lim<br />
. (6)<br />
r<br />
0 ( )<br />
where is an arbitrary objective function, x is the center<br />
<strong>of</strong> the hole Q, r is the radius <strong>of</strong> Q, \Q(x,r) is the new<br />
topology after the small hole is present and () is the<br />
volume change <strong>of</strong> the domain , which is the volume <strong>of</strong><br />
Q but with a negative sign. Thus a positive value <strong>of</strong><br />
TG(x) means a negative change <strong>of</strong> the values <strong>of</strong> the<br />
objective function after the small hole Q is created.<br />
Hence the topological gradient can provide information<br />
on whether a topology change (creating a small hole in<br />
the system) will result in a decrease <strong>of</strong> the objective<br />
function.<br />
Now we can define a robust objective function based<br />
on the worst performance <strong>of</strong> a non-robust function J due<br />
to the perturbation <strong>of</strong> the design variable x as:<br />
f<br />
w<br />
J ( ) , (7)<br />
max<br />
U<br />
( x )<br />
and U(x) is the uncertainty set similar to (2), while in a<br />
topological design using TG, the design variables are the<br />
three dimensional coordinates, x, <strong>of</strong> the center <strong>of</strong> the<br />
potential topology change. Hence n = 3 and the vector <br />
= {1 2 3} represents the largest variation to the<br />
nominal value <strong>of</strong> x <strong>of</strong> the three dimensional coordinates.<br />
This robust objective function fw can be easily estimated<br />
using the worst vertex <strong>of</strong> the rectangular uncertainty set<br />
U.<br />
If a topology change is taking place in the design<br />
domain , the scalar objective function J can be<br />
approximated near the point using a first order local<br />
expansion, as:<br />
2<br />
J ( ) (<br />
\ Q(<br />
, r))<br />
(<br />
)<br />
TG(<br />
) <br />
( r)<br />
o(<br />
r ) .(8)<br />
Since () and (r) are both constants with respect to<br />
(note that (r) is the volume <strong>of</strong> Q with a negative sign),<br />
J() has the largest value where TG() is the smallest.<br />
Therefore, the worst performance <strong>of</strong> J due to the<br />
perturbation <strong>of</strong> the design variables x is determined by<br />
the point in the uncertainty set, where TG() has the<br />
smallest value. Hence we can obtain a robust topological<br />
gradient as:
TG<br />
R<br />
( x)<br />
min TG ( ) . (9)<br />
U<br />
( x )<br />
Figure 1 is used to illustrate the robustness <strong>of</strong> a<br />
topology. There exist two areas for a potential<br />
topological change in the design domain . However, the<br />
first area, which has the highest TG value, is close to a<br />
large area which has the lowest TG value, i.e. the TG<br />
value drops drastically in the neighborhood <strong>of</strong> the first<br />
area. Therefore the second area, which has the second<br />
largest TG values, is superior to the first one for a<br />
topological change in terms <strong>of</strong> the topological robustness.<br />
This is, in fact, equivalent to the robust topological<br />
design using second-order sensitivity analysis.<br />
Figure 1. Robustness <strong>of</strong> topology<br />
C. Worst vertex prediction using sensitivity<br />
In robust topology optimization, first, we use the<br />
robust TG to determine the topology change in the<br />
problem domain. After we “drilled” a hole in the system,<br />
the boundary <strong>of</strong> the hole is parameterized and is<br />
optimized using a shape optimizer. Thus a new<br />
uncertainty set is defined for the new design variables,<br />
which are the coordinates <strong>of</strong> the controlling points on the<br />
boundary <strong>of</strong> the hole. However, the robust objective<br />
function remains the same through the entire design<br />
process.<br />
In order to find the worst vertex, we can use the<br />
information <strong>of</strong> the gradient computed at the point x. The<br />
following figure gives an example <strong>of</strong> the worst vertex<br />
prediction in R 2 , where the opposite direction <strong>of</strong> the<br />
gradient <strong>of</strong> the objective function points to the worst<br />
vertex <strong>of</strong> the uncertainty set.<br />
- 162 - 15th IGTE Symposium 2012<br />
Figure 2. Worst vertex prediction using gradient<br />
D. Algorithm<br />
Finally, an algorithm for robust topology optimization<br />
based on topological shape optimization is described as<br />
follows:<br />
1. Set the iteration number k =0.<br />
2. Calculate the robust topological gradient TGR at the<br />
center <strong>of</strong> each element.<br />
3. Define the new domain k where the topology<br />
changes take place by removing the material in the<br />
elements where TGR is greater than zero.<br />
4. Apply standard shape optimization method with a<br />
robust objective function to determine the shape <strong>of</strong> the<br />
boundary.<br />
5. Check convergence and exit if the optimality<br />
condition is satisfied.<br />
6. Set k=k+1 and go to 2.<br />
Note that uncertainties related to both topology and<br />
shape are being handled throughout the entire design<br />
process.<br />
III. COMPETITIVE CO-EVOLUTIONARY ALGORITHMS<br />
Co-evolutionary algorithms are suitable for solving<br />
minimax optimization problems. The robust design<br />
formulation based on the worst case analysis defined in<br />
(1) can be generalized as<br />
min max f ( x,<br />
u)<br />
, (10)<br />
xX uU<br />
where f(·,·) is an objective or fitness function, x is a<br />
vector <strong>of</strong> the design variables and u is a vector <strong>of</strong> the<br />
uncertainty variables. Equation (1) is a special case <strong>of</strong><br />
(10) where = x + u. This formulation presents a<br />
competitive relationship between the two players, where<br />
the leader selects a value in X and the follower chooses a<br />
value in U in correspondence.<br />
In a worst case analysis based robust design, the<br />
system seeks for the best design under its worst case<br />
scenario. Thus it is possible to decompose this design<br />
process into two tasks, to find the design with the best<br />
performance and to find the worst performance <strong>of</strong> a
design subject to small variations <strong>of</strong> the design<br />
parameters. Similarly, in a competitive co-evolutionary<br />
algorithm, one population competes with the other<br />
leading to an “arms race”. The first population (denoted<br />
as population A in the rest <strong>of</strong> the paper) represents<br />
candidate solutions in the design space, which are<br />
evolving to minimize the objective; while the second<br />
population (population B) represents disturbances in U,<br />
an uncertainty set applied to the design variables in order<br />
to maximize the objective function. In other words, the<br />
former population provides a solution and the second<br />
population tries to attack the solution in the worst case<br />
scenario. Through the evolutions <strong>of</strong> the two populations,<br />
a more robust solution, i.e. with the best possible worst<br />
case performance, can be found after several generations.<br />
Therefore, this competitive model will guide the<br />
evolution towards robust solutions.<br />
A. Fitness computation<br />
In co-evolutionary algorithms, the two populations, A<br />
and B, evolve independently, but the fitness evaluations<br />
<strong>of</strong> the populations are related to each other. The fitness <strong>of</strong><br />
an individual in one population is evaluated against<br />
values <strong>of</strong> the individuals from the other population. For<br />
instance, the fitness <strong>of</strong> an individual in population A is<br />
defined as,<br />
F( x)<br />
f ( x,<br />
u*)<br />
, (11)<br />
where u* is the current best solution from population B.<br />
The goal <strong>of</strong> evolution <strong>of</strong> population A is to minimize the<br />
fitness function F(x).<br />
The fitness <strong>of</strong> an individual in population B is assigned<br />
against the value <strong>of</strong> the current best individual x* in<br />
population A. Therefore the fitness function for<br />
population B is defined as,<br />
G( u)<br />
f ( x*,<br />
u)<br />
. (12)<br />
The goal <strong>of</strong> evolution <strong>of</strong> population B is to maximize the<br />
fitness function G(u).<br />
B. Alternating co-evolutionary GA<br />
One typical co-evolutionary approach that can be<br />
applied to the continuous minimax problem is called the<br />
alternating co-evolutionary GA (ACGA). Figure 3 shows<br />
a diagram <strong>of</strong> the ACGA. The two populations (A and B)<br />
are initialized randomly. The finesses <strong>of</strong> the individuals<br />
in one population are evaluated against the other<br />
population using the functions defined in (11) and (12).<br />
For instance, after the initialization <strong>of</strong> population A (i.e. a<br />
set <strong>of</strong> random values is assigned to the design variables x<br />
between the lower bound and the upper bound), the<br />
algorithm fixes the values <strong>of</strong> the uncertainty variables u<br />
and evolves population A for several generations to<br />
minimize the fitness function F. Then the algorithm<br />
switches to the evolution <strong>of</strong> population B, while the<br />
values <strong>of</strong> the design variables archived for the best fitness<br />
are kept and the values <strong>of</strong> the uncertainty variables are<br />
- 163 - 15th IGTE Symposium 2012<br />
updated towards the maximization <strong>of</strong> the fitness function<br />
G. This alternating process repeats until the stopping<br />
criterion is met, e.g. the maximum number <strong>of</strong><br />
generations.<br />
Figure 3 Alternating Co-Evolutionary GA<br />
In the algorithm implemented in this paper, both<br />
populations have a total <strong>of</strong> = 100 individuals each. At<br />
the beginning <strong>of</strong> the execution, these individuals are<br />
randomly generated, respecting the bounds. The<br />
candidate solutions are represented by a one-dimensional<br />
array <strong>of</strong> real values. An individual has three genes<br />
representing the values <strong>of</strong> the design variables or the<br />
uncertainty variables. Individuals are selected for<br />
reproduction by means <strong>of</strong> a binary tournament where two<br />
individuals are randomly selected and their fitness values<br />
are compared, and that individual with the best fitness is<br />
selected for reproduction.<br />
The crossover operator used in the algorithm is a<br />
combination <strong>of</strong> an extrapolation method with a one-point<br />
crossover method [17]. Each pair <strong>of</strong> the selected<br />
individuals undergoes crossover with a recombination<br />
rate r = 1.0, and produces two <strong>of</strong>fspring. The operator<br />
performs a blend crossover <strong>of</strong> the gene at the crossing<br />
point, with a random factor within the interval [0, 1].<br />
After crossover, a mutation operator is applied to the<br />
<strong>of</strong>fspring, with a mutation rate m = 0.2. The mutation<br />
operator is very simple. If a gene is under mutation, the<br />
algorithm randomly generates a new real value within the<br />
bounds. The genetic algorithm implemented is<br />
generational, i.e. all <strong>of</strong>fspring replace their parents in the<br />
next generation.<br />
All the <strong>of</strong>fspring are then evaluated and the best<br />
individual is stored and is used as a population<br />
representative and passed as argument for the opponent’s<br />
evaluation, as described in equations (11) and (12).
The algorithm runs for a maximum <strong>of</strong> 100 generations<br />
and returns the best stored pair (x, u).<br />
C. Parallel co-evolutionary GA<br />
The parallel co-evolutionary GA (PCGA), shown in<br />
figure 4, is very similar to the ACGA, except that the two<br />
competitive populations evolve simultaneously. As a<br />
parallel model, this can be implemented easily for a<br />
parallel computing environment and the computational<br />
time will be reduced to half in theory.<br />
Figure 4 Parallel Co-Evolutionary GA<br />
Several other methods <strong>of</strong> co-evolutionary algorithms<br />
using different schemes <strong>of</strong> the fitness assignment can be<br />
seen in [18]–[20]. In this paper, the alternating coevolutionary<br />
GA is used to solve the robust design<br />
problem.<br />
IV. RESULTS<br />
Figure 5 A simulation model <strong>of</strong> an IPM motor.<br />
- 164 - 15th IGTE Symposium 2012<br />
A. Numerical model<br />
Figure 5 shows a quarter <strong>of</strong> a 3-phase 4-pole IPM<br />
machine. The quarter <strong>of</strong> the rotor core has one slot in the<br />
center and the rest <strong>of</strong> the core is made <strong>of</strong> steel. A<br />
permanent magnet bar made <strong>of</strong> NdFeB magnet is inserted<br />
in the center <strong>of</strong> the slot. The goal <strong>of</strong> the design is to find<br />
the optimal shape <strong>of</strong> the permanent magnet bar and the<br />
flux barriers which minimize the torque ripples <strong>of</strong> this<br />
motor, while maintaining an adequate average torque.<br />
The objective function can be defined, without<br />
considering the manufacturing uncertainties, as,<br />
Ti<br />
Tavg<br />
minmax<br />
F ( )<br />
x u<br />
i Tavg<br />
. (13)<br />
s.<br />
t.<br />
minT<br />
0.<br />
4Nm<br />
u<br />
avg<br />
The design variables chosen for the optimization are the<br />
length <strong>of</strong> the permanent magnet, L, the width <strong>of</strong> the<br />
permanent magnet, h and the distance from the<br />
permanent magnet to the surface <strong>of</strong> the rotor, d.<br />
This numerical model is solved using a 2-D nonlinear<br />
finite element solver (MagNet [21]). At each iteration <strong>of</strong><br />
the optimization, torques are evaluated at different<br />
positions <strong>of</strong> the rotor. The rotor mesh is regenerated after<br />
a new geometry <strong>of</strong> the rotor is archived during the<br />
optimization process.<br />
B. Results from RTO<br />
The robust topological optimization method is applied<br />
to a rotor core filled only with iron [22]. The topological<br />
gradient is evaluated in the design region in order to find<br />
potential topological changes which reduce the value <strong>of</strong><br />
the cost function. The permanent magnet and air<br />
materials are created in the region according to the TG<br />
values, as shown in figure 6.<br />
Figure 6 Topology <strong>of</strong> the rotor generated by RTO<br />
This shows a rough topology with one permanent<br />
magnet block and two air flux barriers <strong>of</strong> the design,<br />
which serves as the starting point <strong>of</strong> the shape<br />
optimization process. The system then optimizes the<br />
shape <strong>of</strong> the boundaries between different materials in<br />
order to achieve more accurate values <strong>of</strong> the geometries.<br />
The value <strong>of</strong> the design variables <strong>of</strong> the robust optimal is:<br />
H = 1.616 mm, L = 19.879 mm and d = 12.243 mm.
C. Results from ACGA<br />
It is not practical to combine the topology optimization<br />
with the alternating co-evolutionary GA due to the huge<br />
computational cost. Thus the ACGA is applied to the<br />
model shown in figure 5 to find the robust optimal values<br />
<strong>of</strong> the design variables. The manufacturing tolerances <strong>of</strong><br />
the design variables are considered as the uncertainties <strong>of</strong><br />
the problem. The optimal results, from the robust<br />
formulation, are given in table I.<br />
TABLE I<br />
VALUES OF DESIGN VARIABLES OF THE NOMINAL AND ROBUST OPTIMA<br />
Design<br />
variables<br />
Unit Nominal<br />
optimal<br />
Robust<br />
Optimal<br />
(by RTO)<br />
Robust<br />
Optimal<br />
(by CGGA)<br />
H mm 1.588 1.616 1.453<br />
L mm 18.33 19.879 19.345<br />
D mm 13.426 12.243 12.695<br />
Nominal<br />
Performance<br />
Nm 0.2358 0.2583 0.2547<br />
Worst<br />
Performance<br />
Nm 0.2709 0.2791 0.3039<br />
Feasibility<br />
robustness<br />
No Yes Yes<br />
Table 1 shows the values <strong>of</strong> the non-robust optimal and<br />
the robust optimal. The worst cases <strong>of</strong> the performances<br />
are evaluated. The uncertainty is set to be 5% <strong>of</strong> the<br />
design variables.<br />
V. CONCLUSION<br />
This paper discusses robust design issues and<br />
formulations for IPM design problems. Two different<br />
methods have been applied, and they can both find robust<br />
solutions for the problem.<br />
The robust topology optimization method employs a<br />
deterministic search based on the topological gradient<br />
and the shape sensitivity. The algorithm requires two<br />
FEM solutions per evaluation <strong>of</strong> the robust objective<br />
function (one FEM solution for the nominal cost function<br />
value and sensitivity calculation, and one for the worst<br />
performance). In the deterministic search, the maximum<br />
number <strong>of</strong> objective function evaluation is set to 200 and<br />
the total time <strong>of</strong> execution is around a few hours. Thus<br />
the method is very efficient and fast to converge.<br />
However, the robust objective function defined in [1] is<br />
not necessarily partially differentiable and this may pose<br />
some difficulties to the optimization. Also, convexity <strong>of</strong><br />
the objective function is not guaranteed, thus the worst<br />
performance point may be found inside the uncertainty<br />
set instead <strong>of</strong> on the corner. The worst performance<br />
prediction is only an approximation.<br />
On the other hand, the co-evolutionary GA does not<br />
rely on the sensitivity information. The algorithm<br />
maintains a population <strong>of</strong> the uncertainty variables and<br />
seeks for the exact worst performance point in the<br />
uncertainty set. The algorithm maintains two populations<br />
with 100 individuals for each population. The maximum<br />
number <strong>of</strong> generations <strong>of</strong> evolution is set to 100. The coevolutionary<br />
GA requires a total number <strong>of</strong> 20000 FEM<br />
- 165 - 15th IGTE Symposium 2012<br />
solutions and the total execution time for the algorithm is<br />
around 5 days. Although a huge computation time is<br />
required for the co-evolutionary GA, this algorithm is<br />
parallelizable, thus the time may be reduced by choosing<br />
an appropriate scheme <strong>of</strong> parallel computing. Also,<br />
depending on the nature <strong>of</strong> the optimization problems,<br />
some modifications can be applied to the GA to reduce<br />
the number <strong>of</strong> the function evaluations.<br />
[1]<br />
REFERENCES<br />
H. T.Wang, Z. J. Liu, S. X. Chen, and J. P.Yang, “Application <strong>of</strong><br />
Taguchi method to robust design <strong>of</strong> BLDC motor performance,”<br />
IEEE Trans.Magn., vol. 35, pp. 3700–3702, Sept. 1999.<br />
[2] Y. Sang-Baeck, et al., "Robust shape optimization <strong>of</strong><br />
[3]<br />
electromechanical devices," Magnetics, IEEE Transactions on,<br />
vol. 35, pp. 1710-1713, 1999.<br />
C. M. Piergiorgio Alotto, Werner Renhart, Andreas Weber, Gerald<br />
Steiner, "Robust target functions in electromagnetic design,"<br />
COMPEL: The International Journal for Computation and<br />
Mathematics in Electrical and Electronic Engineering, vol. 22, pp.<br />
549 - 560, 2003.<br />
[4] G. Steiner, et al., "Managing uncertainties in electromagnetic<br />
design problems with robust optimization," Magnetics, IEEE<br />
Transactions on, vol. 40, pp. 1094-1099, 2004<br />
[5] F. G. Guimaraes, et al., "Multiobjective approaches for robust<br />
electromagnetic design," Magnetics, IEEE Transactions on, vol.<br />
42, pp. 1207-1210, 2006.<br />
[6] J. S. Han and B. M. Kwak, "Robust optimization using a gradient<br />
index: MEMS applications," Structural and Multidisciplinary<br />
Optimization, vol. 27, pp. 469-478, 2004.<br />
[7] K. Nam-Kyung, et al., "Robust Optimization Utilizing the Second-<br />
Order Design Sensitivity Information," Magnetics, IEEE<br />
[8]<br />
Transactions on, vol. 46, pp. 3117-3120, 2010.<br />
Min Li, David A. Lowther, "A robust objective function for<br />
topology optimization", COMPEL: The International Journal for<br />
Computation and Mathematics in Electrical and Electronic<br />
Engineering, Vol. 30 Iss: 6, pp.1829 – 1841, 2011<br />
[9] G. Spagnuolo, "Worst case tolerance design <strong>of</strong> magnetic devices<br />
by evolutionary algorithms," Magnetics, IEEE Transactions on,<br />
vol. 39, pp. 2170-2178, 2003.<br />
[10] M. Ci<strong>of</strong>fi, et al., "Stochastic handling <strong>of</strong> tolerances in robust<br />
magnets design," Magnetics, IEEE Transactions on, vol. 40, pp.<br />
1252-1255, 2004.<br />
[11] H. J. C. Barbosa, A coevolutionary genetic algorithm for<br />
constrained optimization. <strong>Proceedings</strong> <strong>of</strong> the 1999 Congress on<br />
Evolutionary Computation, CEC 99. vol. 3, 1999.<br />
[12] J. Kim, Co-evolutionary computation for constrained min-max<br />
problems and its applications for pursuit-evasion games.<br />
<strong>Proceedings</strong> <strong>of</strong> the IEEE Congress on Evolutionary Computation,<br />
CEC 2001, vol. 2, pp. 1205-1212, 2001.<br />
[13] J. M. Claverie, Robust nonlinear control design using competitive<br />
coevolution, <strong>Proceedings</strong> <strong>of</strong> the IEEE Congress on Evolutionary<br />
Computation, CEC 2000, vol. 1, pp. 403-409, 2000.<br />
[14] A. M. Cramer, et al., "Evolutionary Algorithms for Minimax<br />
Problems in Robust Design," Evolutionary Computation, IEEE<br />
Transactions on, vol. 13, pp. 444-453, 2009.<br />
[15] K. Dong-Hun, et al., "Smooth Boundary Topology Optimization<br />
for Electrostatic Problems Through the Combination <strong>of</strong> Shape and<br />
Topological Design Sensitivities," Magnetics, IEEE Transactions<br />
on, vol. 44, pp. 1002-1005, 2008.<br />
[16] D. H. Kim, et al., "The Implications <strong>of</strong> the Use <strong>of</strong> Composite<br />
Materials in Electromagnetic Device Topology and Shape<br />
Optimization," Magnetics, IEEE Transactions on, vol. 45, pp.<br />
1154-1157, 2009<br />
[17] Haupt, Randy L. Practical genetic algorithms / Randy L. Haupt,<br />
Sue Ellen Haupt.—2nd ed. p. cm. Red. ed. <strong>of</strong>: Practical genetic<br />
algorithms. c1998. “A Wiley-Interscience publication.” ISBN 0-<br />
471-45565-2.<br />
[18] Y. Shi and R. A. Krohling, “Co-evolutionary particle swarm<br />
optimization to solve min-max problems,” in Proc. 2002 Cong.<br />
Evol. Comput., vol. 2, pp. 1682–1687<br />
[19] M. T. Jensen, “A new look at solving minimax problems with<br />
coevolution,” in Applied Optimization, Vol. 86, Metaheuristics:
Computer Decision-Making,M.G. C. Resende and J. Pinho de<br />
Sousa, Eds. Boston, MA: Kluwer, 2004, pp. 369–384.<br />
[20] J. Hur, H. Lee, and M.-J. Tahk, “Parameter robust control design<br />
using bimatrix co-evolution algorithms,” Eng. Optim., vol. 35, no.<br />
4, pp. 417–426, Aug. 2003.<br />
[21] MagNet user’s manual 2012, http://www.infolytica.ca<br />
[22] M. Li, and D. A. Lowther, “Robust Topology Optimization <strong>of</strong> an<br />
IPM Motor using Topological Analysis,” proceeding <strong>of</strong><br />
CompuMag2011, 2011<br />
- 166 - 15th IGTE Symposium 2012
IGTE Symposium, TU <strong>Graz</strong> 2012<br />
- 167 - 15th IGTE Symposium 2012<br />
Free-form Optimization for Magnetic Design<br />
Z. Andjelić 1 , S. Sadović 2<br />
1 ABB Corporate Research, Baden, Switzerland;<br />
2 Sadovic Consulting, Paris, France<br />
E-mail: zoran.andjelic@ch.abb.com<br />
Abstract— The paper presents an approach for free-form optimization <strong>of</strong> the magnetic problems. The approach is based on<br />
the novel simple sensitivity analysis and does not require the calculation <strong>of</strong> the adjoint problem. The solution engine in the<br />
background is IEM. The developed approach is illustrated on some typical benchmark problems.<br />
Index Terms— Free-form optimization, IEM, Sensitivity analysis<br />
Also, in free-form optimization the meshing <strong>of</strong> the<br />
I. INTRODUCTION<br />
analysed objects using mesh generator is performed<br />
When speaking about free-form optimization <strong>of</strong> industrial<br />
only in the first iteration. For all further iterations the<br />
problems we distinguish two different approaches: direct<br />
mesh is updated directly in the Analysis module.<br />
and indirect approach. In direct approach we try to As we use the non-gradient approach, the calculation<br />
minimize the maximal field quantities laying directly on time is much faster than with the gradient approach,<br />
the interface between different media by changing the requiring the costly calculation <strong>of</strong> the gradients.<br />
form <strong>of</strong> those interfaces in the normal direction. Typical As mentioned above we distinguish between the direct<br />
applications are optimization <strong>of</strong> the structural problems and indirect approaches for free-form optimization. One<br />
[1], or dielectric design <strong>of</strong> electrical apparatus [2], [3], <strong>of</strong> the additional key differences between direct and<br />
[4]. In indirect approach we are searching for the indirect approach is that for the optimization problems<br />
prescribed distribution <strong>of</strong> the objective function in the following the direct approach it is not necessary to<br />
space <strong>of</strong> interest by changing the shape <strong>of</strong> the structures calculate any sensitivity function [2], [3]. In the current<br />
outside <strong>of</strong> such space <strong>of</strong> interest. In this paper we discuss contribution we shall focus us on the indirect approach<br />
in more details the second approach, illustrated by some illustrated by some applications in magnetic design. It is<br />
typical benchmark problems.<br />
important to note that the proposed approach is<br />
independent <strong>of</strong> the application class and can be used for<br />
optimization <strong>of</strong> not only magnetic but also dielectric,<br />
acoustic or similar class <strong>of</strong> problems. It also has a generic<br />
character and can be used having FEM or other numerical<br />
method as the numerical engine in the background. In the<br />
present contribution we use IEM (Integral Equation<br />
Method) for the solution <strong>of</strong> the magnetostatic field<br />
problems.<br />
II. FREE-FORM OPTIMIZATION<br />
For automatic shape optimization we follow a nonparametric,<br />
non-gradient approach, which in<br />
combination with IEM (Integral Equation Method)<br />
enables fast and robust optimization <strong>of</strong> the real-world 3D<br />
problems [5]. The main benefits <strong>of</strong> such an approach<br />
comparing to the standard parametric, gradient-based<br />
approaches are:<br />
The applied procedure usually leads to the global<br />
optimum contrary to the parametric optimization<br />
approach where the optimum can be searched only<br />
within the “parametric space” defined by the design<br />
parameters (radii, distances, etc.).<br />
Due to the fact that we don’t need as input any design<br />
parameter it is not necessary to “communicate” with<br />
the CAD system during the optimization iterative<br />
procedure. As shown in Figure 1 the iterative<br />
framework in free-form optimization requires<br />
communication only between Analysis and<br />
Optimization module, whereby by parametric<br />
optimization in each iteration a new set <strong>of</strong> parameters<br />
has to be generated in CAD tool, meshed in mesh<br />
generator and then passed to Analysis module for<br />
further processing.<br />
Figure 1: Free-form vs. parametric optimization framework<br />
III. IEM FORMULATION<br />
The analysis <strong>of</strong> the non-linear problems in<br />
magnetostatic by IEM is performed using the improved<br />
procedure described initially in [6], and more detailed<br />
elaborated recently in [7]. The magnetic field in any space<br />
point can be found as:<br />
J M<br />
H H H<br />
J<br />
where H is a field component produced by the excitation<br />
M<br />
current in free space and H is a field produced by the<br />
magnetic charges. The first field component can be easily<br />
calculated by Bio-Savarot law. For the calculation <strong>of</strong> the<br />
second one we use the formula:<br />
M 1 1<br />
H J 1dSJ N<br />
2dVN<br />
(2)<br />
4 J K 1 dS <br />
4<br />
N 2 2dV<br />
N (2)<br />
4 4<br />
<br />
K<br />
S<br />
J<br />
VN<br />
where J and N are the fictitious surface and volume<br />
magnetic charges, and 1 K and K 2 are the kernels <strong>of</strong> the<br />
3<br />
type r / r . The surface charges are obtained by solving<br />
second Fredholm integral equation:<br />
(1)
IGTE Symposium, TU <strong>Graz</strong> 2012<br />
1 1<br />
2 (3)<br />
<br />
(3)<br />
2 <br />
1<br />
<br />
I<br />
J<br />
J GdS 1 I 2 I I N NGdV 2 2d<br />
N<br />
I 2 2 2<br />
s V N<br />
JGdS 1 2H<br />
n<br />
Here 1 G and 2<br />
12 <br />
<br />
1 2<br />
G are the kernels <strong>of</strong> the type<br />
3<br />
rn/ r and<br />
where the 1 and 2 are the relative<br />
permeabilities <strong>of</strong> the surrounding media and magnetic<br />
materials. When solving the linear problems the last term<br />
on the right-hand side <strong>of</strong> equation (3) is equal to zero.<br />
Here is important to stress some <strong>of</strong> the main features<br />
<strong>of</strong> IEM when solving the non-linear magnetostatic<br />
problem. In spite <strong>of</strong> the fact that it is necessary to mesh<br />
the volume <strong>of</strong> the non-linear magnetic parts, the number<br />
<strong>of</strong> unknowns for the non-linear problem is same as the<br />
number <strong>of</strong> unknowns for the linear one. This is due to the<br />
fact that the non-linear contribution - second term on the<br />
right-hand side <strong>of</strong> (3) - appears just as the correction term<br />
and is calculated throughout the iteration procedure from<br />
the previous iteration. Also, as the material parameter<br />
appears only in the diagonal term, the matrix<br />
calculation is performed only during the first iteration. In<br />
other iterations only the diagonal term is changed together<br />
with the RHS term taking into account the contributions<br />
due to volume charges.<br />
IV. INDIRECT APPROACH<br />
Contrary to the direct approach where it is not necessary<br />
to calculate any sensitivity functions, in indirect approach<br />
this function has to be established. To establish such<br />
function we use the analogy to the sensitivity analysis<br />
typically used in the signal-processing (SP) problems. In<br />
SP the objective <strong>of</strong> controller design is to keep the error<br />
between the controlled output and the external input as<br />
small as possible. In signal processing the sensitivity<br />
function S(s) is typically calculated as:<br />
Es ()<br />
Ss () ; Es () Rs () Ys<br />
() (4)<br />
Rs () ds ()<br />
where E(s) is feedback error, R(s) and d(s) are the<br />
external input and disturbance.<br />
To calculate the sensitivity for our optimization tasks we<br />
use the analogy to the above SP scheme. Here we take as<br />
example the quantities from the magnetic problem,<br />
Figure 2.<br />
Figure 2: Sensitivity calculation scheme for optimization<br />
tasks<br />
In magnetic problems the magnetic field in the space<br />
point <strong>of</strong> interest can be calculated using BEM [5] as:<br />
- 168 - 15th IGTE Symposium 2012<br />
1<br />
H(<br />
j) ( i) K( i, j) d<br />
(5)<br />
4 <br />
<br />
Sensitivity <strong>of</strong> changing the field H in the space <strong>of</strong> interest<br />
with the changes <strong>of</strong> the geometry <strong>of</strong> the magnetized body<br />
can then be obtained as:<br />
H H<br />
S <br />
H H<br />
G C<br />
G C<br />
max<br />
In the above case the external input H G is a given i.e.<br />
prescribed (desired) field distribution in the space <strong>of</strong><br />
interest, H C is a calculated field in the same space. The<br />
C<br />
disturbance H is calculated as:<br />
max<br />
C C<br />
max max<br />
(6)<br />
H ( i) max[ H ( i, j), j1, N ]; (7)<br />
The displacement vector D <strong>of</strong> the moving <strong>of</strong> the mesh<br />
nodes can then be calculated as:<br />
D Sn More information on the calculation <strong>of</strong> the sensitivity<br />
function for free-form optimization tasks can be found in<br />
[8].<br />
For illustration the above procedure has been used to<br />
optimize the Die mold problem, Example 1 and Field<br />
homogenization problem, Example 2.<br />
V. EXAMPLE 1: DIE MOLD OPTIMIZATION<br />
This is a TEAM benchmark problem No. 25 used up to<br />
now for the benchmarking <strong>of</strong> the codes dealing with 2D<br />
parametric optimization [9], Figure 3.<br />
Figure 3: TEAM benchmark problem No. 25<br />
Here we use the same model as a 3D problem adding the<br />
extrusion in y-direction <strong>of</strong> 200 mm, Figure 4. The<br />
objective is to obtain the homogeneous radial field<br />
distribution within the cavity shown in Figure 3. One <strong>of</strong><br />
the die molds is keept fix (inner cylinder) and the other<br />
one is in our approach subjected to the free-optimization<br />
process in order to get the radial field distribution in the<br />
j<br />
8
IGTE Symposium, TU <strong>Graz</strong> 2012<br />
cavity. 3D model is shown in Figure 4 and the detailed<br />
2D view in Figure 5.<br />
Figure 4: 3D model <strong>of</strong> the Team problem No. 25<br />
Figure 5: Details <strong>of</strong> the Team problem No. 25<br />
Applying module for free-form optimization governed by<br />
the above given approach for sensitivity calculation we<br />
have after 24 iterations obtained the optimal form <strong>of</strong> the<br />
magnetic poles, Figure 6 (in red).<br />
Figure 6: Outer magnetic mold before and after optimization<br />
Such new form <strong>of</strong> magnetic poles has provided a desired<br />
radial field distribution in the cavity. Figure 7 shows the<br />
form <strong>of</strong> the magnetic poles befor otpimization, after 10 th<br />
iteration and as the optimal form after 24 iterations. The<br />
field vectors illustrate the changes <strong>of</strong> the field during the<br />
optimization process. Only at the end <strong>of</strong> the die molds<br />
some deviations are observed caused by the end-region<br />
field disturbances.<br />
- 169 - 15th IGTE Symposium 2012<br />
Figure 7: Magnetic field homogenization during the<br />
optimization process.<br />
VI. EXAMPLE 2: AIR GAP FIELD HOMOGENIZATION<br />
In this example the objective function is to achieve the<br />
homogeny field distribution over the prescribed space <strong>of</strong><br />
interest lying in the air gap between the magnetic poles,<br />
Figure 8.<br />
Figure 8: Model <strong>of</strong> the magnetic structure<br />
The core is made <strong>of</strong> the material with 1500<br />
, and is<br />
excited by the current-carrying coil with I=12240A.<br />
Before doing any optimization the magnetic field<br />
distribution over the space <strong>of</strong> interest is shown in Figure 9<br />
and Figure 11, a.). The field over the space <strong>of</strong> interest<br />
varies from 35737 A/m to 57555 A/m. As the<br />
optimization objective we define here the desired value <strong>of</strong><br />
the homogeneous field over the space <strong>of</strong> interest<br />
(50x50mm) as H d =50000 A/m. After applying the<br />
optimization modus governed by the above sensitivity<br />
calculation, we have obtained after 37 iterations the<br />
optimal form <strong>of</strong> the magnetic pole shoes that deliver the<br />
desired field distribution within the error less than 10%,<br />
Figure 10.
IGTE Symposium, TU <strong>Graz</strong> 2012<br />
Figure 9: Magnetic field distribution over the space <strong>of</strong><br />
interest before any optimization<br />
Figure 10: Magnetic field distribution over the space <strong>of</strong><br />
interest after 37 iterations. The magnetic poles have changed<br />
the form in order to provide prescribed homogeneous field <strong>of</strong><br />
50000A/m.<br />
Figure 11 shows in more details the field distribution over<br />
the space <strong>of</strong> interest before (a) and after optimization (b).<br />
Figure 11: Detailed view on the field distribution over the<br />
space <strong>of</strong> interest before (a) and after (b) optimization<br />
The field variation for optimal design (with threshold<br />
error <strong>of</strong> 10%) is between Hmin = 46489A/m and<br />
Hmax=55027 A/m.<br />
VII. CONCLUSION<br />
The paper elaborates the procedure for free-form<br />
optimization <strong>of</strong> magnetic problems. The procedure is<br />
- 170 - 15th IGTE Symposium 2012<br />
based on the novel approach for the simple sensitivity<br />
calculation. The proposed approach does not require<br />
calculation <strong>of</strong> the adjoint problem and has a generic<br />
character with respect to both the classes <strong>of</strong> the<br />
application (magnetic, dielectric, acoustics...) and the<br />
numerical methods used within the simulation engine<br />
(BEM, FEM).<br />
REFERENCES<br />
[1] R. Meske: “Non-parametric gradient-less shape optimization in<br />
solid mechanics”, Shaker Verlag,2007, ISBN 978-3-8322-6373-7<br />
[2] Z. Andjelic, S. Sadovic: “Reduction <strong>of</strong> breakdown appearance by<br />
automatic geometry optimization”, IEEE Conf. on El. Insulation<br />
and Dielectric Phenomena, Vancouver BC, Canada, 2007<br />
[3] Z. Andjelic, D. Pusch, T. Schoenemann, S. Sadovic: “Multi-load<br />
optimization in electrical engineering design, Part 1: Simulation,<br />
EngOpt 2008- Int. Conf. on Engineering Optimization, Rio de<br />
Janeiro, Brazil, 01-05. June 2008<br />
[4] Z. Andjelic, S. Sadovic, Jean-Claude Mauroux: “Preventing<br />
breakdown by direct optimization approach”, IEEE Int. Power<br />
Modulator and High Voltage Conf, San Diego, CA-June 3-7,<br />
2012<br />
[5] Z. Andjelic at al: “BEM-based simulations in engineering<br />
design”, In Boundary Element Analysis, Mathematical Aspects<br />
and Applications, Springer Verlag 2007, ISBN: 3-540-47465-X<br />
[6] B. Krstajic, Z. Andjelic, S. Milojkovic, S. Babic, S. Salon:<br />
“Nonlinear 3D magnetostatic field calculation by the integral<br />
equation method with surface and volume magnetic charges”,<br />
IEEE Tran. on Mag., vol.28, No.2, March 199<br />
[7] Z. Andjelic, G. Of, O. Steinbach, P. Urthaler: “Fast BEM for<br />
industrial applications in magnetostatic”, in Lecture Nodes in<br />
Applied and Computational Mechanics, Springer-Verlag, Vol. 63,<br />
2012<br />
[8] Z. Andjelic: “Simple sensitivity approach for optimization tasks in<br />
electrical engineering”, OIPE Workshop, Gent, Belgium, 2012<br />
[9] N. Takahashi, M. Natsumeda, M. Otoshi and K. Muramatsu:<br />
“Examination <strong>of</strong> optimal design method using die press model<br />
(problem 25)”, COMPEL 17 5/6, 1982
- 171 - 15th IGTE Symposium 2012<br />
Optimization for ECT treatment planning<br />
1 P. Di Barba, 3 L.G. Campana, 2 F. Dughiero, 3 C.R. Rossi, 2 E. Sieni<br />
1 Department <strong>of</strong> Industrial and Information Engineering, Pavia <strong>University</strong>, via Ferrata 1, 27100 Pavia (Italy)<br />
2 Department <strong>of</strong> Industrial Engineering, Padova <strong>University</strong>, via Gradenigo 6/A, 35131 Padova (Italy)<br />
3 Melanoma and Sarcoma Unit, Istituto Oncologico Veneto (IOV),Via Gattamelata 64, 35128 Padova (Italy)<br />
E-mail: paolo.dibarba@unipv.it,{fabrizio.dughiero, carlor.rossi, elisabetta.sieni}@unipd.it, luca.campana@ioveneto.it<br />
Abstract—Treatment planning <strong>of</strong> Electrochemotherapy (ECT) is designed by means <strong>of</strong> a genetic multi-objective optimization<br />
method: the needle position maximizing the electric field in the treated volume is searched for. NSGA algorithm is coupled<br />
with penalty function technique in order to identify the constrained Pareto front to select the best compromise solutions and<br />
discard the unfeasible ones.<br />
Index Terms—Electrochemotherapy, conduction field, Finite Element, Pareto front, NSGA.<br />
I. INTRODUCTION<br />
ECT uses pulses <strong>of</strong> electric field in order to improve the<br />
delivery <strong>of</strong> chemotherapeutic drugs into cancer cells [1]-<br />
[2]. A suitable electric field intensity is able to induce cell<br />
membrane permeabilization that improves the<br />
chemotherapy drug delivery. However, a high electric<br />
field intensity in healthy tissues, and in some critical<br />
regions like e.g. large vessels, is to be prevented. A<br />
conduction electric field is applied to tumor tissues by<br />
means <strong>of</strong> needle electrodes suitably positioned in the<br />
target volume. In order to improve the therapy success,<br />
the positioning <strong>of</strong> electrodes is considered an<br />
optimization problem. The research group in Ljubljana<br />
<strong>University</strong> has proposed some solutions to optimal<br />
electrode positioning in deep-seated tumor like in [3-6].<br />
In this paper a multiobjective optimization method, based<br />
on a modified NSGA-II algorithm, that includes<br />
constraints and penalty functions in order to prevent<br />
unfeasible solutions, is proposed for the optimal<br />
positioning <strong>of</strong> needles in the tumor mass [7-10]. The<br />
optimization problem is solved using a 2D model <strong>of</strong><br />
steady conduction field.<br />
II. CLINICAL ECT<br />
ECT is a medical therapy based on cell electroporation<br />
for patients with cutaneous and subcutaneous tumor<br />
nodules on the basis <strong>of</strong> the synergistic association <strong>of</strong><br />
locally applied brief electrical currents (reversible<br />
electroporation) and low permeant anticancer agents [11-<br />
13]. Electroporation is a local electric treatment that uses<br />
a physical behavior <strong>of</strong> cells when a pulsed electric field is<br />
applied in order to open some pores on the cell<br />
membrane. Those opening can be used as channels as a<br />
delivery system to enhance the penetration <strong>of</strong> drugs,<br />
genes, or molecular probes into cancer cells. This is an<br />
applied electrical fields with suitable intensity that<br />
increase cell membrane permeability [14-17]. Figure 1<br />
shows the most important phase <strong>of</strong> chemotherapy drug<br />
administration using ECT technique: In the phase I <strong>of</strong> the<br />
treatment the clinician injects the drug (e.g. bleomicine),<br />
then during phase II he applies the electric pulse, and<br />
finally the drug penetrate the membrane cells.<br />
Since its development at the Institute Gustave Roussy,<br />
this technique has been quickly tested in the clinical<br />
setting and recently is entered in the clinical practice [11-<br />
12], [18-21]. At Melanoma and Sarcoma unit <strong>of</strong> the<br />
“Istituto Oncologico Veneto” (IOV) in Padova, Italy,<br />
clinical application <strong>of</strong> ECT using standard electrodes [23]<br />
has shown yet satisfying results [22]. Standard electrodes<br />
are a set <strong>of</strong> 7 needles with a length between 10 and 30<br />
mm on a rigid support, [23]. The ECT equipment<br />
manufacturer produces also a long needles machine that<br />
can be used to treat with ECT some deep-seated tumors<br />
like sarcoma [23-25]. In this case the clinician implants<br />
single 20 cm length electrodes on the tumor mass<br />
accordingly to medical image <strong>of</strong> the tumor and clinical<br />
practice. So, in the case <strong>of</strong> flexible long-needle<br />
equipment, it is <strong>of</strong> interest to improve the therapy success<br />
studying the electric field produced by some<br />
configurations <strong>of</strong> electrodes implanted on the tumor mass<br />
using optimization algorithms.<br />
ECT electrode<br />
E<br />
Skin surface<br />
Figure 1: Description <strong>of</strong> the ECT application.<br />
III. DIRECT PROBLEM: ELECTRIC FIELD ANALYSIS<br />
In general, the case study models three regions: the<br />
tumor, T, with an average radius <strong>of</strong> 3 cm, the<br />
surrounding healthy tissue, H, and a region close to the<br />
treated region that might be a critical one, C. Each<br />
region is attributed the relevant electric conductivity [3].<br />
The needle electrodes are represented as a set <strong>of</strong> nine<br />
points. In particular, the fixed main electrode is located in<br />
the center <strong>of</strong> the lesion whereas the other eight electrodes,<br />
the ones that can be moved, are around the central one.<br />
The ECT process forces a sequence <strong>of</strong> voltages in the<br />
range 1 to 3 kV for each electrode pair. The imposed<br />
voltages represent the boundary conditions <strong>of</strong> the field<br />
problem. Then, the electric field is computed by means <strong>of</strong><br />
the finite-element method (FEM) solving a steady<br />
conduction problem for each electrode pair: specifically,<br />
16 field analyses on the same grid are needed to compute<br />
the electric field for each needle configurations [26]. The
solved equation is:<br />
V<br />
0<br />
(1)<br />
imposing Neumann condition on electric scalar potential<br />
on the domain boundary:<br />
V<br />
n<br />
0<br />
And finally the electric potential has been fixed to a<br />
constant value, U, in two <strong>of</strong> the ne electrodes in the<br />
following way<br />
V U<br />
i<br />
0 V U<br />
(<br />
i,<br />
j)<br />
i j i 1,...<br />
n<br />
j<br />
e<br />
U<br />
i<br />
Then, given an electrode configuration, solving the direct<br />
problem implies to repeat the field analysis, i.e. solving<br />
(1), for all possible (i,j) pairs <strong>of</strong> electrodes.<br />
<br />
H<br />
C<br />
T<br />
Electrode<br />
U i<br />
Main electrode<br />
Figure 1: Geometry <strong>of</strong> the 2D conduction field.<br />
Given all the ne field analyses considering the mesh<br />
nodes <strong>of</strong> each problem region the highest value <strong>of</strong> the<br />
electric field is searched for each node <strong>of</strong> the problem<br />
domain and recorded in sets named Emax(i), one for each<br />
<strong>of</strong> examined region.<br />
IV. INVERSE PROBLEM: OPTIMAL ELECTRODE<br />
POSITIONING<br />
The therapy efficacy depends on the electric field<br />
intensity applied to the cells. In some practical cases the<br />
proximity to a prescribed therapeutic value <strong>of</strong> the<br />
temperature is searched for [27-28], whereas in our case<br />
the overcoming <strong>of</strong> a given electric field threshold is to be<br />
controlled. The ideal configuration <strong>of</strong> needle electrodes is<br />
the one that maximizes the sub-volume <strong>of</strong> the tumor<br />
region covered with an electric field intensity over the<br />
electropermeabilization threshold [3], ERE, and<br />
simultaneously minimizes the volume <strong>of</strong> healthy tissues or<br />
critical organs that have an electric field higher than ERE<br />
[29-30]. Accordingly, the following objective functions,<br />
to be minimized, have been defined:<br />
<br />
<br />
N E ( E E<br />
f1(<br />
E)<br />
100<br />
<br />
1<br />
N E,<br />
tot<br />
RE<br />
) <br />
<br />
<br />
<br />
that represents the complementary sub-volume <strong>of</strong> the<br />
tumor region for which the electric field is under ERE,<br />
evaluated as the number <strong>of</strong> nodes, NE, in which the<br />
U j<br />
(2)<br />
(3)<br />
(4)<br />
- 172 - 15th IGTE Symposium 2012<br />
electric field intensity is higher than ERE. NE,tot is the total<br />
number <strong>of</strong> nodes in which the electric field is evaluated in<br />
the tumor region. The design criterion considered<br />
evaluates the nodes <strong>of</strong> the healthy tissue region or the<br />
region <strong>of</strong> a critical organ (e.g. large vessel), in which the<br />
electric field exceeds a prescribed threshold ETH:<br />
g(<br />
E,<br />
E<br />
TH<br />
)<br />
N ( E E<br />
N<br />
E<br />
TH<br />
100<br />
(5)<br />
E,<br />
tot<br />
)<br />
Starting from (5) three objective functions have been<br />
generated. Namely:<br />
(a) g(<br />
E,<br />
E )<br />
f (6)<br />
2 IRE<br />
in which the threshold <strong>of</strong> electric field is fixed to the<br />
irreversible electroporation value, EIRE = 10 5 V/m, and is<br />
computed in the tumor region T;<br />
(b) g E,<br />
E )<br />
f (7)<br />
3 ( ETH 1<br />
in which the threshold <strong>of</strong> electric field is fixed to the<br />
reversible electroporation, ETH1 = 410 4 V/m, computed<br />
on the healthy tissue H. In this case it is desirable that<br />
the electric field does not exceed the threshold ETH1 in<br />
order to preserve healthy tissue; and finally:<br />
(c) g E,<br />
E )<br />
f (8)<br />
4 ( ETH 2<br />
In this case the electric field cannot exceed the ETH2 = 10 3<br />
V/m in the critical region C to prevent the damage <strong>of</strong><br />
critical organs. Generally this threshold is chosen lower<br />
than electroporation threshold in order to ensure an<br />
electric field lower the ERE.<br />
All the objective functions (4) and (6)-(8) are computed<br />
using the Emax(i) set <strong>of</strong> values in the corresponding<br />
region <strong>of</strong> the computation domain.<br />
A1<br />
A2 A2<br />
Figure 2: f1 and f3 optimization goal.<br />
Accordingly, a sequence <strong>of</strong> bi-objective optimization<br />
problems have been considered and solved: find the<br />
Pareto front minimizing the couple <strong>of</strong> functions (f1, fk)<br />
with k=2,3 and 4 subject to the solution <strong>of</strong> the direct<br />
problem (1) and a set <strong>of</strong> geometrical constraints on the<br />
electrode position. For instance the minimum distance<br />
between two electrodes must be greater than 10 mm.<br />
Constraints have been incorporated in the objectives<br />
functions by means <strong>of</strong> a penalty term as in [7]. For<br />
instance Figure 2 shows the f1 and f3 optimization goal: f1<br />
tends to maximize the area A1, whereas f3 tends to<br />
A1
minimize the area A2. Moreover, Figure 3 shows the<br />
penalty constraint effect: if the non-penalty algorithm is<br />
used, two electrodes can be at a distance lower than the<br />
prescribed minimum (10 mm) like the one marked with a<br />
circle in Figure 3 (a). In contrast, if the penalty algorithm<br />
is used, too near electrodes configuration are discarded<br />
and a possible configuration is like the one in Figure 3<br />
(b).<br />
(a) (b)<br />
Figure 3: Penalty algorithm effect.<br />
V. RESULTS<br />
Results <strong>of</strong> some optimized configurations are here<br />
presented.<br />
A. Case 1: penalty vs non-penalty<br />
The optimization problem considers the electric field in<br />
the tumor region T that must exceed the electroporation<br />
threshold ERE (f1) and the electric field in the healthy<br />
tissue region, T, that must be lower than the<br />
electroporation threshold ETH1=ERE (f3).<br />
In this case, results obtained using penalty algorithm are<br />
compared with results obtained using non-penalty<br />
algorithm. Figure 4 reports the two Pareto fronts obtained<br />
starting from the same initial population and using the<br />
two algorithm: the Pareto front is reshaped.<br />
Figure 4: Pareto Front for the case 1 using penalty and<br />
non-penalty algorithm.<br />
200103 E [V/m]<br />
15010 3<br />
10010 3<br />
5010 3<br />
0,00<br />
Not feasible<br />
Figure 5: Optimized electrodes configurations: Electric<br />
field in the examined region using (a) penaltyand (b) nonpenalty<br />
algorithm.<br />
- 173 - 15th IGTE Symposium 2012<br />
In Figure 5 the highest value <strong>of</strong> the electric field obtained<br />
at each domain point applying the whole sequence <strong>of</strong><br />
electrodes discharges during an ECT treatment is reported<br />
for the tumor region, Emax(T), and the healthy tissue,<br />
Emax(H). The corresponding electrodes position is also<br />
indicated by dots.<br />
A. Case 2: preventing irreversible electroporation<br />
The optimization problem considers the electric field in<br />
the tumor region T that must exceed the electroporation<br />
threshold ERE (f1) and must be lower than the irreversible<br />
threshold, EIRE (f2).<br />
Figure 6 reports the Pareto front obtained starting from an<br />
initial population and using the penalty algorithm.<br />
Figure 6: Pareto Front for the case 2 using penalty<br />
algorithm.<br />
In Figure 7 the highest value <strong>of</strong> the electric field obtained<br />
at each domain point applying the whole sequence <strong>of</strong><br />
electrodes discharges during an ECT treatment is reported<br />
for the tumor region, Emax(T). The corresponding<br />
electrodes position is also indicated by dots.<br />
200103 E [V/m]<br />
15010 3<br />
10010 3<br />
5010 3<br />
0,00<br />
Figure 7: Optimized electrodes configurations: Electric<br />
field in the examined region using penalty algorithm.<br />
In this case electrodes are positioned in the healthy tissue<br />
because the irreversible electroporation is avoided in<br />
order to prevent cells necrosis.<br />
A. Case 3: preserving critical organ (blood vessel)<br />
The optimization problem considers the electric field in<br />
the tumor region T that must exceed the electroporation<br />
threshold ERE (f1) and the electric field in the critical<br />
region, C, that must be lower than the threshold ETH1<br />
(f4).<br />
Figure 8 reports the Pareto front obtained starting from an<br />
initial population and using the penalty algorithm. In<br />
Figure 9 the highest value <strong>of</strong> the electric field obtained at<br />
each domain point applying the whole sequence <strong>of</strong><br />
electrodes discharges during an ECT treatment is reported
for the tumor region, Emax(T) and the critical region,<br />
Emax(C). The corresponding electrodes position is also<br />
marked by black and green dots.<br />
Figure 8: Pareto Front for the case 3 using penalty<br />
algorithm.<br />
200103 E [V/m]<br />
15010 3<br />
10010 3<br />
5010 3<br />
Figure 9: Optimized electrodes configurations: Electric<br />
field in the examined region using penalty algorithm.<br />
In this case the electrode are far from the critical region,<br />
whereas in Figure 5 are close and even inside the critical<br />
region. Then different objective functions allow to<br />
identify different electrodes configurations depending on<br />
the problem constraints.<br />
VI. CONCLUSIONS<br />
The NSGA-II algorithm modified including constraints<br />
and penalty function has been applied to design ECT<br />
electrodes positioning. Various objective function pairs<br />
have been implemented in order to compare results<br />
obtained considering different problem targets.<br />
VII. ACKNOWLEDGES<br />
This project has been developed in the frame <strong>of</strong> a Post-<br />
Doc granted by the Padova <strong>University</strong>, Italy.<br />
REFERENCES<br />
[1] G. Sersa, D. Miklavcic, M. Cemazar, Z. Rudolf, G. Pucihar, M.<br />
Snoj, “Electrochemotherapy in treatment <strong>of</strong> tumours”, European<br />
J. Surgical Oncology, 34 (2), 232–240, 2008.<br />
[2] S. Corovic, A. Zupanic, D. Miklavcic, “Numerical modeling and<br />
optimization <strong>of</strong> electric field distribution in subcutaneous tumor<br />
treated with electrochemotherapy using needle electrodes”, IEEE<br />
Trans. on Plasma Science, 36(4), 1665–1672, 2008.<br />
[3] B. Kos, A. Zupanic, T. Kotnik, M. Snoj, G. Sersa, D. Miklavcic,<br />
“Robustness <strong>of</strong> treatment planning for electrochemotherapy <strong>of</strong><br />
deep-seated tumors”, J. Membrane Biol., 236(1), 147–153, 2010.<br />
[4] D. Sel, A. M. Lebar, D. Miklavcic, “Feasibility <strong>of</strong> employing<br />
model-based optimization <strong>of</strong> pulse amplitude and electrode<br />
distance for effective tumor electropermeabilization”, Biomedical<br />
Engineering, IEEE Trans, 54(5), 773–781, 2007.<br />
[5] D. Miklavcic, M. Snoj, A. Zupanic, B. Kos, M. Cemazar, M.<br />
Kropivnik, M. Bracko, T. Pecnik, E. Gadzijev, G. Sersa,<br />
“Towards treatment planning and treatment <strong>of</strong> deep-seated solid<br />
tumors by electrochemotherapy”, BioMedical Eng. On Line, 9(1),<br />
0,00<br />
- 174 - 15th IGTE Symposium 2012<br />
10, 2010.<br />
[6] A. Županič, S. Čorović, D. Miklavčič, “Optimization <strong>of</strong> electrode<br />
position and electric pulse amplitude in electrochemotherapy”,<br />
Radiology and Oncology, vol. 42(2), 93–101, 2008.<br />
[7] P. Di Barba, L.G. Campana, F. Dughiero, C.R. Rossi, E. Sieni,<br />
“Optimal needle positioning for electrochemotherapy: a<br />
constrained multiobjective strategy”, to appear in Proc. CEFC<br />
2012.<br />
[8] K. Deb, A. Pratap, S. Agarwal, T. Meyarivan, “A fast and elitist<br />
multiobjective genetic algorithm: NSGA-II”, IEEE Trans.<br />
Evolutionary Computation, vol. 6(2), 182–197, 2002.<br />
[9] P. Di Barba, Multiobjective Shape Design in Electricity and<br />
Magnetism. Springer, 2010.<br />
[10] P. Di Barba, F. Dughiero, E. Sieni, “Field synthesis for the<br />
optimal treatment planning in Magnetic Fluid Hyperthermia”,<br />
Archives <strong>of</strong> Electrical Engineering, vol. 61(1), 57–67, 2012.<br />
[11] L.M. Mir, “Terapeutic perspectives <strong>of</strong> in vivo cell<br />
electropermeabilization”, Bioelecrtochemistry, 53, 1–10, 2000.<br />
[12] Belehradek M, Domenge C, Luboinski B, et al.<br />
“Electrochemotherapy, a new antitumor treatment. First clinical<br />
phase I–II trial”, Cancer, 72, 3694–700, 1993.<br />
[13] L.M. Mir, S. Orlowski. “Mechanisms <strong>of</strong> electrochemotherapy”.<br />
AdvDrug Del Rev. 35,107–18, 1999.<br />
[14] C. Chen, S.W. Smye, M.P. Robinson, et al. “Membrane<br />
electroporation theories: a review”, Med Biol Eng Comput. 44, 5–<br />
14, 2006.<br />
[15] S. Somiari, J. Glasspool-Malone, J.J. Drabick, et al. “Theory and<br />
in vivo application <strong>of</strong> electroporative gene delivery”, Mol Ther., 3,<br />
178–87, 2000.<br />
[16] R. Heller, M.J. Jaroszeski, A. Atkin, et al. “In vivo gene<br />
electroinjection and expression in rat liver”, FEBS Lett.,389, 225–<br />
8, 1996.<br />
[17] M. Cemazar, G. Sersa, “Electrotransfer <strong>of</strong> therapeutic molecules<br />
into tissues”, Curr Opin Mol Ther., 9, 554–62, 2007.<br />
[18] B.J. Mossop, R.C. Barr, W. Henshaw, et al. “Electric fields in<br />
tumors exposed to external voltage sources: implication for<br />
electric field mediated drug and gene delivery”, Ann Biochem<br />
Eng., 34, 1564–72, 2006.<br />
[19] A. Gothelf, L.M. Mir, J. Gehl, “Electrochemotherapy: results <strong>of</strong><br />
cancer treatment using enhanced delivery <strong>of</strong> bleomycin by<br />
electroporation”, Cancer Treat Rev., 29, 371–87, 2003.<br />
[20] G. Sersa, “The state-<strong>of</strong>-the-art <strong>of</strong> electrochemotherapy before the<br />
ESOPE study; advantages and clinical use”, EJC Suppl., 4, 52–9,<br />
2006.<br />
[21] M. Snoj, M. Cemazar, T. Srnovrsnik, S. P. Kosir, G. Sersa, “Limb<br />
sparing treatment <strong>of</strong> bleeding melanoma recurrence by<br />
electrochemotherapy”, Tumori, 95(3), 398–402, 2009.<br />
[22] L. Campana, S. Mocellin, M. Basso, O. Puccetti, G. De Salvo, V.<br />
Chiarion-Sileni, A. Vecchiato, L. Corti, C. Rossi, D. Nitti,<br />
“Bleomycin-Based Electrochemotherapy: Clinical Outcome from<br />
a Single Institution’s Experience with 52 Patients”, Annals <strong>of</strong><br />
Surgical Oncology, 16(1), 191–199, 2009.<br />
[23] Cliniporator, Igea: http://www.igea.it (last visited October 2012).<br />
[24] B. Kos, A. Zupanic, T. Kotnik, M. Snoj, G. Sersa, D. Miklavcic,<br />
“Robustness <strong>of</strong> treatment planning for electrochemotherapy <strong>of</strong><br />
deep-seated tumors”, J. Membrane Biol., 236(1), 147–153, 2010.<br />
[25] I. Edhemovic, E. M. Gadzijev, E. Brecelj, D. Miklavcic, B. Kos,<br />
A. Zupanic, B. Mali, T. Jarm, D. Pavliha, M. Marcan, G.<br />
Gasljevic, V. Gorjup, M. Music, T. P. Vavpotic, M. Cemazar, M.<br />
Snoj, G. Sersa, “Electrochemotherapy: a new technological<br />
approach in treatment <strong>of</strong> metastases in the liver”, Technol. Cancer<br />
Res. Treat., vol. 10(5), 475–485, 2011.<br />
[26] Cedrat: http://www.cedrat.com/ (last visited October 2012).<br />
[27] I. M. V. Caminiti, F. Ferraioli, A. Formisano, R. Martone,<br />
“Adaptive Ablation Treatment Based on Impedance Imaging”,<br />
IEEE Tran, Magn., 46(8), 3329–3332, 2010.<br />
[28] I. M. V. Caminiti, F. Ferraioli, A. Formisano, R. Martone, “Three<br />
dimensional optimal current patterns for radi<strong>of</strong>requency ablation<br />
treatments”, COMPEL, 31(3), 985–995, 2012.<br />
[29] P. Neittaanmaki, M. Rudnicki, A. Savini Inverse problems and<br />
optimal design in electricity and magnetism, Oxford Science<br />
Pub., 1996.<br />
[30] P. Di Barba, F. Dughiero, E. Sieni, “Synthesizing Distributions <strong>of</strong><br />
Magnetic Nanoparticles for Clinical Hyperthermia”, IEEE Trans.<br />
Magn., 48(2), 263–266, 2012.
- 175 - 15th IGTE Symposium 2012<br />
Investigation <strong>of</strong> the Electroporation Effect<br />
in a Single Cell<br />
Jaime A. Ramirez ∗ , William P.D. Figueiredo ∗ , Joao Francisco C. Vale ∗ , Isabela D. Metzker ∗ , Rafael G. Santos ∗ ,<br />
Matheus S. de Mattos ∗ Elizabeth R.S. Camargos ∗ , and David A. Lowther †<br />
∗ Federal <strong>University</strong> <strong>of</strong> Minas Gerais, Belo Horizonte, Brazil<br />
† McGill <strong>University</strong>, Montreal, Canada<br />
E-mail: jramirez@ufmg.br<br />
Abstract—This paper investigates the electroporation phenomenon in a single cell exposed to ultra short (μs) and high voltage (kV/m)<br />
electric pulses. The problem is addressed by two complementary approaches. First, numerical simulations based on an asymptotic<br />
approximation derived from the Smoluchowski theory are used to calculate the pore generation, growth and size evolution at the<br />
membrane <strong>of</strong> a spherical cell model, immersed in a suspension medium and consisting <strong>of</strong> cytoplasm and membrane. The numerical<br />
calculations are solved using the finite difference method. Second, an in vitro experiment with LLC-MK2 cells is carried out in which<br />
electroporation was monitored with molecules <strong>of</strong> propidium iodide. This part also comprehended the design and manufacturing <strong>of</strong> a<br />
portable electric pulse generator capable <strong>of</strong> providing rectangular pulses with amplitude <strong>of</strong> 1,000V and duration in the range <strong>of</strong> 1-μs<br />
to 100-μs. The pulse generator is composed <strong>of</strong> three modules: a high voltage dc source, a control module, and an energy storage and<br />
high voltage switching. The numerical simulations considered a 5-μm radius cell submitted to a 500kV/m rectangular electric pulse<br />
for 1-μs. The results indicate the formation <strong>of</strong> ∼3,500 pores at the cell membrane, most <strong>of</strong> them, ∼950, located at poles <strong>of</strong> the cell<br />
aligned to the applied electric pulse, with radii sizes varying from 0.5-nm to 13-nm. The in vitro experiment considered expositon<br />
<strong>of</strong> LLC-MK2 cells to pulses <strong>of</strong> 200V, 500V, and 700V, and 1-μs. Images from fluorescence microscopy exhibit the LLC-MK2 cells<br />
with intense red, a strong evidence <strong>of</strong> the electroporation.<br />
Index Terms—Electroporation, electric fields, finite difference method.<br />
I. INTRODUCTION<br />
Electroporation is the process <strong>of</strong> applying pulsed electric<br />
fields to biological cells to induce the formation <strong>of</strong> transient<br />
“pores” in the cell membrane. Depending on the magnitude<br />
and duration <strong>of</strong> the electric pulse, the membrane may recover<br />
to its original state (the pores reseal) -areversible process;<br />
otherwise, the cell dies - an irreversible process. This phenomenon<br />
was first reported by [1] and is well discussed in the<br />
contributions [2]- [4].<br />
Earlier studies have focused on relatively low external fields,<br />
i.e. less than a kilovolt per centimeter, applied over time periods<br />
ranging from several tens <strong>of</strong> microseconds to milliseconds.<br />
Recently, the use <strong>of</strong> high electric fields (∼ 100kV/cm), or<br />
higher, with pulse durations in the nanosecond range has been<br />
employed and opened a new area <strong>of</strong> research in bioelectrics<br />
[5]. A controlled electroporation process can, therefore, be<br />
used to deliver substances to the cell cytoplasm in a wide range<br />
<strong>of</strong> applications, including gene therapy, drug delivery, nonthermal<br />
inactivation <strong>of</strong> micro-organisms and cancer treatment,<br />
see for instance [4].<br />
From the practical point <strong>of</strong> view, controlling the electroporation<br />
process involves two complementary challenges. First,<br />
a comprehensive simulation analysis is required. The time<br />
dependent electric field, induced at the cell membrane by<br />
the external pulse, need to be obtained. It is this field that<br />
provides the dynamic driving force for the physical process.<br />
In addition, the dynamical evolution <strong>of</strong> the pores at the<br />
cell membrane under the influence <strong>of</strong> this field need to be<br />
adequately treated. Second, a detailed experimental laboratory<br />
test is necessary to confirm the simulation. This involves the<br />
building <strong>of</strong> an electric pulse generator in which the pulse<br />
width and electric field strength are controlled. Moreover,<br />
appropriate microscopy techniques are required to confirm the<br />
pore formation at the cell membrane.<br />
In terms <strong>of</strong> numerical simulations, most works that consider<br />
the dynamics aspects <strong>of</strong> the electroporation phenomenon are<br />
based on the Smoluchowski theory. Krassowska et al. [6]- [9]<br />
employ an asymptotic approximation <strong>of</strong> the Smoluchowski<br />
theory in a single cell model to determine the formation <strong>of</strong><br />
pores in a spherical cell submitted to electric pulses <strong>of</strong> a<br />
few kV/m in the millisecond range. Schoenbach et al. [10]-<br />
[13], [5], use the full equations <strong>of</strong> Smoluchowski theory to<br />
establish the nucleation <strong>of</strong> pores in spherical cell models<br />
exposed to high intensity (thousands <strong>of</strong> kV/m) and ultra short<br />
(nano seconds) electric pulses. The approaches used by both<br />
groups yield acceptable results that are used in a wide range<br />
<strong>of</strong> applications, which in the first case are confirmed indirectly<br />
and in the second case are verified experimentally; however<br />
there is the need to address the electroporation phenomenon<br />
for electric pulses in the micro second range.<br />
This work investigates the electroporation phenomenon in<br />
a single cell when submitted to electric pulses <strong>of</strong> magnitude<br />
in the order <strong>of</strong> 1kV/mm and duration <strong>of</strong> 1-μs. The<br />
phenomenon is addressed by two complementary approaches,<br />
numerical simulations and an in vitro experiment. The Material<br />
and Methods is divided into three subsection. First, the<br />
mathematical modeling <strong>of</strong> electroporation describes how the<br />
numerical simulations can be used to assess the dynamics<br />
<strong>of</strong> the pore formation process, i.e. pore generation, growth<br />
and size-evolution at the cell membrane. This is based on
an asymptotic approximation based on the Smoluchowski<br />
theory and is solved using the finite difference method. The<br />
formulation is capable <strong>of</strong> providing the voltage induced across<br />
the cell membrane and important features for the practical<br />
application <strong>of</strong> electroporation, i.e. the number <strong>of</strong> pores and the<br />
distribution <strong>of</strong> pore radii as a functions <strong>of</strong> time and position<br />
on the cell membrane. A detailed description on how the<br />
numerical calculations are made is also given. Second, the<br />
electric pulse generator subsection discusses the theory used<br />
to design and build the generator. The first module is a high<br />
voltage d.c. source, the second is a control module and the<br />
third is responsible for the energy storage and high voltage<br />
switching. The generator is capable <strong>of</strong> providing retangular<br />
pulses with amplitude <strong>of</strong> 1,000V and duration in the range<br />
<strong>of</strong> 1μs to 100μs, with resting intervals <strong>of</strong> 10μs between<br />
the pulses. Third, the cell culture subsection describes the<br />
procedures used to prepare the LLC-MK2 cells for the in<br />
vitro experiment with molecules <strong>of</strong> propidium iodide. Finally,<br />
the Results presents the numerical analysis in a spherical cell<br />
<strong>of</strong> 5-μm radius, exposed to an electric pulse <strong>of</strong> 500-kV/m,<br />
duration <strong>of</strong> 1-μs; and the in vitro exposition <strong>of</strong> LLC-MK2 cells<br />
to electric pulses <strong>of</strong> <strong>of</strong> 200-kV/m, 500-kV/m, and 700-kV/m,<br />
duration <strong>of</strong> 1-μs, and fluorescence microscopy analysis.<br />
II. MATERIAL AND METHODS<br />
A. Mathematical Modeling <strong>of</strong> Electroporation<br />
For the mathematical description <strong>of</strong> the electroporation at<br />
the cell membrane, let us consider the model consisting <strong>of</strong><br />
a cell in a suspension medium within the parallel plates <strong>of</strong><br />
a cuvette, as indicated in Fig.1. For convenience, the cell<br />
is considered spherical and composed only by a membrane<br />
and cytoplasm, which are characterized by a conductivity and<br />
permittivity (σm, εm) and (σc, εc), respectively. The outer<br />
region, or suspension medium, is also characterized by its<br />
conductiviy and permittivity (σo, εo).<br />
<br />
<br />
<br />
<br />
σ , ε <br />
<br />
Fig. 1. Cell model - not to scale<br />
σ , ε <br />
<br />
<br />
<br />
<br />
σ, ε <br />
- 176 - 15th IGTE Symposium 2012<br />
The dynamics <strong>of</strong> the electroporation process, i.e. behavior<br />
<strong>of</strong> pore generation, growth and size-evolution at the cell membrane,<br />
can be calculated using the continuum Smoluchowski<br />
theory [6], [11], with the following governing equation for the<br />
pore density distribution function n (r, t),<br />
<br />
∂n ∂<br />
+ D −<br />
∂t ∂r<br />
n∂E<br />
<br />
1 ∂n<br />
− = S(r); (1)<br />
∂r kT ∂r<br />
where S(r) is the source, or pore formation term; D is a pore<br />
diffusion constant; r is the pore radius; T is the temperature;<br />
kB is the Boltzmann constant; and E(r) is the energy. This<br />
expression can be simplified by an asymptotic approximation<br />
based on [6]- [9]. This is discussed next.<br />
1) The formation <strong>of</strong> pores: It is assumed that the pores<br />
are hydrophilic and, thus, able to conduct current, and created<br />
with an initial radius r∗ at a rate determined by,<br />
<br />
dn<br />
= αe(Vm/Vep)2 1 −<br />
dt n<br />
<br />
(2)<br />
neqVm<br />
where n(t, θ) is the pore density and neq is the equilibrium<br />
pore density for a given transmembrane voltage Vm,<br />
neq(Vm) =n0e q(Vm/Vep)2<br />
2) The evolution <strong>of</strong> pore radii: The rate <strong>of</strong> change <strong>of</strong> the<br />
pore radii is given by<br />
drj<br />
dt = U(rj,Vm,σeff ),j =1, 2, ..., K (4)<br />
where U is the advection velocity given by<br />
U(r, Vm,σeff )= D<br />
<br />
kT<br />
V 2 mFmax<br />
1+rh/(r + rt) +4β<br />
r∗<br />
r<br />
<br />
4 1<br />
r<br />
+ D<br />
kT [−2πγ +2πσeff r]<br />
The last term represents the effective tension <strong>of</strong> the membrane,<br />
σeff , which is a function <strong>of</strong> Ap, the combined area <strong>of</strong> all pores<br />
existing on the cell,<br />
σeff (Ap) =2σ ′ − 2σ′ − σ0<br />
(6)<br />
1 − Ap/A<br />
where σ0 is the tension <strong>of</strong> the membrane without pores, σ ′ is<br />
the energy per area <strong>of</strong> the hydrocarbon-water interface, Ap =<br />
k j=1 πr2 j<br />
(3)<br />
(5)<br />
, and A is the surface area <strong>of</strong> the cell. In this work<br />
it is assumed that changes <strong>of</strong> cell shape, area, and volume,<br />
can be ignored for microsecond pulses.<br />
3) The voltage in the cell membrane: The voltage in the<br />
cell membrane Vm can be calculated as the difference between<br />
Vi and Ve, i.e. the difference between the potential at the<br />
interfaces <strong>of</strong> the cell membrane, as indicated in the next Fig.2.<br />
Vm(t, θ) =Vi(t, R2,θ) − Ve(t, R1,θ) (7)<br />
The potential Vi at the inner interface (cytoplasm) and Ve<br />
at the outer interface (outer region) <strong>of</strong> the cell membrane can<br />
be obtained by two systems defined by Laplace’s equations,<br />
∇ 2 Vi =0 and ∇ 2 Ve =0 (8)
σ , ε <br />
<br />
σ , ε <br />
<br />
<br />
<br />
σ , ε <br />
Fig. 2. Cell membrane interface - not to scale<br />
The applied electric pulse is imposed as a boundary condition<br />
on the outer region Ve,<br />
Ve(t, r, θ) =−Ercos(θ) as r →∞ (9)<br />
where r is the distance from the center <strong>of</strong> the cell and θ<br />
is the polar angle measured with respect to the direction <strong>of</strong><br />
the applied pulse E. In terms <strong>of</strong> numerical calculation, it is<br />
sufficient to set r ≥ 3R1, i.e. the outer region at least three<br />
times greater than the cell radius. This will be discussed in<br />
detail in the numerical calculation subsection.<br />
The other boundary conditions can be defined for t
TABLE I<br />
PARAMETERS FOR THE CELL NUMERICAL MODEL [9]<br />
Parameter Value<br />
σc(Sm −1 ) 3.0 × 10 −1<br />
σo(Sm −1 ) 3.0 × 10 −1<br />
σm(Sm −1 ) 3.0 × 10 −7<br />
εo(AsV −1 ) 6.4 × 10 −10<br />
εc(AsV −1 ) 6.4 × 10 −10<br />
εm(AsV −1 ) 4.4 × 10 −11<br />
R1(m) 5.0000 × 10 −6<br />
R2(m) 4.9995 × 10 −6<br />
CM (Fm −1 ) 8.8 × 10 −3<br />
n0(m −2 ) 1.5 × 10 9<br />
α(m −2 s −1 ) 1 × 10 9<br />
Vep(V ) 0.258<br />
r∗(m) 0.51 × 10 −9<br />
rm(m) 0.8 × 10 −9<br />
rh(m) 0.97 × 10 −9<br />
rt(m) 0.31 × 10 −9<br />
T (K) 310<br />
q 2.4606<br />
D(m 2 s −1 ) 5 × 10 −14<br />
γ(Jm −1 ) 1.8 × 10 −11<br />
β(J) 1.4 × 10 −19<br />
Fmax(NV −2 ) 0.7 × 10 −9<br />
σ ′ (Jm −2 ) 2 × 10 −2<br />
σ0(Jm −2 ) 1 × 10 −6<br />
Vrest(V ) −0.08<br />
It is capable <strong>of</strong> providing rectangular pulses with amplitude<br />
<strong>of</strong> 1,000V and duration in the range <strong>of</strong> 1μs to100μs with<br />
resting intervals <strong>of</strong> 10μs between the pulses. The modules are<br />
discussed in detail next.<br />
Fig. 4. Pulse generator modules.<br />
1) The high voltage d.c. source: The high voltage source<br />
module is based on [17] and uses a six capacitor doubler<br />
stage, as indicated in Fig.5. It is capable <strong>of</strong> converting a.c.<br />
voltage into d.c. voltage up to 2kV. It uses MUR1560 diodes<br />
and 330μF capacitors. A 1:1 transformer is used to provide<br />
insulation between the electrical grid and the module; in<br />
addition, a variable autotransformer is used to control the a.c.<br />
input and the d.c. output.<br />
2) The control module: The control circuit uses a<br />
PIC18F4550 microcontroller that operates at 12 Mips. The<br />
control driver is based in a push-pull amplifier and composed<br />
<strong>of</strong> a pair <strong>of</strong> Mosfets models ZVN2106a and ZVP2106a, both<br />
- 178 - 15th IGTE Symposium 2012<br />
Fig. 5. High voltage dc source.<br />
capable <strong>of</strong> working with voltages up to 60V and currents<br />
up to 500mA, as indicated in Fig.6. The control driver is<br />
responsible for adjusting the control signal produced by the<br />
microcontroller to the ratings required to fast charge the<br />
capacitances <strong>of</strong> the switching circuit, which can be achieved<br />
by increasing the output voltage and the maximum current.<br />
Fig. 6. Driver circuit.<br />
3) High voltage storage and switching: The high voltage<br />
switching is based on a IGBT (IRGPS60B120KD) that operates<br />
with voltages up to 1,2kV and current pulses up to 240A.<br />
It has a 45ns rise time, 58ns fall time and 400ns turn <strong>of</strong>f<br />
delay that provides pulses with amplitude and width in the<br />
range required in this work. The IGBT is connected in series<br />
to a 73μF capacitor and the load, as indicated in Fig.7. The<br />
capacitor charges through a 1kΩ resistor until it reaches the<br />
same voltage <strong>of</strong> the high voltage source. When the IGBT is<br />
activated, it inverts the voltage on the capacitor and a negative<br />
voltage, that equals the one that charged the capacitor, appears<br />
on the load.<br />
Fig. 7. Switching circuit.
4) The pulse generator: The pulse generator built is shown<br />
in Fig.8. For safety reasons, the high voltage source and the<br />
high voltage storage and switching circuit are inside an acrylic<br />
box; the control module is outside the box.<br />
Fig. 8. Pulse generator.<br />
C. In vitro experiment<br />
1) Cell culture procedures: LLC-MK2 (monkey kidney<br />
epithelial cell line) were maintained in DMEM (Dulbecco’s<br />
modified Eagle’s medium - Invitrogen) with 10% fetal bovine<br />
serum (FBS, Invitrogen), 1% penicillin-streptomycin (Invitrogen)<br />
and 1% glutamine (Sigma-Aldrich) at 37C under 5%<br />
CO2 atmosphere. Cell culture was kept at an optimal density<br />
through weekly passages. Briefly, LLC-MK2 cells were seeded<br />
until 70-90% confluent cell monolayer. To perform subculture,<br />
the cell culture medium was removed and the cells were<br />
rinsed twice with PBS -/-. Trypsin was used to remove<br />
adherent cells (0,5ml/25cm2 surface area). The cells were then<br />
resuspended in 4,5ml <strong>of</strong> fresh serum-containing medium for<br />
trypsin inactivation. Cell viability was assessed directly by<br />
Trypan Blue staining. Cultures were split 1:10 and placed in<br />
a new flask with DMEM 10%.<br />
2) Exposure to electroporation: LLC-MK2 cells were collected<br />
from culture media and suspended in HBS (Hepes<br />
Buffered Saline) at a concentration <strong>of</strong> 2, 6 × 106 cells/ml<br />
in rectangular electroporation cuvettes with 1-mm electrode<br />
separation. All the manipulations were done in sterile condition<br />
in a vertical laminar flow cabinet Veco. Electroporation<br />
was monitored with molecules <strong>of</strong> propidium iodide, 1,5-nm<br />
× 2-nm, (25 μg/ml, Sigma-Aldrich), a fluorochrome that is<br />
excluded from cells with intact membrane.<br />
3) Fluorescence microscopy: Aliquots <strong>of</strong> control and<br />
pulsed cells were placed into glass slides and observed in<br />
Axioplan 2 Zeiss fluorescence microscope (UV emission 630<br />
nm).<br />
III. RESULTS AND DISCUSSION<br />
A. Numerical simulations<br />
The simulations considered a 5-μm radius cell submitted to<br />
a 500kV/m rectangular electric pulse for 1-μs. The results for<br />
the total number <strong>of</strong> pores, number <strong>of</strong> pores at the polarized<br />
poles and the maximum radii <strong>of</strong> the pores are indicated at<br />
- 179 - 15th IGTE Symposium 2012<br />
Figs.9-11. It can be seen from Figs.9-11 that the pore nucleation<br />
starts at approximately 0.8μs, when Vm is approximately<br />
1.25V, an reaches ∼3,500 pores at the cell membrane, most <strong>of</strong><br />
them, ∼950, located at poles <strong>of</strong> the cell aligned to the applied<br />
electric pulse (θ =0 ◦ and θ = 180 ◦ ), with radii sizes varying<br />
from 0.5-nm to 13-nm. After the initial stage, the number <strong>of</strong><br />
pores increases until around 1-μs, when the pulse ends. In the<br />
final stage, the radii <strong>of</strong> the pores decrease very fast but the<br />
number <strong>of</strong> pores stays stable for a longer period. This is in<br />
agreement with the literature, large pores tend to decay faster<br />
but the complete resealing <strong>of</strong> all pores take a longer period.<br />
Fig. 9. Total number <strong>of</strong> pores.<br />
Fig. 10. Number <strong>of</strong> pores in the 1st sector (θ =0 ◦ ) and in the 180th sector<br />
(θ = 180 ◦ ).<br />
B. Experiments<br />
A series <strong>of</strong> in vitro experiments were carried out with the<br />
LLC-MK2 cells following the methodology described in the<br />
previous section. After exposure to 1-μs pulse, the LLC-MK2<br />
cells exhibited intense red fluorescence mostly in the cell<br />
nuclei where propidium iodide binds to double-stranded DNA,<br />
as illustrated in Fig.13. This is a strong evidence that pores<br />
with radii <strong>of</strong> at least 2-nm (the size <strong>of</strong> the propidium iodide<br />
molecule) were created at the cell membrane. The numerical
Fig. 11. Maximum radii evolution.<br />
calculation predicts the creation <strong>of</strong> most pores at poles <strong>of</strong><br />
the cell (θ=180 and θ=0) with radii sizes varying from 0.5nm<br />
to 13-nm; the fluoresce microscopy can only partially<br />
confirm this. Further tests with other microscopy techniques<br />
are required to confirm the region in the cell membrane where<br />
the pores are created, their sizes and how long they last.<br />
Fig. 12. Pulse <strong>of</strong> 500V × 1μs measured at the cuvette.<br />
Fig. 13. Propidium iodide influx into LLC-MK2 cell exposed to pulses <strong>of</strong><br />
1-μs, 200V (a), 500V (b) and 700V (c).<br />
IV. CONCLUSIONS<br />
This paper considered the study <strong>of</strong> the electroporation<br />
phenomenon in a single cell, using numerical simulations and<br />
- 180 - 15th IGTE Symposium 2012<br />
an in vitro experiment. The numerical simulations considered<br />
a5-μm radius cell submitted to a 500kV/m rectangular electric<br />
pulse for 1-μs, which was addressed using an asymptotic<br />
approximation based on the Smoluchowski theory and the<br />
finite difference method. The results indicate the formation<br />
<strong>of</strong> ∼3,500 pores at the cell membrane, most <strong>of</strong> them, ∼950,<br />
located at poles <strong>of</strong> the cell aligned to the applied electric pulse,<br />
with radii sizes varying from 0.5-nm to 13-nm. The in vitro<br />
experiment considered expositon <strong>of</strong> LLC-MK2 cells to electric<br />
pulses <strong>of</strong> 200kV/m, 500kV/m, and 700kV/m, and 1-μs. Images<br />
from fluorescence microscopy confirm the electroporation at<br />
the LLC-MK2 cells. The methodology employed is adequate<br />
to investigate the electroporation phenomenon in simple cells<br />
exposed to electric pulses <strong>of</strong> kV/m and in the μs range.<br />
V. ACKNOWLEDGMENT<br />
This work was supported by CNPq, Brazil, under<br />
Grants 306910/2006-3, 482185/2010-4, 507810/2010-4,<br />
504978/2010-1; by FAPEMIG, Brazil, under Grants Pronex:<br />
TEC 01075/09 and TEC-PPM-489/10; by CAPES, Brazil and<br />
DFAIT, Canada.<br />
REFERENCES<br />
[1] R. Stampfli, “Reversible electrical breakdown <strong>of</strong> the excitable membrane<br />
<strong>of</strong> a Ranvier node,” An. Acad. Brasil. Ciens., vol.30, pp.57-63, 1958.<br />
[2] T.Y. Tsong, “Electroporation <strong>of</strong> cell membrane,” Biophy. J., vol.60,<br />
pp.297-306, 1991.<br />
[3] J.C. Weaver and Y.A. Chizmadzhev, “Theory <strong>of</strong> electroporation: a review,”<br />
Bioelectroch. Bioenergetics, vol.41, pp.135-160, 1996.<br />
[4] T. Kotnik, P. Kramar, G. Pucihar, D. Miklavcic, M. Tarek, “Cell<br />
membrane electroporation. Part 1: The phenomenon,” IEEE Elec. Ins.<br />
Magazine, vol.28, pp.14-23, 2012.<br />
[5] R.P. Joshi and K.H. Schoenbach, “Bioelectric effects <strong>of</strong> intense ultrashort<br />
pulses,” Critic. Rev. in Biom. Eng., vol.38, pp.255-304, 2010.<br />
[6] J.C. Neu and W. Krassowska, “Asymptotic model <strong>of</strong> electroporation,”<br />
Phys. Rev. E, 59:3471-3482, 1999.<br />
[7] K.A. DeBruin and W. Krassowska, “Modeling electroporation in a single<br />
cell. I: Effects <strong>of</strong> field strength and rest potential,” Biophy. J., vol.77,<br />
pp.1213-1224, 1999.<br />
[8] K.C. Smith, J.C. Neu, W. Krassowska, “Model <strong>of</strong> creation and evolution<br />
<strong>of</strong> stable electropores for DNA delivery,” Biophy. J., vol.86, pp.2813-<br />
2826, 2004.<br />
[9] W. Krassowska and P.D. Filev, “Modeling electroporation in a single cell,”<br />
Biophy. J., vol.92, pp.404-417, 2007.<br />
[10] R.P. Joshi and K.H. Schoenbach, “Electroporation dynamics in biological<br />
cells subjected to ultrafast electrical pulses: a numerical simulation<br />
study,” Phys. Review E, vol.62, pp.1025-1033, 2000.<br />
[11] R.P. Joshi, Q. Hu, K.H. Schoenbach, “Dynamical modeling <strong>of</strong> cellular<br />
response to short duration, high intensity electric fields,” IEEE Trans. on<br />
Dielec. Elec. Ins., vol.10, pp.778-787, 2003.<br />
[12] K.H. Schoenbach, R.P. Joshi, J.F. Kolb, N. Chen, M. Stacey, P.F.<br />
Blackmore, E.S. Buescher, S.J. Beebe, “Ultrashort electrical pulses open a<br />
new gateway into biological cells,” <strong>Proceedings</strong> <strong>of</strong> IEEE, vol.92, pp.1122-<br />
1137, 2004.<br />
[13] Q. Hu, R.P. Joshi, K.H. Schoenbach, “Simulations <strong>of</strong> nanopore formation<br />
and phosphatidylserine externalization in lipid membrane subjected to a<br />
high intensity, ultrashort electric pulse,” Phys. Review E, vol.72, 031902,<br />
2005.<br />
[14] J. Mankowski and M. Kristiansen, “A review <strong>of</strong> short pulse generator<br />
technology,” IEEE Trans. on Plasma Science, vol.28, pp.102-108, 2000.<br />
[15] A. Chaney and R. Sundararajan, “Simple mosfet-based high-voltage<br />
nanosecond pulse circuit,” IEEE Trans. on Plasma Science, vol.32,<br />
pp.1919-1924, 2004.<br />
[16] J. R. Grenier and M. Kazerani, “Mosfet-based pulse power supply for<br />
bacterial transformation,” IEEE Trans. on Industry Application, vol.44,<br />
pp.25-31, 2008.<br />
[17] E. Kuffel, W.S. Zaengl, J. Kuffel. High Voltage Engineering Fundamentals.<br />
Second Edition, Butterworth Heinemann, Oxford, UK, 2000.
- 181 - 15th IGTE Symposium 2012<br />
Anisotropic Model for the Numerical<br />
Computation <strong>of</strong> Magnetostriction in<br />
Grain-Oriented Electrical Steel Sheets<br />
M. Kaltenbacher∗ ,A.Volk † , and M. Ertl ‡<br />
∗Institute <strong>of</strong> Mechanics and Mechatronics, Vienna <strong>University</strong> <strong>of</strong> <strong>Technology</strong>, Austria<br />
† Department <strong>of</strong> Sensor <strong>Technology</strong>, <strong>University</strong> <strong>of</strong> Erlangen-Nuremberg, Germany<br />
‡ Siemens Energy Sector, Nuremberg, Germany<br />
E-mail: manfred.kaltenbacher@tuwien.ac.at<br />
Abstract—We present a recently developed physical model for magnetostriction in transformer cores and its efficient<br />
numerical computation by applying the Finite Element (FE) method. Thereby, we fully take the anisotropic behavior <strong>of</strong> the<br />
material into account, both in the computation <strong>of</strong> the nonlinear electromagnetic field as well as the induced magnetostrictive<br />
strains. Numerical computations demonstrate the importance <strong>of</strong> modeling the anisotropy <strong>of</strong> grain oriented electrical steel<br />
sheets as used in electric transformers. Both the magnetic field along the joint regions, and furthermore the mechanical<br />
vibrations especially in thickness direction differ strongly as compared to computations with an isotropic material model.<br />
Index Terms—magnetostriction, finite element method, anisotropic material behavior, nonlinearity<br />
I. INTRODUCTION<br />
Magnetostrictive materials are widely used for actuator<br />
and sensor applications. However, <strong>of</strong>ten the magnetostrictive<br />
behavior <strong>of</strong> these alloys is an undesirable effect, as<br />
e.g. in electric machines and transformers, where it is one<br />
<strong>of</strong> the main sources for noise generation. Unfortunately,<br />
these materials exhibit nonlinear behavior for the magnetic<br />
properties as well as the mechanical characteristics<br />
leading to the well-known magnetic hysteresis loop and<br />
the magnetostrictive hysteresis loop (so-called butterfly<br />
curve), respectively (see, e.g., [1], [2], [3]). A quite<br />
important aspect – especially for grain-oriented electrical<br />
steel as used in transformers – is the anisotropic material<br />
behavior both concerning the magnetic properties as well<br />
as the induced mechanical strains [4].<br />
The modeling <strong>of</strong> magnetostrictive effects is a topic <strong>of</strong><br />
intensive research. Among the huge amount <strong>of</strong> publications<br />
one can find three main approaches. The first one,<br />
which is widely used, is based on introducing a magnetostrictive<br />
strain tensor, where the entries depend on the<br />
magnetic induction (see, e.g., [5], [1]). Thereby, these<br />
additional strains result in mechanical forces modeled as<br />
a right hand side term in the partial differential equation<br />
(PDE) for mechanics. In a second approach, a free<br />
energy as a tensor function depending on the mechanical<br />
strain and magnetic induction is used (see, e.g., [6], [7]).<br />
Thereby, a fully coupled constitutive relation between<br />
mechanical and magnetic quantities is achieved. The last<br />
approach is based on a thermodynamic consistent model,<br />
where the mechanical strain and magnetic induction is<br />
decomposed in a reversible and an irreversible part [2],<br />
[3]. Furthermore, the full constitutive model is based on a<br />
free energy function. Whereas in [2] the irreversible part<br />
is modeled by a switching criterion using inner variables,<br />
[3] uses hysteresis operators. Common to all models<br />
is the current restriction to isotropic and / or uniaxial<br />
behavior.<br />
Our goal is the precise investigation <strong>of</strong> the magnetic<br />
field and resulting mechanical vibrations caused by magnetostriction<br />
along the joint regions <strong>of</strong> electric transformers.<br />
Therefore, we cannot apply any homogenization<br />
technique and fully resolve each individual steel sheet.<br />
This is clearly not possible for a whole transformer<br />
core, and so we restrict our investigation to some few<br />
steel sheets. To reduce the complexity, we choose an<br />
ansatz, in which we neglect the reaction <strong>of</strong> the mechanical<br />
stresses and strains on the magnetic properties and<br />
therefore decouple the computation <strong>of</strong> the magnetic and<br />
mechanical field. By help <strong>of</strong> an Epstein frame and a SST<br />
(Single Sheet Tester), we measure the magnetic as well<br />
as the mechanical hysteresis curves <strong>of</strong> the grain-oriented<br />
electrical steel sheets with different orientations (w.r.t the<br />
rolling direction). From these curves we then extract for<br />
each orientation the corresponding commutation curve,<br />
so that the hysteretic behavior is simplified to a nonlinear<br />
one. This approach is then applied to a stack <strong>of</strong> six<br />
electrical steel sheets with a 90 o joint region, excited<br />
by two current loaded coils. We compare this anisotropic<br />
model to an isotopic one, where the nonlinear magnetic<br />
and mechanical material parameter are just used from the<br />
rolling direction.<br />
The rest <strong>of</strong> the paper is organized as follows. In Sec.<br />
II we describe our physical model and its in-cooperation<br />
into the magnetic and mechanical PDE as well as their Finite<br />
Element (FE) formulation. The measurement setups,<br />
which provide us the nonlinear curves, are discussed in<br />
Sec. III. In Sec. IV the numerical results are presented,<br />
demonstrating the importance <strong>of</strong> taking anisotropy for<br />
grain-oriented electrical steel sheets as used in transformers<br />
into account. Finally, Sec. V summarizes our<br />
achievements.
II. PHYSICAL MODELING AND FE DISCRETIZATION<br />
Magnetostrictive materials are characterized by the<br />
magnetic hysteresis between the magnetic induction B<br />
and magnetic field intensity H as well as the mechanical<br />
hysteresis between the mechanical strain S and magnetic<br />
induction B (see Fig. 1).<br />
Fig. 1. Magnetic and mechanical hysteresis (butterfly curve).<br />
According to a thermodynamically consistent model,<br />
we decompose the physical quantities magnetic induction<br />
and mechanical strain into a reversible and an irreversible<br />
part1 (indicated by the superscripts r and i, respectively)<br />
S = S r + S i , B = B r + B i . (1)<br />
To allow for the history <strong>of</strong> the driving magnetic field<br />
intensity, the irreversible magnetic induction Bi is set to<br />
be equal to the magnetization M, which is modeled, e.g.,<br />
by a Preisach hysteresis operator [3]<br />
B i = M = H[H] eM . (2)<br />
The irreversible strain can be, e.g., expressed by the<br />
following polynomial ansatz [3]<br />
i<br />
S = 3<br />
2 (β1 ·H[H]+β2 · (H[H]) 2 + ···<br />
+βn · (H[H]) n <br />
) eM ⊗ e t M − 1<br />
3 [I]<br />
<br />
, (3)<br />
while the parameters β1, ···,βn need to be fitted to<br />
measurement data and [I] denotes the identity tensor.<br />
Now, magnetostriction is a property <strong>of</strong> ferromagnetic<br />
materials and can be described as a coupling between the<br />
mechanical and the magnetic field. This relation is described<br />
by the well-known magnetostrictive constitutive<br />
equations modeling the linear coupling <strong>of</strong> the magnetic<br />
and the mechanical deformation [2]<br />
σ = [c H ]S r − [e] t H (4)<br />
B r = [e]S r +[μ S ]H . (5)<br />
In (4), (5) σ denotes the Cauchy stress tensor in Voigt notation,<br />
[cH ] the tensor <strong>of</strong> mechanical moduli (at constant<br />
magnetic field intensity), [e] the piezomagnetic coupling<br />
tensor and [μS ] the tensor <strong>of</strong> magnetic permeability (at<br />
constant mechanical strain).<br />
By using these constitutive relations, we have presented<br />
in [3] a formulation based on the magnetic<br />
1 With [S] we denote the tensor <strong>of</strong> mechanical strain and with S<br />
the algebraic vector containing the three normal and three shear strains<br />
according to Voigt notation.<br />
- 182 - 15th IGTE Symposium 2012<br />
scalar potential, and in [8] have even extended it for<br />
the magnetic vector potential to also take eddy current<br />
effects into account. However, both models are currently<br />
restricted to scalar hysteresis operators, and do not take<br />
into account the anisotropic behavior, which is a crucial<br />
point for grain-oriented electrical steel used in transformer<br />
cores [4]. Furthermore, both models are quite<br />
expensive concerning computational time. Therefore, we<br />
have developed a dedicated physical model for grainoriented<br />
electrical steel. In doing so, we first assume that<br />
the entries <strong>of</strong> the piezomagnetic coupling tensor [e] are<br />
small, and we are allowed to neglect this coupling in (4),<br />
(5). Next the anisotropic and nonlinear magnetic behavior<br />
<strong>of</strong> the steel sheets is modeled by its vector relation<br />
between the magnetic induction B and field intensity H<br />
B = B (H) =Bϕ(H)eB ; eB = B<br />
. (6)<br />
B<br />
Here, we compute the unit vector eB and evaluate the<br />
magnetic commutation curve Bϕ for which the orientation<br />
fits best with eB. Therefore, the defining partial<br />
differential equation (PDE) for the magnetic field reads<br />
as<br />
γ ∂A<br />
∂t −∇×ν(Bϕ)∇×A = Ji (7)<br />
with A the magnetic vector potential, Ji the impressed<br />
current density, ν the magnetic reluctivity depending on<br />
Bϕ (see (6)) and γ the electric conductivity.<br />
The PDE for mechanics is given by<br />
ρ ∂2u ∂t2 −Btσ =0 (8)<br />
with u the mechanical displacement, ρ the density, and<br />
B = ∇s the differential operator. As in (3), we assume the<br />
conservation <strong>of</strong> volume for the irreversible strain. However,<br />
we now model instead <strong>of</strong> the hysteretic behavior<br />
a nonlinear, anisotropic behavior, and denote it by [Sm ]<br />
(magnetostrictive induced strain tensor), which computes<br />
as follows<br />
[S m ]= 3<br />
2<br />
<br />
eB × e t B − 1<br />
3 I<br />
<br />
S m ϕ (B) . (9)<br />
Here, we compute the direction <strong>of</strong> B and evaluate the<br />
magnetostrictive commutation curve S m ϕ (B) for which<br />
the orientation fits best with eB. Now, we can express<br />
the reversible mechanical strain S r by the difference <strong>of</strong><br />
the total strain S = Bu and the irreversible (magnetostrictive)<br />
strain S i = S m via<br />
S r = Bu − S m . (10)<br />
This relation in combination with (4) by neglecting [e]<br />
results for (8) into<br />
ρ ∂2 u<br />
∂t 2 −Bt [c H ]Bu = −B t [c H ]S m . (11)<br />
The Finite Element (FE) formulation <strong>of</strong> (7) and (11)<br />
is straight forward. For (7) we use edge finite elements<br />
and solve the arising algebraic system <strong>of</strong> equations by<br />
an efficient Newton scheme utilizing a two level solver
[9]. For (11) we apply nodal finite elements (for details,<br />
see e.g., [10]).<br />
Summarizing, the developed magnetostrictive model<br />
has the following features:<br />
• Decoupling <strong>of</strong> magnetic and mechanical PDEs; so<br />
both PDEs can be solved separately with optimal<br />
conditions.<br />
• Anisotropy and eddy currents are taken into account<br />
• No hysteresis considered; instead it uses commutation<br />
curves computed from measured hysteresis<br />
curves.<br />
• Change <strong>of</strong> magnetic properties due to the mechanical<br />
field within a working point is neglected<br />
(working point can be determined by pre-stressing<br />
<strong>of</strong> measured samples).<br />
III. MEASUREMENT SETUPS<br />
First <strong>of</strong> all, to obtain reliable measurement data for the<br />
magnetic behavior, we have constructed an Epstein frame<br />
according to IEC 60404-2 (see Fig. 2). The 25 cm Epstein<br />
Steel sheets (overlap at the corners)<br />
Coil to compensate flux in air<br />
Excitation and<br />
measurement<br />
coils<br />
Fig. 2. 25 cm Epstein frame for measuring the magnetic properties<br />
<strong>of</strong> grain-oriented electrical steel sheets.<br />
apparatus consists <strong>of</strong> 4 coils with primary windings,<br />
secondary windings, a compensation coil and the material<br />
sample as core. The sheets are stratified in stripes. The<br />
measurement setup represents in this way a transducer,<br />
whose characteristics are specified. The primary outer<br />
windings are used to magnetize the material and the<br />
secondary inner windings are needed for magnetic flux<br />
density determination over the induced voltage. We have<br />
performed measurements for steel sheets, which have<br />
been cut out at different angles according to the rolling<br />
direction. Thereby, for each stack <strong>of</strong> steel sheets, we<br />
have measured the outer and all inner hysteresis loops, as<br />
demonstrated in Fig. 3. Out <strong>of</strong> all the hysteresis loops,<br />
we compute for each angle a commutation curve (see<br />
Fig. 4), which we then use in our numerical computation<br />
for the magnetic field.<br />
To measure the mechanical hysteresis <strong>of</strong> the electrical<br />
steel sheets a second measurement setup was constructed<br />
on the basis <strong>of</strong> a Single Sheet Tester (SST) as displayed<br />
in Fig. 5. This extended setup also captures<br />
- 183 - 15th IGTE Symposium 2012<br />
B (T)<br />
H (kA/m)<br />
Angle 0 o<br />
Angle 15 o<br />
Angle 30 o<br />
Angle 45 o<br />
Angle 60 o<br />
Angle 75 o<br />
Angle 90 o<br />
Fig. 3. Magnetic hysteresis curves for grain-oriented electrical steel<br />
sheets being cut out at different angles according to the rolling direction<br />
(0 o corresponds to the rolling direction).<br />
B ( (T) )<br />
H (kA/m)<br />
Angle 0 o<br />
Angle 15 o<br />
Angle 30 o<br />
Angle 45 o<br />
Angle 60 o<br />
Angle 75 o<br />
Angle 90 o<br />
Fig. 4. Nonlinear BH curves for different angles (0 o corresponds to<br />
the rolling direction).<br />
the magnetic induction as well as the magnetic field<br />
intensity. However we did not use the SST to determine<br />
magnetic hysteresis since it has to be calibrated to a<br />
certified setup to obtain reliable measurement results. To<br />
capture the mechanical hysteresis the SST was extended<br />
by a lifting mechanism to unload the sample sheet to<br />
ensure its stress-less vibration. The mechanical vibration<br />
due to magnetic excitation <strong>of</strong> the SST is measured by<br />
Ferrite core<br />
Excitation n and<br />
Steel sheet measurement ment coil<br />
(material under test)<br />
Laser-<br />
vibrometer<br />
Fig. 5. Single sheet tester as used to obtain the mechanical hysteresis<br />
(principle and manufactured setup).
a laser vibrometer, which compared to strain gauges<br />
provides high accuracy without electromagnetic crosssensitivity<br />
and is contact-free. The measurement <strong>of</strong> the<br />
mechanical strain as a function <strong>of</strong> the magnetic field<br />
results in the magnetostrictive hysteresis loop (so-called<br />
butterfly curve). Additionally the extended SST permits<br />
pre-stressing <strong>of</strong> the steel sheets in order to capture the<br />
reaction on the magnetic properties which corresponds to<br />
a working point that is used in the simulation. To consider<br />
anisotropy, again a series <strong>of</strong> measurement is performed<br />
with different electrical steel sheets which have been cut<br />
out with varying cutting angles with respect to the grain<br />
orientation <strong>of</strong> the steel. As for the magnetic hysteresis, we<br />
also convert the mechanical hysteresis in a single commutation<br />
curve, which leads to angle-dependent nonlinear<br />
magnetostriction curves, as displayed in Fig. 6.<br />
S (μm/m)<br />
Angle 0 o<br />
Angle 15 o<br />
Angle 30 o<br />
Angle 45 o<br />
Angle 60 o<br />
Angle 75 o<br />
Angle 90 o<br />
B (T)<br />
Fig. 6. Nonlinear SB curves for different angles (0 o corresponds to<br />
the rolling direction).<br />
IV. NUMERICAL RESULTS<br />
For the numerical investigation, we choose a setup <strong>of</strong><br />
six stacked electrical steel sheets with a 90 degree joint<br />
and an excitation coil along each yoke as displayed in<br />
Fig. 7. We model just a quarter symmetry by applying<br />
Steel sheets<br />
Excitation coils<br />
Zoomed and scaled<br />
in thickness direction<br />
Fig. 7. Computational model: quarter symmetry is considered.<br />
appropriate boundary conditions at the symmetry planes.<br />
Our main goal is to study the difference between an<br />
isotropic and anisotropic magnetostrictive computation.<br />
- 184 - 15th IGTE Symposium 2012<br />
Thereby, we choose for the isotropic computation the<br />
measured material curves along the rolling direction<br />
(angle <strong>of</strong> zero degree), whereas for the anisotropic computation<br />
we use all measured material curves (see Fig. 4<br />
and 6).<br />
In a first step, we compute the magnetic field and<br />
compare the flux lines at the joints. Figure 8 displays the<br />
flux lines for the isotropic and Fig. 9 for the anisotropic<br />
case at the time step <strong>of</strong> maximal magnetic induction<br />
(about 1.7 T). We display the flux lines just for the two<br />
Fig. 8. Magnetic flux lines for the two upper steel sheets in case<br />
<strong>of</strong> isotropic computation. For better visualization we have scaled the<br />
thickness direction by a factor <strong>of</strong> ten.<br />
last layers and zoom into the joint region. Comparing<br />
the results, one can clearly see the difference. For the<br />
Fig. 9. Magnetic flux lines for the two upper steel sheets in case<br />
<strong>of</strong> anisotropic computation. For better visualization we have scaled the<br />
thickness direction by a factor <strong>of</strong> ten.<br />
isotropic case, the amplitude <strong>of</strong> the magnetic induction<br />
immediately drops to a low one at the beginning <strong>of</strong> the<br />
joint due to the increased effective cross section when<br />
turning the flux direction in the rectangular joint region.<br />
Since the magnetic material properties are homogeneous<br />
and independent <strong>of</strong> direction, the change <strong>of</strong> the magnetic<br />
flux direction itself is continuously across the joint region.<br />
The transition <strong>of</strong> the magnetic flux between the<br />
two vertical stacked steel sheets is mainly limited when<br />
entering and leaving the joint region. Accordingly the<br />
magnetic flux density reaches its full value just at the<br />
end <strong>of</strong> the joint when entering the opposite yoke.<br />
In the anisotropic case, the guiding effect <strong>of</strong> the<br />
preferred magnetic direction in the grain orientation <strong>of</strong><br />
the electrical sheet keeps the amplitude and direction <strong>of</strong><br />
the magnetic flux for some distance in the joint region.
In the area <strong>of</strong> the central diagonal <strong>of</strong> the joint region (at<br />
45 o ), the reduced magnetic permeability perpendicular<br />
to the grain orientation forces the magnetic flux to a<br />
vertical transition into neighbouring steel sheets. This<br />
x-Displacement (nm)<br />
y-Displacement (nm)<br />
10<br />
0<br />
-25<br />
20<br />
0<br />
-50<br />
0 20 40 60<br />
Time (ms)<br />
0 20 40 60<br />
Time (ms)<br />
z-Displacement (nm)<br />
2.5<br />
0<br />
-2.5<br />
Evaluation point<br />
0 20 40 60<br />
Time (ms)<br />
Isotropic case<br />
Anisotropic case<br />
Fig. 10. Mechanical displacement at an observation point along the<br />
yoke.<br />
behavior <strong>of</strong> the magnetic field has a strong impact on the<br />
mechanical vibrations. In a second step we use the computed<br />
magnetic induction and calculate the mechanical<br />
deformation according to the additional magnetostrictive<br />
strain. In Figs. 10 and 11 we display all three components<br />
<strong>of</strong> the mechanical displacement over time at<br />
two different observation points. In general, we observe<br />
that the displacement in plane direction (x− and y−<br />
displacement) show almost no difference. However, the<br />
displacement in thickness direction (z− displacement) is<br />
quite different both concerning amplitude and frequency<br />
content. Especially at the joint region the amplitude <strong>of</strong> the<br />
mechanical vibration is a factor <strong>of</strong> about 1000 larger in<br />
the anisotropic case as in the isotropic case. Furthermore,<br />
we can state that the computation for the isotropic<br />
material model exhibits mainly the 100 Hz component<br />
(current excitation is at 50 Hz). In the anisotropic case<br />
higher harmonics are predominant. The related frequency<br />
spectrum in vibration and noise is typical what can be<br />
measured at real transformers.<br />
V. CONCLUSION AND OUTLOOK<br />
We have presented a magnetostrictive constitutive<br />
model which fully takes the anisotropy <strong>of</strong> grain-oriented<br />
electrical steel sheets as used in electrical transformers<br />
into account. The model itself is simplified in this<br />
sense that the magnetic as well as mechanical hysteretic<br />
behavior is reduced to a nonlinear one by computing<br />
commutation curves out <strong>of</strong> the corresponding hysteresis<br />
measurements. Furthermore, we neglect the impact <strong>of</strong><br />
the mechanical field on the magnetic properties within a<br />
working point, which can be determined by pre-stressing<br />
the measured sample sheets. However, the model needs<br />
measurements provided by an Epstein frame and a SST<br />
- 185 - 15th IGTE Symposium 2012<br />
x-Displacement (nm)<br />
y- Displacement (nm)<br />
0<br />
-45<br />
0<br />
0 20 40 60<br />
Time (ms)<br />
-45<br />
0 20 40 60<br />
Time (ms)<br />
z - Displacement (nm)<br />
8<br />
0<br />
-6<br />
Evaluation point<br />
Scaled by 1000<br />
0 20 40 60<br />
Time (ms)<br />
Isotropic case<br />
Anisotropic case<br />
Fig. 11. Mechanical displacement at an observation point at the joint<br />
region.<br />
(Single Sheet Tester). The computations show strong differences<br />
both in the magnetic field as well as mechanical<br />
vibrations when comparing this anisotropic model to an<br />
isotropic one, which just uses measured curves in rolling<br />
direction <strong>of</strong> the steel sheets.<br />
Currently we are working on an experimental validation<br />
setup, where we can study different joint techniques,<br />
especially step-lap joints.<br />
REFERENCES<br />
[1] L. Vandevelde, J. A. Melkebeek. Modeling <strong>of</strong> Magnetoelastic<br />
Material. IEEE Trans. on Magnetics, 38(2), 2002.<br />
[2] K.Linnemann, S. Klinkel, W. Wagner. A constitutive model for<br />
magnetostrictive and piezoelectric materials. International Journal<br />
<strong>of</strong> Solids and Structures 46, 2009.<br />
[3] M. Kaltenbacher, M. Meiler, M. Ertl. Physical modeling and<br />
numerical computation <strong>of</strong> magnetostriction. Compel, 28(4), 2009.<br />
[4] B. Weiser, H. Pfützner, J. Anger. Relevance <strong>of</strong> Magnetostriction<br />
and Forces for the Generation <strong>of</strong> Audible Noise <strong>of</strong> Transformer<br />
Cores IEEE Trans. on Magnetics, 36(5), 2000.<br />
[5] K. Delaere, W. Heylen, K. Hameyer, R. Belmans. Local Magnetostriction<br />
Forces for Finite Element Analysis. IEEE Trans. on<br />
Magnetics, 36(5), 2000.<br />
[6] A. Dorfmann and R. W. Ogden. Magneto-elastic modeling <strong>of</strong><br />
elastomers. Eur. J. Mechanics and Solids, 22, 2003.<br />
[7] K. Fonteyn, A. Belahcen, R. Kouhia, P. Rasilo, A. Arkkio. FEM<br />
for Directly Coupled Magneto-Mechanical Phenomena in Electrical<br />
Machines. IEEE Trans. on Magnetics, 46(8), 2010.<br />
[8] A. Volk, M. Kaltenbacher, A. Hauck, M. Ertl, R. Lerch. Finite Element<br />
Scheme based on Magnetic Vector Potential and Mechanical<br />
Displacement for Modeling Magnetostriction. <strong>Proceedings</strong> <strong>of</strong> the<br />
8th International Conference on Computation in Electromagnetics<br />
CEM, 2011.<br />
[9] A. Hauck, M. Ertl, J. Schöberl, M. Kaltenbacher. Accurate Simulation<br />
<strong>of</strong> Transformer Step-Lap Joints using Anisotropic Higher<br />
Order FEM. 15 th IGTE Symposium, <strong>Graz</strong>, Austria, 2012.<br />
[10] M. Kaltenbacher. Numerical Simulation <strong>of</strong> Mechatronic Sensors<br />
and Actuators. Springer, 2nd edition, 2007.
- 186 - 15th IGTE Symposium 2012<br />
Analytic Approximation Solution for the<br />
Schwarz-Christ<strong>of</strong>fel Parameter Problem<br />
Norbert Eidenberger∗ and Bernhard G. Zagar∗ ∗Institute for Measurement <strong>Technology</strong>, Altenberger Strasse 69, A-4040 Linz, Austria<br />
E-mail: norbert.eidenberger@jku.at<br />
Abstract—We present a novel analytic approximation method for the Schwarz-Christ<strong>of</strong>fel parameter problem based on<br />
linearization. The modeling requirements for successful linearization are discussed. The linearization introduces a mapping<br />
error which can be virtually eliminated by applying an optimization method. Thus, the proposed method yields conformal<br />
mapping functions which can provide solutions for inverse problems and are suited for sensitivity analyses.<br />
Index Terms—Schwarz-Christ<strong>of</strong>fel parameter problem, Schwarz-Christ<strong>of</strong>fel transform, conformal mapping, potential<br />
problem.<br />
I. INTRODUCTION<br />
Conformal mapping methods provide useful tools for<br />
the analysis <strong>of</strong> many physical phenomena. In particular,<br />
these methods can be utilized to solve two dimensional<br />
potential problems which appear e. g. in electromagnetics,<br />
fluid dynamics, or heat transfer [1], [2]. The<br />
general idea consists in transforming a potential problem<br />
bounded by a complicated geometry to a simpler one,<br />
for which the solution can be computed more easily.<br />
The transformation is performed by a conformal mapping<br />
function. Subsequently, this function also transforms the<br />
solution <strong>of</strong> the simpler problem to the complicated one<br />
which provides the solution <strong>of</strong> the original problem.<br />
Some recent examples for the application <strong>of</strong> conformal<br />
mapping methods are [3], [4], and [5].<br />
For the purpose <strong>of</strong> conformal mapping the coordinates<br />
<strong>of</strong> two dimensional problems are interpreted as the real<br />
and imaginary parts <strong>of</strong> complex numbers. Thus, the<br />
mapping functions represent complex valued functions<br />
in complex variables [6]. Different methods are available<br />
for the construction <strong>of</strong> suitable mapping functions [2].<br />
One <strong>of</strong>ten utilized method, the Schwarz-Christ<strong>of</strong>fel transform<br />
(SCT), constructs mapping functions for polygonal<br />
geometries. Many relevant technical problems involve<br />
polygon shaped boundaries therefore the SCT plays an<br />
important role in many applications.<br />
In order to employ the SCT, its problem dependant<br />
parameters need to be computed. It is not possible to<br />
compute the parameters for polygons with more than<br />
three corners analytically, although a unique solution for<br />
the SCT parameters exists. This constitutes the so-called<br />
SCT parameter problem [7]. Nowadays numerical methods<br />
are routinely employed to solve the SCT parameter<br />
problem [8]. However, numerical methods yield solutions<br />
which are disconnected from the original problem geometry.<br />
This prevents further analysis with respect to the<br />
geometric parameters <strong>of</strong> the original problem.<br />
In this paper we propose a solution method for the SCT<br />
parameter problem based on a series expansion <strong>of</strong> the<br />
SCT base function (1). The method yields an approximate<br />
analytic solution for the SCT parameters containing the<br />
geometry parameters. Due to the approximation error<br />
the resulting mapping function produces mapping errors.<br />
We show that the mapping errors can be eliminated by<br />
prewarping the geometric parameters appropriately. This<br />
is achieved through an optimization method which minimizes<br />
the mapping error. The advantage <strong>of</strong> the proposed<br />
method over the standard numerical solution consists in<br />
the presence <strong>of</strong> the geometry parameters in the mapping<br />
function which permits further analysis <strong>of</strong> the problem,<br />
e. g. sensitivity analyses or solving inverse problems.<br />
This paper consists <strong>of</strong> three main parts. The first part<br />
gives a short introduction to the SCT together with<br />
the corresponding parameter problem. The second part<br />
describes the approximation method for the solution <strong>of</strong><br />
the SCT parameter problem. The third part presents the<br />
minimization method which eliminates the mapping error.<br />
Finally, the conclusion sums up the properties <strong>of</strong> the<br />
proposed method and highlights its potential advantages<br />
and applications.<br />
II. THE SCHWARZ-CHRISTOFFEL TRANSFORM<br />
The SCT represents a widely utilized method for<br />
constructing conformal mapping functions. The SCT base<br />
equation,<br />
<br />
n<br />
z = f(w) =A (w − wi) αi π −1<br />
dw + B, (1)<br />
i=1<br />
maps the upper half <strong>of</strong> the image (w-) plane to the<br />
inside <strong>of</strong> a polygon in the object (z-) plane [9] which<br />
is illustrated in Fig. 1. It contains several unknown<br />
parameters. Parameter A represents a scaling and rotation<br />
factor, parameter B represents a translation, and the<br />
parameters wi represent the corner coordinates in the<br />
w-plane with the known parameters αi representing the<br />
corresponding interior angles.<br />
The unknown parameters A, B and wi are computed<br />
by comparing the polygon corners coordinates in both<br />
planes via the relation z = f(w). The resulting number <strong>of</strong><br />
equations equals the number <strong>of</strong> unknowns, which means<br />
that a unique solution exists. However, for polygons with<br />
more than three corners the integral in (1) yields special
- 187 - 15th IGTE Symposium 2012<br />
Fig. 1. Example setup for the Schwarz-Christ<strong>of</strong>fel transform showing the geometric parameters <strong>of</strong> a rectangle in the z-plane and its image in the<br />
w-plane.<br />
functions for which no inverse functions exist. This<br />
prohibits the analytic computation <strong>of</strong> the SCT parameters<br />
even though it is known that a unique solution exists. This<br />
constitutes the SC parameter problem [7].<br />
Nowadays, the parameter problem is usually solved<br />
numerically. A thorough discussion <strong>of</strong> numerical solution<br />
methods for the SC parameter problem is presented in<br />
[8]. The authors <strong>of</strong> [8] also provide a Matlab toolbox<br />
[10] which permits an easy application <strong>of</strong> the SCT.<br />
For quadrilaterals such as rectangles the SCT produces<br />
mapping functions which consist <strong>of</strong> elliptic functions.<br />
In these cases the elliptic modulus can be utilized to<br />
efficiently compute the parameters numerically [7]. An<br />
application example for this approach is presented in<br />
[11].<br />
III. ANALYTIC APPROXIMATION OF THE PARAMETER<br />
PROBLEM<br />
The disadvantage <strong>of</strong> purely numerical solutions <strong>of</strong> the<br />
SC parameter problem consists in the missing relation to<br />
the original geometry. This prevents subsequent analyses<br />
<strong>of</strong> problems with respect to their geometry. Preserving<br />
this relation requires an analytic approach which is<br />
developed below.<br />
The proposed method consists <strong>of</strong> several steps. The<br />
first step consists in constructing the SCT for the problem<br />
at hand. The evaluation <strong>of</strong> the integral in (1) then yields<br />
a mapping function for which the SC parameters need to<br />
be computed. In order to compute them analytically, the<br />
mapping function is linearized. Then the SC parameter<br />
problem is solved for the linearized mapping function.<br />
The results are inserted back into the original nonlinear<br />
mapping function which then contains geometry parameters<br />
<strong>of</strong> the original problem.<br />
Unfortunately, the procedure is not quite that straight<br />
forward. Several problems which may occur during linearization<br />
need to be addressed, before the method can<br />
be applied successfully.<br />
A. Method Development<br />
An analysis <strong>of</strong> (1) shows that the SCT lends itself to<br />
Taylor series expansion. Equation (1) can be rewritten as<br />
<br />
z = A g(w)dw + B (2)<br />
where<br />
g(w) =<br />
n<br />
i=1<br />
(w − wi) α i<br />
π −1<br />
which represents the transformation core <strong>of</strong> the SCT<br />
defining the general polygon shape. Equation (2) indicates,<br />
that beginning with the first order term, the Taylor<br />
series consists only <strong>of</strong> the transformation core g(w) and<br />
its derivatives. Because g(w) represents a product, this<br />
means that the series expansion consists mainly <strong>of</strong> simple<br />
functions.<br />
Expanding (2) as a Taylor series yields<br />
(3)<br />
z = f(w0)+Ag(w0)(w − w0)+O(w 2 ) (4)<br />
where O(w2 ) represents the higher order terms <strong>of</strong> the<br />
series. In order to be able to solve the parameter problem,<br />
the SC parameters must not appear within non-invertible<br />
functions. The non-invertible special functions contained<br />
in f(w) disappear in the first and higher order terms<br />
<strong>of</strong> the series. However, there remains a special function<br />
within the zeroth order term<br />
<br />
<br />
f(w0) =A g(w)dw<br />
+ B. (5)<br />
w=w0<br />
The special function vanishes if the result <strong>of</strong> the integral<br />
at w0 equals zero. Parameter B represents a translation<br />
<strong>of</strong> the mapped polygon in the z-plane, thus it can be set<br />
to B =0without loss <strong>of</strong> generality. This corresponds<br />
to mapping the origins <strong>of</strong> the w- and z-plane onto each<br />
other, so the Taylor series expansion is centered at w0 =<br />
0 and f(w0) =0. If a translation <strong>of</strong> the mapping result is<br />
truly required, this can be achieved by a simple additional<br />
mapping function.<br />
Truncating (4) after the first order term and incorporating<br />
the above considerations yields<br />
z = Ag(w0)(w − w0) (6)<br />
which represents a mapping function linearized with<br />
respect to the image coordinates w. Note, that (6) usually<br />
will not be linear with respect the geometric parameters.<br />
The condition developed in the previous step requires<br />
that the origins <strong>of</strong> the planes are mapped onto each<br />
other. This can only be guaranteed if the coordinates <strong>of</strong><br />
a point are known in both planes. This is the case for the<br />
polygon corners which are present in the transformation<br />
core g(w).
- 188 - 15th IGTE Symposium 2012<br />
Fig. 2. Example for a symmetric polygon which features a point besides the corners for which its coordinates are known in both planes.<br />
The transformation core consists <strong>of</strong> a product <strong>of</strong> terms<br />
which contain the coordinates <strong>of</strong> the polygon corners<br />
in the w-plane. However, for a corner at w0 = 0 the<br />
transformation core evaluates either to zero or infinity,<br />
depending on the angle αi in the exponent corresponding<br />
to the corner.<br />
g(0) = 0 if α>π (7)<br />
g(0) = ∞ if α
Fig. 3. Illustration <strong>of</strong> the mapping error introduced into the mapping<br />
function by the linearization.<br />
In this case the mapping error can be defined as the<br />
distances between the ideal and the actual mapped corner<br />
positions in the z-plane<br />
i=1<br />
Δz = f<br />
<br />
w, <br />
d − z, (13)<br />
where the components <strong>of</strong> w represent the coordinates<br />
wi <strong>of</strong> the polygon corners in the w-plane, and z and<br />
Δz consist <strong>of</strong> the corresponding ideal polygon corner<br />
positions and mapping errors in the z-plane.<br />
Equation (13) is not suited as an objective function<br />
for minimizing the mapping error because the sign <strong>of</strong><br />
the mapping error may change. In order to avoid this,<br />
the sum <strong>of</strong> the square <strong>of</strong> the real and imaginary part <strong>of</strong><br />
the mapping errors is utilized as the objective function<br />
<br />
n<br />
<br />
min Re(Δzi) 2 +Im(Δzi) 2<br />
, (14)<br />
where n represents the number <strong>of</strong> polygon corners.<br />
Equation (14) can be reformulated as<br />
min Δz T Δz ∗ , (15)<br />
where Δz ∗ represents the vector containing the complex<br />
conjugate mapping error. The objective function in (15)<br />
forms a convex function with a global minimum at 0<br />
which is known to exist. In addition, (15) consists <strong>of</strong> a<br />
sum <strong>of</strong> squares. For this type <strong>of</strong> functions exist specialized<br />
optimization algorithms [12], [13], which ensures<br />
that a solution can be computed easily.<br />
V. CONCLUSION<br />
In this paper we have presented a method for computing<br />
an analytic approximation solution <strong>of</strong> the SC<br />
parameter problem. The proposed method takes advantage<br />
<strong>of</strong> the structure <strong>of</strong> the SCT base equation and<br />
shows that a linearization <strong>of</strong> the problem is possible<br />
under certain conditions. These conditions are defined<br />
and their consequences regarding the modeling process<br />
are discussed.<br />
The resulting conformal mapping function contains<br />
approximation errors. We presented a formulation <strong>of</strong> the<br />
corresponding mapping error which permits its minimization<br />
to arbitrarily small dimensions by varying the<br />
geometric parameters <strong>of</strong> the original polygon. Thus, the<br />
proposed method yields a conformal mapping function<br />
which produces the desired map.<br />
- 189 - 15th IGTE Symposium 2012<br />
The advantage <strong>of</strong> the proposed analytic over the conventional<br />
numeric approximation consists in the presence<br />
<strong>of</strong> the geometry parameters in the mapping function.<br />
Together with the minimization procedure this permits<br />
• the solution <strong>of</strong> inverse problems e.g. in capacitive<br />
sensing applications,<br />
• the sensitivity analysis <strong>of</strong> sensor setups with respect<br />
to their geometry [14].<br />
ACKNOWLEDGMENT<br />
The authors gratefully acknowledge the partial financial<br />
support for the work presented in this paper by the<br />
Austrian Research Promotion Agency and the Austrian<br />
COMET program supporting the Austrian Center <strong>of</strong><br />
Competence in Mechatronics (ACCM).<br />
REFERENCES<br />
[1] P. M. Morse and H. Feshbach, Methods <strong>of</strong> theoretical physics.<br />
McGraw-Hill, 1953, vol. 1.<br />
[2] R. Schinzinger and A. Laura, Conformal Mapping: Methods and<br />
Applications. Elsevier, 1991.<br />
[3] A. Verh<strong>of</strong>f, “Generalized poisson integral formula applied to<br />
potential flow solutions for free and confined jets with secondary<br />
flow,” Computers & Fluids, vol. 54, pp. 18–38, 2012.<br />
[4] A. J. Davidson and N. J. Mottram, “Conformal mapping techniques<br />
for the modelling <strong>of</strong> liquid crystal devices,” European<br />
Journal <strong>of</strong> Applied Mathematics, vol. 23, no. 01, pp. 99–119,<br />
2012.<br />
[5] M. Schwarz, T. Holtij, A. Kloes, and B. Iñíguez, “Analytical<br />
compact modeling framework for the 2D electrostatics in lightly<br />
doped double-gate mosfets,” Solid-State Electronics, vol. 69, pp.<br />
72–84, 2012.<br />
[6] P. Henrici, Applied and Computational Complex Analysis. John<br />
Wiley & Sons, 1974, vol. 1.<br />
[7] ——, Applied and Computational Complex Analysis. John Wiley<br />
& Sons, 1986, vol. 3.<br />
[8] T. A. Driscoll and L. N. Trefethen, Schwarz–Christ<strong>of</strong>fel Mapping,<br />
P. Ciarlet, A. Iserles, R. Kohn, and M. Wright, Eds. Cambridge<br />
<strong>University</strong> Press, 2002.<br />
[9] V. I. Smirnov, Lehrbuch der höheren Mathematik, 14th ed. Harri<br />
Deutsch, 1995.<br />
[10] T. Driscoll. (2012, Sep.) Schwarz-Christ<strong>of</strong>fel toolbox<br />
for MATLAB. <strong>University</strong> <strong>of</strong> Delaware, Department<br />
<strong>of</strong> Mathematical Sciences. [Online]. Available:<br />
http://www.math.udel.edu/ driscoll/s<strong>of</strong>tware/SC/<br />
[11] R. Igreja and C. Dias, “Extension to the analytical model <strong>of</strong> the<br />
interdigital electrodes capacitance for a multi-layered structure,”<br />
Sensors and Actuators A: Physical, vol. 172, no. 2, pp. 392–399,<br />
2011.<br />
[12] R. Fletcher, Practical methods <strong>of</strong> optimization, 2nd ed. Wiley,<br />
2000.<br />
[13] P. E. Gill, W. Murray, and M. H. Wright, Practical optimization.<br />
Academic Press, 1981.<br />
[14] N. Eidenberger and B. G. Zagar, “Sensitivity <strong>of</strong> capacitance<br />
sensors for quality control in blade production,” in ICST 2011,<br />
Dec. 2011.
- 190 - 15th IGTE Symposium 2012<br />
Additional Eddy Current Losses in Induction<br />
Machines Due to Interlaminar Short Circuits<br />
P. Handgruber∗ , A. Stermecki∗ ,O.Bíró∗ , and G. Ofner †<br />
∗Institute for Fundamentals and Theory in Electrical Engineering (IGTE),<br />
Christian Doppler Laboratory for Multiphysical Simulation, Analysis and Design <strong>of</strong> Electrical Machines,<br />
Inffeldgasse 18/I, A-8010 <strong>Graz</strong>, Austria<br />
† ELIN Motoren GmbH, Elin-Motoren-Straße 1, A-8160 Preding/Weiz, Austria<br />
E-mail: paul.handgruber@tugraz.at<br />
Abstract—A novel three-dimensional eddy current model to account for the additional losses caused by interlaminar short<br />
circuits is presented and applied to the loss estimation <strong>of</strong> an induction machine. The method is based on a single sheet model<br />
with appropriate boundary conditions on the interlaminar contact surface avoiding cumbersome full three-dimensional<br />
models with multiple short circuited laminations. The results <strong>of</strong> the single sheet model without short circuits are compared<br />
to measured no-load iron loss data. The interlaminar model is validated by means <strong>of</strong> full models comprising several<br />
interconnected sheets and used for the quantification <strong>of</strong> extra losses caused by conductive joints and shearing burrs. It has<br />
been found that particularly burrs occurring on the tooth edges lead to a significant loss increase.<br />
Index Terms—AC-machines, eddy currents, electric machines, electromagnetic modeling, finite element methods, magnetic<br />
losses<br />
I. INTRODUCTION<br />
Active iron parts in electrical machines are commonly<br />
built <strong>of</strong> thin steel laminations. In an ideally assembled<br />
core, the individual laminates are isolated from each<br />
other in order to minimize the effects <strong>of</strong> eddy currents.<br />
During the cutting process, mechanical deformations lead<br />
to microscopic shearing burrs on the cutting edges. This<br />
edge burrs can break down the insulation resulting in<br />
conductive connections between the stacked sheets. If<br />
the burr-induced short circuits cover several laminations,<br />
high currents begin to circulate leading to a significant<br />
loss increase and hence to local overheatings [1].<br />
The core stacks are frequently held together by conductive<br />
joints, such as bolts, welds or clamping bars.<br />
In most cases, these fixations as well as the shaft and<br />
parts <strong>of</strong> the frame are mounted uninsulated on the core,<br />
short-circuiting a large number <strong>of</strong> laminations. Further<br />
interlaminar contacts are induced by small insulation<br />
faults on the lamination surfaces inside the core middle.<br />
However, the probability <strong>of</strong> such inner short circuits is<br />
very low and stochastic [2]. Therefore, their effects are<br />
not considered in this work.<br />
Up to now, the complex problem <strong>of</strong> interlaminar short<br />
circuits has been chiefly treated by statistical and/or empirical<br />
methods accompanied by vast measurement series<br />
[3], [4]. Attempts to quantify the resulting additional<br />
losses are mostly based on analytical approaches. For<br />
instance, in [5], [6] and [7] artificial burrs have been applied<br />
in a controlled manner to a distribution transformer.<br />
Comparisons <strong>of</strong> the performed measurements to an analytical<br />
eddy current model showed only poor correlation<br />
due to the simplifications made. In [8], [9] and [10] the<br />
losses have been evaluated using a resistance network<br />
analogy. The circuit parameters have been derived from<br />
small-scale models based on simple geometries.<br />
In order to enable studies on more complicated structures<br />
like an electrical machine, a novel method based<br />
on a three-dimensional (3-D) finite element analysis<br />
is proposed in this work. The method is capable to<br />
compute the true paths <strong>of</strong> the eddy currents and values<br />
<strong>of</strong> the ensuing losses. A full 3-D model with multiple<br />
joined laminations is avoided by introducing appropriate<br />
boundary conditions on the interlaminar contact surface<br />
<strong>of</strong> a single sheet model.<br />
This paper is organized as follows: In section II a<br />
novel 3-D eddy current model considering the effects <strong>of</strong><br />
interlaminar short circuits is introduced. In section III<br />
the method presented is applied to the loss estimation<br />
<strong>of</strong> a megawatt rated slip-ring induction machine. First,<br />
the single sheet model without short circuits is compared<br />
to no-load iron loss measurements. The validation <strong>of</strong><br />
the interlaminar contact model is performed using full<br />
models with several short circuited laminations. Finally,<br />
the effects <strong>of</strong> conductive joints and shearing burrs are<br />
subjected to an in-depth analysis.<br />
II. 3-D EDDY CURRENT MODEL<br />
The proposed method can be subdivided into two steps:<br />
first, a transient two-dimensional (2-D) field analysis is<br />
carried out for the whole machine. In a second step, a<br />
transient nonlinear 3-D eddy current problem is solved<br />
separately for an individual stator and rotor sheet.<br />
The 3-D model is excited by time-dependent boundary<br />
conditions obtained from the 2-D field analysis. This<br />
allows separate treatment <strong>of</strong> the stator and rotor sheets<br />
and avoids cumbersome and computationally expensive
transient 3-D finite element simulations including rotor<br />
motion [11]. Nevertheless, all loss relevant effects, like<br />
high-order field harmonics and the rotor movement are<br />
included in the analysis.<br />
Only a single sheet is considered in the 3-D model,<br />
the effects <strong>of</strong> interlaminar short circuits are taken into<br />
account by boundary conditions on the contact surface.<br />
This approach is valid ins<strong>of</strong>ar as, for multiple interconnected<br />
laminates, the electromagnetic quantities in the<br />
sheets within the core become periodic.<br />
A nodal based A,V -A formulation is employed for the<br />
analysis <strong>of</strong> the 2-D problem. The 2-D iron regions are assumed<br />
to be non-conductive. The massive rotor windings<br />
are modeled as eddy current domains but not the stator<br />
conductors. The starting transients <strong>of</strong> the machine are<br />
bypassed by using the initial steady state solution from<br />
a time harmonic analysis. The 3-D eddy current problem<br />
is solved solely in the conductive laminations using the<br />
A,V formulation based on isoparametric hexahedral edge<br />
elements with quadratic shape functions [12]. Introducing<br />
the magnetic vector potential A, in the whole problem<br />
domain Ω and the electric scalar potential V , in the eddy<br />
current region Ωc as<br />
B = ∇×A,<br />
E = − ∂ ∂<br />
A −<br />
∂t ∂t ∇V<br />
⎫<br />
⎬<br />
⎭ in Ωc, (1)<br />
B = ∇×A in Ω − Ωc<br />
(2)<br />
where B is the magnetic flux density, E is the electric<br />
field intensity and t is time, the Maxwell equations under<br />
quasistatic approximation can be written as<br />
∇×ν∇×A + σ ∂ ∂<br />
A + σ ∇V = 0,<br />
∂t ∂t<br />
∇· σ ∂ ∂<br />
A + σ<br />
∂t ∂t ∇V<br />
⎫<br />
⎪⎬<br />
in Ωc, (3)<br />
=0 ⎪⎭<br />
∇×ν∇×A = J0 in Ω − Ωc. (4)<br />
Herein ν denotes the reluctivity and σ the conductivity <strong>of</strong><br />
the sheet material, J0 represents the given current density<br />
<strong>of</strong> the stranded coils in the 2-D model.<br />
Fig. 1 shows the specifications <strong>of</strong> the boundary conditions<br />
for the 3-D model <strong>of</strong> a stator sheet. The boundary<br />
conditions are prescribed anew for every time step. On<br />
the boundaries along the lamination thickness, the model<br />
is excited by the normal component <strong>of</strong> B derived form<br />
the 2-D field analysis. The prescription <strong>of</strong> the normal<br />
component <strong>of</strong> B is equivalent to specifying the tangential<br />
component <strong>of</strong> A. This component is obtained from the<br />
2-D field vector potential A2-D and is assumed to be<br />
constant along the thickness. The boundaries along the<br />
sheet thickness are hereinafter called axial boundaries.<br />
Since flux in the axial direction is neglected, the tangential<br />
component <strong>of</strong> A is set to zero on the boundaries<br />
parallel to the laminations.<br />
Setting the electric scalar potential free on the boundaries<br />
<strong>of</strong> Ωc denoted by Γi ensures the satisfaction <strong>of</strong> the<br />
- 191 - 15th IGTE Symposium 2012<br />
Fig. 1. Specification <strong>of</strong> the boundary conditions for a 3-D stator sheet<br />
model. The sheet thickness is increased for better visibility. Symmetry<br />
boundary conditions on the bottom surface allow to consider only half<br />
<strong>of</strong> the thickness.<br />
natural boundary condition <strong>of</strong> vanishing normal component<br />
<strong>of</strong> the current density J here. At the interlaminar<br />
contact surface Γc, the current flow is assumed to be<br />
normal to the face, resulting in a constant and unknown<br />
electric scalar potential. Furthermore, it has to be guaranteed<br />
that the exchanged current through the contact spots<br />
between the sheets is zero by introducing the surface<br />
integral relation<br />
<br />
Γc<br />
σ<br />
<br />
∂ ∂<br />
A +<br />
∂t ∂t ∇V<br />
<br />
· n dΓ = 0 (5)<br />
as an additional equation into the finite element equation<br />
system [13]. The symbol n stands for the outer normal<br />
vector.<br />
For interlaminar contacts occurring on the sheet boundaries<br />
like shearing burrs, the effects <strong>of</strong> the high circulating<br />
eddy currents on the given magnetic boundary field<br />
cannot be neglected. In such cases, the magnetic field<br />
density is not prescribed directly on the axial sheet<br />
boundaries, but on an additional outer finite element layer<br />
surrounding the lamination. This layer is modeled as nonconductive<br />
extending the 3-D problem to an A,V -A one.<br />
Thus, the required independence <strong>of</strong> the prescribed field<br />
from the eddy currents is ensured again.<br />
III. APPLICATION AND RESULTS<br />
The method presented has been applied to the loss<br />
estimation <strong>of</strong> a megawatt rated, 50 Hz, 690 V, deltaconnected<br />
three-phase, four pole slip-ring induction machine<br />
fed by sinusoidal voltage. In the case <strong>of</strong> a healthy<br />
machine without short circuits present in the core, the<br />
computed total iron losses are compared to no-load<br />
measurements. After validating the proposed interlaminar<br />
contact model against full models with many short<br />
circuited sheets, the additional losses due to conductive<br />
joints and shearing burrs will be quantified.
- 192 - 15th IGTE Symposium 2012<br />
(a) (b)<br />
Fig. 2. Eddy current loss density distribution in a stator (a) and rotor (b) sheet at rated no-load current and a specific time instant.<br />
A. No-load iron losses<br />
According to the statistical loss theory, the total iron<br />
losses are composed <strong>of</strong> eddy current, hysteresis and<br />
excess losses [14]. The computed 3-D eddy current<br />
loss distribution for a healthy stator and rotor sheet<br />
is presented in Fig. 2. The losses are quite uniformly<br />
distributed in the stator sheet and mainly attributable to<br />
the 50 Hz fundamental frequency component. The rotor<br />
losses are concentrated in the vicinity <strong>of</strong> the air-gap<br />
and primarily evoked by the first stator slot harmonic at<br />
1800 Hz. In order to cover all relevant harmonic effects,<br />
the time step size Δt was fixed to Δt = T/500 for the stator<br />
and Δt = T/1000 for the rotor sheet simulation; T is the<br />
time period <strong>of</strong> the fundamental frequency. The total eddy<br />
current losses are obtained by integrating the product <strong>of</strong><br />
E and J over the sheet volume, averaged over time and<br />
multiplied by the number <strong>of</strong> laminations. The proportion<br />
<strong>of</strong> the losses in the rotor core is remarkable and nearly<br />
as high as in the stator (see Fig. 3(a)).<br />
The measured and computed no-load iron losses as<br />
a function <strong>of</strong> the supply current are compared in Fig.<br />
3(b). During the measurements, the rotor windings were<br />
(a) (b)<br />
Fig. 3. Simulated no-load eddy current losses (a) and separated total<br />
iron losses (b) as a function <strong>of</strong> the supply current as well as measured<br />
losses.<br />
kept open in order to avoid additional rotor currents<br />
and hence further losses. The test machine was driven<br />
by a second machine at synchronous speed. Thereby,<br />
the friction losses are covered by the driving machine.<br />
The measured losses on the stator terminals <strong>of</strong> the test<br />
machine correspond to the iron losses and stator winding<br />
losses. For the computation, the hysteresis and excess<br />
losses are obtained by a method developed in [15] based<br />
on the evaluation <strong>of</strong> the shape <strong>of</strong> dynamic magnetization<br />
curves. The hysteresis losses are calculated by a static<br />
vector Preisach model [16], [17], the excess losses using<br />
the statistical loss theory [14]. The good agreement<br />
between the measurement and simulation results confirms<br />
that the methods used are able to cope with the complex<br />
electromagnetic phenomena arising in an induction machine,<br />
i. e. rotating flux and high-order field harmonics.<br />
The stator eddy current losses for no-load can even be<br />
evaluated correctly using time harmonic analyses, since<br />
they are almost exclusively caused by the fundamental<br />
field component. The methodology <strong>of</strong> the proposed twostep<br />
approach can be adopted in a straightforward way<br />
to time harmonic problems. For rated no-load current<br />
(I0=288 A), the losses obtained from a transient analysis<br />
are 901.1 W, the time harmonic analysis yields 903.7 W.<br />
When load is getting applied, the field harmonics increase<br />
strongly and transient analyses are required [18]. In<br />
order to shorten the computation time, the following<br />
investigations on the effects <strong>of</strong> interlaminar short circuits<br />
have been performed for the stator sheet at rated noload<br />
current using time harmonic calculations. However,<br />
all relevant factors influencing the behavior underlying<br />
an interlaminar short circuit are still incorporated in the<br />
computation. The use <strong>of</strong> time harmonic analyses constitutes<br />
no restriction <strong>of</strong> the introduced approach which is<br />
easily expandable for transient simulations.<br />
B. Validation <strong>of</strong> the interlaminar model<br />
The proposed single sheet model combined with<br />
boundary conditions considering the interlaminar interaction<br />
has been validated using two different examples:<br />
a simple conductive ring and a stator sheet sector <strong>of</strong> the<br />
induction machine investigated.<br />
1) Conductive ring: As shown in Fig. 4(a), the 2-D<br />
model <strong>of</strong> the ring is excited by a conductor arranged<br />
symmetrically in the bore separated from the core by<br />
an air-gap. The 3-D model (see Fig. 4(b)) consists <strong>of</strong><br />
ten and a half stacked lamination quarters short circuited<br />
by two conductive joints through the entire core stack.<br />
The joint material is the same as those <strong>of</strong> the laminates.<br />
The isolation between the sheets is modeled as a nonconductive<br />
A-region with a relative permeability <strong>of</strong> one<br />
and a thickness <strong>of</strong> one hundredth <strong>of</strong> those <strong>of</strong> the laminations.<br />
On the curved boundaries, the 3-D problem is
(a) (b)<br />
Fig. 4. Validation geometry <strong>of</strong> the conductive ring: 2-D (a) and 3-D<br />
(b) model.<br />
excited by boundary conditions derived form the 2-D<br />
field solution. Periodic boundary conditions have been<br />
applied on the left and right boundaries. On the top<br />
surface, the normal component <strong>of</strong> B and that <strong>of</strong> J are set<br />
to zero; on the bottom surface, the problem is mirrored<br />
using symmetric boundary conditions. The 2-D problem<br />
is treated with the time harmonic A formulation, the<br />
3-D one with the A,V -A formulation. Linear material<br />
properties are assumed for the sake <strong>of</strong> simplicity.<br />
Figs. 5(a) and 5(b) show the current and flux density<br />
distributions computed by the full model. The electromagnetic<br />
quantities in the center sheets repeat periodically<br />
suggesting the applicability <strong>of</strong> the reduced<br />
approach. Consequently, the currents circulating through<br />
the contact spots affect the original 2-D field distribution<br />
(see Figs. 5(c) and 5(d)). The undermost half lamination<br />
serves as a validation reference for the reduced model<br />
- 193 - 15th IGTE Symposium 2012<br />
with a single sheet. Fig. 6 compares the obtained field<br />
solutions. The good agreement is also verified in terms<br />
<strong>of</strong> losses wich are 10.93 mW for the full model and<br />
11.37 mW for the reduced one.<br />
2) Stator sheet sector: Since 3-D simulations with<br />
multiple short circuited stator laminations are hardly<br />
feasible, only a small sheet sector is considered in the<br />
full model. Fig. 7(a) shows the used 2-D model <strong>of</strong><br />
the previously examined induction machine, Fig. 7(b)<br />
the 3-D model comprising hundred and a half laminations<br />
interconnected by shearing burrs over two teeth<br />
throughout the stack. A continuous burr width <strong>of</strong> 100 μm<br />
(a) (b)<br />
Fig. 7. Validation geometry <strong>of</strong> the stator sheet sector: 2-D (a) and<br />
3-D (b) model.<br />
has been selected requiring a rather fine mesh near the<br />
burred region. Results for different burr widths will be<br />
discussed in section III-C2. The burr material properties<br />
are the same as that <strong>of</strong> the sheets. The isolation thickness<br />
is specified as one fiftieth <strong>of</strong> the lamination thickness<br />
(a) (b)<br />
(c) (d)<br />
Fig. 5. Current density (a) and flux density (b) at a specific time instant for the full model as well as the corresponding vector plots (c,d) for the<br />
framed sectors.
- 194 - 15th IGTE Symposium 2012<br />
(a) (b)<br />
(c) (d)<br />
Fig. 6. Current density and flux density distribution computed for the bottom lamination <strong>of</strong> the full model (a,b) and the reduced one (c,d).<br />
(d=0.5 mm). The 2-D field solution is prescribed on an<br />
outer non-conductive layer enclosing the teeth and on the<br />
stator back. The normal component <strong>of</strong> both B and J is<br />
set to zero on the top surface and on the boundaries intersecting<br />
the yoke. Again, the problem region is mirrored<br />
at the bottom surface and solved in the frequency domain<br />
using the A,V -A formulation. Linear media are used for<br />
validation purposes, nonlinearity is considered in the next<br />
sections by means <strong>of</strong> well established methods.<br />
The eddy current distribution for rated no-load current<br />
calculated by the full model is shown in Fig. 8. In<br />
the burr region, high currents are closing over the teeth<br />
(see also Fig. 12). The induced currents increase with<br />
Fig. 8. Current density distribution at rated no-load current and a<br />
specific time instant for the full model.<br />
the number <strong>of</strong> short circuited laminations and approach<br />
asymptotically a final value. Accordingly, the reduced<br />
model presents a worst case scenario for a suitably<br />
large number <strong>of</strong> interconnected sheets. The required sheet<br />
number depends on various factors, such as size and<br />
position <strong>of</strong> the short circuits, sheet dimensions or material<br />
properties. The undermost lamination <strong>of</strong> the full model<br />
and the reduced method again give similar current density<br />
distributions as shown in Fig. 9. The resulting losses<br />
for the full model are 392.0 mW, the reduced approach<br />
predicts 363.9 mW.<br />
C. Effects <strong>of</strong> interlaminar short circuits<br />
Two examples <strong>of</strong> interlaminar short circuits will be<br />
addressed in more detail. The first one involves conductive<br />
joints represented by clamping bars. Second, the<br />
impact <strong>of</strong> shearing burrs will be discussed by means <strong>of</strong><br />
parametric studies. All <strong>of</strong> the following simulations have<br />
been carried out for rated no-load current in the frequency<br />
domain using the proposed interlaminar model. Nonlinearity<br />
is taken into account by an effective reluctivity<br />
approach [19].<br />
1) Conductive joints: The iron stacks <strong>of</strong> mid-power<br />
range machines are <strong>of</strong>ten axially fixed by clamping bars<br />
installed after pressing the core. These clamping plates<br />
are attached uninsulated on the core back leading to<br />
a conductive connection among the sheets. Different<br />
numbers <strong>of</strong> bars installed along the stator back circumference<br />
have been investigated using the method presented.<br />
Fig. 10 demonstrates the computed current and flux<br />
density distribution when four clamping bars per pole<br />
pitch are considered in the simulation. On the one hand,<br />
the induced currents in axial direction cause additional<br />
losses. On the other hand, they oppose the original field<br />
distribution increasing the magnetic flux density and<br />
hence the losses near the clamping areas. A detailed view<br />
<strong>of</strong> the electromagnetic quantities near a contact area is<br />
shown in Fig. 11. The eddy current losses in the stator<br />
core increase by 3.93 % for one installed connection bar<br />
per pole. For four bars they rise by 13.65 %.
- 195 - 15th IGTE Symposium 2012<br />
(a) (b)<br />
Fig. 9. Current density distribution for the bottom lamination <strong>of</strong> the full (a) and reduced (b) model.<br />
(a) (b)<br />
Fig. 10. Maximal flux density (a) and loss density (b) distribution for four clamping bars per pole installed.<br />
(a) (b)<br />
(c) (d)<br />
Fig. 11. Current density (a) and flux density (b) in a contact area at a specific time instant as well as the corresponding vector plots in the bar<br />
section (c,d).<br />
2) Shearing burrs: The quantification <strong>of</strong> the extra<br />
losses caused by edge burrs is carried out performing<br />
simulations for different numbers <strong>of</strong> teeth afflicted by<br />
burrs and various burr widths. Contrary to the former<br />
validation example, a complete stator sheet quarter has<br />
been considered in the computations. Fig. 12 shows the<br />
eddy current paths in the burr layer for different numbers<br />
<strong>of</strong> teeth burred. The high magnetizing flux in the tooth<br />
area induces interlaminar currents enclosing the teeth and<br />
resulting in a steep rise <strong>of</strong> the losses. The trends in Fig.<br />
13 indicate a cubic dependence <strong>of</strong> the additional losses on<br />
the number <strong>of</strong> burred teeth. The application <strong>of</strong> burrs on<br />
the stator back led to no significant loss increase owing<br />
to the small flux density near the outermost rim.
- 196 - 15th IGTE Symposium 2012<br />
(a) (b)<br />
(c) (d)<br />
Fig. 12. Current density vector plots for one (a), two (b) three (c) and five (d) burred teeth at a specific time instant and a burr width <strong>of</strong> 100 μm.<br />
The burr region only is considered in the plot.<br />
Eddy Current Losses per Sheet Pcl in W<br />
10<br />
9<br />
8<br />
7<br />
6<br />
5<br />
4<br />
3<br />
2<br />
1<br />
bburr =10μm<br />
bburr =20μm<br />
bburr =40μm<br />
bburr =60μm<br />
bburr =80μm<br />
bburr = 100 μm<br />
0<br />
0 0.5 1 1.5 2 2.5 3 3.5 4 4.5 5<br />
Number <strong>of</strong> Teeth Burred<br />
Fig. 13. Losses as a function <strong>of</strong> the number <strong>of</strong> burred teeth for different<br />
burr widths.<br />
The width <strong>of</strong> the burr layer has been varied in a<br />
practically relevant range from 10 to 100 μm. Even<br />
broader interlaminar contacts can be present in laser cut<br />
sheets due to the heat induced insulation burn-<strong>of</strong>f. As<br />
shown in Fig. 14, the losses rise linearly with the burr<br />
width. The loss behavior for different no-load supply<br />
currents can be seen in Fig. 15. The losses increase<br />
quadratically for lower values, whereas at higher currents,<br />
saturation effects occur limiting the maximal attainable<br />
flux densities and thus losses.<br />
IV. DISCUSSION AND CONCLUSIONS<br />
The method presented enables to compute the true 3-<br />
D eddy current distribution underlying an interlaminar<br />
Eddy Current Losses per Sheet Pcl in W<br />
10<br />
9<br />
8<br />
7<br />
6<br />
5<br />
4<br />
3<br />
2<br />
1<br />
1 tooth burred<br />
2 teeth burred<br />
3 teeth burred<br />
4 teeth burred<br />
5 teeth burred<br />
0<br />
10 20 30 40 50 60 70 80 90 100<br />
Burr Width bburr in μm<br />
Fig. 14. Losses as a function <strong>of</strong> the burr width for different numbers<br />
<strong>of</strong> burred teeth.<br />
short circuit allowing a quantitative assessment <strong>of</strong> the<br />
arising additional losses. In order to avoid full models<br />
comprising multiple short circuited laminations, only a<br />
single sheet is considered. The interlaminar interaction is<br />
taken into account by boundary conditions on the contact<br />
surface using a generalized A,V -A formulation.<br />
First, the method has been applied to the no-load<br />
iron loss estimation <strong>of</strong> a slip-ring induction machine.<br />
Therefore, simulations for a healthy machine with no<br />
short circuits present in the core have been carried out.<br />
The transient 3-D eddy current problem has been excited<br />
by boundary conditions derived from a classical 2-D field<br />
analysis and solved separately for a stator and rotor sheet.<br />
The hysteresis and excess losses required for comparisons
Eddy Current Losses per Sheet Pcl in W<br />
2<br />
1.5<br />
1<br />
0.5<br />
bburr =10μm<br />
bburr =20μm<br />
bburr =40μm<br />
bburr =60μm<br />
bburr =80μm<br />
bburr = 100 μm<br />
0<br />
50 100 150 200 250 300 350 400<br />
No-Load Current I0 in A<br />
Fig. 15. Losses as a function <strong>of</strong> the supply current for two burred<br />
teeth and different burr widths.<br />
to no-load iron loss measurements have been evaluated<br />
by a static Preisach vector model and the statistical<br />
loss theory, respectively. Good agreement was obtained<br />
between measured and simulated results.<br />
The effects <strong>of</strong> interlaminar short circuits have been<br />
studied for the stator sheet at rated no-load using timeharmonic<br />
computations, since it was found that the occurring<br />
eddy current losses are almost completely caused<br />
by the fundamental field component. Comparisons <strong>of</strong> the<br />
interlaminar contact model against full models with many<br />
laminations joined confirmed the validity <strong>of</strong> the reduced<br />
method as long as the electromagnetic quantities in the<br />
short-circuited sheets stay periodic. For a low number <strong>of</strong><br />
interconnected laminations, the full model yields lower<br />
losses than the reduced one. Consequently, the reduced<br />
model constitutes a worst-case approximation.<br />
Applications <strong>of</strong> the proposed approach revealed a<br />
significant loss increase for conductive paths introduced<br />
by shearing burrs on the tooth edges. It should be noted<br />
that the burrs studied will not be present to such an extent<br />
in a healthy machine, but the trends and findings will<br />
still apply, indicating the strong necessity to minimize<br />
bur-induced short circuits as far as practicable.<br />
The magnetic and electric properties have been assumed<br />
to be unaffected by the manufacturing process. In<br />
[20], [21] and [22] it was shown that especially the magnetic<br />
permeability near the edges can vary considerably<br />
due to the mechanical stress applied during punching.<br />
The incorporation <strong>of</strong> these effects as well as combinations<br />
<strong>of</strong> the developed method with measurement-based<br />
statistics have to be carried out in future work.<br />
V. ACKNOWLEDGMENT<br />
This work has been supported by the Christian<br />
Doppler Research Association (CDG) and by the ELIN<br />
Motoren GmbH.<br />
REFERENCES<br />
[1] P. Beckley, Electrical Steels for Rotating Machines. The Institution<br />
<strong>of</strong> Engineering and <strong>Technology</strong>, 2002.<br />
- 197 - 15th IGTE Symposium 2012<br />
[2] M. C. Marion-Pera, A. Kedous-Lebouc, T. Waeckerle, and B. Comut,<br />
“Characterization <strong>of</strong> SiFe Sheet Insulation,” IEEE Transactions<br />
on Magnetics, vol. 31, no. 4, pp. 2408–2415, 1995.<br />
[3] A. C. Beiler and P. L. Schmidt, “Interlaminar Eddy Current Losses<br />
in Laminated Cores,” Transactions <strong>of</strong> the American Institute <strong>of</strong><br />
Electrical Engineers, vol. 66, pp. 872–78, 1947.<br />
[4] C. A. Schulz, S. Duchesne, D. Roger, and J.-N. Vincent, “Capacitive<br />
short circuit detection in transformer core laminations,”<br />
Journal <strong>of</strong> Magnetism and Magnetic Materials, vol. 320, pp.<br />
e911–e914, 2008.<br />
[5] A. J. Moses and M. Aimoniotis, “Effects <strong>of</strong> Artificial Edge Burrs<br />
on the Properties <strong>of</strong> a Model Transformer Core,” Physica Scripta,<br />
vol. 39, pp. 391–393, 1989.<br />
[6] R. Mazurek, P. Marketos, A. Moses, and J.-N. Vincent, “Effect<br />
<strong>of</strong> Artificial Burrs on the Total Power Loss <strong>of</strong> a Three-Phase<br />
Transformer Core,” IEEE Transactions on Magnetics, vol. 46,<br />
no. 2, pp. 638–641, 2010.<br />
[7] R. Mazurek, H. Hamzehbahmani, A. Moses, P. I. Anderson, F. J.<br />
Anayi, and B. Thierry, “Effect <strong>of</strong> Artificial Burrs on Local Power<br />
Loss in a Three-Phase Transformer Core,” IEEE Transactions on<br />
Magnetics, vol. 48, no. 4, pp. 1653–1656, 2012.<br />
[8] D. A. Jones and W. S. Leung, “A theoretical and analogue<br />
approach to stray eddy-current loss in laminated magnetic cores,”<br />
<strong>Proceedings</strong> <strong>of</strong> the IEE - Part C: Monographs, vol. 108, no. 14,<br />
pp. 509–519, 1961.<br />
[9] C. A. Schulz, D. Roger, S. Duchesne, and J.-N. Vincent, “Experimental<br />
Characterization <strong>of</strong> Interlamination Shorts in Transformer<br />
Cores,” IEEE Transactions on Magnetics, vol. 46, no. 2, pp. 614–<br />
617, 2010.<br />
[10] J.-P. Bielawski, S. Duchesne, D. Roger, C. Demian, and T. Belgrand,<br />
“Contribution to the Study <strong>of</strong> Losses Generated by Interlaminar<br />
Short-Circuits,” IEEE Transactions on Magnetics, vol. 48,<br />
no. 4, pp. 1397–1400, 2012.<br />
[11] K. Yamazaki and N. Fukushima, “Iron-Loss Modeling for Rotating<br />
Machines: Comparison Between Bertotti’s Three-Term Expression<br />
and 3-D Eddy-Current Analysis,” IEEE Transactions on<br />
Magnetics, vol. 46, no. 8, pp. 3121–3124, 2010.<br />
[12] O. Bíró, “Edge element formulations <strong>of</strong> eddy current problems,”<br />
Computer Methods in Applied Mechanics and Engineering, vol.<br />
169, no. 3-4, pp. 391–405, 1999.<br />
[13] I. Bakhsh, O. Bíró, and K. Preis, “Skin effect problems with<br />
prescribed current condition,” in <strong>Proceedings</strong> <strong>of</strong> the 14 th Int. IGTE<br />
Symp. on Numerical Field Calculation in Electrical Engineering,<br />
2010.<br />
[14] G. Bertotti, Hysteresis in Magnetism. Academic Press, 1998.<br />
[15] E. Dlala, “A Simplified Iron Loss Model for Laminated Magnetic<br />
Cores,” IEEE Transactions on Magnetics, vol. 44, no. 11, pp.<br />
3169–3172, 2008.<br />
[16] E. Dlala, J. Saitz, and A. Arkkio, “Inverted and Forward Preisach<br />
Models for Numerical Analysis <strong>of</strong> Electromagnetic Field Problems,”<br />
IEEE Transactions on Magnetics, vol. 42, no. 8, pp. 1963–<br />
1973, 2006.<br />
[17] E. Dlala, “Efficient Algorithms for the Inclusion <strong>of</strong> the Preisach<br />
Hysteresis Model in Nonlinear Finite-Element Methods,” IEEE<br />
Transactions on Magnetics, vol. 47, no. 2, pp. 395–408, 2011.<br />
[18] E. Dlala, O. Bottauscio, M. Chiampi, M. Zucca, A. Belahcen, and<br />
A. Arrkio, “Numerical Investigation <strong>of</strong> the Effects <strong>of</strong> Loading and<br />
Slot Harmonics on the Core Losses <strong>of</strong> Induction Machines,” IEEE<br />
Transactions on Magnetics, vol. 48, no. 2, pp. 1063–1066, 2012.<br />
[19] G. Paoli and O. Bíró, “Time harmonic eddy currents in nonlinear<br />
media,” COMPEL: The International Journal for Computation<br />
and Mathematics in Electrical and Electronic Engineering,<br />
vol. 17, no. 5/6, pp. 567–575, 1998.<br />
[20] F. Ossart, E. Hug, C. Hubert, Olivier Buvat, and R. Billardon,<br />
“Effect <strong>of</strong> punching on electrical steels: Experimental and numerical<br />
coupled analysis,” IEEE Transactions on Magnetics, vol. 36,<br />
no. 5, pp. 3137–3140, 2000.<br />
[21] A. Schoppa, J. Schneider, and J.-O. Roth, “Influence <strong>of</strong> the cutting<br />
process on the magnetic properties <strong>of</strong> non-oriented electrical<br />
steels,” Journal <strong>of</strong> Magnetism and Magnetic Materials, vol. 215-<br />
216, pp. 100–102, 2000.<br />
[22] K. Fujisaki, R. Hirayama, T. Kawachi, S. Satou, C. Kaidou,<br />
M. Yabumoto, and T. Kubota, “Motor Core Iron Loss Analysis<br />
Evaluating Shrink Fitting and Stamping by Finite-Element<br />
Method,” IEEE Transactions on Magnetics, vol. 43, no. 5, pp.<br />
1950–1954, 2007.
- 198 - 15th IGTE Symposium 2012<br />
Evaluating the influence <strong>of</strong> manufacturing<br />
tolerances in permanent magnet synchronous<br />
machines<br />
I. Coenen, T. Herold, C. Piantsop Mbo’o, and K. Hameyer<br />
Institute <strong>of</strong> Electrical Machines, RWTH Aachen <strong>University</strong>, Schinkelstrasse 4, D-52056 Aachen, Germany<br />
E-mail: isabel.coenen@iem.rwth-aachen.de<br />
Abstract—Manufacturing tolerances can result in an unwanted behavior <strong>of</strong> electrical machines. Undesired parasitic effects<br />
such as torque ripples may be increased. A quality control <strong>of</strong> machines subsequent to manufacturing is therefore required<br />
in order to test whether the machines comply with its specifications. This is useful to ensure a high reliability <strong>of</strong> the<br />
manufactured machines. This paper describes the consideration <strong>of</strong> rotor tolerances due to non-ideal manufacturing processes.<br />
The idea is to estimate the influence <strong>of</strong> the manufacturing tolerances for realization <strong>of</strong> a reliable quality control. To study<br />
various fault scenarios numerical field simulations are employed which are parameterized by measurements.<br />
Index Terms—electrical machines, Finite Element Analysis (FEA), manufacturing tolerances, quality control.<br />
I. INTRODUCTION<br />
The reliability <strong>of</strong> electrical drives [1] is an important<br />
aspect to ensure a high availability. In industrial applications,<br />
permanent magnet excited synchronous machines<br />
(PMSM) are widely employed as they <strong>of</strong>fer advantages<br />
in efficiency and power density. However, especially the<br />
rotor <strong>of</strong> PMSMs is susceptible to tolerances caused during<br />
mass production. Variations from the ideal machine<br />
influence its operational behavior [2]. Therefore, it is<br />
important to verify the machine’s quality prior to its<br />
installation.<br />
Reliable and widely used diagnostic methods are vibration<br />
and current monitoring [3]. In this study, electrical<br />
quantities are focused because this <strong>of</strong>fers the advantage<br />
that no additional sensors need to be installed [4].<br />
A. Proposed monitoring setup<br />
The most <strong>of</strong>ten proposed end-<strong>of</strong>-line test is back-EMF<br />
monitoring [5], [6]. Fig. 1 shows a possible setup for its<br />
realization. Here, the motor under test is driven under<br />
open-circuit conditions. For attenuation <strong>of</strong> the drive’s<br />
influence, a flywheel is employed between drive and<br />
motor under test.<br />
<br />
<br />
<br />
<br />
<br />
<br />
Fig. 1. Back-EMF monitoring setup.<br />
<br />
<br />
This approach presents a non invasive monitoring<br />
method being benefical for diagnosis. However, such a<br />
setup is very expensive. It is cost-expensive because a<br />
drive is required and a certain device is needed to damp<br />
possible influences <strong>of</strong> the drive. Above all, it is timeexpensive<br />
due to the fact that the motor under test is<br />
mechanically coupled to the drive. This is not an efficient<br />
solution when a large number <strong>of</strong> machines needs to be<br />
tested during mass production.<br />
In this study, an additional approach is investigated<br />
where the current is being monitored. The corresponding<br />
setup is shown in Fig. 2. Here, a start-up <strong>of</strong> the motor<br />
up to a certain speed is performed in such a way that the<br />
current is measured at various speeds. The benefit <strong>of</strong> this<br />
method is its time- and cost-saving setup. When compared<br />
to the back-EMF setup, less hardware is needed.<br />
No mechanical coupling to a drive is required, simply<br />
the motor is connected to the inverter.<br />
However, for evaluating the current, it needs to be considered<br />
that the current is a controlled quantity. Impacts<br />
caused by the control system or the inverter supply might<br />
lead to misinterpretation <strong>of</strong> the results.<br />
<br />
<br />
<br />
<br />
Fig. 2. Current monitoring setup.<br />
<br />
<br />
In the following, the back-EMF and current characteristic<br />
<strong>of</strong> a PMSM is determined. In order to study various<br />
fault scenarios, numerical field calculation is employed<br />
considering tolerance affected rotor components. The aim<br />
is to evaluate the influence <strong>of</strong> such tolerances to be able to<br />
determine distinguishing characteristics. This information
can be helpful to develop an appropriate end-<strong>of</strong>-line test<br />
and to reveal which <strong>of</strong> the proposed setups is most<br />
qualified.<br />
II. MOTOR UNDER STUDY<br />
The machine studied within this work is a three-phase<br />
permanent magnet synchronous machine with tooth-coil<br />
winding system. It presents six stator slots and four pole<br />
pairs p. The eight magnets <strong>of</strong> the rotor are arranged in a<br />
spoke configuration.<br />
III. INFLUENCE OF ROTOR TOLERANCES<br />
During the manufacturing process material dependant<br />
failures, geometrical or shape deviations may occur. Such<br />
tolerances influence the machine’s behavior. For instance,<br />
increased torque ripples are caused [7].<br />
The considered tolerances within this paper concern<br />
the magnet’s material and its dimensions. The magnetization<br />
faults are illustrated Fig. 3. Possible deviations<br />
affect the magnitude <strong>of</strong> the remanence flux density BR<br />
and the angle β <strong>of</strong> the magnetization direction. Further<br />
Fig. 3. Magnetization faults.<br />
BR<br />
examples <strong>of</strong> rotor tolerances, not considered within this<br />
study, would be a displacement <strong>of</strong> the magnet and rotor<br />
eccentricity.<br />
A. Theoretical analysis<br />
Within this study, the influence <strong>of</strong> rotor tolerances onto<br />
electrical signals is focused. According to [5], for nonideal<br />
rotor components new harmonic orders nrf are<br />
expected to appear in the back-EMF spectrum which are<br />
a function <strong>of</strong> the pole pair number p:<br />
nrf =1± k<br />
with k ∈ N. (1)<br />
p<br />
In the following, this relation shall be approved and<br />
specialized for the certain machine investigated.<br />
The back-EMF Vi is the induced voltage at no load<br />
condition (open circuit). For a coil with w numbers <strong>of</strong><br />
turns Vi is defined as follows:<br />
Vi = −w dφ d<br />
= −w<br />
dt dt (<br />
<br />
Bd A). (2)<br />
Applied to a machine’s winding, it means that the<br />
back-EMF in one coil <strong>of</strong> the winding is determined by the<br />
air gap flux density B. Therefore the back-EMF presents<br />
the same harmonic orders which appear in the spectrum<br />
<strong>of</strong> the flux density. The latter will be considered for an<br />
order analysis.<br />
β<br />
- 199 - 15th IGTE Symposium 2012<br />
The magnetic flux density at the air gap <strong>of</strong> the machine<br />
is a rotating wave which is a function <strong>of</strong> relative position<br />
at the air gap α and time t [8]. It is given as the product<br />
<strong>of</strong> the magnetomotive force Θ (MMF) and the air gap<br />
permeance Λ:<br />
B(α, t) =Θ(α, t) · Λ(α, t). (3)<br />
The functions <strong>of</strong> permeance, magnetomotive force and<br />
flux density can generally be represented by a series <strong>of</strong><br />
space and time harmonics [9]:<br />
Λ(α, t) = <br />
Λyl,zl · cos(yl · α − zl · t), (4)<br />
yl,zl<br />
Θ(α, t) = <br />
yt,zt<br />
B(α, t) = <br />
yb,zb<br />
Θyt,zt · cos(yt · α − zt · t), (5)<br />
Byb,zb · cos(yb · α − zb · t). (6)<br />
At this, ω1 is the supplying angular frequency. For<br />
reasons <strong>of</strong> illustration the phase angle is neglected.<br />
For derivation <strong>of</strong> the new harmonic orders caused by<br />
non-ideal rotor components only the rotor fundamental<br />
component <strong>of</strong> the MMF is considered, meaning yt = p<br />
and zt = ω1. Furthermore, a constant air gap width is<br />
considered, meaning yl =0and zl =0. For the faultless<br />
case, this implies:<br />
Bp(α, t) =Bp,ω1 · cos(p · α − ω1 · t). (7)<br />
The above mentioned rotor tolerances lead to an<br />
asymmetrical distribution <strong>of</strong> the air gap field. With the<br />
described approach, this means a modulation <strong>of</strong> the<br />
MMF caused by the rotor magnets. New space and time<br />
harmonics k with k ∈ N appear resulting in the following<br />
expression for the flux density considering non-ideal rotor<br />
components:<br />
Brf(α, t) =<br />
<br />
Bk · cos((p ± k) · α − (ω1 ± kωm) · t).<br />
k<br />
Here, ωm is the rotational speed with ωm = ω1<br />
p . Hence,<br />
Brf can be expressed as follows:<br />
Brf(α, t) =<br />
<br />
Bk · cos((p ± k) · α − (1 ± k<br />
) · ω1t).<br />
p<br />
k<br />
This expression indicates the new harmonic orders appearing<br />
in the spectrum <strong>of</strong> the flux density and equally<br />
in the back-EMF spectrum due to deviations at the machine’s<br />
rotor as predicted in expression (1). However, it<br />
is only valid considering one single coil [10]. Derivating<br />
the harmonics for one phase, the coil configuration needs<br />
to be considered. One phase <strong>of</strong> the investigated machine<br />
contains two coils which are displaced by 180 ◦ .<br />
According to (9) the back-EMF <strong>of</strong> the first coil in<br />
one phase assuming faulty rotor components can be<br />
(8)<br />
(9)
determined as follows:<br />
Virf1(α, t) =<br />
<br />
Vik · cos((p ± k) · 0 ◦ − (1 ± k<br />
) · ω1t).<br />
p<br />
k<br />
(10)<br />
Similarly, the back-EMF <strong>of</strong> the second coil in the same<br />
phase is:<br />
Virf2(α, t) =<br />
<br />
Vik · cos((p ± k) · 180 ◦ − (1 ± k (11)<br />
) · ω1t).<br />
p<br />
k<br />
The resulting back-EMF Virfph for one phase can be<br />
determined by the superposition <strong>of</strong> the two coils:<br />
Virfph = Virf1 + Virf2 =<br />
<br />
Vikcos((1 ±<br />
k<br />
k<br />
p )ω1t) · [1 + cos((p ± k) · 180 ◦ )].<br />
(12)<br />
For odd numbers (p ± k), (12) is equal to zero. With<br />
p =4it is obvious that only even numbers <strong>of</strong> k appear<br />
in the back-EMF spectrum.<br />
Finally, (1) can be specialized for the analyzed machine,<br />
indicating the new harmonic orders appearing in<br />
the spectrum <strong>of</strong> back-EMF in case <strong>of</strong> non-ideal rotor<br />
components:<br />
n ′ rf =1± 2k′<br />
p with k′ ∈ N. (13)<br />
For the faultless case where the air gap field is symmetrical,<br />
the appearing orders are determined by the winding<br />
arrangement [5]. Considering a three phase winding,<br />
these harmonic orders are:<br />
n =6m ± 1 with m ∈ N. (14)<br />
The mentioned new harmonic orders caused by faults<br />
appear in addition to (14).<br />
For the current, the harmonic orders can be derived<br />
analogously. Ampere’s law reveals the general relation<br />
between electrical current I and magnetic flux density<br />
B:<br />
<br />
μ0 · I = Bds. (15)<br />
In practice, the current may additionally be affected by<br />
the control system and by the inverter supply. These<br />
impacts need to be considered in order to avoid wrong<br />
interpretation <strong>of</strong> the measured signals.<br />
IV. METHODOLOGY<br />
To determine the influence <strong>of</strong> the rotor tolerances<br />
onto the back-EMF and current characteristic, numerical<br />
field simulations are used. Reliable analysis requires a<br />
sufficiently large number <strong>of</strong> experiments which means<br />
less effort performing with simulation instead <strong>of</strong> measurements.<br />
In addition, the interpretation <strong>of</strong> measurement<br />
results is difficult within this context, as for a certain<br />
prototype the real existing deviations are unknown. The<br />
intentional construction <strong>of</strong> tolerances is very difficult to<br />
realize.<br />
S<br />
- 200 - 15th IGTE Symposium 2012<br />
Back−EMF [p.u.]<br />
A. Finite Element Analysis<br />
To calculate the back-EMF, a two-dimensional timestepping<br />
Finite Element Analysis (FEA) is applied. Noload<br />
operation at a speed <strong>of</strong> 3000 rpm is assumed and the<br />
voltage is calculated by use <strong>of</strong> the time derivative <strong>of</strong> the<br />
magnetic flux, as in equation (2).<br />
For analysis, a discrete Fourier transform (DFT) is performed<br />
which yields the spectrum <strong>of</strong> back-EMF as shown<br />
in Fig. 4. For the ideal faultless case with symmetrical<br />
air gap field, harmonic orders appear according to (14).<br />
1<br />
0.9<br />
0.8<br />
0.7<br />
0.6<br />
0.5<br />
0.4<br />
0.3<br />
0.2<br />
0.1<br />
0<br />
0 2 4 6 8<br />
Harmonic order<br />
Back−EMF [p.u.]<br />
0.05<br />
0.04<br />
0.03<br />
0.02<br />
0.01<br />
0<br />
0 2 4 6 8<br />
Harmonic order<br />
Fig. 4. Back-EMF spectrum assuming faultless case.<br />
1) Parameterization by statistical measurements: For<br />
parameterization <strong>of</strong> the FEA model, a statistical verification<br />
is performed. The back-EMF is measured for ten<br />
prototypes <strong>of</strong> the machine. The resulting first order is<br />
evaluated in form <strong>of</strong> a histogram shown in Fig. 5. Based<br />
on this results, the magnets material properties (BR)<br />
within the model are adjusted in order that simulated<br />
value <strong>of</strong> first order and mean value <strong>of</strong> measured first<br />
order agree.<br />
Absolute Frequency<br />
5<br />
4<br />
3<br />
2<br />
1<br />
0<br />
0.99 1 1.01<br />
First order back−EMF [p.u.]<br />
Fig. 5. Measured Back-EMF histogram.<br />
B. Extended d-q model<br />
A d-q model is a common way to describe the PMSM’s<br />
dynamical behavior considering the control system. Here,<br />
the application <strong>of</strong> such a model is studied to calculate<br />
the current. Since the common d-q model only takes the<br />
fundamental wave into account, it is not able to consider<br />
non-ideal behavior such as local defects studied within<br />
this work. Hence, an extended d-q model [11] is applied
to calculate the current. Here, FEA is used to extract<br />
additional elements to extend the common d-q equations.<br />
In the following, a start-up <strong>of</strong> the machine from zero to<br />
3000 rpm is simulated and the stator current is analyzed<br />
by use <strong>of</strong> a short-time Fourier transform (STFT). This<br />
yields the spectrum including the frequency distribution<br />
over time <strong>of</strong> the non-stationary current signal. Fig. 6<br />
shows the result for the faultless case. The value <strong>of</strong><br />
current is represented by a color range, where light colors<br />
mean a high and dark colors a low value. It can be seen<br />
that the harmonic orders are the same as for the back-<br />
EMF.<br />
Fig. 6. Current spectrum assuming faultless case.<br />
Here, sine-wave excitation is assumed. With the presented<br />
model it is also possible to simulate inverter<br />
operation. However, modeling the inverter leads to a<br />
computationally expensive model. Fig. 7 shows the result<br />
for the faultless case assuming inverter supply. Due to<br />
the high intensity <strong>of</strong> computation it is illustrated with<br />
lower resolution. Besides the main harmonic orders some<br />
new orders appear. However, these do not interfere with<br />
the orders which are expected to appear correspending to<br />
(13) due to tolerances. Therefore, inverter supply is not<br />
considered within this study because <strong>of</strong> the computational<br />
costs.<br />
Fig. 7. Current spectrum assuming faultless case and inverter supply.<br />
V. SIMULATION RESULTS<br />
To develop a reliable end-<strong>of</strong>-line check, the most<br />
common and important fault modes should be captured.<br />
In the following, different approaches are applied to<br />
- 201 - 15th IGTE Symposium 2012<br />
Back−EMF [p.u.]<br />
simulate various fault scenarios. The choice <strong>of</strong> the corresponding<br />
approach depends on the particular fault, the<br />
prior knowledge <strong>of</strong> the fault and the available data.<br />
A. Worst-case analysis<br />
For PMSMs cogging torque is an undesired effect as it<br />
leads to rotational oscillations <strong>of</strong> the drive train. Cogging<br />
torque is strongly influence by deviations caused by<br />
the manufacturing process [2],[7]. However, measuring<br />
cogging torque is very time- and cost-expensive [12] and<br />
therefore no appropiate method for an end-<strong>of</strong>-line control.<br />
In the following it shall be studied how back-EMF and<br />
current are influenced at a faulty machine presenting a<br />
high value <strong>of</strong> cogging torque due to magnetization faults.<br />
Considering a deviation in the magnitude <strong>of</strong> the magnets’<br />
remanence flux density BR, the amount <strong>of</strong> variation<br />
<strong>of</strong> cogging torque is depending on which and how<br />
many permanent magnets are affected. In [13] Design<strong>of</strong>-Experiments<br />
is applied to find out the worst-case<br />
configuration <strong>of</strong> magnetization faults concerning cogging<br />
torque. Applied to the studied machine, the configuration<br />
shown in Fig. 8 presents the highest value <strong>of</strong> peak-to-peak<br />
cogging torque. Thereby, the filled magnets represent the<br />
ones that are defective. Considering a deviation at BR<br />
<strong>of</strong> -10%, the value <strong>of</strong> cogging torque is about seventimes<br />
higher compared to the reference value <strong>of</strong> the ideal<br />
machine.<br />
Fig. 8. Worst-case configuration <strong>of</strong> magnetization faults.<br />
Fig. 9 and Fig. 10 show the results for back-EMF and<br />
current in case <strong>of</strong> the described worst-case magnetization<br />
fault.<br />
1<br />
0.9<br />
0.8<br />
0.7<br />
0.6<br />
0.5<br />
0.4<br />
0.3<br />
0.2<br />
0.1<br />
Faulty case<br />
Faultless case<br />
0<br />
0 2 4 6 8<br />
Harmonic order<br />
Back−EMF [p.u.]<br />
0.05<br />
0.04<br />
0.03<br />
0.02<br />
0.01<br />
Faulty case<br />
Faultless case<br />
0<br />
0 2 4 6 8<br />
Harmonic order<br />
Fig. 9. Back-EMF spectrum assuming worst-case magnetization fault.<br />
The spectra show new harmonic orders, especially<br />
n ′ rf =0.5 and n′ rf =2.5 are apparent according to (1).<br />
Compared to the faultless case the first harmonic order<br />
is reduced.
Back−EMF [p.u.]<br />
Fig. 10. Current spectrum assuming worst-case magnetization fault.<br />
B. Sample cases<br />
For the studied machine the height <strong>of</strong> the magnet can<br />
vary between 97% and 100% <strong>of</strong> its desired value and<br />
the width can vary by ± 1.5%. The dimensions <strong>of</strong> some<br />
sample magnets have been measured which are used<br />
as input data for this approach. None <strong>of</strong> the measured<br />
dimensions exceed the allowed tolerance range. Five<br />
cases are created where every magnet is subjected to<br />
the given tolerances. Fig. 11 and Fig. 12 exemplarily<br />
show the back-EMF and current spectrum <strong>of</strong> one faulty<br />
case. The first harmonic order is reduced compared to<br />
the faultless case and the new ordinal numbers appear<br />
corresponding to (13).<br />
1<br />
0.9<br />
0.8<br />
0.7<br />
0.6<br />
0.5<br />
0.4<br />
0.3<br />
0.2<br />
0.1<br />
Faulty case<br />
Faultless case<br />
0<br />
0 2 4 6 8<br />
Harmonic order<br />
Back−EMF [p.u.]<br />
0.05<br />
0.04<br />
0.03<br />
0.02<br />
0.01<br />
Faulty case<br />
Faultless case<br />
0<br />
0 2 4 6 8<br />
Harmonic order<br />
Fig. 11. Back-EMF spectrum assuming faulty magnet dimensions.<br />
Fig. 12. Current spectrum assuming faulty magnet dimensions.<br />
When compared to the results from the worst-case<br />
magnetization fault studied in V-A, one can see that the<br />
- 202 - 15th IGTE Symposium 2012<br />
influence <strong>of</strong> deviations at the magnets’ dimensions within<br />
the allowed tolerance range is not significant. The spectra<br />
show the same specific charateristics but less amounts.<br />
For all five studied cases the simulated spectra do not<br />
differ considerably from the faultless case.<br />
C. Stochastic analysis<br />
In [14] the influence <strong>of</strong> varying qualities <strong>of</strong> the permanent<br />
magnet has been investigated applying a stochastic<br />
analysis. This is applied here to compare the influences<br />
<strong>of</strong> deviations in the magnetization magnitude and magnetization<br />
direction. Overall, 60 failure configurations are<br />
studied. For 20 cases the remanence flux density BR<br />
is assumed to be Gaussian distributed with a standard<br />
deviation <strong>of</strong> 3σ equal to 10% <strong>of</strong> the nominal value.<br />
For 20 other cases the magnetization direction is also<br />
assumed to be normally distributed with a standard<br />
deviation <strong>of</strong> 5 ◦ . The other 20 cases present both kind<br />
<strong>of</strong> deviations. Applying FEA, cogging torque and back-<br />
EMF are calculated and evaluated employing histograms.<br />
The distribution <strong>of</strong> the first harmonic order <strong>of</strong> the back-<br />
EMF is shown in Fig. 13. It shows the range in which<br />
the back-EMF is influenced because <strong>of</strong> the different<br />
magnetization deviations.<br />
Absolute Frequency<br />
Absolute Frequency<br />
Magnetization magnitude<br />
7<br />
6<br />
5<br />
4<br />
3<br />
2<br />
1<br />
0<br />
0.98 1 1.02 1.04<br />
First order back−EMF [p.u.]<br />
7<br />
6<br />
5<br />
4<br />
3<br />
2<br />
1<br />
Magnitude and direction<br />
0<br />
0.98 1 1.02 1.04<br />
First order back−EMF [p.u.]<br />
Absolute Frequency<br />
20<br />
15<br />
10<br />
5<br />
Magnetization direction<br />
0<br />
0.98 1 1.02 1.04<br />
First order back−EMF [p.u.]<br />
Fig. 13. Back-EMF histogram considering magnetization faults.<br />
It can be seen that the influence <strong>of</strong> the deviations<br />
concerning the magnetization direction is very small. The<br />
magnitude fault is prevailing. This is to be expected for<br />
the studied machine as it presents interior magnets.<br />
The same analysis is performed for the peak-to-peak<br />
cogging torque, which is presented in Fig. 14. Again<br />
it can be concluded that the influence <strong>of</strong> magnetization<br />
direction is small when compared to the deviation in<br />
magnitude as the corresponding distribution shows.
Absolute Frequency<br />
Absolute Frequency<br />
8<br />
6<br />
4<br />
2<br />
Magnetization magnitude<br />
0<br />
0 2 4 6<br />
Peak−to−peak cogging torque [p.u.]<br />
8<br />
6<br />
4<br />
2<br />
Magnitude and direction<br />
Absolute Frequency<br />
0<br />
0 2 4 6<br />
Peak−to−peak cogging torque [p.u.]<br />
20<br />
15<br />
10<br />
5<br />
Magnetization direction<br />
0<br />
0 2 4 6<br />
Peak−to−peak cogging torque [p.u.]<br />
Fig. 14. Cogging torque histogram considering magnetization faults.<br />
Generally, magnetization faults influence cogging<br />
torque and back-EMF characteristic simultaneously [15].<br />
For both quantities new harmonic orders arise depending<br />
on the caused asymmetry in the air gap field. Based<br />
on the presented studies, it becomes apparant that the<br />
influence <strong>of</strong> magnetization tolerances on cogging torque<br />
is more significant as on back-EMF. This means, that<br />
on the one hand the machine’s behavior is strongly<br />
influenced in general. But on the other hand, as a cogging<br />
torque test is excluded for an end-<strong>of</strong>-line concept, it<br />
implies high functional requirement <strong>of</strong> the measurement<br />
devices to detect faults by use <strong>of</strong> back-EMF analysis.<br />
This shows the importance <strong>of</strong> an influence analysis such<br />
as presented in this study. It is required to study the<br />
impact <strong>of</strong> tolerances to be able to separate it towards<br />
measurement inaccuracy.<br />
VI. CONCLUSION<br />
In this work, the influence <strong>of</strong> non-ideal manufactured<br />
rotor components <strong>of</strong> a PMSM on its back-EMF and<br />
current characteristic is studied. It has been shown that<br />
electrical quantities are applicable to realize tolerance<br />
diagnosis. Especially the stator current approves to be<br />
a promising approach due to its time- and cost-saving<br />
setup. The new harmonic orders caused by rotor faults<br />
are derived within a theoretical analysis and confirmed<br />
by the simulation results.<br />
An end-<strong>of</strong>-line check could be realized in such a<br />
way that all machines presenting a certain level in these<br />
specific characteristics are rejected. With the presented<br />
methods, the range <strong>of</strong> these distinguishing characteristics<br />
can be evaluated to detect the corresponding levels for<br />
rejection. At this, measurement accuracy should be taken<br />
into account.<br />
- 203 - 15th IGTE Symposium 2012<br />
Comparing the different rotor tolerances, all present<br />
the same characteristics but with different amounts depending<br />
on the fault’s intensity and arrangement. As a<br />
feedback for manufacturing, a differentiation <strong>of</strong> various<br />
tolerances would be gainful but can not be achieved with<br />
the presented analysis. However, the focus <strong>of</strong> the end<strong>of</strong>-line<br />
check is to verify the machines’ quality which<br />
can be realized by the suggested approach. The results<br />
<strong>of</strong> this study evince to be valueable for application <strong>of</strong> an<br />
accurate quality control for PMSMs finally improving its<br />
reliability.<br />
REFERENCES<br />
[1] S. Nandi, H.A. Toliyat, and X. Li, ”Condition Monitoring and Fault<br />
Diagnosis <strong>of</strong> Electrical Motors - A Review,” IEEE Transactions on<br />
Energy Conversion, vol. 20, no. 4, pp. 710-729, December 2005.<br />
[2] L. Gasparin, A. Cernigoj, S. Markic, and R. Fiser, ”Additional<br />
Cogging Torque Components in Permanent-Magnet Motors Due<br />
to Manufacturing Imperfections,” IEEE Transactions on Magnetics,<br />
vol. 45, no. 3, pp. 1210-1213, March 2009.<br />
[3] P.J. Tavner, ”Review <strong>of</strong> condition monitoring <strong>of</strong> rotating electrical<br />
machines,” IET Electric Power Applications, vol. 2, no. 4, pp.<br />
215247, 2008.<br />
[4] W. le Roux, R. G. Harley, and T. G. Habetler, ”Detecting Rotor<br />
Faults in Low Power Permanent Magnet Synchronous Machines,”<br />
IEEE Transactions on Power Electronics, vol. 22, no. 1, pp. 322-<br />
328, January 2007.<br />
[5] D. Casadei, F. Filippetti, C. Rossi, A. Stefani, and D.J. Ewins,<br />
”Magnets faults characterization for Permanent Magnet Synchronous<br />
Motors,” IEEE International Symposium on Diagnostics<br />
for Electric Machines, Power Electronics and Drives, pp. 1-6, 2009.<br />
[6] A. Flach, F. Drager, M. Ayeb, and L. Brabetz, ”A New Approach to<br />
Diagnostics for Permanent-Magnet Motors in Automotive Powertrain<br />
Systems,” IEEE International Symposium on Diagnostics for<br />
Electrical Machines, Power Electronics and Drives, pp. 234-239,<br />
September 2011.<br />
[7] G. Heins, T. Brown, and M. Thiele, ”Statistical Analysis <strong>of</strong> the<br />
Effect <strong>of</strong> Magnet Placement on Cogging Torque in Fractional Pitch<br />
Permanent Magnet Motors ,” IEEE Transactions on Magnetics, vol.<br />
47, no. 8, pp. 2142-2148, August 2011.<br />
[8] J.R. Cameron, W.T. Thomson, and A.B. Dow, ”Vibration and current<br />
monitoring for detecting airgap eccentricity in large induction<br />
motors,” IEE <strong>Proceedings</strong> B Electric Power Applications, vol. 133,<br />
no. 3, pp. 155 - 163, May 1986.<br />
[9] B.M. Ebrahimi, J. Faiz, and M.J. Roshtkhari, ”Static-, Dynamicand<br />
Mixed-Eccentricity Fault Diagnoses in Permanent-Magnet<br />
Synchronous Motors,” IEEE Transactions on Industrial Electronics,<br />
vol. 56, no. 11, pp. 4727-4739, November 2009.<br />
[10] J. Urresty, J. Riba Ruiz, and L. Romeral, ”A Back-emf Based<br />
Method to Detect Magnet Failures in PMSMs ,” IEEE Transactions<br />
on Magnetics, July 2012.<br />
[11] T. Herold, D. Franck, E. Lange, and K. Hameyer, ”Extension<br />
<strong>of</strong> a D-Q Model <strong>of</strong> a Permanent Magnet Excited Synchronous<br />
Machine by Including Saturation, Cross-Coupling and Slotting<br />
Effects,” International Electric Machines and Drives Conference<br />
(IEMDC), pp. 1379-1383, 2011.<br />
[12] C. Schlensok, D. van Riesen, B. Schmülling, M. Schöning, and<br />
K. Hameyer, ”Cogging Torque Analysis on Permanent Magnet<br />
Machines by Simulation and Measurement,” tm - Technisches<br />
Messen, vol. 74, no. 7-8, pp. 393-401, August 2007.<br />
[13] I. Coenen, M. van der Giet, and K. Hameyer, ”Manufacturing Tolerances:<br />
Estimation and Prediction <strong>of</strong> Cogging Torque Influenced<br />
by Magnetization Faults,” IEEE Transactions on Magnetics, vol.<br />
48, no. 5, pp. 1932-1936, May 2012.<br />
[14] I. Coenen, M. Herranz Gracia, and K. Hameyer, ”Influence and<br />
evaluation <strong>of</strong> non-ideal manufacturing process on the cogging<br />
torque <strong>of</strong> a permanent magnet excited synchronous machine,”<br />
COMPEL, vol. 30, no. 3, pp. 876-884, 2011.<br />
[15] K. Kim, S. Lim, D. Koo, and J. Lee, ”The Shape Design <strong>of</strong><br />
Permanent Magnet for Permanent Magnet Synchronous Motor<br />
Considering Partial Demagnetization,” IEEE Transactions on Magnetics,<br />
vol. 42, no. 10, October 2006.
- 204 - 15th IGTE Symposium 2012<br />
<br />
<br />
Hai Van Jorks, Erion Gjonaj and Thomas Weiland<br />
TU Darmstadt, Institute <strong>of</strong> Computational Electromagnetics, Schloßgartenstraße 8, 64289 Darmstadt, Germany<br />
Abstract— High frequency eddy currents are investigated and the Common Mode Input Impedance <strong>of</strong> a PWM controlled<br />
induction motor is calculated from finite element simulations. In order to determine machine parameters accurately, two<br />
modelling approaches are compared. The first is a two-dimensional simulation approach where iron core lamination effects<br />
are included by means <strong>of</strong> an equivalent material approximation. The second approach consists in fully three-dimensional<br />
analysis taking into account explicitly the eddy currents induced in the laminations. It is shown that homogenised equivalent<br />
material models may lead to large errors in the calculation <strong>of</strong> machine inductances, especially at high frequencies. However,<br />
the Common Mode Input Impedance, which is the final parameter <strong>of</strong> interest, seems to be less affected by the lamination<br />
modelling.<br />
Index Terms—eddy currents, finite element, lamination, PWM<br />
I. INTRODUCTION<br />
In modern drive systems fast switching inverters are<br />
the source <strong>of</strong> high frequency common mode voltages at<br />
the motor terminals. Due to stray capacitances between<br />
windings and grounded iron parts <strong>of</strong> the machine, a<br />
common mode current is excited and for its part may<br />
cause circulating bearing currents which may damage the<br />
bearing [1]. The phenomena can be described by<br />
equivalent circuit representation which employs the<br />
frequency dependent Common Mode Input Impedance<br />
being the ratio <strong>of</strong> common mode voltage and current<br />
<br />
Common Mode Input Impedance can be computed<br />
using <br />
<br />
<br />
Figure 1: Lumped parameter model <strong>of</strong> a two conductor system.<br />
Parameters can be gathered in the impedance and capacitance matrix<br />
Firstly, the stray capacitances are extracted from<br />
electrostatic and winding impedances from<br />
magnetoquasistatic simulations. Ohmic conductivity <strong>of</strong><br />
winding insulation is negligible.<br />
Secondly, the winding scheme is taken into account to<br />
match the corresponding voltages and currents at the<br />
front and rear end <strong>of</strong> the machine. At this point, endwinding<br />
inductances can be included in the model, but<br />
require distinct modelling approaches and are, therefore,<br />
neglected in the present analysis.<br />
Considering the laminated middle part <strong>of</strong> the motor, it<br />
is common to use two-dimensional (2D) models <strong>of</strong> the<br />
motor cross-section to compute the self and coupling<br />
impedances. In earlier literature it is proposed to apply<br />
the finite element (FE) method within a single stator slot<br />
while the magnetic field was assumed to be zero outside<br />
the slot perimeter [2]. However, as shown in a recent<br />
investigation [3], strong inductive coupling at high<br />
frequencies (several to ) may occur even<br />
between distant slots. The effect is caused by the core<br />
lamination, which, despite the small skin depth (7),<br />
promotes the spreading <strong>of</strong> high frequency magnetic<br />
fluxes over the iron sheets’ surfaces. In [4] the lamination<br />
was approximated by an additional impedance.<br />
Moreover, it is possible to include the lamination already<br />
in the FE model. Therefore, we consider the entire motor<br />
cross section in the magnetoquasistatic simulations.<br />
In the case <strong>of</strong> 2D analysis, modelling the laminated<br />
core in a plane requires the application <strong>of</strong><br />
homogenization techniques. We investigated the accuracy<br />
<strong>of</strong> the widely used formulation (2) for a broad frequency<br />
range . The reference solution was<br />
obtained from fully three-dimensional (3D) simulations.<br />
Since resolving the small skin depth in motor laminations<br />
in a 3D mesh is computationally very costly, general<br />
purpose simulation s<strong>of</strong>tware cannot be utilized. On that<br />
account, we developed a specialised 3D FE simulation<br />
tool, which takes advantage <strong>of</strong> the periodicity <strong>of</strong> the<br />
lamination stack, but otherwise does not use any<br />
approximation on motor geometry or on the material<br />
properties <strong>of</strong> the laminated core.<br />
II. 2D LAMINATION MODELLING<br />
A well-known homogenization model for laminated<br />
cores [5] utilizes a frequency dependent equivalent<br />
permeability for the iron core given by,<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
where 0r is the permeability <strong>of</strong> iron, 2b the thickness <strong>of</strong><br />
the plate and the skin depth at a given frequency. The<br />
magnetic field problem for the homogenized core is<br />
reduced to a planar problem. While this approach allows<br />
for efficient 2D FE modelling, accuracy at higher<br />
frequencies may not be sufficient. Figure 2 shows the Bfield<br />
plot <strong>of</strong> the motor model obtained from simulations
with “FEMM” [6], a 2D open source s<strong>of</strong>tware which<br />
employs the approach (2). In order to obtain self and<br />
mutual impedances <strong>of</strong> the multi-conductor system, only<br />
one conductor was excited by a current. The spreading <strong>of</strong><br />
the flux over the lamination as well as across the periodic<br />
boundary <strong>of</strong> the computational model can be observed.<br />
The impedance matrix was extracted and will be<br />
compared to the 3D reference in Section IV.B.<br />
Figure 2: 2D model <strong>of</strong> cross-sectional motor geometry (60° section)<br />
with magnitude <strong>of</strong> magnetic flux density at 1 MHz<br />
III. FULLY 3D FE ANALYSIS<br />
A. 3D FE formulation<br />
Maxwell’s equations in frequency domain expressed<br />
by a magnetic vector potential yield<br />
<br />
<br />
<br />
where is the permeability, the conductivity, is a<br />
voltage gradient used for excitation. Two types <strong>of</strong><br />
boundary conditions on are applied. Firstly,<br />
<br />
where is the unit normal vector and is the<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
triangular prims are chosen, because they allow for an<br />
efficient discretisation <strong>of</strong> the very thin iron sheet (Fig. 3).<br />
Applying Galerkin’s method to (3) the matrix equation <br />
<br />
is generated, where the complex matrix combines the<br />
discrete operators corresponding to the left hand side <strong>of</strong><br />
(3), vector holds the degrees <strong>of</strong> freedom (DOFs) <strong>of</strong><br />
the vector potentials and vector the exciting currents.<br />
In the case <strong>of</strong> voltage excitation <strong>of</strong> individual conductors<br />
the relation<br />
<br />
- 205 - 15th IGTE Symposium 2012<br />
can be employed [7], where <br />
, is the<br />
<br />
angular frequency and is the coupling matrix for the<br />
vector <strong>of</strong> the wire voltages . The<br />
important case where a single conductor n is exited can<br />
be obtained by setting in (6) and for all<br />
other conductors (see also Section IV.A).<br />
B. Reduction <strong>of</strong> the problem size<br />
A high frequency 3D FE model <strong>of</strong> the complete motor<br />
geometry is still far beyond today’s computing capacities.<br />
But even if we consider just a slice <strong>of</strong> the motor<br />
with the thickness <strong>of</strong> half a lamination sheet ,<br />
an appropriate 3D discretisation will lead to several<br />
million mesh cells. In order to model eddy currents at<br />
frequencies up to the discretisation has to<br />
resolve the small skin depth in the high-permeability iron<br />
<br />
<br />
<br />
In the analysis, the following parameters were used:<br />
,<br />
,<br />
where is is the electrical conductivity and the<br />
permeability <strong>of</strong> iron, respectively.<br />
Parallelization <strong>of</strong> our 3D FEM code is a key feature,<br />
nevertheless, further reduction <strong>of</strong> the problem size is<br />
particularly important. In the case <strong>of</strong> common mode<br />
excitation <strong>of</strong> a 3-phase 4-pole induction machine, the<br />
field pattern in the motor cross section is periodic with<br />
respect to a 60° rotation around the longitudinal axis <strong>of</strong><br />
the motor. This reduces the computational domain to a<br />
60° section while periodic boundary conditions are<br />
applied to the cut planes (Fig. 3).<br />
Figure 3: Simulation mesh and magnitude <strong>of</strong> the magnetic flux density<br />
at 1 MHz for the 3D motor model. For better visibility, the model is<br />
scaled by a factor 100 in the transversal direction.<br />
IV. IMPEDANCE MATRIX CALCULATION<br />
A. Extraction procedure<br />
The standard procedure to extract the impedances <strong>of</strong><br />
the cross-sectional conductors from FE analysis, is to<br />
excite the -th conductor with a current <strong>of</strong> and set all<br />
the other conductors to . After running the simulation,<br />
the induced voltages in all conductors have to be<br />
computed from the magnetic vector potential solution.
This procedure has to be repeated for all <br />
conductors. A section <strong>of</strong> the analyzed <br />
induction motor holds 120 stator and 8 rotor conductors.<br />
If a general purpose FEM s<strong>of</strong>tware is used, this leads to a<br />
large computational overhead, which makes the method<br />
inconvenient. However, the extraction procedure can be<br />
condensed into a single simulation cycle. In this way,<br />
common steps like loading <strong>of</strong> the mesh, setup <strong>of</strong> the curlcurl<br />
matrix and its LU decomposition have to be<br />
performed only once (see Fig. 4). Referring to the 3D<br />
simulation <strong>of</strong> the motor cross section, a speedup factor <strong>of</strong><br />
could be obtained. As was shown in [7],<br />
impedance matrix can be computed from<br />
<br />
Still, in order to avoid the explicit inversion <strong>of</strong> the sparse<br />
matrix , the equation system (5) has to be solved <br />
times, with being the number <strong>of</strong> conductors. A detailed<br />
overview <strong>of</strong> the implemented algorithm is depicted in<br />
Fig. 4.<br />
Figure 4: Flowchart <strong>of</strong> computational algorithm<br />
B. Simulation results<br />
The same motor geometry is used in the 2D (Fig. 2)<br />
and the 3D (Fig. 3) case. The simulation time <strong>of</strong> a 3D<br />
model with DOFs on a cluster with 60 nodes<br />
was for a sweep <strong>of</strong> 7 frequency points. Figure 5<br />
shows the magnitude <strong>of</strong> self-impedance <strong>of</strong> one stator<br />
conductor extracted from 2D and 3D simulations. The 2D<br />
solution which employs the lamination model (equivalent<br />
- 206 - 15th IGTE Symposium 2012<br />
permeability) shows major discrepancies from the 3D<br />
reference case. For additional validation <strong>of</strong> the simulation<br />
models, the same geometry was tested without laminated<br />
materials, i.e. the iron core forms a massive block. Thus,<br />
2D analysis does not require the lamination formulae and<br />
therefore should give the same results as 3D analysis. The<br />
corresponding impedance is shown in Fig. 6. 2D and 3D<br />
impedances can be found in very good agreement.<br />
Z / Ω<br />
10 0<br />
10 -7<br />
9%<br />
10 1<br />
Figure 5: Laminated iron with . Comparison <strong>of</strong> conductor<br />
impedance extracted from field simulations and relative error between<br />
2D and 3D results.<br />
Z / Ω<br />
10 -1<br />
10 -2<br />
10 -3<br />
10 -4<br />
10 -5<br />
10 -6<br />
10 -3<br />
10 -4<br />
10 -5<br />
10 -6<br />
10 0<br />
10 -7<br />
0.00%<br />
53%<br />
0.02%<br />
10 1<br />
Figure 6: Massive iron with . Comparison <strong>of</strong> conductor<br />
impedance extracted from field simulations and relative error between<br />
2D and 3D results.<br />
V. COMMON MODE INPUT IMPEDANCE<br />
A. Assembling <strong>of</strong> transmission line model<br />
The state <strong>of</strong> a multi-conductor line in the frequency<br />
domain is described by telegrapher’s equations. Taking<br />
into account the lumped parameter approximation, three<br />
matrix equations are obtained<br />
<br />
Z 1,1 laminated μ r,Fe = 1000<br />
94%<br />
10 2<br />
10 2<br />
97%<br />
10 3<br />
f / Hz<br />
10 3<br />
f / Hz<br />
<br />
<br />
<br />
<br />
where , , and are the voltage and<br />
current vectors at the front (z=0) and rear end (z=l) <strong>of</strong> the<br />
95%<br />
10 4<br />
Z 1,1 massive iron μ Fe = 1000<br />
0.13%<br />
1.82%<br />
6.43%<br />
10 4<br />
88%<br />
10 5<br />
2D<br />
3D<br />
9.58%<br />
10 5<br />
2D<br />
3D<br />
80%<br />
6.04%<br />
10 6<br />
10 6
motor, respectively, and and <br />
<br />
<br />
are the lumped element matrices in the pi-equivalent<br />
circuit (Fig. 1). The vector holds the currents through<br />
the impedances and can be eliminated by inserting one<br />
equation into the others. In the next step, the winding<br />
scheme is taken into account which further reduces the<br />
degrees <strong>of</strong> freedom in the system (9a-c). The proceeding<br />
can be found in [3]. It is important to set the current at the<br />
star point <strong>of</strong> the machine to zero, according to the<br />
common mode measuring setup, where the star point is<br />
not grounded. When solving the final equation system, a<br />
given voltage at the motor terminals will yield a certain<br />
input current. The ratio <strong>of</strong> the two quantities is the<br />
Common Mode Input Impedance . B. 240 kW induction motor<br />
The 2D and 3D plots (Fig. 7) show very good<br />
agreement, despite the deviations in the impedance<br />
matrices (Fig. 5). This is understandable since the stray<br />
capacitances are dominant in the common mode circuit <strong>of</strong><br />
the machine. However, impedances are important to<br />
predict resonance points. The frequency <strong>of</strong> the first<br />
resonance in the 2D curve is shifted by 6 kHz while its<br />
magnitude differs by 4%.<br />
Z / Ω<br />
10 3<br />
10 2<br />
10 1<br />
10 3<br />
first resonance<br />
| Z com | 2D vs. 3D<br />
10 4<br />
f / Hz<br />
Figure 7: Comparison <strong>of</strong> the Common Mode Impedance between 2D<br />
and 3D results.<br />
VI. CONCLUSION<br />
We developed a specialized 3D FE simulation code<br />
which is able to efficiently extract the impedance matrix<br />
<strong>of</strong> a multi-conductor setup, e.g. a motor cross section.<br />
Since the laminated iron is modelled with its actual<br />
geometry and material properties, 3D results are more<br />
accurate than those from 2D simulations with<br />
homogenized material approach. Finally, capacitance<br />
matrix, impedance matrix and the winding scheme are<br />
combined to obtain the frequency-dependent Common<br />
Mode Input Impedance <strong>of</strong> the machine. It can be<br />
observed that the solution is dominated by the<br />
10 5<br />
2D<br />
3D<br />
10 6<br />
- 207 - 15th IGTE Symposium 2012<br />
capacitances and, therefore, less sensitive to inaccuracies<br />
in the impedance matrix. In future work the nonlinear<br />
properties <strong>of</strong> the iron will be considered in the<br />
simulations and the end regions <strong>of</strong> the motor will be<br />
included in the Transmission Line Model.<br />
Acknowledgement This work is founded by the<br />
Deutsche Forschungsgemeinschaft (DFG) under the<br />
collaborative research group grant FOR 575.<br />
[1]<br />
REFERENCES<br />
S. Chen, T.A. Lipo, D. Fitzgerald, “Source <strong>of</strong> induction motor<br />
bearing currents caused by PWM inverters”, IEEE Trans. on<br />
Energy Conv., Vol. 11, Iss. 1, 1996, pp. 25–32.<br />
[2] I. Boldea, S. A. Nasar, “The induction machine handbook”, CRC<br />
Press, 2002.<br />
[3] H. Jorks, E. Gjonaj, T. Weiland, O. Magdun, “Three-dimensional<br />
simulations <strong>of</strong> an induction motor including eddy current effects in<br />
core laminations”, IET Science, Measurement & <strong>Technology</strong>, Vol.<br />
6, Iss. 5, Sep. 2012, pp. 344 – 349.<br />
[4] P. Maeki-Ontto, J. Luomi, “Induction motor model for the analysis<br />
<strong>of</strong> capacitive and induced shaft voltages”, Proc. <strong>of</strong> IEMDC '05,<br />
May 2005, pp. 1653-1660.<br />
[5] R.L. Stoll, “The analysis <strong>of</strong> eddy currents”, Oxford <strong>University</strong><br />
Press, 1974.<br />
[6] FEMM by David Meeker, version 4.2, available at<br />
[7]<br />
www.femm.info.<br />
A. Bossavit, "Two dual formulations <strong>of</strong> the 3-D eddy-current<br />
problem", COMPEL, Vol. 4, Iss. 2, 1985, pp. 103 – 116.<br />
[8] H. De Gersem, O. Henze, T. Weiland, A. Binder, “Simulation <strong>of</strong><br />
wave propagation effects in machine windings”, COMPEL, Vol.<br />
29, Iss. 1, 2010, pp. 23 – 38.
- 208 - 15th IGTE Symposium 2012<br />
Computation <strong>of</strong> end-winding inductances <strong>of</strong> rotating<br />
electrical machinery through three-dimensional<br />
magnetostatic integral FEM formulation<br />
F. Calvano 1 , G. Dal Mut 2 , F. Ferraioli 2 , A. Formisano 3 , F. Marignetti 4 ,<br />
R. Martone 3 , G. Rubinacci 1 , A. Tamburrino 4 and S. Ventre 4<br />
1 Dip. di Ingegneria Elettrica, Università di Napoli Federico II, Via Claudio 25, I-80124, Naples, Italy<br />
2 Ansaldo Energia, Via N. Lorenzi 8, I-16152, Genova, Italy<br />
3 Dip. di Ing. Industriale e dell’Informazione, Seconda Università di Napoli, Via Roma 29, I-81031, Aversa (CE), Italy<br />
4 Dip. di Ing. Elettrica e dell’Informazione, Univ. di Cassino e del Lazio Merid., Via Di Biasio 43, I-03043, Cassino,<br />
Italy<br />
Abstract—An effective numerical technique to calculate end-winding inductances <strong>of</strong> rotating electrical machinery is presented.<br />
The algorithm is based on a 3D integral formulation; it allows to take into account non linearities, relative speed between<br />
stator and rotor and is well suited for the treatment <strong>of</strong> regions with large air volumes. Numerical implementation concerning<br />
the analysis <strong>of</strong> a large synchronous generator highlights the advantages <strong>of</strong> the proposed method. The aim <strong>of</strong> the paper is to<br />
assess, by means <strong>of</strong> an accurate 3D model, the correction factor to be applied to the inductances computed through 2D models<br />
to take into account the effects due to end windings.<br />
Index Terms— End windings, Integral FEM approach, Inductances, Flux density numerical computation, Synchronous<br />
generators.<br />
I. INTRODUCTION<br />
Although the computation <strong>of</strong> the inductances <strong>of</strong><br />
rotating electrical machinery is a key issue both in the<br />
design stage and in performance assessment, its accurate<br />
calculation by Finite Elements approaches is still an open<br />
problem.<br />
The inductances can be split into two contributions:<br />
main and leakage inductances.<br />
The end-winding effect affects both the contributions:<br />
main and, especially, leakage inductances. End-winding<br />
inductances are at the base <strong>of</strong> both the steady-state<br />
operation and the dynamical behavior <strong>of</strong> electrical<br />
machinery [1-3]. The main inductances are related to the<br />
flux linkages while the leakage inductances can be<br />
divided into slot inductances, tooth tip inductances,<br />
skewing inductances and zigzag leakage inductances. In<br />
large machines, end windings contribute significantly to<br />
the values <strong>of</strong> both main and more significantly leakage<br />
inductances [4].<br />
The contributions to the inductances due to the active<br />
length <strong>of</strong> the conductor can easily be computed either<br />
numerically, by standard 2D FEM analyses, or<br />
analytically by means <strong>of</strong> infinite length models. On the<br />
contrary, the end winding contribution can only be<br />
computed from the actual 3D magnetic field distribution.<br />
Most analytical approaches to the magnetic field<br />
computation [5, 6], including the most recent ones [7], are<br />
based on the solution <strong>of</strong> the Biot-Savart law through<br />
equivalent current sheet representations, using<br />
axisymmetric hypothesis [8] or the theory <strong>of</strong> images [9].<br />
Both three dimensional and two-dimensional<br />
techniques based on FEM can be used as an alternative,<br />
but they also rely on rough approximations to reduce the<br />
number <strong>of</strong> nodes [10,11]. End regions fields and fluxes<br />
can be very complex to be computed, especially for large<br />
power machines, where end regions may occupy up to<br />
one third <strong>of</strong> the total machine length.<br />
The aim <strong>of</strong> the paper is to propose a method based on<br />
the use <strong>of</strong> an accurate 3D FEM simulation to improve the<br />
accuracy <strong>of</strong> the standard 2D model achieved via a<br />
commercial s<strong>of</strong>tware.<br />
The 3D FEM technique here proposed takes advantage<br />
from an integral formulation implemented in an noncommercial<br />
code. Such an approach has been previously<br />
applied to compute end winding forces [12,13].<br />
In this paper the analysis <strong>of</strong> a large synchronous<br />
generator is considered as an example. Field simulations<br />
in different working conditions are used to assess the<br />
influence <strong>of</strong> end effects on flux linkages.<br />
The main advantages <strong>of</strong> the proposed approach are<br />
manifold: (1) a reduced number <strong>of</strong> elements is required to<br />
model the end regions; because the integral formulations<br />
do not require the discretization <strong>of</strong> the air region but,<br />
rather, <strong>of</strong> the material regions only (conducting and/or<br />
magnetic materials); (2) neighboring elements do not<br />
need to share nodes allowing for more freedom in<br />
meshing complex geometries; (3) the formulation<br />
provides an inherent capability to include air-spaced<br />
moving parts, as no interface mesh is needed [13].<br />
This approach is therefore particularly effective to<br />
model generator end regions because it can also take into
account rotor motion and magnetic nonlinearities.<br />
The paper is organized as follows: Section II presents<br />
the basis <strong>of</strong> the integral formulation and its numerical<br />
implementation. Flux expressions coming from integral<br />
formulation are also discussed. Then, the 3D correction<br />
with respect to 2D fluxes and inductances is introduced in<br />
Section III.<br />
Such formulation is used in Section IV to look for the<br />
3D flux density in the end regions <strong>of</strong> a large synchronous<br />
generator. The 3D correction to the 2D quantities are then<br />
performed according to the proposed formulation.<br />
Section IV contributes in particular to extend the<br />
knowledge <strong>of</strong> rotating electrical machinery by providing a<br />
powerful numerical tool to compute lumped parameters in<br />
the analytical model <strong>of</strong> synchronous generators with a<br />
precision superior to that achieved by classical 2D finite<br />
element models. As matter <strong>of</strong> fact, with the proposed<br />
model, the contribution to terminal quantities such as the<br />
reactances coming from the end winding region can be<br />
accurately taken into account. Terminal quantities are<br />
finally compared with experimental measurements.<br />
II. INTEGRAL FORMULATION AND ITS NUMERICAL<br />
IMPLEMENTATION<br />
It is well known that for a synchronous machine<br />
operating at steady state for the computation <strong>of</strong> the<br />
inductances it is sufficient to refer to a nonlinear<br />
magnetostatic model [16, 17]. The currents in the stator<br />
and in the rotor coils are supposed to be assigned for any<br />
position <strong>of</strong> the rotor in order to focus the attention on the<br />
magnetostatic problem formulation. However for assigned<br />
voltage, active and reactive power field currents and<br />
inductances can be computed by solving a sequence <strong>of</strong><br />
non linear mangetostatic problems [17]. In any case it is<br />
possible to neglect the effects <strong>of</strong> the eddy currents in the<br />
massive conductive parts <strong>of</strong> the device.<br />
The numerical model is based on an integral<br />
formulation <strong>of</strong> the nonlinear magnetostatic problem in<br />
terms <strong>of</strong> the unknown magnetization M. The solution is<br />
obtained by means <strong>of</strong> a Picard-Banach iteration whose<br />
convergence can be theoretically proved when the<br />
magnetic constitutive equation is uniformly monotonic<br />
and Lipschitzian [14, 15].<br />
In particular, by using the Biot-Savart law, the<br />
magnetic induction can be expressed in terms <strong>of</strong> its<br />
sources as:<br />
( ) = <br />
( ) =<br />
ˆ S ( ) +<br />
0 ( )<br />
μ0<br />
(<br />
( r−r') ) 3<br />
Br Mr B r<br />
μ Mr − ∇⋅ 'Mr' 4π Vf<br />
r−r' dV '+<br />
μ<br />
( r−r') + ( ') ⋅ ˆ ( ') dS ', for∈V<br />
3<br />
f<br />
4 Mr nr<br />
r<br />
r−r' 0<br />
π ∂Vf<br />
- 209 - 15th IGTE Symposium 2012<br />
(3.1)<br />
where BS is the magnetic induction produced in the free<br />
space by the stator and rotor currents, Vf is the region<br />
filled by the magnetic materials, ∂Vf is its boundary and<br />
ˆn is the outward unit normal on ∂Vf.<br />
The nonlinear constitutive relationship in Vf can be<br />
expressed by introducing the local operator as<br />
M(r)=[B(r)] in Vf<br />
(3.2)<br />
Therefore M is the solution <strong>of</strong> the following nonlinear<br />
problem:<br />
M(r)=[M] in Vf<br />
(3.3)<br />
As shown in [14], the operator is a contraction if is<br />
uniformly monotonic and Lipschitzian. Therefore, the<br />
solution exists, is unique and can be found by the fixed<br />
point iteration.<br />
From the numerical point <strong>of</strong> view, the magnetization<br />
can be expressed in terms <strong>of</strong> piece-wise constant vector<br />
shape functions in each elementary volume arising the<br />
after discretization <strong>of</strong> Vf, such as<br />
M<br />
() r = M jP<br />
j () r in V f<br />
j<br />
(3.4)<br />
where the Pj's are discontinuous shape functions obtained<br />
multiplying the scalar pulse functions pk's (pk = 1 in the kth<br />
element and it is zero elsewhere) by the (three) unit<br />
vectors along the coordinate axes.<br />
The numerical model is obtained by applying the<br />
Galerkin method to (3.3), rewritten as -1 [M]=[M] in<br />
Vf:<br />
-1<br />
Pi [ M] Pi [<br />
M]<br />
⋅ dV = ⋅ dV, ∀i<br />
Vf Vf<br />
The fixed point iteration is therefore rewritten as:<br />
<br />
V f<br />
k+<br />
1<br />
k<br />
[ M ] dV Pi<br />
⋅ [ M ]<br />
-1<br />
Pi ⋅ <br />
<br />
V f<br />
V f<br />
<br />
P ⋅ P dV<br />
i<br />
i<br />
=<br />
<br />
V f<br />
P ⋅ P dV<br />
i<br />
i<br />
dV<br />
, ∀i<br />
(3.5)<br />
(3.6)<br />
where the subscript k indicates the approximation <strong>of</strong> M<br />
and B at the k-th iteration. Note that being the term<br />
the volume <strong>of</strong> the i-th element, the r.h.s <strong>of</strong><br />
P ⋅ P dV<br />
<br />
V f<br />
i j<br />
(3.6) is the average <strong>of</strong> the magnetic induction B k =[M k ]<br />
in the i-th elementary volume at the iteration k. Being the<br />
magnetization piece-wise constant, eq. (3.6) can be solved<br />
for M k+1 in each element, by applying the constitutive<br />
relation to the average magnetic induction in the same<br />
element.<br />
Therefore, after discretization, (3.6) gives rise to the<br />
following fixed point iteration [14, 15]:<br />
( )<br />
k −1<br />
k<br />
B = D EM + W<br />
(3.7)
where:<br />
( )<br />
k+ 1<br />
k<br />
M = G B<br />
(3.8)<br />
( ) ⋅ ( ) ( ) ⋅ ( )<br />
μ ˆ ˆ ' '<br />
0 nr Pi r nr Pj r <br />
Eij = μ0Dij−<br />
<br />
dSdS'<br />
4π <br />
r−r' ∂Vi ∂Vj<br />
<br />
Dij<br />
= Pi⋅PjdV (3.9)<br />
Vf<br />
<br />
Wi<br />
= Pi⋅BSdV Vf<br />
Vi is the volume <strong>of</strong> the i-th element, M k the column<br />
vector <strong>of</strong> the coefficients in (3.4) at the k-th iteration, B k<br />
the column vector made by the average magnetic<br />
induction in the elements and G is the global relationship<br />
corresponding to after the discretization process.<br />
The flux Φn linked with the n-th circuit <strong>of</strong> volume τ and<br />
produced by the currents flowing in a set <strong>of</strong> coils is<br />
defined as:<br />
<br />
Φ n = Ar () ⋅Jn()<br />
r dτ<br />
τ n<br />
(3.10)<br />
where A is the magnetic vector potential associated to all<br />
the sources and Jn is the current density associated to the<br />
unit current impressed in the n-th circuit.<br />
This definition is consistent with the definition <strong>of</strong> the<br />
magnetic energy in the linear case and with the definition<br />
<strong>of</strong> the flux linked with a circuit <strong>of</strong> infinitely small crosssection.<br />
As a matter <strong>of</strong> fact, in this case it results:<br />
Jn() r<br />
Φ n = Ar () ⋅ dτ<br />
=<br />
I<br />
τ n<br />
n<br />
In<br />
1<br />
= () ˆ <br />
Ar ⋅ tSd<br />
n =<br />
S<br />
n<br />
n I<br />
γ<br />
n<br />
= Ar () ⋅tˆd<br />
<br />
γ n<br />
(3.11)<br />
being γn the closed curve defining the axis <strong>of</strong> the circuit,<br />
ˆt the unit tangent vector and In the unit current flowing<br />
in the n-th filamentary circuit.<br />
The magnetic vector potential appearing in (3.11) can<br />
be calculated from both free and (magnetic) polarization<br />
currents after (3.3) is solved with suitable boundary<br />
conditions by applying the Biot-Savart law for the<br />
magnetic vector potential:<br />
( ) = ˆ ( )<br />
Ar r<br />
μ<br />
<br />
( ', t)<br />
× ( − ')<br />
Mr r r<br />
0 AS+ dV', for<br />
r∈V<br />
3<br />
f<br />
4π V r−r' f<br />
(3.12)<br />
In (3.12) A has been written as the sum <strong>of</strong> the<br />
contribution <strong>of</strong> the free and magnetizing currents. All the<br />
procedure is quite time consuming when high number <strong>of</strong><br />
unknowns are treated; however a very effective<br />
computational tool has been recently proposed [18] based<br />
on a suitable use <strong>of</strong> high performance computing<br />
- 210 - 15th IGTE Symposium 2012<br />
architecture.<br />
III. THE 3D CORRECTION TO THE 2D SOLUTION<br />
In the case here treated the 2D solution provides an<br />
accurate solution in a large part <strong>of</strong> the domain <strong>of</strong> interest.<br />
As a consequence, the 3D analysis can be limited just to<br />
the region where it is really required.<br />
The classical theory [19], <strong>of</strong> the electrical machinery<br />
suggests to split fluxes and inductances in a 2D and 3D<br />
contributions, each corresponding to one <strong>of</strong> the two<br />
geometrical parts <strong>of</strong> the system geometry.<br />
Unfortunately such a separation is rather questionable<br />
and ambiguous. Then here a quite different approach is<br />
suggested: the actual 3D magnetic flux linked with a close<br />
line, is split in (a) the 2D part Φn (2D) (evaluated by 2D<br />
flux per the unit length, in the axial symmetrical region,<br />
multiplied for the length) and (b) the complement ΔΦn (3D)<br />
defined as the 3D correction requested for the case at<br />
hand. Similar consideration can be applied to other<br />
parameter including the main or flux leakage coefficients.<br />
IV. NUMERICAL EXAMPLE<br />
Among the rotating electrical machinery, large turbogenerators<br />
are endowed with quite long end windings<br />
which contribute poorly to produce linkage flux but,<br />
unfortunately, to produce significant leakage flux [20].<br />
Such a contribution is generally evaluated by simplifying<br />
considerably the complex geometry and, in addition, by<br />
neglecting the non linearity’s [7].<br />
The integral approach presented in the Section II is a<br />
powerful tool able provide a deeper analysis <strong>of</strong> the<br />
machinery and to overcome both limitations. This kind <strong>of</strong><br />
information is particularly relevant in the design process<br />
<strong>of</strong> the turbine-generator where the flux leakage has a<br />
preeminent significance.<br />
In the following a numerical example is presented in<br />
order to assess the consistency <strong>of</strong> the 3D corrections to be<br />
considered for different operating conditions.<br />
The turbine generator simulated has a rated apparent<br />
power in the range 300-350 MVA depending on the room<br />
temperature affecting the cooling system. It has two poles<br />
and the nominal frequency is 50 Hz.<br />
Some details <strong>of</strong> the finite element mesh <strong>of</strong> the iron<br />
regions denoted (Vf ) as well as <strong>of</strong> the field and armature<br />
coils, are reported in Fig.1.<br />
The 3D FEM model is characterized by a number <strong>of</strong><br />
45593 nodes (24201 in the iron region Vf including stator<br />
and rotor iron as well as the enclosure) and 17774<br />
elements (12478 in Vf ). It is worth noticing that the a non<br />
conformal mesh <strong>of</strong> the iron regions Vf has been adopted in<br />
order to exploit some additional geometrical degrees <strong>of</strong><br />
freedom in the sub-regions where a particular refinement<br />
<strong>of</strong> the mesh is necessary.
Armature coil<br />
Field coil<br />
Stator<br />
iron<br />
Rotor<br />
iron<br />
Enclosure<br />
Figure 1: Section <strong>of</strong> the Finite element Mesh used in the<br />
computations.<br />
Boundary conditions. In principle the complete<br />
geometry <strong>of</strong> the machine should be treated. However just<br />
a part has been considered while the effect <strong>of</strong> the<br />
remaining part has been forced by suitable boundary<br />
conditions in a cutting plane in the region where the 3D<br />
solution actually meets the 2D approximation.<br />
Material characterization. The magnetic iron properties<br />
has been represented in the (3.2) form by substituting the<br />
BH curve in B=μ0(H+M).<br />
In the following the no load operation mode is<br />
considered: the rotor is assumed to rotate at the nominal<br />
angular speed, a DC current is applied to the field coil<br />
and an open circuit is imposed to the armature terminals.<br />
Notice that, for a synchronous machine such an<br />
operative condition can be examined by means <strong>of</strong> just a<br />
single magneto-static problem. As a matter <strong>of</strong> fact, such<br />
solution provides as many samples <strong>of</strong> the time evolution<br />
<strong>of</strong> the terminal voltage as the number <strong>of</strong> the stator slot if<br />
the stator winding is a double layer one [16].<br />
Of course the non-linear iron effects affects the main<br />
flux and, consequently, the output voltages. Therefore the<br />
magnetic analysis has been repeated for several rotor<br />
currents, including, 100, 500 and 1000A, respectively.<br />
In particular the field current 500 A corresponds to the<br />
rated voltage at the generator terminals. In the following,<br />
the 500 A case is discussed in details while the other two<br />
currents are considered just to evaluate the saturation<br />
effect on ΔΦn (3D) .<br />
The three-dimensional distribution <strong>of</strong> the magnetic<br />
vector potential on the field and armature coils are<br />
sketched in figs. 2, 3 and the flux density in figs. 4, 5,<br />
assuming the boundary condition imposed on the<br />
symmetry plane and an excitation current <strong>of</strong> 500 A.<br />
Flux waveforms as well as their amplitude spectrum is<br />
then calculated according to (3.11); the results are<br />
reported in fig. 6. In order to look for higher harmonics, a<br />
spectrum analysis <strong>of</strong> the principal flux has been<br />
performed (fig. 7).<br />
- 211 - 15th IGTE Symposium 2012<br />
Figure. 2: amplitude <strong>of</strong> the magnetic vector potential<br />
[Tm] in the armature coil.<br />
Figure. 3: amplitude <strong>of</strong> the magnetic vector potential<br />
[Tm] in the field coil.<br />
Figure. 4: amplitude <strong>of</strong> the magnetic induction [T] in the<br />
armature coil.<br />
Figure. 5: amplitude <strong>of</strong> the magnetic induction [T] in the<br />
field coil.
As mentioned before, the 2D field coincides with the<br />
3D field in the neighborhood <strong>of</strong> the symmetry plane. The<br />
analysis <strong>of</strong> the vector potential and flux density<br />
components <strong>of</strong> the 3D model can be compared to the 2D<br />
solution, to evaluate the validity <strong>of</strong> the 2D approximation.<br />
The 2D solution matches the 3D one in a large part <strong>of</strong> the<br />
active length, with a good approximation (in the order <strong>of</strong><br />
90%).<br />
The comparison <strong>of</strong> magnetic flux, inductance<br />
coefficient per unit length <strong>of</strong> both solutions provides the<br />
desired correction factors. The flux waveforms as well as<br />
the amplitude spectrum from the 2D solution are reported<br />
in figs. 6, 7 and the results are compared with those from<br />
3D solution.<br />
Figure. 6: Flux linkage [Wb] waveforms <strong>of</strong> a single a<br />
stator coil<br />
From the comparison <strong>of</strong> the main fluxes it follows the<br />
2D solution underestimates the flux as well as the no load<br />
voltage <strong>of</strong> ΔΦn (3D) =2%.<br />
Unfortunately, due to its complexity, both the accuracy<br />
and the resolution <strong>of</strong> the 3D solution could be<br />
unsatisfactory for a number <strong>of</strong> applications. Of course, the<br />
2D analysis is able to provide more accurate and robust<br />
solution in the plane. Therefore, in order to assess the<br />
quality <strong>of</strong> the 2D solution given by the 3D analysis, a 2D<br />
analysis <strong>of</strong> linkage fluxes has been carried out by using<br />
the commercial s<strong>of</strong>tware package Ansys (Release 13) and<br />
the voltage computed from the principal flux has been<br />
compared with both the 3D evaluations and the<br />
experimental measurements.<br />
The finite element mesh used in this case is shown in<br />
fig. 8 and consists <strong>of</strong> 125549 nodes and 5754 second<br />
order triangular elements.<br />
The no load voltage ha been computed and compared<br />
to the 3D calculations. The actual end-winding effect,<br />
neglected by the 2D solution, is highlighted in fig. 9<br />
where the air gap radial magnetic induction as a function<br />
<strong>of</strong> the angle θ and <strong>of</strong> the axial position Z is plotted. In<br />
addition, to further assess the analysis the 2D no load<br />
voltage has been also compared with a set <strong>of</strong> experimental<br />
measurements.<br />
The discrepancy is rather vanishing (below 1%). Of<br />
- 212 - 15th IGTE Symposium 2012<br />
course such a result comes by an equilibrium <strong>of</strong> two<br />
conflicting effects: the first is lack <strong>of</strong> the end winding<br />
contribution and the second the error introduced by<br />
neglecting the 3D effects in the 2D evaluations.<br />
Figure. 7: Flux linkage [Wb] amplitude spectrum.<br />
Fig. 8: 2D finite element mesh used in the 2D<br />
calculations.<br />
Figure. 9: 3D rendering <strong>of</strong> the radial magnetic flux at the<br />
air gap.
Similar results can be achieved with different currents<br />
(discrepancy in the order 2-3% <strong>of</strong> the actual flux linkage<br />
with 100 A or 1000 A) showing that, in no load operation<br />
the effect <strong>of</strong> iron non linearity is quite limited.<br />
The same procedure is applied to evaluate the leakage<br />
flux and its contribution coming from 3D effects. The<br />
results show that the discrepancy is much more relevant.<br />
In the order <strong>of</strong> 10, 15, 20 % for an exciting current <strong>of</strong><br />
100, 500 and 1000 A, respectively.<br />
V. CONCLUSION<br />
The computation <strong>of</strong> end winding inductances <strong>of</strong> large<br />
turbo generators requires proper mathematical tools to be<br />
performed. This paper proposes an integral FEM<br />
formulation to compute the 3D vector potential and flux<br />
density distribution in the end regions. The approach does<br />
not requires the meshing <strong>of</strong> the free space than allowing a<br />
significant reduction <strong>of</strong> computer burden.<br />
A large synchronous generator with power in the range<br />
300-350 MVA has been analyzed as a case study. Both<br />
axial and radial components <strong>of</strong> the flux density generated<br />
by stator coils have been computed.<br />
The flux linkages for all coils and the no load voltages<br />
have been computed.<br />
In order to assess the influence <strong>of</strong> the end regions,<br />
different stack lengths have been simulated for the same<br />
end windings length. The results achieved include the<br />
definition <strong>of</strong> 3D correction to be applied to 2D<br />
simulation and, in addition, the variation <strong>of</strong> the correction<br />
as a function <strong>of</strong> the exciting currents has been evaluated.<br />
The comparison between terminal quantities coming from<br />
widely overspread 2D models and experimental<br />
measurements have been reviewed by considering 3D<br />
effects evaluated by using the proposed model.<br />
REFERENCES<br />
[1] B. Hosninger, “Theory <strong>of</strong> end-winding leakage reactance”, Power<br />
Apparatus and Systems, Part III, vol. 78, pp. 417-426, Aug. 1959.<br />
[2] W. M. Arshad, H. Lendenmann, Y. Liu, J.-O. Lamell and H.<br />
Persson, “Finding end winding inductances <strong>of</strong> MVA machines”<br />
Proc. Fortieth IAS Meeting, vol. 4, pp. 2309-2314, 2005.<br />
[3] M.F. Hsieh, Y.C. Hsu, D.G. Dorrell, and K.H. Hu, "Investigation<br />
on end winding inductance in motor stator windings", IEEE<br />
Trans. on Magn., vol. 43, pp. 2513–2515, June 2007.<br />
[4] J. Pyrhönen, T. Jokinen and V. Hrabovcová, Design <strong>of</strong> Rotating<br />
Electrical Machines, John Wiley and Sons, Ltd, 2008.<br />
[5] J. A. Tegopoulos, "End component <strong>of</strong> armature leakage reactance<br />
<strong>of</strong> turbine generators", IEEE Trans. on PAS, vol. 83, pp. 632-637,<br />
June 1964.<br />
[6] J. A. Tegopoulos, “Current sheets equivalent to the end-winding<br />
currents <strong>of</strong> turbine generator stator and rotor,” AIEE Trans. Pt. III,<br />
vol. PAS-81, pp. 695–700, February 1963.<br />
[7] V.S Lazarns, A.G Kladas, A.G Mamalis, and J.A. Tegopoulos,<br />
"Analysis <strong>of</strong> end zone magnetic field in generators and shield<br />
optimization for force reduction on end windings", IEEE Trans.<br />
on Mag., vol. 45, pp.1470–1473, March 2009.<br />
- 213 - 15th IGTE Symposium 2012<br />
[8] D. J. Scott, S. J. Salon, and G. L. Kusik, “Electromagnetic forces<br />
on the armature end windings <strong>of</strong> large turbine generators I—<br />
Steady state conditions”, IEEE Trans. PAS., vol. PAS-100, pp.<br />
4597–4603, Nov. 1981.<br />
[9] Q. Li and F. Wang, “Application <strong>of</strong> image method to calculate 3-<br />
D magnetic field and parameters <strong>of</strong> SC alternator”, IEEE Trans.<br />
on Mag., vol. 25, pp. 1850–1853, Feb. 1989.<br />
[10] D. Ban, D. Zarko, and I. Mandic, “Turbo-generator end winding<br />
leakage inductance calculation using a 3D analytical approach<br />
based on the solution <strong>of</strong> Neumann integrals”, IEEE Trans. on En.<br />
Conv., vol. 20, pp. 98–105, March 2005.<br />
[11] A.T. Brahimi, A. Foggia, and G. Meunier, "End winding<br />
reactance computation results using a 3D finite element program"<br />
IEEE Trans. on Mag., vol. 29, pp. 1411-1414, March 1993.<br />
[12] R. Albanese, F. Calvano, G. Dal Mut, F. Ferraioli, A. Formisano,<br />
F. Marignetti, R. Martone, G. Rubinacci, A. Tamburrino and S.<br />
Ventre, "Coupled three dimensional numerical calculation <strong>of</strong><br />
forces and stresses on the end windings <strong>of</strong> large turbo generators<br />
via Integral Formulation", IEEE Trans. on Mag., vol. 48, pp. 875<br />
- 878, Feb. 2012.<br />
[13] F. Calvano, G. Dal Mut, F. Ferraioli, A. Formisano, F. Marignetti,<br />
R. Martone, G. Rubinacci, A. Tamburrino and S. Ventre, “A<br />
novel technique based on integral formulation to treat the motion<br />
in the analysis <strong>of</strong> electric machinery”, International Journal <strong>of</strong><br />
Applied Mathematics and Mechanics, in press.<br />
[14] R. Albanese, F. I. Hantila, and G. Rubinacci, “A nonlinear eddy<br />
current integral formulation in terms <strong>of</strong> a two-components current<br />
density vector potential”, IEEE Trans. Mag. 32, pp. 784-787,<br />
March 1996.<br />
[15] R. Albanese, and G. Rubinacci, “Finite elements methods for the<br />
solution <strong>of</strong> 3D eddy current problems”, Advances in Imaging and<br />
Electron Physics, vol. 102, pp. 1-86, 1998.<br />
[16] N. Bianchi, Electrical machine analysis using finite elements,<br />
Taylor and Francys, pp. 141-162, 2005.<br />
[17] M.V.K. Chari, S.H. Minnich, S.C. Tandon, Z.J. Csendes, J.<br />
Berkery, “Load characteristics <strong>of</strong> synchronous generator by the<br />
finite element method”, IEEE Trans. on PAS, vol. 100, pp.1-13,<br />
January 1981.<br />
[18] R. Albanese, F. Calvano, G. Dal Mut, F. Ferraioli, A. Formisano,<br />
F. Marignetti, R. Martone, G. Rubinacci, A. Tamburrino and S.<br />
Ventre, “Electromechanical analysis <strong>of</strong> end windings in turbo<br />
generators”. COMPEL, vol. 30, pp. 1885-1898, 2011.<br />
[19] J. Pyrhonen, T. Jokinen, V. Hrabokova, Design <strong>of</strong> Rotating<br />
Electrical Machines, John Wiley and sons, Ltd, 2008, pp.246-249<br />
[20] M.V. Deshpande, Design and Testing <strong>of</strong> Electrical Machines,<br />
Phi learning Pvt. Ltd., 2010.
- 214 - 15th IGTE Symposium 2012<br />
Magnetomechanical Coupled FE Simulations <strong>of</strong><br />
Rotating Electrical Machines<br />
*A. Belahcen, *K. Fonteyn, † R. Kouhia, *P. Rasilo, and *A. Arkkio<br />
*Aalto <strong>University</strong>, Dept. <strong>of</strong> Electrical Engineering, POBox 13000, FIN-00076 Aalto, Finland<br />
† Tampere <strong>University</strong> <strong>of</strong> <strong>Technology</strong>, Dept. <strong>of</strong> Mechanics and Design, P.O BOX 589, 33101 Tampere, Finland<br />
E-mail: anouar.belahcen@aalto.fi<br />
Abstract— Regardless <strong>of</strong> the relatively large amount <strong>of</strong> published models <strong>of</strong> magnetostriction, only few <strong>of</strong> them have been<br />
applied to describe this phenomenon in electrical steel and even less have been incorporated in the FE simulation <strong>of</strong> electrical<br />
machines. In this paper we review the models <strong>of</strong> magnetostriction and magnetomechanical coupling in electrical steel and their<br />
incorporation into the FE analysis <strong>of</strong> rotating electrical machines. We also discuss the advantages and disadvantages <strong>of</strong> the<br />
different models and present an energy-based coupled magnetomechanical set <strong>of</strong> constitutive equations that describe both the<br />
magnetostriction and the magnetic nonlinearity and its dependency on stresses in the electrical steel. We further present the<br />
implementation <strong>of</strong> these equations into in-house 2D FE s<strong>of</strong>tware for the simulations <strong>of</strong> electrical machines. The simulations<br />
carried out show that the energy based model describes well the vibrations <strong>of</strong> electrical machines due to magnetostriction and<br />
reluctance forces. A discussion on how the model should be improved to account for hysteresis is also presented.<br />
Index Terms—coupled models, electrical machines, finite elements, magnetostriction.<br />
current density and the geometry <strong>of</strong> the windings are<br />
known. However, some issues related to the skin-effect<br />
and the eddy-currents make this computation rather<br />
complex in some special cases as explained by Islam et al.<br />
[1]. In iron, and due to its magnetic domain structure and<br />
its finite electric conductivity, the flow <strong>of</strong> a time-varying<br />
flux produces hysteresis and eddy-current losses (in some<br />
approaches also excess losses). The computation <strong>of</strong> these<br />
so-called iron losses is still one <strong>of</strong> the most active<br />
research fields in the simulation <strong>of</strong> electrical machines.<br />
Finally, the motion <strong>of</strong> the rotating parts <strong>of</strong> the machine<br />
and the friction in the bearings <strong>of</strong> the machine as well as<br />
the one between the moving parts and the air produces<br />
mechanical losses that can be computed through complex<br />
CFD models in case <strong>of</strong> high speed machines or<br />
approximated by semi-empirical equations.<br />
The knowledge <strong>of</strong> the above loss components is<br />
valuable information for the designers <strong>of</strong> electrical<br />
machines as they allow them to optimize the structure <strong>of</strong><br />
the machine with regards to the cooling and the<br />
mechanical strength as well as the use <strong>of</strong> magnetic and<br />
other materials.<br />
Besides the structural and cooling optimization <strong>of</strong> the<br />
machine, the designers are bind by environmental aspects<br />
such as the level <strong>of</strong> acoustic noise and the estimation <strong>of</strong><br />
the lifecycle <strong>of</strong> the machine. The acoustic noise is<br />
produced by the vibrations <strong>of</strong> the structure <strong>of</strong> the machine<br />
under the effect <strong>of</strong> different forces and other excitations<br />
and by the airflow in different channels.<br />
I. INTRODUCTION<br />
Owing to the high demand on energy efficient and<br />
environmental friendly apparatuses, the designers <strong>of</strong><br />
electrical machines, among others, are more and more<br />
interested in accurate computational methods for the<br />
analysis <strong>of</strong> their design. The energy conversion in<br />
electrical machines occurs within three different but<br />
tightly coupled subsystems, i.e., the electrical system, the<br />
magnetic system and the mechanical system as shown in<br />
Fig. 1. The electric system consists <strong>of</strong> the windings <strong>of</strong> the<br />
machine that are connected to the supply in the case <strong>of</strong> a<br />
motor or to the load in case <strong>of</strong> generator operation. The<br />
electric supply/load is nowadays typically a voltage<br />
source frequency converter, the current <strong>of</strong> which is<br />
controlled according to the operation point <strong>of</strong> the<br />
machine. Such current depends on the load <strong>of</strong> the<br />
machine and consists <strong>of</strong> a torque producing component as<br />
well as a component necessary for the magnetization <strong>of</strong><br />
the iron core and another compensating for the core<br />
losses. The magnetic system consists <strong>of</strong> the iron core <strong>of</strong><br />
the machine as well as the airgap and other construction<br />
parts in which a magnetic flux is produced by the coils’<br />
currents according to the Ampere’s law <strong>of</strong> induction.<br />
These fluxes and their interaction with the magnetic<br />
materials produce forces and torque that are transferred to<br />
the mechanical system consisting <strong>of</strong> the rotor, the shaft<br />
and the bearings <strong>of</strong> the machine as well as a possible<br />
cooling fan mounted on the shaft <strong>of</strong> the machine.<br />
The electric, magnetic and mechanical systems,<br />
although represented as separate subsystems, are tightly<br />
coupled to each other and their operation quantities<br />
cannot be solved separately. Indeed the current drawn by<br />
the machine, the torque it produces and the magnetic flux<br />
density in the airgap and the iron core are usually solved<br />
simultaneously especially if the machine is voltage fed.<br />
The operation <strong>of</strong> the above subsystems is known to be<br />
dissipative as there are energy losses related to each<br />
subsystem. The Joule losses resulting from the currents<br />
flowing in the windings are usually easy to compute if the<br />
Electrical power<br />
Lorentz forces<br />
Electrical<br />
System (windings)<br />
Electrical<br />
Losses<br />
Magnetic forces and<br />
Magnetostriction<br />
Coupling field<br />
(iron and air)<br />
Air flow and Friction<br />
Vibrations Wearing and Noise<br />
Magnetic<br />
Losses<br />
Mechanical system<br />
(bearings and fan)<br />
Mechanical<br />
Losses<br />
Mechanical power<br />
Fig. 1. Illustration <strong>of</strong> the energy conversion process with the<br />
related electric, magnetic and mechanical subsystems and<br />
related losses, forces and vibrations and their origins.
The interaction between the flux and the currents<br />
produces Lorentz forces acting on the windings, while the<br />
flow <strong>of</strong> the magnetic flux in the iron core and the airgap<br />
gives rise to magnetic forces and strains in the core.<br />
The vibrations produced by the interaction between the<br />
magnetic forces, the deformations and the structure result<br />
in acoustic noise and mechanical wearing <strong>of</strong> different<br />
parts <strong>of</strong> the machine such as the winding insulation and<br />
the bearings. The knowledge <strong>of</strong> these parasitic effects at<br />
the design stage will help in reducing the vibrations and<br />
noise <strong>of</strong> the machine as well as estimating the lifecycle <strong>of</strong><br />
the machine and optimizing the structure for longer life<br />
too. An illustration <strong>of</strong> the different losses and vibration<br />
phenomena occurring at different subsystems <strong>of</strong> the<br />
energy conversion is given in Fig. 1.<br />
The strains and stresses in the iron core <strong>of</strong> an electrical<br />
machine are produced by different sources and have<br />
strong degrading effect on the quality <strong>of</strong> the iron, thus<br />
reducing the efficiency <strong>of</strong> the energy conversion process<br />
and increasing the amount <strong>of</strong> iron needed for a given<br />
power <strong>of</strong> the machine. The effect <strong>of</strong> the mechanical stress<br />
on the power losses in electrical steel have been known<br />
for quite long time. Already in the 70’s <strong>of</strong> the last century<br />
Moses [2], [3], among others, showed through magnetic<br />
measurements that the mechanical stress affects the<br />
losses, the magnetization, and the magnetostriction <strong>of</strong><br />
electrical steel. By magnetostriction it was meant the<br />
relative change in the length <strong>of</strong> a specimen <strong>of</strong> magnetic<br />
material when it is subjected to a magnetic field.<br />
For a better understanding let us first clarify what is<br />
magnetostriction. In 1842 W. P. Joule discovered what is<br />
today called Joule magnetostriction, which is a volume<br />
conserving deformation <strong>of</strong> magnetic material caused by<br />
its magnetization. Such a deformation results in an<br />
elongation <strong>of</strong> the material in the direction <strong>of</strong><br />
magnetization and a shrink in the orthogonal directions<br />
for positive magnetostrictive materials and vice versa for<br />
negative magnetostrictive materials. Later on, it was<br />
observed that at high values <strong>of</strong> the magnetization the<br />
deformation is no more volume conserving and this<br />
phenomena was called volume magnetostriction. Both<br />
types <strong>of</strong> magnetostriction are called forced<br />
magnetostriction in a sense that they are caused by the<br />
magnetization that forces the magnetic domain walls to<br />
move and the domains to rotate thus producing the<br />
mechanical deformation. On the other hand, when the<br />
magnetic material is cooled down from a high<br />
temperature, it undergoes a strong isotropic change in its<br />
dimensions as it goes though the Curie temperature. Such<br />
a change in volume was explained by the formation <strong>of</strong><br />
magnetic domains and the orientation <strong>of</strong> elementary<br />
magnetic moments within the domains. Several other<br />
experimental works have been conducted on magnetic<br />
materials and separate magnetomechanical phenomena<br />
obtained separate names according to their respective<br />
discoverers. The skew magnetostriction ,e.g., resulting<br />
from a helical magnetization and producing a bending <strong>of</strong><br />
some electrically conducting magnetic material has been<br />
called Wiedemann-effect and the inverse effect <strong>of</strong><br />
- 215 - 15th IGTE Symposium 2012<br />
magnetostriction which results in a change <strong>of</strong> the<br />
magnetization properties <strong>of</strong> magnetic materials under the<br />
action <strong>of</strong> applied mechanical stress was called Vilari<br />
effect. Also the apparent change in the Young modulus <strong>of</strong><br />
magnetic material, which is due to the intrinsic<br />
rearrangement <strong>of</strong> the magnetic domains and thus the<br />
intrinsic elongation following this rearrangement, has<br />
been called Delta-E effect. A comprehensive description<br />
<strong>of</strong> these phenomena can be found, e.g., from [4]. The<br />
intrinsic forms <strong>of</strong> magnetostriction are nowadays referred<br />
to as spontaneous magnetostriction. They are <strong>of</strong> great<br />
interest for metallurgist but are not <strong>of</strong> importance for the<br />
simulation <strong>of</strong> electrical machines under operation as they<br />
do not occur anymore at this stage. Fig. 2 shows an<br />
illustration <strong>of</strong> the difference between spontaneous and<br />
forced magnetostriction.<br />
Cooling through<br />
Curie temperature<br />
H<br />
Positive Joule<br />
magnetostriction<br />
H=0<br />
Applying external<br />
magnetic field<br />
Spontaneous<br />
Negative Joule<br />
magnetostriction<br />
H<br />
Forced magnetostriction<br />
magnetostriction<br />
Fig. 2. Illustration <strong>of</strong> spontaneous and forced magnetostriction.<br />
The mechanical stress or strain acting on the magnetic<br />
material can originate from different phenomena some <strong>of</strong><br />
them produce static stresses and other dynamic stresses.<br />
The shrink fitting <strong>of</strong> the stator into the frame <strong>of</strong> the<br />
machine produces static compressive stresses <strong>of</strong> the order<br />
<strong>of</strong> 10 MPa as shown by Fujisaki et al. [5]. These stresses<br />
can be evaluated by means <strong>of</strong> numerical simulations or<br />
through analytical approximations, but due to the<br />
manufacturing tolerances they may have excessive local<br />
values. On the other hand the so called reluctance forces<br />
occurring between the stator and the rotor <strong>of</strong> the machine<br />
and which are time and space dependent, produce<br />
dynamic tensile and compressive stresses at different<br />
locations <strong>of</strong> the machine. These stresses are at the origin<br />
<strong>of</strong> the so called magnetic noise, i.e., the acoustic noise<br />
due to magnetically excited vibrations. The level <strong>of</strong> these<br />
stresses depends on the magnetic flux density in the air<br />
gape <strong>of</strong> the machine and can be <strong>of</strong> the order <strong>of</strong> 200 MPa.<br />
Further, the rotating and alternating magnetization <strong>of</strong> the<br />
iron core produces dynamic magnetostrictive strains the<br />
level <strong>of</strong> which depends on the state <strong>of</strong> stress in the core.<br />
At last but not least, the punching <strong>of</strong> the magnetic<br />
material in the manufacturing process produces residual<br />
stresses and plastic strains at some regions <strong>of</strong> the<br />
magnetic material. The level <strong>of</strong> these stresses and strains<br />
depend the manufacturing process and the quality <strong>of</strong><br />
punching tools.<br />
All these stresses and strains will affect both the<br />
magnetization characteristics and the energy losses <strong>of</strong> the<br />
magnetic material as well as the vibrations <strong>of</strong> the core and<br />
thus the acoustic noise and the wearing <strong>of</strong> the materials<br />
and parts <strong>of</strong> the machine.<br />
The modeling <strong>of</strong> magnetostriction started In the 50’s <strong>of</strong><br />
the last century as related to the so called giant<br />
magnetostrictive materials and their applications in<br />
ultrasound generators and receptors. The effect <strong>of</strong>
magnetostriction on the noise <strong>of</strong> power transformer was<br />
also investigated through experimental studies starting in<br />
the 70’s by Moses [6] and later by Weiser and Pfutzner<br />
[7] but the modeling <strong>of</strong> such phenomena in electrical<br />
machines started only at the beginning <strong>of</strong> the last two<br />
decades as it was noted that the prediction <strong>of</strong> the losses<br />
and vibrations <strong>of</strong> the machine requires adequate models<br />
able to account for the magnetomechanical coupling as<br />
well as the other electromagnetic couplings.<br />
In this paper we will concentrate on the modeling <strong>of</strong><br />
magnetostriction and the related magnetomechaical<br />
effects and their incorporation into the simulation <strong>of</strong><br />
electrical machines as well as the effect <strong>of</strong> these<br />
phenomena on the computation <strong>of</strong> iron losses. The<br />
coupled electromagnetic simulation methodology for<br />
electrical machines, which made it possible to develop the<br />
magnetomechanical models is now quite established as<br />
reported by Arkkio [8] and Salon et al. [9] among others<br />
and will not be handled here.<br />
In section II we will explore the force-based models <strong>of</strong><br />
magnetostriction and in section III the stress-strain based<br />
models. In section IV we will introduce the energy based<br />
model and in section V we will discuss the incorporation<br />
<strong>of</strong> the different models into 2D finite element simulation<br />
<strong>of</strong> electrical machines. In Section VI we will discuss the<br />
impact <strong>of</strong> these models on the computation <strong>of</strong> iron losses<br />
and sketch the necessity for future developments in view<br />
<strong>of</strong> accurate simulations and computation <strong>of</strong> losses and<br />
vibrations.<br />
II. FORCE-BASED MODELS OF MAGNETOSTRICTION<br />
The main idea behind the force-based models <strong>of</strong><br />
magnetostriction is that the magnetostrictive deformation<br />
<strong>of</strong> a sample <strong>of</strong> magnetic material under homogeneous<br />
magnetization can be produced by a distributed set <strong>of</strong><br />
forces acting on the boundaries <strong>of</strong> the sample as<br />
explained by Delaere et al. [10] and sketched in Fig. 3 for<br />
an arbitrary element.<br />
Such a representation <strong>of</strong> the magnetostriction emanates<br />
from earlier work <strong>of</strong> Besbes et al. [11], where the<br />
magnetostrictive forces were directly derived from the<br />
principle <strong>of</strong> virtual work and by accounting for the<br />
variation <strong>of</strong> the permeability with the mechanical stress. It<br />
should be mentioned that in [11], the magnetostriction has<br />
not been well described as sever assumptions such as<br />
linear magnetization and its linear dependency on the<br />
stress have been made. The local application <strong>of</strong> the<br />
principle <strong>of</strong> virtual work for the computation <strong>of</strong> magnetic<br />
forces itself was developed by Bossavite [12] after its<br />
introduction at a global level by Coulomb [13].<br />
The development <strong>of</strong> such force-based models and their<br />
coupling with the electromagnetic simulation <strong>of</strong> electrical<br />
machines was also reported by Mohamed et al. [14],<br />
Vandevelde et al. [15] and Belahcen [16]. Vandevelde<br />
and Belahcen used a stress approach to compute the<br />
magnetostrictive forces, whereas Delayer used a strain<br />
approach. In all these works, the other magnetic forces<br />
were computed either according to the principle <strong>of</strong> virtual<br />
- 216 - 15th IGTE Symposium 2012<br />
work and introduced into the FE simulation as<br />
generalized nodal forces or through the Maxwell stress<br />
tensor. The two methodologies have been earlier<br />
demonstrated by Kameari [17] to be equivalent. Lately,<br />
many authors applied the concept <strong>of</strong> magnetostrictive<br />
forces in the FE simulation <strong>of</strong> electrical machines.<br />
The equation for the computation <strong>of</strong> the generalized<br />
nodal magnetic forces is given bellow<br />
B<br />
T 1 <br />
F A A d dSˆ<br />
J e<br />
Sˆ<br />
e<br />
H<br />
B J<br />
(1)<br />
<br />
e U U<br />
0<br />
<br />
where B and H are the magnetic flux distribution and<br />
field strength respectively. J is the Jacobian matrix for the<br />
transformation from the reference finite element to actual<br />
one and ˆ Se stands for the reference element. U is the<br />
vector <strong>of</strong> nodal displacement. The integration with respect<br />
to the magnetic flux density in (1) as well as the<br />
differentiation with respect to the displacements is carried<br />
out analytically. For this purpose and for FE computation,<br />
a cubic spline representation <strong>of</strong> the HB-curve <strong>of</strong> the iron<br />
sheets is used. The different approaches for the<br />
computation <strong>of</strong> nodal magnetostrictive forces can be<br />
found in [10], [14], [15], and [16]. Fig. 4 shows the<br />
magnetic and magnetostrictive force computed for the<br />
stator core <strong>of</strong> a synchronous machine [18]. Similar forces<br />
for the induction machines have been reported in [19].<br />
Fig. 3. Sketch <strong>of</strong> the computation <strong>of</strong> magnetostrictive force from<br />
magnetostrictive deformation after Delaere [10].<br />
Fig. 4. Generalized magnetic forces (left) and equivalent<br />
magnetostrictive forces computed in the stator core <strong>of</strong> a<br />
synchronous machine. The forces have been normalized.<br />
The structural deformation can be computed either in a<br />
coupled or uncoupled methodology. In the coupled<br />
approach the mechanical and magnetic problems are<br />
solved simultaneously and the forces are updated at<br />
iteration level. In the uncoupled approach the magnetic<br />
problem is first solved and the forces computed as postprocessing<br />
quantities from the magnetic problem then<br />
introduced in the mechanical problem as loads. The<br />
results <strong>of</strong> the mechanical problem are the nodal<br />
displacements from which the deformation as well as the<br />
strains and stresses can be computed.<br />
Although the concept <strong>of</strong> magnetostrictive forces<br />
describes quit well the deformation <strong>of</strong> the magnetic
material it has a major drawback that consist <strong>of</strong> resulting<br />
in erroneous stress state in the material. Indeed, the<br />
magnetostrictive forces result into tensile stress if the<br />
boundaries <strong>of</strong> the element are free to move (refer to Fig.<br />
3). However, the actual state <strong>of</strong> stress in this case should<br />
be a zero stress in the element. If the boundaries <strong>of</strong> the<br />
element are fixed, the magnetostrictive forces will result<br />
into a zero stress, while the actual stress is a compressive<br />
one, the magnitude <strong>of</strong> which depends on the material<br />
properties and the level <strong>of</strong> magnetization. This erroneous<br />
behavior due to equivalent magnetostriction forces is<br />
illustrated in Fig. 5, where the magnetostrictive<br />
elongation is solved correctly but the stress is wrong.<br />
From the above discussion it is clear that the concept <strong>of</strong><br />
magnetostrictive force is not able to describe the stress<br />
dependency <strong>of</strong> the magnetic properties <strong>of</strong> the material and<br />
thus <strong>of</strong> the magnetostriction itself when it has to be<br />
computed under different boundary conditions dictated by<br />
the geometry and the topology <strong>of</strong> the electrical machine.<br />
III. STRAIN-BASED MODELS OF MAGNETOSTRICTION<br />
In the strain-based models <strong>of</strong> magnetostriction there is<br />
no need for the calculation <strong>of</strong> equivalent forces. The<br />
magnetostrictive strains are incorporated in the structural<br />
analysis in a similar way to the thermal dilatation <strong>of</strong><br />
metals. Here also, the magnetostriction can be modeled in<br />
a decoupled or coupled approach. In the decoupled<br />
approach [20] the magnetostrictive strains are computed<br />
from per element magnetic flux densities and the<br />
measured single valued flux densty-elongation<br />
relationship. These strains are then incorporated in a<br />
structural finite element model <strong>of</strong> the electrical machine<br />
to produce the deformation. If the effect <strong>of</strong> stress is to be<br />
accounted for in the computation <strong>of</strong> the magnetostriction,<br />
a coupled model should be used. Such a model emanates<br />
from the coupled magnetomechanical constitutive<br />
equations <strong>of</strong> the material [21] as explained in the next<br />
Section.<br />
Equivalent forces:<br />
Equivalent forces:<br />
= 0 ; = 0 = ms ; = -ms<br />
Iron<br />
Actual behavior:<br />
Actual behavior:<br />
= 0 ; = ms = ms ; = 0<br />
Fig. 5. Illustration <strong>of</strong> the magnetostriction and the state <strong>of</strong><br />
stress in the sample under the effect <strong>of</strong> magnetostrictive forces.<br />
Left clamped sample and right free sample. In both case the<br />
elongation is correct but the stress is wrong.<br />
IV. THE ENERGY-BASED CONSTITUTIVE EQUATIONS<br />
The energy-based, coupled constitutive equations <strong>of</strong><br />
the electrical steel are derived from an appropriate<br />
representation <strong>of</strong> the Helmholtz free energy [21], which<br />
itself is based on previous empirical observations made<br />
from the measurement <strong>of</strong> magnetostriction under different<br />
stresses and flux densities [22]. A summary <strong>of</strong> the results<br />
<strong>of</strong> these measurements all together with the model<br />
prediction are shown in Fig. 6. The Helmholtz free energy<br />
in a sample is written as:<br />
Iron<br />
- 217 - 15th IGTE Symposium 2012<br />
1<br />
I<br />
<br />
2<br />
2<br />
1<br />
4 1<br />
g ( I ) I <br />
i 1 4 1 1<br />
<br />
2 <br />
2 4 5 5 6 6<br />
2<br />
<br />
i0 i 1 <br />
<br />
B <br />
2 2<br />
ref <br />
2GI<br />
I I I<br />
<br />
where the invariants I .. I are:<br />
1 6<br />
1 2 1 3<br />
I tr( )<br />
, I tr( ) , I tr( )<br />
1 2<br />
3<br />
2<br />
3<br />
(3)<br />
I <br />
4 B B, 2<br />
I BB, I BB 5<br />
6<br />
(4)<br />
is the first Lamé parameter,G the shear modulus <strong>of</strong> the<br />
material, the mass density and<br />
3 3 1<br />
g exp( I ) 0 0 1 0 5<br />
4 4 3<br />
(5)<br />
3( i 1) 4( i 1) <br />
g exp <br />
I<br />
<br />
; i i<br />
1<br />
4 <br />
<br />
3 <br />
i 1..4(6)<br />
; i 0..6 are model parameters and B is a reference<br />
i ref<br />
flux density. The magnetomecjanically coupled<br />
constitutive equations are derived from (2) as<br />
6<br />
6<br />
<br />
Ii<br />
<br />
Ii<br />
and M <br />
i1, i3I i<br />
i1, i3IB i<br />
(7)<br />
where and are the stress and strain tensors and M the<br />
magnetization. An extensive derivation <strong>of</strong> the model and<br />
its equations is given in [23]. The model results in an<br />
explicitly coupled formulation for the stress tensor and<br />
the magnetic field strength vector in terms <strong>of</strong> the magnetic<br />
flux density and the mechanical strain tensor as<br />
( B,<br />
) I + (<br />
I ) 2G<br />
1 1 4<br />
<br />
1 1 <br />
<br />
( ) <br />
<br />
2 ( ) <br />
0 <br />
B B BB <br />
B B BB 4<br />
2 <br />
<br />
B B( BB) <br />
5<br />
2<br />
2<br />
<br />
<br />
<br />
B B B B<br />
<br />
6 2<br />
2 ( ) +( )<br />
<br />
B B B B B B<br />
<br />
(8)<br />
1<br />
5 <br />
2<br />
HB ( , ) B2 B B B <br />
0 4 5<br />
2 2<br />
(9)<br />
Magnetostriction (m/m)<br />
(2)<br />
In (8) and (9) 1 and are used to shorten the notation:<br />
4<br />
x 10-6<br />
5<br />
4<br />
3<br />
2<br />
1<br />
0<br />
-1<br />
1 g<br />
<br />
<br />
I and<br />
4<br />
i i1<br />
1 4<br />
2<br />
i0<br />
I1<br />
3.9 MPa<br />
0.0 MPa<br />
-1.7 MPa<br />
-6.1 MPa<br />
-2<br />
-2 -1.5 -1 -0.5 0 0.5 1 1.5 2<br />
Flux density (T)<br />
<br />
1<br />
g<br />
(10)<br />
4<br />
i i<br />
<br />
I<br />
4 4<br />
2i0i1 Fig. 6. Measured magnetostrictive strains at different flux<br />
densities and applied mechanical stresses (left) and comparison<br />
with model prediction (right). Positive stresses are tensile and<br />
negative ones are compressive.<br />
V. INCORPORATION INTO FE SIMULATIONS<br />
The starting point for the implementation <strong>of</strong> the<br />
magnetomechanically coupled equation in FE analysis <strong>of</strong><br />
electrical machines is an in-house 2D s<strong>of</strong>tware package.<br />
The field equations have been previously coupled with
the electrical circuit equations <strong>of</strong> the machine winding,<br />
which makes it possible to feed the model from a voltage<br />
source and also resolve the induced voltages and currents<br />
in other parts <strong>of</strong> the machine, e.g., cage winding or solid<br />
rotor, through time stepping analysis [8].<br />
The coupled constitutive equations (8) and (9) were<br />
first linearized to the first order and then the weak form <strong>of</strong><br />
Galerkin method was applied to (8) and the principle <strong>of</strong><br />
virtual work applied to (9). This resulted in:<br />
<br />
H ( w) <br />
d H<br />
( w) <br />
<br />
B d<br />
<br />
B <br />
<br />
<br />
(11)<br />
( w) H d wH ds 0<br />
0 0<br />
<br />
<br />
T ˆ <br />
<br />
<br />
<br />
d <br />
<br />
T <br />
<br />
<br />
ˆ<br />
<br />
<br />
d<br />
<br />
<br />
ˆ d uˆ ( f f ) d<br />
B<br />
<br />
T<br />
0<br />
B<br />
<br />
<br />
<br />
T<br />
<br />
T<br />
uˆf ds 0<br />
surf<br />
<br />
mech inert<br />
- 218 - 15th IGTE Symposium 2012<br />
(12)<br />
where w is a test or weight function and the quantities<br />
with hat are virtual ones.u is the mechanical displacement<br />
vector and f , f , f are respectively mechanical,<br />
mech inert surf<br />
inertia body forces and surface forces. In the<br />
implementation the shape functions <strong>of</strong> the finite element<br />
approximation are used as weight function.<br />
Equations (11) and (12) are then spatially descitized<br />
using standard finite element procedure and inserted in<br />
the in-house code, thus replacing the nonlinear model <strong>of</strong><br />
iron cores. The insertion <strong>of</strong> these equations in the code<br />
does not affected the electromagnetic coupling as this<br />
latter one takes place in the windings and conducting<br />
regions only whereas the magnetomechanical coupling<br />
takes place in non conduction iron. Special attention<br />
however has to be given to the region formed by the<br />
airgap or more properly the interface between any noniron<br />
region and the iron core. This is because the Maxwell<br />
stress tensor makes sense only if it is computed from both<br />
regions at any interface. Thus when assembling the<br />
system matrix, the contribution to the nodal values <strong>of</strong> the<br />
Maxwell stress are computed from both iron and non-iron<br />
element with common interface with the iron core.<br />
In the case <strong>of</strong> force based model <strong>of</strong> magnetostriction,<br />
the approach is quite similar to the one presented above,<br />
except that the equivalent magnetostrictive forces as well<br />
as the other magnetic forces computed with (1) have been<br />
inserted in the model as external mechanical forces. Such<br />
implementation was already reported in [19].<br />
The implemented formulation and s<strong>of</strong>tware were first<br />
tested on a simple model consisting <strong>of</strong> an iron disc<br />
excited through Direchlet boundary condition on its outer<br />
edge. The magnetic vector potential on this edge was set<br />
to time dependent values as to create a rotating field<br />
uniformly distributed on the surface <strong>of</strong> the test sample.<br />
The boundaries <strong>of</strong> the sample were free to move and only<br />
the center <strong>of</strong> the disc was fixed in both x- and y-direction.<br />
Fig. 7. Shows the original and deformed mesh used in the<br />
model when the flux density was 1.5 T either along the xaxis<br />
or at an angle <strong>of</strong> 45 deg. to it. The results show that<br />
thanks to the tensor representation and formulation the<br />
effect <strong>of</strong> the shear stress and strain are correctly<br />
computed. The model was also applied to a induction<br />
machine-like device without the airgap in view <strong>of</strong><br />
minimizing all the other magnetic forces. The results from<br />
this verification are reported in [23] and show good<br />
agreement between the measured and computed<br />
displacements. The model was applied to the computation<br />
<strong>of</strong> the deformation and vibrations <strong>of</strong> two induction<br />
machines, the parameters <strong>of</strong> which are given in Table I.<br />
The extensive results from the simulations <strong>of</strong> the two<br />
machines as well as a comprehensive analysis <strong>of</strong> these<br />
results are reported in [24]. Here we present a comparison<br />
between the computed displacements <strong>of</strong> nodes on the<br />
tooth <strong>of</strong> the machines when the magnetostriction only is<br />
accounted for and when the so called reluctance forces<br />
(Maxwell stress) are also accounted for. Although, the<br />
vibrations depend on the machine construction and could<br />
not be generalized, this result gives the reader an estimate<br />
<strong>of</strong> the effect <strong>of</strong> magnetostriction on the vibrations <strong>of</strong><br />
rotating electrical machines. Fig. 8 shows this comparison<br />
for both machines. Due to the differences in the number<br />
<strong>of</strong> pole pairs the vibration behaviors are different too.<br />
y-coordinate (m)<br />
0.15<br />
0.1<br />
0.05<br />
0<br />
-0.05<br />
-0.1<br />
-0.1 -0.05 0 0.05 0.1 0.15<br />
x-coordinate (m)<br />
y-coordinate (m)<br />
0.15<br />
0.1<br />
0.05<br />
0<br />
-0.05<br />
-0.1<br />
-0.1 -0.05 0 0.05 0.1 0.15<br />
x-coordinate (m)<br />
Fig. 7. Test model, original and deformed mesh computed with<br />
the developed method with a flux density <strong>of</strong> 1.5 T along the xaxis<br />
(left) and at 45 deg (right).<br />
Table I. Parameters <strong>of</strong> the simulated induction machines<br />
Parameter Machine I Machine II<br />
Rated voltage 380 V 380 V<br />
Slip 2 % 3.2 %<br />
Rated current 60 A 27 A<br />
Rated power 30 kW 15 kW<br />
Number <strong>of</strong> pole pairs 1 2<br />
Outer diameter <strong>of</strong> the stator core 323 mm 235 mm<br />
Inner diameter <strong>of</strong> the stator core<br />
190.2 mm 145 mm<br />
Number <strong>of</strong> stator slots<br />
36<br />
36<br />
Outer diameter <strong>of</strong> the rotor core 188.37 mm 144.1 mm<br />
Number <strong>of</strong> rotor slots 28 34<br />
Fig. 8.Comparison between computed displacements <strong>of</strong> a<br />
node on the stator tooth in machine 1 (left) and machine 2<br />
(right). Subscripts r and stand for the radial and tangential<br />
directions. 1 is the case with only magnetostriction and 2 when<br />
both magnetostriction and reluctance forces are considered.
VI. FUTURE TRENDS AND DEVELOPMENTS<br />
The developed coupled magnetomechanical model<br />
although bidirectional in magnetic and mechanics is<br />
single valued and thus does not take dissipation into<br />
account. On the other hand, existing hysteresis models<br />
either for magnetic [25] or mechanics [26] are based on<br />
the Preisach approach, which is mathematically rigorous<br />
but does not give clear insight into the energetic balance<br />
between the two subsystems. The only energy-based<br />
hysteresis model that describes both magnetism and<br />
mechanics is the one developed by Jiles [27] but its<br />
application in electric steel and further to the simulation<br />
<strong>of</strong> electrical machines has not been reported yet. This<br />
might be due to the sharp saturation <strong>of</strong> the magnetization<br />
curves <strong>of</strong> electrical steel, which cause convergence<br />
problems but also to the fact that the original model traits<br />
the compressive and tensile stresses in a symmetric way,<br />
which is not adequate for the electric steel where the<br />
compressive stresses have much pronounced effect on the<br />
magnetic and magnetostrictive properties <strong>of</strong> the material.<br />
The dynamic behavior <strong>of</strong> magnetostriction also needs to<br />
be addressed [28] as well as the effect <strong>of</strong> anisotropy [20].<br />
These shortcuts in the modeling and simulation <strong>of</strong><br />
electrical machines still need to be addressed in view <strong>of</strong><br />
better estimation <strong>of</strong> the vibrations <strong>of</strong> electrical machines<br />
and iron losses, which are known to depend on the state<br />
<strong>of</strong> stress in the material. The presented model already<br />
estimates the effect <strong>of</strong> magnetostriction on the state <strong>of</strong><br />
stress but other causes <strong>of</strong> stress have to be added too.<br />
The models to be developed and used need both<br />
characterization <strong>of</strong> the material under different flux<br />
densities and mechanical stress and verification<br />
procedures to assess the validity <strong>of</strong> the models. The work<br />
presented in [29] is a good start for the characterization<br />
work. We have also developed characterization<br />
methodologies and analyzed their accuracy [30]; this<br />
work is still continuing. The verification work needs still<br />
some development.<br />
REFERENCES<br />
[1] M. J. Islam, A. Arkkio, “Effects <strong>of</strong> pulse-width-modulated supply<br />
voltage on eddy currents in the form-wound stator winding <strong>of</strong> a cage<br />
induction motor,” IET Electric Power Applications, vol. 3, no. 1, pp.<br />
50-58, January 2009.<br />
[2] A. Moses, P. Phillips, “Some effects <strong>of</strong> stress in Goss-oriented siliconiron,”<br />
IEEE Trans. Magn. , vol. 14, no. 5, pp. 353-355, Sep 1978.<br />
[3] A. Moses, “Effects <strong>of</strong> applied stress on the magnetic properties <strong>of</strong><br />
high permeability silicon-iron,” IEEE Trans. Magn., vol. 15, no. 6, pp.<br />
1575-1579, Nov 1979.<br />
[4] Du Trémolet de Lacheisserie, E., 1993. Magnetostriction–Theory<br />
and Applications <strong>of</strong> Magnetoelasticity. CRC Press Inc. 432 pages.<br />
[5] K. Fujisaki, R. Hirayama, T. Kawachi, S. Satou, C. Kaidou, M.<br />
Yabumoto, T. Kubota, “Motor core iron loss analysis evaluating<br />
shrink fitting and stamping by finite-element method,” IEEE Trans.<br />
Magn., vol. 43, no. 5, pp.1950-1954, May 2007<br />
[6] A. Moses, “Measurement <strong>of</strong> magnetostriction and vibration with<br />
regard to transformer noise,” IEEE Trans. Magn., vol. 10, no. 2, pp.<br />
154-156, Jun 1974<br />
[7] B. Weiser, H. Pfutzner, J. Anger, “Relevance <strong>of</strong> magnetostriction and<br />
forces for the generation <strong>of</strong> audible noise <strong>of</strong> transformer cores,” IEEE<br />
Trans. Magn., vol. 36, no. 5, pp.3759-3777, Sep 2000<br />
[8] A. Arkkio, Analysis <strong>of</strong> induction motors based on the numerical<br />
solution <strong>of</strong> the magnetic field and circuit equations, Doctoral<br />
dissertation, 1987, Helsinki <strong>University</strong> <strong>of</strong> <strong>Technology</strong>, Espoo, Finland<br />
- 219 - 15th IGTE Symposium 2012<br />
[9] S. J. Salon, R. Palma, C. C. Hwang, “Dynamic modeling <strong>of</strong> an<br />
induction motor connected to an adjustable speed drive,” IEEE Trans.<br />
Magn., vol. 25, no. 4, pp. 3061-3063, Jul 1989.<br />
[10] K. Delaere, W. Heylen, R. Belmans, K. Hameyer, “Comparison <strong>of</strong><br />
induction machine stator vibration spectra induced by reluctance<br />
forces and magnetostriction,” IEEE Trans. Magn., vol. 38, no. 2, pp.<br />
969-972, Mar 2002<br />
[11] M. Besbes, Z. Ren, A. Razek, “Finite element analysis <strong>of</strong> magnetomechanical<br />
coupled phenomena in magnetostrictive materials,” IEEE<br />
Trans. Magn., vol. 32, no. 3, pp. 1058-1061, May 1996<br />
[12] A. Bossavit, “Edge-element computation <strong>of</strong> the force field in<br />
deformable bodies,” IEEE Trans. Magn., vol. 28, no. 2, pp. 1263-<br />
1266, Mar 1992.<br />
[13] J. L. Coulomb, “A methodology for the determination <strong>of</strong> global<br />
electromechanical quantities from a finite element analysis and its<br />
application to the evaluation <strong>of</strong> magnetic forces, torques and<br />
stiffness,” IEEE Trans. Magn., vol. 19. 6, pp. 2514-19, 1983.<br />
[14] O.A. Mohammed, T. Calvert, R. McConnell, “Coupled<br />
magnetoelastic finite element formulation including anisotropic<br />
reluctivity tensor and magnetostriction effects for machinery<br />
applications,” IEEE Trans. Magn., vol. 37, no. 5, pp. 3388-3392, Sep<br />
2001.<br />
[15] L. Vandevelde, J.A.A. Melkebeek, “Magnetic forces and<br />
magnetostriction in electrical machines and transformer cores,” IEEE<br />
Trans. Magn., vol. 39, no. 3, pp. 1618- 1621, May 2003.<br />
[16] A. Belahcen, “Vibrations <strong>of</strong> rotating electrical machines due to<br />
magnetomechanical coupling and magnetostriction,” IEEE Trans.<br />
Magn., vol. 42, no. 4, pp. 971-974, Apr. 2006.<br />
[17] A. Kameari, “Local calculation <strong>of</strong> forces in 3D FEM with edge<br />
elements,” International Journal <strong>of</strong> applied Electromagnetics in<br />
Materials, vol. 3, pp. 231-240, 1993.<br />
[18] A. Belahcen, “Magnetoelastic coupling in rotating electrical<br />
machines,” IEEE Trans. Magn., vol. 41, no. 5, pp. 1624-1627, May<br />
2005.<br />
[19] A. Belahcen, Magnetoelasticity, magnetic forces and<br />
magnetostriction in electrical machines. Doctoral dissertation, 2004,<br />
Helsinki <strong>University</strong> <strong>of</strong> <strong>Technology</strong>, Espoo, Finland.<br />
[20] S. Somkun, A. J. Moses, P. I. Anderson, P. Klimczyk,<br />
“Magnetostriction anisotropy and rotational magnetostriction <strong>of</strong> a<br />
nonoriented electrical steel,” IEEE Trans. Magn., vol. 46, no. 2, pp.<br />
302-305, Feb. 2010.<br />
[21] A. Belahcen, K. Fonteyn, A. Hannukainen and R. Kouhia, “On<br />
numerical modeling <strong>of</strong> coupled magnetoelastic problem”, Nordic<br />
Seminar on Computational Mechanics NSCM-21, pp. 203-206, Oct.<br />
16-17.2008, Trondheim, Norway.<br />
[22] A. Belahcen and M. El Amri, “Measurement <strong>of</strong> stress-dependent<br />
magnetisation and magnetostriction <strong>of</strong> electrical steel sheets,”<br />
International Conference on Electrical Machines ICEM, Sep. 5-<br />
8.2004, Cracow, Poland.<br />
[23] K. A. Fonteyn, Energy-based magneto-mechanical model for<br />
electrical steel sheets, Doctoral dissertation, 2010, Aalto <strong>University</strong>,<br />
Finland<br />
[24] K. A. Fonteyn, A. Belahcen, P. Rasilo, R. Kouhia, A. Arkkio,<br />
“Contribution <strong>of</strong> Maxwell stress in air on the deformations <strong>of</strong><br />
induction machines,” Journal <strong>of</strong> Electrical Engineering & <strong>Technology</strong>,<br />
vol. 7, no. 3, pp. 336-341, 2012.<br />
[25] E. Dlala, A. Belahcen, K. Fonteyn, M. Belkasim, “Improving loss<br />
properties <strong>of</strong> the mayergoyz vector hysteresis model,” IEEE Trans.<br />
Magn., vol. 46, no. 3, pp. 918-924, March 2010.<br />
[26] A. Bergqvist, On magnetic hysteresis modelling, Doctoral<br />
dissertation, 1994, Royal Institute <strong>of</strong> <strong>Technology</strong>, Stockholm,<br />
Sweden<br />
[27] D. C. Jiles, D. L. Atherton, “Theory <strong>of</strong> ferromagnetic hysteresis<br />
(invited),” Journal <strong>of</strong> Applied Physics, vol. 55, no. 6, pp. 2115-2120,<br />
Mar 1984.<br />
[28] P. Rasilo, A. Belahcen, “Iron losses, magnetoelasticity and<br />
magnetostriction in ferromagnetic steel laminations,” IEEE<br />
Conference on Electromagnetic Field Computation CEFC, 11-<br />
14.11.2012, Oita, Japan.<br />
[29] Y. Kai, Y. Tsuchida, T. Todaka, M. Enokizono, “Influence <strong>of</strong> stress<br />
on vector magnetic property under rotating magnetic flux conditions,”<br />
IEEE Trans. Magn., vol. 48, no. 4, pp. 1421-1424, April 2012.<br />
[30] A. Belahcen, P. Rasilo, K. Fonteyn, R. Kouhia and A. Arkkio,<br />
“Modeling the stress effect on the measurement <strong>of</strong> magnetostriction in<br />
electrical sheets under rotational magnetization,” IEEE Conference on<br />
Electromagnetic Field Computation CEFC, 11-14.11.2012, Oita,<br />
Japan.
- 220 - 15th IGTE Symposium 2012<br />
Magnetic Saturation Effect on Modeling Squirrel-cage<br />
Induction Motors with Stator Inter-turn Fault<br />
*Jawad Faiz, † Mansour Ojaghi and † Mahdi Sabouri<br />
*Center <strong>of</strong> Excellence on Applied Electromagnetic Systems, School <strong>of</strong> Electrical and Computer Engineering,<br />
College <strong>of</strong> Engineering, <strong>University</strong> <strong>of</strong> Tehran, Tehran, Iran, E-mail: jfaiz@ut.ac.ir<br />
† Department <strong>of</strong> Electrical Engineering, <strong>University</strong> <strong>of</strong> Zanjan, Zanjan, Iran<br />
Abstract— Coupled circuits model (CCM) <strong>of</strong> squirrel-cage induction motors is the most detailed and complete analytical<br />
model for analyzing the performance <strong>of</strong> the faulty induction motors. This paper extends the CCM to a saturable model<br />
including variable degrees <strong>of</strong> the saturation effects using an appropriate air gap function and novel techniques for locating<br />
the angular position <strong>of</strong> the air gap flux density and estimating the saturation factor. Comparing simulated and experimental<br />
magnetization characteristics shows the accuracy <strong>of</strong> the new saturable model. Using saturable and non-saturable models,<br />
various simulations are carried out on faulty induction motors, and then, by comparing the results, the impacts <strong>of</strong> the<br />
saturation on the performance <strong>of</strong> the faulty motor are presented.<br />
Index Terms—Induction motors, Inter-turn fault, Magnetic saturation, Coupled circuits model.<br />
I. INTRODUCTION<br />
Implementing a proper condition monitoring system is<br />
essential to prevent squirrel-cage induction motors<br />
(SCIMs) from catastrophic failure. The stator inter-turn<br />
fault is a relatively frequent fault and if not diagnosed on<br />
time, causes major breakdown <strong>of</strong> the SCIM. SCIM<br />
performance under the inter-turn fault can be analyzed<br />
using magnetically coupled circuits model (CCM) [1],<br />
[2]. Such analysis helps to realize the faulty SCIMs<br />
performance, to extract proper indexes for the various<br />
faults and to develop effective fault diagnosis and<br />
condition monitoring techniques for SCIMs. To do so,<br />
CCM must be as exact as possible; however, ignoring the<br />
magnetic saturation may keep it away from the required<br />
exactness [3].<br />
For economical utilization <strong>of</strong> the magnetic material,<br />
electrical machines operating regions have to be extended<br />
above the knee <strong>of</strong> the magnetization characteristic, which<br />
forces the machine into the saturation region. Many<br />
attempts have been so far made to include saturation<br />
effects in SCIM models including CCM [4]–[9]. An<br />
extension to CCM <strong>of</strong> healthy SCIM, which includes<br />
variable degrees <strong>of</strong> saturation, has been reported in [9].<br />
The proposed saturable CCM (SCCM) needs to track the<br />
air gap rotating flux density, and this has been done by a<br />
rather simple technique in [9]. However, distortion <strong>of</strong> the<br />
air gap flux density distribution due to the fault causes<br />
the application <strong>of</strong> that technique erroneous. Thus, the<br />
existing SCCM is not viable to analyze the faulty SCIMs.<br />
In this paper, the rotor meshes are used as the air gap<br />
flux samplers. The flux-linkages <strong>of</strong> the rotor meshes,<br />
calculated at each step, are used to estimate the air gap<br />
flux density distribution. Then, Fourier series analysis is<br />
used to determine the space harmonics <strong>of</strong> the air gap flux<br />
density, including fundamental harmonic amplitude (B1)<br />
and its phase angle (1). B1 is used to determine<br />
saturation factor (Ksat) properly and 1 is utilized to track<br />
the air gap flux density. Therefore, a modified version <strong>of</strong><br />
the SCCM is obtained, whose accuracy is proved by<br />
comparing the magnetization characteristics determined<br />
through the simulation and experiments. Then, using<br />
both the modified SCCM and the normal CCM, the<br />
performance <strong>of</strong> SCIM with some inter-turn faults are<br />
simulated. By comparing the results, impacts <strong>of</strong> magnetic<br />
saturation on the faulty SCIMs performance and their<br />
fault indexes will be clear. Comparisons are also made<br />
with experimental results, which confirm the accuracy <strong>of</strong><br />
the proposed model.<br />
II. SCCM OF SCIM WITH INTER-TURN FAULT<br />
Any loop on the rotor <strong>of</strong> SCIM consisting <strong>of</strong> any two<br />
adjacent rotor bars is considered as a circuit mesh. The<br />
shorted turns in the stator are also considered as an<br />
independent circuit (phase 'd'). Applying KVL to the<br />
rotor meshes, stator phases and stator shorted turns and<br />
adding the related torque and mechanical equations,<br />
CCM dynamic equations are attained. These equations<br />
with complete details are presented in [2]. Self/mutual<br />
inductances <strong>of</strong> the various circuits and their derivatives<br />
versus the rotor position are the most important<br />
parameters <strong>of</strong> CCM equations. These inductances are<br />
calculated using the following equation [10]:<br />
2<br />
1<br />
Lxy or<br />
l<br />
g n N d<br />
0 x y<br />
(1)<br />
where x and y can be any phase <strong>of</strong> the stator (a, b, c or d)<br />
or any mesh <strong>of</strong> the rotor (1 to R), μ0 is the air magnetic<br />
permeability, r is the air gap mean radius, l is the stack<br />
length, g -1 is the inverse air gap function, nx is the x<br />
phase (mesh) turn function and is the angle in the stator<br />
stationary reference. In the case <strong>of</strong> the uniform air gap<br />
machine, Ny is the winding function <strong>of</strong> the y circuit, but<br />
in the case <strong>of</strong> the non-uniform air gap machine, it is the<br />
modified winding function <strong>of</strong> the y circuit.<br />
Saturation <strong>of</strong> the magnetic material causes its<br />
reluctance to be increased against the machine's flux.<br />
Similar increase <strong>of</strong> the reluctance can be achieved by a<br />
proportional increase in the air gap length along the main<br />
flux path [4]. Anywhere within the core material,<br />
reluctance increase caused by the saturation depends on<br />
the exact value <strong>of</strong> the flux density, but independent <strong>of</strong> the<br />
flux direction. Thus, it is expected that the fictitious air<br />
gap length (gf) fluctuates a complete cycle every half
cycle <strong>of</strong> the air gap flux density distribution around the<br />
air gap. Assuming a sinusoidal form for this fluctuation,<br />
the following satisfies the mentioned requirements [9]:<br />
g f g[<br />
1<br />
cos( 2P(<br />
f ))] (2)<br />
where f is the angular position <strong>of</strong> a zero crossing point<br />
<strong>of</strong> the air gap flux density in the stator reference, P is the<br />
pole pairs, g' is the mean value <strong>of</strong> gf and is the peak<br />
value <strong>of</strong> its fluctuation. g' and are determined as<br />
follows:<br />
3K<br />
sat<br />
g <br />
ge<br />
(3)<br />
K 2<br />
2( Ksat<br />
1)<br />
<br />
(4)<br />
3Ksat<br />
where ge is the effective air gap length <strong>of</strong> the unsaturated<br />
machine, which is related to the mechanical air gap<br />
length (g0) by the Carter's coefficient (ge=kcg0). Ksat is the<br />
saturation factor which is defined as the ratio <strong>of</strong> the<br />
fundamental components <strong>of</strong> the air gap voltage for the<br />
saturated and unsaturated conditions [4].<br />
Replacing the inverse <strong>of</strong> (2) into (1), using modied<br />
winding function theory, assuming the turn functions to<br />
have only step variation in the center <strong>of</strong> the slots and<br />
determining the indefinite integrals, exact analytic<br />
equations are obtained for the various inductances. Then,<br />
differentiating the equations versus the rotor position<br />
(r), exact analytic equations also are obtained for the<br />
derivatives <strong>of</strong> the inductances. The equations are<br />
functions <strong>of</strong> Ksat and f, thus by using them; variable<br />
degree <strong>of</strong> saturation effect enters to the model <strong>of</strong> the<br />
SCIM. The equations are included in the appendix.<br />
For the proposed SCIM (see Section V), Figure 1<br />
shows the turn functions <strong>of</strong> the phase 'a' winding before<br />
and after inter-turn fault. The winding has 4 concentric<br />
coils each with 90 turns in healthy condition. In the faulty<br />
case 14 turns from the outer coils are short-circuited.<br />
Figure 1 also shows the turn function <strong>of</strong> the shorted turns<br />
(phase 'd'). As an example, Figure 2 shows the variations<br />
<strong>of</strong> the self inductance <strong>of</strong> the phase 'd' (Ldd) by the<br />
variations <strong>of</strong> Ksat and f, which has been calculated using<br />
the proposed analytical equations. As seen, by increasing<br />
Ksat and Ldd more variations occur around its decreasing<br />
mean value. The increase <strong>of</strong> g' and by increasing Ksat<br />
are respectively the reasons for the decreasing mean<br />
value and increasing uctuations amplitude <strong>of</strong> the<br />
inductance. A complete rotation <strong>of</strong> f causes two<br />
complete cycles <strong>of</strong> variation for Ldd, which is due to the<br />
two poles <strong>of</strong> the SCIM.<br />
sat<br />
- 221 - 15th IGTE Symposium 2012<br />
Figure 1: Turn function <strong>of</strong> phase 'a' winding: a) before and b) after<br />
inter-turn fault with 14 shorted turns in the outer coil and c) turn<br />
function <strong>of</strong> phase 'd' after the fault occurrence<br />
Figure 2: Phase 'd' self inductance (Ldd) variation versus Ksat and f<br />
III. DETERMINING K SAT AND F IN FAULTY SCIM DURING<br />
SIMULATION STEPS<br />
Generally, the air gap flux density distribution is<br />
disturbed in the faulty SCIMs and this leads to an error in<br />
the use <strong>of</strong> the technique for determining f [9]. In<br />
addition, Ksat was determined using the air gap voltage,<br />
which depends on the rotation speed <strong>of</strong> the air gap flux as<br />
well as its amplitude, while the saturation degree depends<br />
only on the flux density amplitude. This bring difficulty<br />
when applying the model in variable-speed drives<br />
systems, as the air gap voltage before saturation is no<br />
more a constant, but varies by the reference speed<br />
variation.<br />
When simulating SCIM using the CCM or SCCM,<br />
flux-linkages <strong>of</strong> all the rotor meshes are evaluated within<br />
any simulation step. Since any rotor mesh consists <strong>of</strong><br />
only one turn, these flux-linkages are the total fluxes<br />
passing through the meshes. Considering the short air<br />
gap length and ignoring the small flux leakages, the air<br />
gap flux density next to any rotor mesh i can be estimated<br />
as:<br />
Bai i<br />
A<br />
(5)<br />
where i is the flux-linkage <strong>of</strong> mesh i and A is the area<br />
above the mesh in the air gap:<br />
A 2rl<br />
R<br />
(6)<br />
where R is the rotor bars number. Therefore, the air gap<br />
flux density distribution is estimated within any<br />
simulation step. Then, using Fourier series analysis, the<br />
space harmonics <strong>of</strong> the air gap flux density is determined<br />
as follows:<br />
1 2<br />
R 1<br />
i1<br />
Bsn<br />
B(<br />
) sin( nP<br />
) d<br />
Bai<br />
sin( )<br />
0<br />
nP<br />
d<br />
<br />
<br />
i<br />
(7)<br />
B<br />
cn<br />
1<br />
<br />
np<br />
<br />
<br />
R<br />
<br />
i1<br />
B [cos( nP<br />
) cos( nP<br />
) ]<br />
ai<br />
i1<br />
R<br />
1<br />
Bai[sin(<br />
nPi<br />
1)<br />
sin( nPi<br />
)]<br />
np<br />
i1<br />
where Bsn and Bcn are the sine and cosine components <strong>of</strong><br />
the nth space harmonic <strong>of</strong> the estimated air gap flux<br />
density respectively, is the angle in the rotor reference<br />
and i the angle <strong>of</strong> center <strong>of</strong> the rotor bar i. Then, the<br />
phase angle <strong>of</strong> the space harmonics (in electrical<br />
i1<br />
R<br />
1 2<br />
1<br />
i1<br />
B(<br />
)<br />
cos( nP)<br />
d<br />
Bai<br />
0<br />
<br />
i<br />
i 1<br />
i<br />
cos( nP)<br />
d<br />
(8)
adians) and their amplitudes can be calculated as follows<br />
respectively:<br />
tan ( )<br />
1 Bsn<br />
n <br />
(9)<br />
B<br />
cn<br />
2<br />
Bsn<br />
2<br />
Bn Bcn<br />
<br />
(10)<br />
Having the phase angle <strong>of</strong> the fundamental harmonic<br />
(1), f could be estimated by:<br />
1<br />
<br />
f r<br />
<br />
(11)<br />
P 2P<br />
Also using the amplitude <strong>of</strong> the fundamental harmonic,<br />
Ksat obtained by:<br />
1 0 B B Ksat (12)<br />
where B0 is related to the flux density <strong>of</strong> knee (Bkp) <strong>of</strong> the<br />
core material within the teeth.<br />
These modifications in f and Ksat estimations make<br />
the SCCM applicable to the simulation <strong>of</strong> mains-/driveconnected<br />
SCIMs under the inter-turn fault. However,<br />
knee point is not a distinct point on the magnetization<br />
characteristic and there is not precise analytic method to<br />
determine the Carter's coefficient (kc). Therefore, Genetic<br />
Algorithm (GA) is used for the optimal estimation <strong>of</strong> the<br />
required B0 and kc in the next section.<br />
IV. ESTIMATION OF B0 AND K C<br />
GA is a heuristic searching method for the optimal<br />
solution based on mechanics <strong>of</strong> natural selection and<br />
natural genetics. It evolves into new generations <strong>of</strong><br />
individuals by using knowledge from the previous<br />
generations and generally includes three fundamental<br />
genetic operations <strong>of</strong> reproduction, crossover and<br />
mutation. The searching process is independent <strong>of</strong> the<br />
form <strong>of</strong> the objective function, and will not be trapped in<br />
the rapid descending direction introduced by the local<br />
optimum solutions. The solution <strong>of</strong> a complex problem<br />
can be started with weak initial estimations and then be<br />
corrected in evolutionary process <strong>of</strong> fitness. Figure 3<br />
shows a flowchart <strong>of</strong> the applied GA. More details about<br />
GA can be found in [11].<br />
To use GA for estimating B0 and kc in the proposed<br />
SCIM, a proper objective (fitness) function must be<br />
defined. Such fitness function may be achieved by<br />
closely fitting the magnetization characteristic (i.e. the<br />
no-load voltage versus no-load current curve) <strong>of</strong> the<br />
SCIM obtained from SCCM to that obtained by<br />
experiments. To do so, the no-load stator RMS line<br />
currents are measured in the laboratory with n different<br />
stator line voltages up to<br />
- 222 - 15th IGTE Symposium 2012<br />
Reproduction<br />
No<br />
Figure 3: Flowchart <strong>of</strong> the applied Genetic Algorithm<br />
Figure 4: Convergence rate <strong>of</strong> the algorithm<br />
the nominal voltage (I e ai, I e bi, I e ci, for i=1,2,…,n). Then,<br />
for any distinct values <strong>of</strong> B0 and kc, corresponding line<br />
currents with the same stator voltages are obtained by<br />
simulation (I s ai, I s bi, I s ci, for i=1,2,…,n). Now, the fitness<br />
function is defined as follows:<br />
n<br />
<br />
i1<br />
e<br />
ai<br />
s 2<br />
ai<br />
e<br />
bi<br />
Start<br />
Select variables<br />
(Solution space)<br />
Construct initial<br />
population randomly<br />
Calculate the fitness for<br />
each population<br />
Apply crossover<br />
Apply mutation<br />
Ending<br />
condition<br />
reached?<br />
Select the best population<br />
End<br />
Yes<br />
s 2<br />
bi<br />
Fit.<br />
(( I I ) ( I I ) ( I I ) ) (13)<br />
e<br />
ci<br />
s 2<br />
ci<br />
The lower the fitness, the better will be the estimation <strong>of</strong><br />
B0 and kc. With a population size <strong>of</strong> 15, GA converges to<br />
the required solution after about 130 iterations. Figure 4<br />
shows the convergence rate <strong>of</strong> the algorithm. The<br />
optimum values for B0 and kc are 0.5007 and 1.2058<br />
respectively. Figure 5 compares the<br />
magnetization characteristics obtained from the<br />
experiment and SCCM with optimal B0 and kc. Good<br />
agreement between the simulated and experimental<br />
results is evident.<br />
V. EXPERIMENTAL TEST RIG<br />
A test rig consisting <strong>of</strong> a 750 W, 380 V, 50 Hz, 2-pole,<br />
Y-connected SCIM was set up in the laboratory. Three-
Figure 5: Magnetization Characteristic obtained from simulation ()<br />
and experiments (---).<br />
Figure 6: Photograph <strong>of</strong> the test rig.<br />
phase windings <strong>of</strong> the stator <strong>of</strong> the SCIM removed and<br />
replaced by similar windings with various taps taken out<br />
from different turns <strong>of</strong> the phase ‘a’ winding. Inter-turn<br />
fault with variable number <strong>of</strong> shorted turns is produced in<br />
the SCIM by connecting any two <strong>of</strong> the taps. The motor<br />
is mechanically coupled with a magnetic powder brake to<br />
produce adjustable mechanical load. A digital scopemeter<br />
is used for sampling the line currents <strong>of</strong> the SCIM<br />
[12]. Two independent current or voltage signals can be<br />
sampled and recorded with 5000 samples per second.<br />
Figure 6 shows a photograph <strong>of</strong> the test rig.<br />
VI. SIMULATION AND ANALYSIS<br />
The proposed SCIM is simulated using the<br />
developed SCCM under various loads, supply and fault<br />
conditions and corresponding tests are performed on the<br />
real SCIM in the laboratory. The results are presented,<br />
compared and analyzed in this section. Figure 7 shows<br />
the variation <strong>of</strong> f with time in an interval between - to <br />
obtained during a simulation by the method introduced in<br />
the Section III. Constant slop <strong>of</strong> the variation <strong>of</strong> f is due<br />
to the constant speed <strong>of</strong> the air gap rotating magnetic<br />
field (the synchronous speed).<br />
Figure 8 shows the normalized spectra <strong>of</strong> the stator<br />
line current in the faulty SCIM with 21 shorted turns<br />
under no load. As seen, all the even/odd harmonics are<br />
present in the experimental and SCCM result but not in<br />
the CCM result. The amplitude <strong>of</strong> the 3 rd harmonic <strong>of</strong> the<br />
stator line current (150 Hz) might be considered as an<br />
index for the inter-turn fault [13], [14]. As seen, this<br />
amplitude in the SCCM result is very closer to the<br />
experimental result than that in the CCM result.<br />
The stator negative-sequence current component at the<br />
fundamental frequency is one <strong>of</strong> the old indexes<br />
introduced to diagnose the inter-turn fault [15], [16]. In<br />
the healthy symmetrical SCIM with the balanced threephase<br />
supply, the negative-sequence current is zero.<br />
- 223 - 15th IGTE Symposium 2012<br />
Figure 7: Simulated time variations <strong>of</strong> f<br />
Figure 8: Normalized spectra <strong>of</strong> stator line current under no load with<br />
21 shorted turns obtained by: a) experiment, b) SCCM and c) CCM<br />
However, the inter-turn fault quickly increases this<br />
current component. To determine the negative sequence<br />
current at the required frequency, the related line current<br />
phasors <strong>of</strong> the stator are obtained first by using the<br />
sampled currents and the Fourier algorithm which is<br />
conventional in the field <strong>of</strong> the digital protection [17].<br />
Then, using the line current phasors, the negative<br />
sequence current phasor is determined [15]. Knowing the<br />
amplitude, phase angle and frequency <strong>of</strong> the negative<br />
sequence current, its waveform can also be sketched. For<br />
the proposed SCIM with 21 shorted turns under full load,<br />
the negative-sequence currents obtained through the<br />
simulation and experiments have been shown in Figure 9.<br />
As seen, the saturation effect, introduced by the SCCM,<br />
increases the amplitude <strong>of</strong> the current in order to<br />
approach the experimental results<br />
Simulations and experiments on the proposed SCIM<br />
with 14 and 21 shorted turns were repeated under<br />
various load levels. Table I compares the attained<br />
amplitudes <strong>of</strong> the stator negative sequence current at the<br />
fundamental frequency. As seen, the negative sequence<br />
current increases by increasing the fault degree, while the<br />
load level change has negligible impact on the current.<br />
Also, the SCCM results follow the experimental results<br />
more<br />
closely than the CCM results.<br />
However, any negative sequence component in the<br />
TABLE. I
Current<br />
(mA)<br />
- 224 - 15th IGTE Symposium 2012<br />
AMPLITUDE OF STATOR NEGATIVE SEQUENCE CURRENT UNDER VARIOUS LOAD LEVELS<br />
Faulty SCIM with 14 short-circuited turns Faulty SCIM with 21 short-circuited turns<br />
No<br />
load<br />
20%<br />
rated<br />
load<br />
40%<br />
rated<br />
load<br />
60%<br />
rated<br />
load<br />
80%<br />
rated<br />
load<br />
Full<br />
load<br />
No<br />
load<br />
20%<br />
rated<br />
load<br />
40%<br />
rated<br />
load<br />
60%<br />
rated<br />
load<br />
80%<br />
rated<br />
load<br />
Experimental 470 478 481 482 489 494 698 690 690 712 726 730<br />
SCCM 467 490 480 456 462 487 693 700 668 701 702 702<br />
CCM 413 457 438 411 422 430 619 623 609 647 648 650<br />
Figure 9: Stator negative sequence current obtained by a) experiment, b)<br />
SCCM and c) CCM in faulty SCIM with 21 shorted turns under fullload.<br />
mains voltage, which is permissible to some small extent<br />
in real mains, produces negative sequence current in the<br />
stator <strong>of</strong> the healthy SCIM. Simulation result indicates<br />
that with 2% negative sequence in the stator voltage <strong>of</strong><br />
the healthy SCIM, the negative sequence current changes<br />
by 0.54% from no-load to full-load, by 1.86% from nonsaturable<br />
(CCM) to saturable motor (SCCM) and by<br />
6.65% from healthy to single-turn shorted condition (the<br />
weakest fault), while changing the negative sequence<br />
voltage from 2% to 5% changes the negative sequence<br />
current by 148.8% in the healthy SCIM. Therefore, the<br />
negative sequence current as an inter-turn fault index is<br />
highly sensitive to the voltage imbalance level, which is<br />
not pleasing as shown in Figure 10.<br />
The negative sequence apparent impedance <strong>of</strong> the<br />
SCIM is also used as an index to diagnose the stator<br />
inter-turn faults [18], [19]. This impedance is the ratio <strong>of</strong><br />
the stator negative sequence voltage phasor to its<br />
negative sequence current phasor. Figure 11 shows the<br />
similar results with Figure 10 for this index which is<br />
obtained using simulation. As seen, this index has very<br />
smaller sensitivity to the voltage imbalance and the<br />
magnetic saturation is the most effective lateral factor<br />
affecting this index.<br />
VII. CONCLUSION<br />
The flux-linkages <strong>of</strong> the rotor meshes, calculated in<br />
every simulation step <strong>of</strong> the CCM for the SCIM was used<br />
to estimate the air gap flux density distribution. Then,<br />
space harmonic components <strong>of</strong> the air gap flux density<br />
were determined using Fourier series analysis. The phase<br />
angle <strong>of</strong> the space fundamental harmonic was utilized to<br />
locate the air gap flux density during simulation <strong>of</strong> the<br />
faulty SCIMs. Also, the amplitude <strong>of</strong> this fundamental<br />
harmonic is applicable to evaluate the saturation factor<br />
more reasonably. Therefore, a saturable CCM was<br />
Full<br />
load<br />
Figure 10: Sensitivity <strong>of</strong> stator negative sequence current to the weakest<br />
fault (single-turn), saturation, voltage imbalance and load level,<br />
evaluated by simulations.<br />
Figure 11: Sensitivity <strong>of</strong> the negative sequence apparent impedance <strong>of</strong><br />
the SCIM to weakest fault (single-turn), saturation, voltage imbalance<br />
and load level, evaluated by simulations.<br />
developed which is capable to analyze faulty SCIMs.<br />
Comparing the simulation results with the corresponding<br />
experimental results indicates that the saturable model is<br />
more precise than the non-saturable model. Further study<br />
showed that the magnetic saturation affects the inter-turn<br />
fault indices more than the load level and the stator<br />
voltage imbalance.<br />
APPENDIX<br />
The indefinite integral <strong>of</strong> the inverse air gap function<br />
(gf -1 ) is determined first as follows:<br />
f ( ,<br />
, K<br />
f<br />
sat<br />
<br />
1<br />
) g f ( ,<br />
f , K sat ) d<br />
1<br />
cos( 2P(<br />
)) <br />
1<br />
<br />
cos <br />
f <br />
<br />
<br />
<br />
2 <br />
2 P g 1 <br />
1 cos( 2P(<br />
f ))<br />
<br />
(A1)<br />
Then, the analytical equation for the inductances <strong>of</strong> the<br />
`stator windings is obtained as:<br />
L<br />
x y<br />
m<br />
o<br />
r l<br />
nx<br />
( ti )[ n y ( ti ) f s ( f , K sat )]<br />
(A2)<br />
i1<br />
[ f ( <br />
i1<br />
, , K<br />
f<br />
sat<br />
) f ( , , K<br />
where x and y accounts for the stator phases, m is the<br />
number <strong>of</strong> the stator slots, i is the angle <strong>of</strong> center <strong>of</strong> the<br />
stator slot i, ti is the angle <strong>of</strong> center <strong>of</strong> the stator tooth<br />
after the stator slot i, and fs is:<br />
2<br />
g<br />
1<br />
m<br />
fs ( f , Ksat<br />
) ny<br />
( ti)<br />
[ f ( i1,<br />
f , Ksat<br />
) f ( i,<br />
f , K<br />
2<br />
i1<br />
i<br />
f<br />
sat<br />
)]<br />
sat<br />
)]<br />
(A3)
Equation (A2) is independent <strong>of</strong> r and its derivative<br />
versus r is zero for all x and y. For the rotor meshes the<br />
inductance equation is:<br />
Luv o r l [ C fr<br />
( f , K sat )] [ f ( x1,<br />
f , K sat ) f ( x,<br />
f , K sat )] (A4)<br />
now u and v accounts for the rotor meshes 1 to R, f is f<br />
in the rotor reference (f = f - r), u is the angle <strong>of</strong> center<br />
<strong>of</strong> the rotor bar number u in the rotor reference, C=1 for<br />
u = v and C=0 otherwise and fr is:<br />
2<br />
g<br />
1<br />
<br />
fr ( f , K sat ) [ f ( y1,<br />
f , K sat ) f ( y , f , K<br />
2<br />
sat<br />
)]<br />
- 225 - 15th IGTE Symposium 2012<br />
(A5)<br />
Equations (A4)-(A5) depend on r because <strong>of</strong> f.<br />
Considering the relationship between these two variables<br />
yields Luv/r= -Luv/f and thus:<br />
Luv<br />
or<br />
l[<br />
C fr<br />
( f , K<br />
<br />
r<br />
where:<br />
f<br />
r ( f , K<br />
o<br />
rl<br />
<br />
f<br />
sat<br />
f<br />
( u1,<br />
f , Ksat)<br />
f<br />
( u,<br />
f , Ksat)<br />
)] [<br />
<br />
]<br />
<br />
<br />
sat<br />
<br />
)]<br />
[ f ( , , K ) f ( , , K<br />
u1<br />
f<br />
f<br />
sat<br />
u<br />
f<br />
f<br />
sat<br />
)]<br />
(A6)<br />
f r ( f , Ksat<br />
) g<br />
<br />
<br />
f<br />
2<br />
1<br />
<br />
2<br />
f<br />
( y 1,<br />
f , Ksat<br />
) f<br />
( y , f , Ksat<br />
)<br />
[<br />
<br />
]<br />
<br />
f<br />
<br />
f<br />
(A7)<br />
f ( i , f , Ksat<br />
)<br />
1<br />
<br />
; i x1,<br />
x , y1,<br />
y<br />
<br />
f g[<br />
1<br />
cos( 2P(<br />
i <br />
f ))]<br />
(A8)<br />
For the mutual inductances between the rotor meshes and<br />
stator phases the following equation is obtained:<br />
k2<br />
1<br />
i1<br />
<br />
i<br />
Lmn<br />
o<br />
r l<br />
[ nn<br />
( ) f s ( f , K sat )]<br />
ik<br />
2<br />
1<br />
(A9)<br />
[ f ( , , K ) f ( , , K )]<br />
i<br />
f<br />
sat<br />
i1<br />
where x and y account for the rotor meshes and stator<br />
phases respectively, k1-1 and k2+1 are the angles <strong>of</strong> the<br />
two bars <strong>of</strong> mesh x in the stator reference, k1 to k2 are the<br />
stator slots between k1-1 and k2+1 and e.g. k1 is the angle<br />
<strong>of</strong> the stator slot k1. Now k1-1 and k2+1 are responsible to<br />
the dependency <strong>of</strong> inductances on r. The derivatives <strong>of</strong><br />
the two mentioned parameters versus r are equal to 1,<br />
while the derivatives <strong>of</strong> the other parameters in (A9)<br />
versus r are zero. Using these facts and some<br />
differentiation rules leads to:<br />
Lx<br />
y<br />
<br />
r<br />
<br />
o r l<br />
k1<br />
<br />
k11<br />
[ n y ( ) f s ( f , K sat )]<br />
2<br />
g[<br />
1<br />
cos( 2P(<br />
k11<br />
<br />
f ))]<br />
<br />
o<br />
r l<br />
k 21<br />
<br />
k 2<br />
[ n y ( ) f s ( f , K sat )]<br />
2<br />
g[<br />
1<br />
cos( 2P(<br />
<br />
))]<br />
k 21<br />
REFERENCES<br />
f<br />
f<br />
sat<br />
(A10)<br />
[1] X. Luo, Y. Liao, H. A. Toliyat, A. El-Antably and T. A. Lipo,<br />
“Multiple coupled circuit modeling <strong>of</strong> induction machines,”<br />
IEEE Trans. Ind. Applications, vol. 31, pp. 311 - 318,<br />
March/April 1995.<br />
[2] A. Raie, and V. Rashtchi, "Using a genetic algorithm for<br />
detection and magnitude determination <strong>of</strong> turn faults in an<br />
induction motor", Springer-Verlag, vol. 84, pp. 275–279, August<br />
2002.<br />
[3] S. Nandi, “Detection <strong>of</strong> stator faults in induction machines using<br />
residual saturation harmonics,” IEEE Trans. Ind. Applications,<br />
vol. 42, no. 5, pp. 1201 - 1208, 2006.<br />
[4] J. C. Moreira and T.A. Lipo, “Modeling <strong>of</strong> saturated ac machines<br />
including air gap flux harmonic components,” IEEE Trans. Ind.<br />
Applications, vol. 28, pp. 343 - 349, March/April 1992.<br />
[5] D. Bispo, L. M. Neto, J. T. Resende and D. A. Andrade, “A new<br />
strategy for induction machine modeling taking into account the<br />
magnetic saturation ,” IEEE Trans. Ind. Applications, vol. 37, no.<br />
6, pp. 1710 - 1719, Nov./Dec. 2001.<br />
[6] T. Tuovinen, M. Hinkkanen, and J. Luomi, “Modeling <strong>of</strong><br />
saturation due to main and leakage fux interaction in induction<br />
machines,” IEEE Trans. Ind. Applications, vol. 46, no. 3, pp.<br />
937 - 945, 2010.<br />
[7] Tu Xiaoping, L.-A. Dessaint, R. Champagne, and K. Al-Haddad,<br />
“Transient modeling <strong>of</strong> squirrel-cage induction machine<br />
considering air-gap flux saturation harmonics,” IEEE Trans. Ind.<br />
Electronics, vol. 55, no. 7, pp. 2798 - 2809, 2008.<br />
[8] S. Nandi, “A detailed model <strong>of</strong> induction machines with<br />
saturation extendable for fault analysis,” IEEE Trans. Ind.<br />
Applications, vol. 40, pp. 1302 - 1309, September/October 2004.<br />
[9] M. Ojaghi and J. Faiz, “Extension to multiple coupled circuit<br />
modeling <strong>of</strong> induction machines to include variable degrees <strong>of</strong><br />
saturation effects,” IEEE Trans. Magn., vol. 44, no. 11, pp.<br />
4053-4056, Nov. 2008.<br />
[10] J. Faiz, and I. Tabatabaei, "Extension <strong>of</strong> winding function theory<br />
for nonuniform air gap in electric machinery," IEEE Trans. Magn.,<br />
vol. 38, pp. 3654-3657, November 2002.<br />
[11] Z. Michalewicz, Genetic Algorithms & Data Structures, Evalution<br />
Programs, Springer-Verlag, 1992.<br />
[12] Fluke 196c/199C Scope-Meter User's Manual, Fluke<br />
Corporation, Oct. 2001, Netherlands.<br />
[13] G. Joksimovic, J. Penman, “The detection <strong>of</strong> interturn short<br />
circuits in the stator windings <strong>of</strong> operating motors,” IEEE Trans<br />
Ind. Electronics, vol. 47, no.5, pp.1078–1084, Oct. 2000.<br />
[14] J.H. Jung, J.J. Lee, and B.H. Kwon, “Online diagnosis <strong>of</strong><br />
induction motors using MCSA,” IEEE Trans. Ind. Electronics,<br />
vol. 53, no. 6, pp. 1842–1852, Dec. 2006.<br />
[15] A.Bellini, F.Filippetti, C.Tassoni, G.A.Capolino, “Advances in<br />
Diagnostic Techniques for Induction Machines,” IEEE Trans.<br />
Industrial Electronics, vol.55, no12, pp. 4109-4126, Dec. 2008.<br />
[16] Wu Qing, and S. Nandi, “Fast single-turn sensitive stator interturn<br />
fault detection <strong>of</strong> induction machines based on positive- and<br />
negative-sequence third harmonic components <strong>of</strong> line currents,”<br />
IEEE Trans. Ind. Applications, vol. 46, pp. 974 - 983, 2010.<br />
[17] A.T.Johns and S.K. Salman, Digital Protection for Power<br />
Systems. IEE Power series 15, London, UK 1995.<br />
[18] J.L. Kohler, J. Sottile, and F.C. Trutt, “Condition monitoring <strong>of</strong><br />
stator windings in induction motors: I. Experimental<br />
investigation on effective negative-sequence impedance<br />
detector,” IEEE Trans. Ind. Application, vol. 38, pp. 1447–1453,<br />
2002.<br />
[19] L. Sang Bin, R.M. Tallam, and T.G. Habetler, “A robust, on-line<br />
turn-fault detection technique for induction machines based on<br />
monitoring the sequence component impedance matrix,” IEEE<br />
Trans. Power Electronics, vol. 18, pp. 865–872, 2003.
- 226 - 15th IGTE Symposium 2012<br />
Accurate Magnetostatic Simulation <strong>of</strong> Step-Lap<br />
Joints in Transformer Cores Using Anisotropic<br />
Higher Order FEM<br />
A. Hauck∗ , M. Ertl † ,J.Schöberl ‡ and M. Kaltenbacher §<br />
∗ SIMetris GmbH, Erlangen, Germany † SIEMENS Transformers, Nuremberg, Germany<br />
‡ Institute for Analysis and Scientific Computing, Vienna <strong>University</strong> <strong>of</strong> <strong>Technology</strong>, Austria<br />
§ Institute <strong>of</strong> Mechanics and Mechatronics, Vienna <strong>University</strong> <strong>of</strong> <strong>Technology</strong>, Austria<br />
E-mail: andreas.hauck@simetris.de<br />
Abstract—We present a simulation scheme for the accurate simulation <strong>of</strong> thin magnetic structures, specifically the nonlinear<br />
magnetic flux distribution in a core step-lap joint with interest in the local saturation near the air gaps. Due to the high<br />
aspect ratio <strong>of</strong> the model, we utilize hierarchical higher order finite elements, where the polynomial degree is spatially<br />
adapted to resolve the flux distribution within the steel sheets. The deterioration <strong>of</strong> convergence in the iterative conjugate<br />
gradient (CG) solver is handled by an anisotropic Schwarz-type block preconditioner, grouping the unknowns depending<br />
on the aspect ratio <strong>of</strong> the elements. The resulting Newton scheme can optionally be accelerated by a 2-step solution strategy,<br />
where a start value is computed on a coarse subspace <strong>of</strong> lowest order in analogy to a full multigrid scheme.<br />
Index Terms—Step-Lap Joints, Higher Order Finite Elements, Block Preconditioner, Nonlinear Solver<br />
I. INTRODUCTION<br />
Precise knowledge about the accurate flux distribution<br />
in transformer cores is both important for reducing the<br />
magnetic losses, as well as for localizing sources for<br />
forces (magnetostriction, interlaminar forces). In recent<br />
years significant reduction <strong>of</strong> both effects was achieved<br />
using the multi-step-lap technique (see Fig. 1 and 2),<br />
where the overlap region <strong>of</strong> transformer sheets is shifted<br />
in several steps [1]. In order to optimize the layout<br />
further, a detailed simulation <strong>of</strong> the fluxes, including the<br />
nonlinear B-H curve <strong>of</strong> the core has to be performed.<br />
Accurate simulation <strong>of</strong> such a problem poses some diffi-<br />
Fig. 1. Sketch <strong>of</strong> transformer core<br />
with step-lap corners.<br />
A<br />
A<br />
View A – A<br />
Fig. 2. Magnetic flux concentration<br />
(45 ◦ -view).<br />
culties, as the thickness <strong>of</strong> the steel sheets is typically in<br />
the range <strong>of</strong> 200 - 300 μm, while the length can be a few<br />
meters and the number <strong>of</strong> vertically stacked sheets add up<br />
to some thousand layers. In addition, the flux variation is<br />
very high in the vicinity <strong>of</strong> the air gaps in the corner, but<br />
rather smooth in some distance away from it, making it<br />
difficult to resolve accurately. Furthermore, the nonlinear<br />
permeability <strong>of</strong> the grain-oriented electrical steel sheets<br />
has to be taken into account and modeled correctly.<br />
Within this paper we solve the nonlinear magnetostatic<br />
problem using finite elements <strong>of</strong> higher order, together<br />
with an iterative preconditioned conjugate gradient (CG)<br />
method. By exploiting the special structure <strong>of</strong> the Finite<br />
Element (FE) basis [2], we can build an effective<br />
preconditioner for handling elements with high aspect<br />
ratios, while at the same time the spatial accuracy can be<br />
adapted to the discretization <strong>of</strong> the model. The nonlinear<br />
Newton scheme can be further accelerated by calculating<br />
a good initial start value on a coarse sub-space. Finally,<br />
the applicability <strong>of</strong> the method is demonstrated for the<br />
aforementioned step-lap core model.<br />
II. NONLINEAR MAGNETOSTATIC FORMULATION<br />
The nonlinear magnetostatic problem can be written in<br />
terms <strong>of</strong> the magnetic vector potential A as<br />
∇×ν(|∇ × A|)(∇×A) =J + ∇×νB0<br />
B = ∇×A , (1)<br />
where ν(|∇ × A|) is the nonlinear reluctivity (e.g. <strong>of</strong><br />
steel), J the impressed current density (e.g. <strong>of</strong> a coil)<br />
and B0 an additional prescribed flux density. As the<br />
impressed current density J and the curl <strong>of</strong> the prescribed<br />
flux density B0 are assumed to be divergence free, a<br />
unique solution can either be guaranteed by enforcing<br />
the Coulomb gauge<br />
∇·A =0 (2)<br />
explicitly (leading to a mixed formulation) or by adding<br />
a small regularization term αA to (1), where α ≈ 10 −6 ν<br />
[2]. We will modify this strategy in Section III.<br />
For physical reasons, the vector potential A is only<br />
continuous in the tangential part, which requires the use<br />
<strong>of</strong> H(curl)-conforming vectorial elements, which will be<br />
introduced in Sec. III.
The nonlinear problem is solved using a Newton<br />
formulation<br />
F ′ (Ak)[ΔA] =−F(Ak) (3)<br />
Ak+1 = Ak + ηΔA . (4)<br />
Improvement <strong>of</strong> convergence is achieved by computing<br />
a line search parameter η ∈ ]0, 1], which is determined<br />
by minimizing the residual <strong>of</strong> (3). The magnetic B-H<br />
commutation curve is extracted from measured hysteresis<br />
curves, reducing the problem to a simplified nonlinear<br />
one, which is given in terms <strong>of</strong> measured (Bi,Hi)-value<br />
pairs. By applying a C 1 -spline approximation, a smooth<br />
monotone approximation is calculated, from which the<br />
reluctivity ν and its derivative ν ′ (entering the Fréchet<br />
derivative F ′ ) can be derived (see Fig. 3). For details we<br />
refer to [3].<br />
Fig. 3. Approximation <strong>of</strong> measured B-H-data by C 1 -splines.<br />
III. HIGHER ORDER FINITE ELEMENTS<br />
The two key requirements for our choice <strong>of</strong> higher<br />
order H(curl)-conforming FE shape functions are the<br />
ability to choose the polynomial degree p independently<br />
in each local direction (ξ,η,ζ), as well as the availability<br />
<strong>of</strong> efficient iterative solution techniques (i.e. an efficient<br />
preconditioner). The first requirement leads to the use<br />
<strong>of</strong> hierarchical shape functions, where we we utilize the<br />
hierarchical shape functions <strong>of</strong> [2], which can be written<br />
as<br />
N(T )= <br />
N 0 E ⊕ <br />
N ∇ E ⊕ <br />
NF ⊕<br />
E<br />
E<br />
F<br />
N ∇ F ⊕ NI ⊕ N ∇ I<br />
The shape functions N are composed <strong>of</strong> unknowns<br />
defined on edges, faces and in the interior (subscripts<br />
E, F and I), see Fig. 4 for a hexahedral element.<br />
The lowest order Nédélec functions N0 E , which have a<br />
pζ pη pξ E 5<br />
E 12<br />
F 2<br />
E 4<br />
E 8<br />
E 1<br />
ζ<br />
η<br />
ξ<br />
F 1<br />
F 6<br />
E 9<br />
E 11<br />
E 3<br />
E 6<br />
E 10<br />
E 2<br />
F4 E7 Fig. 4. Degrees <strong>of</strong> freedom for the hexahedral element.<br />
- 227 - 15th IGTE Symposium 2012<br />
(5)<br />
constant tangential component along one edge (p = 0),<br />
are explicitly included. In addition, higher order gradient<br />
components on edges N∇ E , faces N∇F and in the interior<br />
N∇ I are represented separately. This key feature - also<br />
known as the local exact sequence property - is equivalent<br />
to fulfilling the so-called De-Rham complex<br />
R id<br />
−→ H 1 (Ω) grad<br />
−→ H(curl,Ω)<br />
curl<br />
−→ H(div,Ω) div<br />
0<br />
−→ L2(Ω) −→ {0} (6)<br />
already on the finite element level, i.e. the gradients,<br />
forming the null-space <strong>of</strong> the curl-operator, can be completely<br />
omitted for each type <strong>of</strong> unknowns (edge, face,<br />
interior) separately if only the flux density B is <strong>of</strong><br />
interest. In [2], this is denoted as the reduced basis and<br />
can be used to gauge the problem in the following way:<br />
• For the lowest order Nédélec functions N0 E , we add<br />
the regularization term α as described in Sec. I.<br />
• For the higher order terms, we simply skip the<br />
gradient functions N∇ E , N∇F and N∇I .<br />
Another unique advantage <strong>of</strong> this basis according to [9]<br />
is that shape functions <strong>of</strong> arbitrary order are available<br />
for all types <strong>of</strong> elements in 2-D and 3-D, utilizing any<br />
kind <strong>of</strong> hierarchical 1-D shape functions, e.g. Legendre<br />
or Gegenbauer.<br />
A. Anisotropic Adapted Polynomial Degree<br />
In general the magnetic flux density B is defined as<br />
⎛<br />
B = ⎝ Bx<br />
⎞<br />
⎛<br />
⎞<br />
∂Az ∂Ay<br />
∂y − ∂z<br />
By ⎠ ⎜ ∂Ax ⎟<br />
= ∇×A = ⎝<br />
⎠ . (7)<br />
Bz<br />
∂z<br />
∂Ay<br />
∂x<br />
∂Az<br />
− ∂x<br />
∂Ax<br />
− ∂y<br />
In case <strong>of</strong> thin structures, the in-plane components<br />
(Bx,By) are dominant (see Fig. 5). Additionally, the<br />
variation <strong>of</strong> the in-plane components <strong>of</strong> A in z-direction<br />
( ∂Ax ∂Ay<br />
, ) in (7) is already resolved accurately by the<br />
∂z<br />
∂z<br />
FE-discretization in thickness direction <strong>of</strong> the single sheet<br />
, i.e.<br />
layers. Thus the dominant terms left are ∂Az<br />
∂y<br />
A (z)<br />
z<br />
z<br />
y<br />
x<br />
z<br />
B (x)<br />
y<br />
A (x)<br />
z<br />
A (x)<br />
z<br />
x<br />
and ∂Az<br />
∂x<br />
Fig. 5. Flux / potential distribution in face on (x, z)-plane.<br />
the magnetic vector potential A should be approximated<br />
quite accurately in the in-plane direction.<br />
Therefore, it is advantageous to reflect his behavior in<br />
the anisotropic polynomial degree as<br />
pη,pξ >pζ , (8)<br />
assuming that the global z and the local ζ direction<br />
coincide. The increase <strong>of</strong> the polynomial degree only<br />
affects the face NF and inner NI degrees <strong>of</strong> freedoms,
as we skip higher order gradient functions N ∇ E and N∇ F<br />
due to gauging. This leads to practical order templates<br />
like paniso =(2, 2, 1) or paniso =(3, 3, 1). Although it<br />
seems that the lowest order anisotropic template should<br />
be paniso =(1, 1, 0), this does not lead to more accurate<br />
results, as only faces in (x, y)-direction get additional<br />
unknowns, which do not contribute to an improved<br />
resolution <strong>of</strong> Az.<br />
As the permeability in air μ0 is typically several<br />
orders <strong>of</strong> magnitude smaller compared to the one in the<br />
ferromagnetic core, the flux is mostly concentrated in<br />
the core. This allows us the choose a small isotropic<br />
polynomial degree <strong>of</strong> pair = 0 for the approximation.<br />
The last step is especially effective, if structured grids<br />
are utilized or if the air domain is significantly large.<br />
B. Single Step Iterative Solution Scheme<br />
The resulting system <strong>of</strong> equations after FE discretization<br />
can be written as<br />
with<br />
⎛<br />
⎝<br />
KN0N0 KN0F KN0I<br />
KF N0 KFF KFI<br />
KIN0 KIF KII<br />
K(A)A = f , (9)<br />
⎞ ⎛<br />
⎠ ⎝<br />
AN0<br />
AF<br />
AI<br />
⎞ ⎛<br />
⎠ = ⎝<br />
fN0<br />
fF<br />
fI<br />
- 228 - 15th IGTE Symposium 2012<br />
⎞<br />
⎠ .<br />
(10)<br />
The interior unknowns AI can be eliminated by static<br />
condensation as<br />
AI = K −1<br />
II (fI − KIN0 AN0 − KIFAF ) , (11)<br />
where K −1<br />
II can be inverted on the element level. Substituting<br />
this result back in (10) results in the reduced<br />
system<br />
<br />
ˆKN0N0 ˆKN0F<br />
ˆfN0<br />
AN0 = , (12)<br />
ˆKF<br />
ˆKFF AF ˆfF<br />
N0<br />
with the modified matrices and RHS vectors as<br />
−1<br />
= KN0N0 − KN0I(KII )KIN0<br />
ˆKN0F = KN0F − KN0I(K −1<br />
II )KIF<br />
ˆKN0N0<br />
ˆKF N0 = KF N0 − KIF(K −1<br />
II )KN0I<br />
(13)<br />
(14)<br />
(15)<br />
ˆKFF = KFF − KIF(K −1<br />
II )KFI (16)<br />
ˆfN0 = fN0 − KN0I(K −1<br />
II )fI (17)<br />
ˆfF = fF − KFI(K −1<br />
II )fI . (18)<br />
The two main effects <strong>of</strong> the static condensation are:<br />
• The number <strong>of</strong> unknowns is reduced significantly,<br />
as with increasing polynomial degree p only face<br />
and interior unknowns are added.<br />
• The condition number κ <strong>of</strong> the reduced system (12)<br />
is much smaller compared to the one <strong>of</strong> the full<br />
system (10), causing less iterations <strong>of</strong> the CG solver.<br />
In order to solve the reduced system (12), we apply a<br />
Preconditioned Conjugate Gradient (PCG) method. It was<br />
shown in [2], that a α-robust preconditioner C−1 can be<br />
simply defined by a block Jacobian preconditioner (i.e.<br />
an additive Schwarz method, ASM), defined by<br />
<br />
C =<br />
ˆKN0N0<br />
0<br />
0<br />
Kˆ B<br />
FF<br />
<br />
, (19)<br />
where the single blocks are formed as follows:<br />
• ˆ KN0N0 : The lowest order Nédélec functions can be<br />
either solved by a sparse direct solver [10] or by a<br />
suitable iterative method, respecting the Helmholtzdecomposition<br />
<strong>of</strong> the magnetic vector potential (see<br />
e.g. [11]).<br />
: For every face, all unknowns are grouped in<br />
• ˆ KB FF<br />
one block (superscript B). The application <strong>of</strong> the<br />
preconditioner basically is just the inversion <strong>of</strong> the<br />
single face blocks K −1<br />
FF .<br />
IV. ANISOTROPIC BLOCK PRECONDITIONER<br />
If the preconditioner (19) is applied for structures with<br />
a very high aspect ratio (AR), the convergence <strong>of</strong> the<br />
iterative solver deteriorates. This can be explained by an<br />
increase in the condition number κ = λmax/λmin, asthe<br />
entries in the stiffness matrix K scale with 1/h, where h<br />
is the mesh size, leading to strongly coupled entries for<br />
nearly parallel edges / faces and thus to nearly singular<br />
systems with high condition numbers [5].<br />
The idea proposed in [5] is based on a singularity<br />
decomposition technique, where new unknown variables<br />
are introduced and assigned to groups <strong>of</strong> parallel edges<br />
with small distance. However, this method introduces<br />
new matrix entries, as all edges in one group couple via<br />
the auxiliary variable. In addition, the method is only<br />
applicable to 1st order elements, as only edge degrees <strong>of</strong><br />
freedom are considered.<br />
An alternative approach is taken in [4], where a plane<br />
smoother for nodal and edge components <strong>of</strong> the A − φformulation,<br />
respecting the Helmholtz-decomposition, is<br />
applied within a geometric multigrid (MG) solver. Again,<br />
explicit knowledge <strong>of</strong> the anisotropic direction is needed<br />
a priori. In our approach, the idea <strong>of</strong> [4] is extended to<br />
η<br />
ζ<br />
ξ<br />
F3<br />
F2<br />
F1<br />
F8<br />
F7<br />
F6<br />
F5<br />
F4<br />
thin direction(s)<br />
Fig. 6. Thin structure with 2 distinct face groups {F1, F2, F3} and<br />
{F4, ..., F10}, where η is the long direction.<br />
the p-version <strong>of</strong> the FEM. As the lowest order edge contributions<br />
KN0N0 are already solved with a direct solver<br />
and the inner degrees <strong>of</strong> freedom KII get eliminated by<br />
static condensation, only the face contributions ˆ KFF are<br />
affected by the anisotropy. Thus we can modify the initial<br />
face blocks ˆ KB FF <strong>of</strong> the preconditioner matrix C (19) by<br />
grouping all unknowns <strong>of</strong> the faces perpendicular to the<br />
F10<br />
F9
thin direction in one diagonal block ˆ K Bai<br />
FF , if the aspect<br />
ratio <strong>of</strong> the element exceeds a user-defined threshold<br />
ARth. We can even generalize the idea, allowing for two<br />
anisotropic / thin directions within one element, e.g. in<br />
Fig. 6 the faces F4 to F10 couple strongly, as the size in<br />
both, ξ- and ζ-direction, is small compared to the extend<br />
in η-direction. This is especially useful in meshes with<br />
tensor-product structure.<br />
The modified preconditioner matrix is then defined as<br />
<br />
ˆKN0N0 0<br />
C =<br />
0 Kˆ Bai . (20)<br />
FF<br />
The procedure for computing ˆ K Bai<br />
FF without explicit<br />
knowledge <strong>of</strong> the thin direction(s) is sketched in Algorithm<br />
1. It collects strongly coupled (thin) faces in a graph<br />
and determines the blocks <strong>of</strong> ˆ K Bai<br />
FF by calculating the set<br />
<strong>of</strong> connected components <strong>of</strong> it. As the only information<br />
needed for the algorithm is the size <strong>of</strong> the elements in<br />
each direction, the procedure can be applied to general<br />
3-D elements (tetrahedron, wedge, pyramid).<br />
Algorithm 1: Definition <strong>of</strong> anisotropic face blocks.<br />
Input: elements e <strong>of</strong> mesh T<br />
Output: groups <strong>of</strong> thin faces Gi,i=1,...,nG<br />
Data: graph <strong>of</strong> connected anisotr. faces G=(V,E)<br />
foreach e ∈T do<br />
if ARmax(e) ≥ ARth then<br />
compute size <strong>of</strong> element w.r.t. local<br />
directions (hξ,hη,hζ)<br />
hmax = max(hξ,hη,hζ)<br />
foreach d ∈{ξ,η,ζ} do<br />
if hd/hmax ≥ ARth then<br />
get faces F1, F2 perpendicular to<br />
d-direction<br />
insert (F1,F2) in G<br />
nG = # <strong>of</strong> non-connected components <strong>of</strong> G<br />
for i =1to nG do<br />
Gi = connected faces in i-th component <strong>of</strong> G<br />
V. NONLINEAR TWO-STEP STRATEGY<br />
In order to accelerate the resulting Newton scheme,<br />
we utilize a 2-step strategy, motivated by full multigrid<br />
methods (see [6] and [8]). The idea is to compute a<br />
good start approximation for the Newton scheme on a<br />
coarse sub-space T H , which is formed in the p-version<br />
by the lowest order functions KN0N0. The fine space T h<br />
is spanned by the complete set <strong>of</strong> polynomials [7]. Thus<br />
the stiffness matrix K in (10) can be splitted into KH<br />
and Kh as follows<br />
KH = K00<br />
Kh = K =<br />
<br />
= KN0N0 (21)<br />
<br />
. (22)<br />
K00 K01<br />
K10 K11<br />
Due to the hierarchical basis functions, the interpolation<br />
operator Ih H is trivially defined as<br />
Ah =[I, 0] T AH = I h HAH . (23)<br />
- 229 - 15th IGTE Symposium 2012<br />
15 cm<br />
y<br />
z<br />
x<br />
30 cm<br />
0.96 mm<br />
1 layer<br />
air gaps<br />
(exploded view)<br />
Fig. 7. Model <strong>of</strong> step-lap core without air domain (scale factor 30 in<br />
thickness direction).<br />
This allows us to perform a 2-step solution strategy as<br />
follows:<br />
1) Solve a few Newton steps on the small coarse<br />
system KN0N0 (21), using a direct solver.<br />
2) Interpolate the coarse space solution AH to the fine<br />
space Ah according to (23).<br />
3) Proceed with the full system Kh (22) using the<br />
solution strategy <strong>of</strong> Section III-B.<br />
VI. APPLICATION: STEP-LAP CORE MODEL<br />
The applicability <strong>of</strong> the method is demonstrated for a<br />
typical 45◦-multi-step-lap joint region <strong>of</strong> a transformer<br />
core (see Fig. 7) with 4 layers <strong>of</strong> steel sheet, each<br />
0.24 mm in thickness, with a step-lap <strong>of</strong> 2 air gaps<br />
(width: 1 mm) in each sheet. The model is discretized<br />
by 2568 hexahedral elements and 3172 nodes. The used<br />
nonlinear B-H curve is the one depicted in Fig. 3. As<br />
excitation, we apply a prescribed flux density B0 =<br />
0.1 − 2.5 Tiny-direction.<br />
Fig. 8. Concentration <strong>of</strong> magnetic flux lines in corner (45 ◦ -view) for<br />
uniform p =0(top) and p =3(bottom) with B0 =1.0 T (scale<br />
factor 30 in thickness direction).<br />
A. Initial Results<br />
Initially, we choose an isotropic polynomial degree<br />
p =0,...,3 and compare the spatial resolution <strong>of</strong> the<br />
magnetic flux density near the air gaps. Here, the Newton<br />
algorithm takes between 3 and 9 iterations.<br />
The results <strong>of</strong> the simulation are visualized on a very<br />
fine postprocessing mesh. In Fig. 8 it is clearly visible,<br />
B 0
Fig. 9. Flux distribution for uniform p =0(top) and p =3(bottom)<br />
with B0 =1.0 T (scale factor 30 in thickness direction).<br />
that the continuation <strong>of</strong> the fluxlines across the air gaps<br />
is poorly approximated for p =0and that the curvature<br />
<strong>of</strong> the streamlines is unphysical between the air gaps. In<br />
contrast, the simulation using p =3resolves accurately<br />
the flux concentration above and below the air gaps<br />
(depicted in red). The same observation holds true for<br />
the absolute value <strong>of</strong> the magnetic flux density in Fig.<br />
9, where the flux concentration between the air gaps is<br />
smeared over a large area for p =0.<br />
From Fig. 10 we deduce, that the iterative method is<br />
by a factor 2 to 10 slower compared to the direct method<br />
for all excitation values B0. In contrast, the memory<br />
consumption is only about 50% compared to the direct<br />
one for higher polynomial degrees p, as seen in Table I<br />
(SC denotes the use <strong>of</strong> static condensation).<br />
Fig. 10. Simulation time for direct and iterative solution approach<br />
without anisotropic block preconditioner.<br />
TABLE I<br />
MEMORY REQUIREMENT AND DOFS FOR DIFFERENT POLYNOMIAL<br />
DEGREES (SC: STATIC CONDENSATION)<br />
Polynomial Degree piso<br />
0 1 2 3<br />
# Total DOFs 7824 43956 141840 332292<br />
# Inner DOFs - 12840 71904 208008<br />
Memory Usage (GB)<br />
Direct Solver 0.24 0.39 1.12 3.61<br />
Direct Solver (SC) 0.24 0.37 0.92 2.32<br />
Iterative 1-step (SC) 0.24 0.28 0.53 1.61<br />
- 230 - 15th IGTE Symposium 2012<br />
B. Use <strong>of</strong> Anisotropic Block Preconditioner<br />
The poor runtime performance <strong>of</strong> the iterative 1-step<br />
scheme in Fig. 10 can be explained by the extremely<br />
high aspect ratios up to 1:1000 in Fig. 11: All elements<br />
within the steel sheets have aspect ratios higher than<br />
1:400, leading to over 3000 CG iterations on average.<br />
Fig. 11. Aspect ratio <strong>of</strong> step-lap setup (not shown for elements in air).<br />
If we utilize the anisotropic preconditioner<br />
(20) for varying aspect ratio thresholds<br />
ARth = {1000, 500, 100, 50, 10} the iteration numbers<br />
and time for solving the linear equation system drops<br />
significantly (see Fig. 12). The results are compared for<br />
the iterative 1-step solver with B0 = 1.0 T. From an<br />
Fig. 12. Reduction <strong>of</strong> CG iterations (left) and solution time (right)<br />
for varying aspect ratio threshold ARth.<br />
initial CG iteration count <strong>of</strong> about 3000 (ARth = 1000)<br />
we achieve an average reduction to 100-150 iterations<br />
for ARth = 10, corresponding to a factor <strong>of</strong> 20-30,<br />
depending on the polynomial degree.<br />
The effect on the solution time is similar, where<br />
a reduction by a factor <strong>of</strong> 9 (p=3) to 25 (p=1) can<br />
be achieved, making it comparable in runtime to the<br />
direct solver. The increase in memory for storing larger<br />
diagonal blocks is very moderate, being in the range <strong>of</strong><br />
5-15% compared to the non-blocked version.<br />
The rate <strong>of</strong> reduction in iterations is not heavily<br />
depending <strong>of</strong> the polynomial degree, making the preconditioner<br />
a p-robust method for practical applications. For<br />
all the following results, we apply the preconditioner with<br />
a default threshold <strong>of</strong> ARth =10.
C. Application <strong>of</strong> Two-Step Approach<br />
By applying the 2-step strategy <strong>of</strong> Section V, an<br />
additional decrease in runtime can be observed (see Fig.<br />
13). We start here with 2 Newton iterations on the coarse<br />
Fig. 13. Runtime comparison <strong>of</strong> standard 1-step iterative solution<br />
approach and 2-step approach.<br />
space T H with p =0and use it as start value for the<br />
fine space T h . On average, this saves 1 to 2 Newton<br />
iterations on the fine space, resulting in a reduction <strong>of</strong><br />
runtime between 4% and 48%, which is in a similar range<br />
as reported in [8]. However, the effect diminishes with<br />
higher flux values for all polynomial degrees.<br />
D. Anisotropic Polynomial Degree<br />
Finally, we utilize the strategy as explained in Section<br />
III-A by reducing the polynomial degree anisotropically<br />
in thickness direction pζ
- 232 - 15th IGTE Symposium 2012<br />
Parameter Identification <strong>of</strong> a Finite Element<br />
Based Model <strong>of</strong> Wound Rotor Induction Machines<br />
*Martin Mohr, *Oszkár Bíró, *Andrej Stermecki and † Franz Diwoky<br />
* Christian Doppler Laboratory for Multiphysical Simulation, Analysis and Design <strong>of</strong> Electrical Machines at the<br />
Institute for Fundamentals and Theory in Electrical Engineering, Inffeldgasse 18, A-8010 <strong>Graz</strong>, Austria<br />
† AVL List GmbH, Hans-List-Platz 1, A-8020 <strong>Graz</strong>, Austria<br />
E-mail: martin.mohr@TU<strong>Graz</strong>.at<br />
Abstract—This paper presents two efficient algorithms for the parameter identification in a finite element based circuit model<br />
<strong>of</strong> a wound rotor induction machine. This approach uses magneto-static finite element method simulations for building lookup<br />
tables and employs quint-cubic splines for the interpolation. A reduction <strong>of</strong> the finite element simulation cost is achieved by<br />
decreasing the number <strong>of</strong> nonlinear iterations using a special simulation order keeping the magneto-motive force in the<br />
machine for several sampling points constant. Furthermore, the quint-cubic spline parameter calculation method has been<br />
revised using a dimensional recursive evaluation approach allowing a fast parameter calculation with lower memory demand<br />
and <strong>of</strong>fering possibility <strong>of</strong> parallelization.<br />
Index Terms— Finite element methods, motor drives, numerical models.<br />
I. INTRODUCTION<br />
Finite element (FE) based models, in particular<br />
physical phase variable (PPV) models, use look-up tables<br />
(LUT) generated by magneto-static finite element method<br />
(FEM) simulations. During the following transient<br />
simulations, only the evaluation <strong>of</strong> these LUTs by an<br />
appropriate interpolation method is needed. Several<br />
applications <strong>of</strong> this approach to electrical machines have<br />
already been published [1]-[8].<br />
For permanent magnet machines, this approach is<br />
straightforward [1]-[5]. Typically three independent state<br />
variables are sufficient for prescribing the state <strong>of</strong> the<br />
machine. Therefore, no more than a few hundred or<br />
thousand FEM simulations are needed and a three<br />
variable interpolation method is sufficient.<br />
In contrast, wound rotor induction machines (WRIM)<br />
necessitate a higher number <strong>of</strong> state variables. This is a<br />
consequence <strong>of</strong> the second coil system on the rotor. The<br />
number <strong>of</strong> simulations Ntot increases exponentially with<br />
the number <strong>of</strong> state variables:<br />
Ntot Ni<br />
(1)<br />
states<br />
where Ni is the number <strong>of</strong> sampling points for the i th state<br />
variable. Furthermore, a higher dimensional interpolation<br />
method is necessary. Nevertheless, several<br />
implementations <strong>of</strong> FE-based WRIM models are found in<br />
the literature [6]-[8].<br />
This work refers to the PPV-model <strong>of</strong> a WRIM with<br />
stranded coils introduced in [8] and can be perceived as<br />
an addendum to it. Since the parameter identification has<br />
not been treated in [8], this paper presents the practical<br />
application <strong>of</strong> this model.<br />
The drawbacks <strong>of</strong> the PPV-WRIM model are the very<br />
high number <strong>of</strong> needed FEM-simulations and the<br />
demanding quint-cubic spline parameter calculation. Both<br />
are caused by the number <strong>of</strong> state variables <strong>of</strong> the model<br />
being five as shown in the next section.<br />
II. INTRODUCTION TO THE PPV-WRIM MODEL<br />
This model uses the electrical rotor position Rot and<br />
S S<br />
two transformed currents for each <strong>of</strong> the stator i, i and<br />
R R<br />
the rotor i , i as state variables. This reduction can be<br />
done under the assumption that the rotor and stator<br />
systems are in Y-connection with isolated star point.<br />
The current state variables used are the phase currents<br />
S S S<br />
R R R<br />
<strong>of</strong> the stator iA, iB, i C and rotor iA, iB, i C transformed<br />
into the rotor related reference system:<br />
S S S R R R<br />
iA iB iC 0, iA iB iC 0,<br />
S 1 0 S<br />
i <br />
<br />
<br />
cosRot sin <br />
Rot 2 i<br />
1 2 A<br />
<br />
<br />
S , S<br />
i<br />
<br />
sinRot cosRot 3 i<br />
(2)<br />
<br />
<br />
<br />
<br />
B <br />
3 3<br />
R 1 0 R<br />
i <br />
<br />
2 i<br />
1 2 A<br />
<br />
<br />
R R .<br />
i<br />
<br />
3 i<br />
<br />
<br />
<br />
B <br />
3 3<br />
The rotor position is derived from the mechanical<br />
model and is an input variable for the electrical machine<br />
model.<br />
The used LUTs <strong>of</strong> this model approach contain the<br />
S S S<br />
phase flux linkages <strong>of</strong> the stator A, B, C<br />
and <strong>of</strong> the<br />
R R R<br />
rotor A, B , C<br />
for the electrical system <strong>of</strong> equations as<br />
well as the machine torque T.<br />
All these quantities are parameterized by the five state<br />
variables:<br />
S S R R<br />
LUT f Rot , i, i, i , i<br />
.<br />
(3)<br />
In contrast to [6], the total machine torque T is also<br />
included in the LUT.<br />
The electrical system <strong>of</strong> equations uses the line to line<br />
quantities defined as<br />
S S S S S S S S S S S S<br />
vAB: vBvA, vBC : vCvB, iAB : iBiA, iBC : iCiB R R R R R R R R R R R R<br />
vAB: vBvA, vBC : vCvB, iAB : iBiA, iBC: iC iB<br />
(4)<br />
S S S<br />
with the phase voltages <strong>of</strong> the stator v , v , v and <strong>of</strong> the<br />
A B C
R R R<br />
rotor vA, vB, v C . This system can be rewritten as<br />
<br />
S S S S S <br />
AB AB AB AB <br />
Rot <br />
AB <br />
<br />
dt <br />
Rot iS iS iR i<br />
R<br />
S<br />
di<br />
<br />
S<br />
S S<br />
S S S S S<br />
<br />
v <br />
AB RCUi AB<br />
BC BC BC BC <br />
<br />
BC<br />
<br />
S <br />
dt<br />
<br />
S S<br />
v <br />
BC RCUi <br />
<br />
BC Rot iS iS iR i<br />
<br />
<br />
<br />
S <br />
R<br />
<br />
di <br />
<br />
R R R R<br />
v <br />
AB RCUi<br />
<br />
R R R R<br />
<br />
<br />
AB <br />
<br />
<br />
(5)<br />
AB AB AB AB AB<br />
R R R <br />
<br />
dt<br />
v BC RCUi R<br />
BC <br />
<br />
Rot<br />
iS iS iR i<br />
<br />
R<br />
di R R R R R <br />
<br />
BC BC BC BC BC dt R<br />
<br />
di<br />
Rot i S i S i R i <br />
<br />
R<br />
dt <br />
S<br />
with the stator phase resistance R CU and the rotor phase<br />
R<br />
resistance R CU .<br />
For the evaluation <strong>of</strong> the function values as well as the<br />
partial derivatives, a quint-cubic spline interpolation has<br />
been suggested in [8]. Furthermore, it has been shown<br />
that this model is equivalent to a FEM model in all but<br />
interpolation errors. However, the parameter<br />
identification process has not been discussed there.<br />
In this work, an improved simulation workflow for the<br />
needed FEM simulations is proposed and presented.<br />
Furthermore, a revised quint-cubic spline parameter<br />
calculation method is introduced in detail.<br />
III. IMPROVED FEM SIMULATION WORKFLOW<br />
The motivation for an improved simulation workflow is<br />
the circumstance that the number <strong>of</strong> magneto-static FEM<br />
simulations needed to achieve an accurate interpolation is<br />
higher than in usual applications. However, there are only<br />
slight changes in the input data <strong>of</strong> these simulations.<br />
Utilizing this circumstance, a reduction <strong>of</strong> the FEM<br />
simulation time can be achieved as shown below.<br />
A. Finite Element WRIM Model for Parameter<br />
Identification<br />
The FEM model <strong>of</strong> the WRIM under investigation is<br />
shown in Fig. 1. This three phase machine has three<br />
magnetic pole pairs. The rotor and stator coil systems are<br />
in Y-connection with isolated star points. Furthermore, all<br />
coils are modeled as stranded coils, skin effect is not<br />
considered. Due to the symmetry <strong>of</strong> the machine, only a<br />
third <strong>of</strong> the geometry is modeled, decreasing the number<br />
<strong>of</strong> finite elements and thus the simulation time. Periodic<br />
boundary conditions are used at the cutting planes. Rotor<br />
and stator are coupled with constraint equations, thus no<br />
conforming mesh is needed, and the rotation <strong>of</strong> the rotor<br />
is taken into account [9].<br />
The number <strong>of</strong> finite elements is 16128 and the<br />
number <strong>of</strong> DOFs is 48 025. A single magneto-static FEM<br />
simulation with ANSYS 12.1 needs approximately 21<br />
seconds on a computer with Intel Core2 Quad CPU<br />
(Q9400) and 8GB RAM. During this simulation, in<br />
average 20 nonlinear iterations are needed. Simulation<br />
time and number <strong>of</strong> iterations depend on the operating<br />
point.<br />
For a test example <strong>of</strong> nine sampling points for each<br />
state current and 15 different rotor positions, 98 415<br />
- 233 - 15th IGTE Symposium 2012<br />
FEM simulations have to be carried out. The overall<br />
calculation time is approximately 574 hours. In order to<br />
reduce the computation time, the approach described<br />
below and called method <strong>of</strong> constant magneto-motive<br />
force (MMF) is proposed.<br />
Figure 1: FEM model <strong>of</strong> wound rotor induction machine with rotor and<br />
stator coil system.<br />
B. Method <strong>of</strong> Constant Magneto-Motive Force<br />
A reduction <strong>of</strong> the FEM simulation time can be<br />
achieved in general by reducing<br />
o the number <strong>of</strong> finite elements,<br />
o the number <strong>of</strong> simulations or<br />
o the number <strong>of</strong> nonlinear iterations.<br />
A reduction <strong>of</strong> the number <strong>of</strong> elements decreases the<br />
model quality and a reduction <strong>of</strong> the total number <strong>of</strong><br />
simulations decreases the quality <strong>of</strong> the interpolation.<br />
However, a reduction <strong>of</strong> the number <strong>of</strong> nonlinear<br />
iterations reduces the simulation time without decreasing<br />
the model quality. Nevertheless, a direct manipulation <strong>of</strong><br />
this quantity is not possible because it depends on the<br />
nonlinear behavior <strong>of</strong> the material used in the model.<br />
However, the material properties <strong>of</strong> a prior simulation can<br />
be used as initial guess for the new simulation.<br />
This can be done for several magneto-static simulations<br />
although they are independent from each other under the<br />
assumption that the saturation state between these<br />
simulations is approximately the same. This can be<br />
accomplished if the magneto-motive force between<br />
these simulations does not change significantly.<br />
If the same reference system is used for the rotor and<br />
stator current transformation, then can be easily<br />
calculated component-by-component using<br />
S R<br />
Θα i i <br />
α α<br />
= =<br />
NS S + NR<br />
R<br />
, (6)<br />
Θβ iβ iβ <br />
with the number <strong>of</strong> windings on the stator side NS and on<br />
the rotor side NR. So, a constant MMF can be reached by
keeping the MMF components and constant as<br />
shown in Fig.2.<br />
R<br />
<br />
S<br />
<br />
<br />
S<br />
<br />
R<br />
<br />
<br />
<br />
Figure 2: Several parameter setups (different stator and rotor currents)<br />
with constant total magneto-motive force.<br />
This leads to a simple algorithm using a special<br />
simulation order for the different parameter setups.<br />
S<br />
i<br />
<br />
S<br />
i<br />
Rot<br />
R<br />
i<br />
R<br />
i<br />
Figure 3: a) Blockdiagram <strong>of</strong> algorithm with five independent loops for<br />
each state variable. b) Blockdiagram <strong>of</strong> constant MMF algorithm<br />
Figure 3a shows the straightforward approach without<br />
any optimization. All loops are independent, the loop<br />
order is arbitrarily and massive parallelization is possible.<br />
Figure 3b shows the improved constant MMF<br />
algorithm. Instead <strong>of</strong> five independent loops for each state<br />
variable, this algorithm consists <strong>of</strong> two coupled loops for<br />
each <strong>of</strong> the orthogonal - and -components and one<br />
independent loop for the rotor position. The outermost<br />
loops define the MMF-vector component-by-component,<br />
the innermost loops change the rotor and stator current<br />
ratio for the relevant components. By this coupling, the<br />
step sizes <strong>of</strong> the state variables are also coupled and<br />
cannot be chosen freely. Nevertheless, a high number <strong>of</strong><br />
simulations with constant MMF can be carried out.<br />
The variation <strong>of</strong> the rotor position is done between<br />
these loops. This is suggested because the saturation state<br />
between adjacent rotor positions does not change<br />
significantly. However, this loop can be easily made the<br />
outermost one, allowing a parallelization for each rotor<br />
position.<br />
<br />
<br />
Rot<br />
S R<br />
Ni Ni S R <br />
S R<br />
Ni Ni S R <br />
- 234 - 15th IGTE Symposium 2012<br />
A direct comparison between the two algorithms in<br />
Fig. 3 is shown in Fig. 4, highlighting the benefits <strong>of</strong> this<br />
simple algorithm.<br />
Figure 4: Comparison <strong>of</strong> the number <strong>of</strong> nonlinear iterations for 100<br />
magneto-static FEM simulations with and without constant MMF.<br />
The number <strong>of</strong> nonlinear iterations has been reduced<br />
approximately by a factor <strong>of</strong> four, the simulation time by<br />
a factor <strong>of</strong> three. This lower improvement for the<br />
simulation time can be explained by computational costs<br />
during the simulation setup and initialization. For the test<br />
example, the total simulation time has decreased to 190h<br />
from 574h, without any parallelization.<br />
IV. QUINT CUBIC SPLINE PARAMETER CALCULATION<br />
As shown in [8], a quint cubic spline interpolation<br />
method is well suited to be applied in the PPV-WRIM<br />
model, because it allows a continuous interpolation <strong>of</strong> the<br />
function value as well as the first partial derivatives <strong>of</strong> the<br />
function defined by the LUT. Nevertheless, the number <strong>of</strong><br />
parameters per segment is 1024 and this results in a high<br />
memory demand for the LUTs holding these spline<br />
parameters.<br />
For the test example, the memory demand per LUT is<br />
approximately 480MB. Thus, approximately 3.3GB are<br />
needed in total for the PPV-WRIM model. However, this<br />
is not a problem for state <strong>of</strong> the art computers and<br />
moreover it is not necessary to load all data into the<br />
RAM. But the calculation <strong>of</strong> the parameters is very<br />
demanding, since their number is as high as 62.9 millions<br />
(Table III).<br />
Nevertheless, under the assumption <strong>of</strong> a regular and<br />
orthogonal grid and with the continuity conditions for<br />
quint-cubic splines utilized, a fragmentation <strong>of</strong> this<br />
system <strong>of</strong> equations is possible, allowing a fast<br />
calculation <strong>of</strong> the spline parameters with less memory<br />
demand and a parallelization capability. In this section,<br />
this method is presented and explained in detail.<br />
A. Quint-cubic spline interpolation<br />
A quint-cubic spline interpolation is a piecewise third<br />
order polynomial interpolation in five dimensions as<br />
shown in (8). C 1 -continuity (continuity <strong>of</strong> the function<br />
value and its first order partial derivatives) at the segment<br />
boundaries can be forced by the right choice <strong>of</strong> continuity<br />
conditions. Within the segments all derivatives are<br />
continues due to the properties <strong>of</strong> polynomials.<br />
Each segment in this sense is a five dimensional (5D)
hyper-cuboid with 32 corners and 10 adjacent segments<br />
sharing a four dimensional (4D) hyper-cuboid. The<br />
number <strong>of</strong> such segments NSeg depends on the number <strong>of</strong><br />
sampling points <strong>of</strong> each coordinate variable x1, x2, x3, x4,<br />
x5 and can be calculated as<br />
NSeg Ni1. (7)<br />
ix , x , x , x , x <br />
1 2 3 4 5<br />
The quint-cubic spline is defined as<br />
x : x1, x2, x3, x4, x5,<br />
3 3 3 3 3<br />
i j k l m<br />
f xaijklmx 1x2x3x 4x5 ,<br />
(8)<br />
i0 j0 k0 l0 m0<br />
with the spline parameters aijklm <strong>of</strong> the segment fulfilling<br />
L U L U L U<br />
x1 x1 x1 , x2 x2 x2 , x3 x3 x3<br />
,<br />
(9)<br />
L U L U<br />
x4 x4 x4 and x5 x5 x5<br />
,<br />
L L L L L<br />
with the lower segment boundaries x1 , x2, x3, x4, x 5 , the<br />
U U U U U<br />
upper segment boundaries x1 , x2 , x3 , x4 , x 5 and the<br />
local coordinates x1, x2, x3, x4, x5<br />
defined by<br />
L L L<br />
x1 x1x1 , x2 x2x2, x3<br />
x3x3, (10)<br />
L L<br />
x4 x4x4 and x5<br />
x5x5. The first order partial derivative with respect to x1 can<br />
be interpolated with<br />
3 3 3 3 3<br />
f i1j k l m<br />
fx1 ia ijklmx1<br />
xxxx 2 3 4 5 (11)<br />
x1 i1 j0 k0 l0 m0<br />
and in a similar manner those with respect to the other<br />
variables.<br />
Linear extrapolation or a periodic behavior at the<br />
domain boundaries can be easily realized using<br />
appropriate additional constraints [11].<br />
B. Continuity conditions for quint-cubic splines<br />
The choice <strong>of</strong> the continuity conditions for the spline<br />
parameter determination is important for the continuity at<br />
the segment boundaries. C 1 -continuity even at the<br />
segment boundaries can be achieved for tri-cubic splines<br />
if the function value f, the first order partial derivatives<br />
fx1, fx2, fx3, and all higher order pure mixed derivatives<br />
fx1x2, fx1x3, fx2x3 and fx1x2x3 are continuous at the corners <strong>of</strong><br />
each cuboid-seqment. This prerequisite is proven in [10]<br />
for tri-cubic splines.<br />
However, this pro<strong>of</strong> can also be used for higher<br />
dimensional splines. This leads to the conclusion that for<br />
C 1 -continuity, the same prerequisites are necessary. In the<br />
case <strong>of</strong> quint-cubic splines, these are the function value,<br />
the five first order partial derivatives and all 26 higher<br />
order pure mixed derivatives. These are in sum 32<br />
equations per corner and in total 1024 constraints for<br />
1024 parameters per segment. It can easily be proven that<br />
these equations are linear independent.<br />
C. Continuity <strong>of</strong> f, fx1, fx2, fx3, fx4 and fx5 on the segment<br />
faces<br />
For the interpolation <strong>of</strong> the partial derivatives is<br />
necessary to show their continuity also on the segment<br />
faces. Therefore, let us assume a regular and orthogonal<br />
grid for the sampling variables and two adjacent segments<br />
- 235 - 15th IGTE Symposium 2012<br />
S 1 and S 2 that share a face with x1=const. Without loss <strong>of</strong><br />
U<br />
generality, the face defined by x 1 is used for S 1 and for<br />
S 2 L<br />
the face defined by x 1 is used. On both faces, the<br />
quint-cubic splines become the quad-cubic splines f S1 and<br />
f S2 below:<br />
3 3<br />
S1U L<br />
i<br />
j k l m<br />
ijklm 1 1 2 3 4 5<br />
jklm , , , 0 i0<br />
S1 quad-cubic spline parameter bjklm<br />
S2 <br />
3<br />
<br />
jklm , , , 0<br />
j k l m <br />
0jklm 2 3 4 5 with:<br />
S2<br />
jklm 0jklm<br />
3 3<br />
S1U L<br />
i1<br />
j k l m<br />
x1 ijklm 1 1 xxxx 2 3 4 5<br />
jklm , , , 0<br />
i1<br />
S 1<br />
quad-cubic spline<br />
parameter b<br />
jklm<br />
S2 x1 <br />
3<br />
<br />
jklm , , , 0<br />
j k l m<br />
<br />
1jklm 2 3 4 5<br />
S2<br />
with: <br />
jklm 1jklm<br />
f a x x x x x x<br />
f a x x x x b a<br />
f i a x x<br />
f a x x x x b a<br />
(12)<br />
For continuity <strong>of</strong> f, fx2, fx3, fx4 and fx5 at this face, it is<br />
sufficient that the quad-cubic splines f S1 and f S2 are equal.<br />
S1<br />
Therefore the all spline parameters b jklm <strong>of</strong> f S1 must be<br />
S 2<br />
equal to the corresponding parameter b jklm <strong>of</strong> f S2 .<br />
This equality can easily be shown, since both <strong>of</strong> these<br />
quad-cubic splines fulfill the same 16 constraints<br />
f, fx2, fx3, fx4, fx5, fx2x3,..., f x2x3x4x5 in all 16 involved<br />
nodes and these 256 equations for 256 unknowns are<br />
linearly independent. Therefore, the solution is unique<br />
and the two splines are the same.<br />
For continuity <strong>of</strong> fx1 at this face, the two additional<br />
S1<br />
S 2<br />
quad-cubic splines f x1<br />
and f x1<br />
representing the partial<br />
derivative with respect to x1 on this face, must be equal.<br />
This can be proven in a similar manner as for the other<br />
constraints fx1, fx1x2, fx1x3, fx1x4, fx1x5,..., f x1x2x3x4x5 per<br />
node.<br />
Furthermore, the same pro<strong>of</strong> can be used for the other<br />
faces. Thus, C 1 -continuity for the quint-cubic spline<br />
interpolation has been shown for a regular and orthogonal<br />
grid.<br />
D. Segmented parameter calculation<br />
The pro<strong>of</strong> <strong>of</strong> continuity shows that f, fx1, fx2, fx3, fx4, fx5,<br />
fx1x2, fx1x3,…, fx2x3x4x5 and fx1x2x3x4x5 must be continuous in<br />
each node. If all <strong>of</strong> those 32 conditions per node can be<br />
determined in advance, it is not necessary to assemble a<br />
single system <strong>of</strong> equations for all segments. Instead, each<br />
segment could be solved independently. This will lead to<br />
NSeg systems <strong>of</strong> equations with 1024 unknowns each and<br />
furthermore allows parallel evaluation.<br />
Under the assumption <strong>of</strong> a regular and orthogonal grid<br />
for the sampling data, only one parameter changes along<br />
each edge <strong>of</strong> the segments. Therefore, the quint-cubic<br />
spline becomes a normal cubic spline<br />
i<br />
f x aˆ x 1<br />
2, x3, x4, x i x<br />
5const<br />
i0<br />
3 3 3 3<br />
j k l m<br />
i <br />
ijklm 2 3 4 5<br />
j0 k0 l0 m0<br />
with aˆ a xxxx,<br />
3<br />
<br />
(13)
as shown for an edge with constant values for x2, x3, x4<br />
and x5. The resulting cubic splines along the straight lines<br />
with constant values for x2, x3, x4 and x5 could be<br />
evaluated independent <strong>of</strong> each other. This can also be<br />
done for all other straight lines where only one parameter<br />
changes. This leads to a number <strong>of</strong> NCSP simple cubic<br />
splines, where<br />
N N with M : x<br />
, x , x , x , x .<br />
(14)<br />
<br />
CSP j<br />
iM jM/ i<br />
1 2 3 4 5<br />
With these cubic splines, all first order partial<br />
derivatives in each node can be evaluated as for x1 below:<br />
3<br />
i 1<br />
f x i aˆx <br />
. (15)<br />
<br />
x1 1 i 1<br />
i1<br />
These derivatives can be used in a similar manner for<br />
determining all higher order mixed derivatives.<br />
This approach needs a high number <strong>of</strong> cubic spline<br />
determinations, but all these systems <strong>of</strong> equations have<br />
only three unknowns per segment and thus in total<br />
3Nx1 1<br />
unknowns for the cubic splines along the x1direction,<br />
for example. Furthermore, the system matrices<br />
for all splines in one direction are equal. This leads to a<br />
single system <strong>of</strong> equations with many right hand sides that<br />
can be solved very effectively.<br />
E. Dimensional recursive approach<br />
Obviously, not all 1024 parameters per segment are<br />
unknown. For example the constant parameter a00000 <strong>of</strong><br />
each segment is equal to the function value in the segment<br />
L L L L L<br />
reference node x1 , x2, x3, x4, x 5 where all local<br />
coordinates are zero<br />
L L L L L<br />
a00000 f x1, x2, x3, x4, x5<br />
(16)<br />
and all derivatives in the reference node can be directly<br />
identified as parameters <strong>of</strong> the quint-cubic spline<br />
L L L L L<br />
a10000 fx1x1, x2, x3, x4, x5,<br />
L L L L L<br />
a01000 fx2x1, x2, x3, x4, x5,<br />
(17)<br />
Nevertheless, most <strong>of</strong> the parameters per segment must<br />
still be calculated. However, the construction <strong>of</strong> the quintcubic<br />
spline by using a reference node can be easily<br />
combined with the continuity conditions. This leads to a<br />
dimensional recursive approach allowing a well<br />
structured determination <strong>of</strong> all parameters.<br />
There are five faces <strong>of</strong> the segment including the<br />
reference node. For each <strong>of</strong> these faces, one coordinate is<br />
constant zero, resulting in a quad-cubic spline, as shown<br />
for x1=0:<br />
3 3 3 3<br />
j k l m<br />
f 0, x2x3x4x5a 0 jklm x2x3x4x5 . (18)<br />
j0 k0 l0 m0<br />
All parameters <strong>of</strong> this quad-cubic spline are also<br />
parameters <strong>of</strong> the quint-cubic spline as already indicated<br />
by the parameter indices in (18). Furthermore, the first<br />
order partial derivative with respect to x1 for the face<br />
x1=0 leads to a quad-cubic spline<br />
3 3 3 3<br />
j k l m<br />
fx10, x 2x3x4x5a1jklmx 2x3x4x5 . (19)<br />
j0 k0 l0 m0<br />
- 236 - 15th IGTE Symposium 2012<br />
All <strong>of</strong> these parameters can be also identified as quintcubic<br />
spline parameters. Doing this also for the other four<br />
coordinates, leads to ten quad-cubic splines. Each <strong>of</strong> them<br />
is defined by the continuity conditions <strong>of</strong> the<br />
corresponding nodes. Finally, 992 quint-cubic parameters<br />
per segment can be determined by this method, only the<br />
32 parameters aijklm with i, j, k, l, m 2,3 have to be<br />
solved for separately. This is due to the segment diagonal<br />
U U U U U<br />
node x1 , x2 , x3 , x4 , x 5 ,<br />
since this node is not one <strong>of</strong><br />
the determined quad-cubic splines.<br />
For a quad-cubic spline, this approach can be used in<br />
the same manner, leading to eight tri-cubic splines and an<br />
additional system <strong>of</strong> equations <strong>of</strong> 16 unknowns for the<br />
diagonal node per segment. The further dimensional<br />
recursion is shown in Fig. 5.<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
Figure 5: Dimensional recursive approach for quint-cubic spline<br />
parameter determination.<br />
It is pointed out that the function calls shown in Fig. 5<br />
are multi-function calls. This means that each call has to<br />
be multiplied with the number <strong>of</strong> sampling points for the<br />
corresponding dimension to get the real number <strong>of</strong> single<br />
function calls. As an example, the number <strong>of</strong> single quadcubic<br />
spline function calls NQ4 can be calculated as<br />
NQ4 2Nx1Nx2 Nx3 Nx4 Nx5,<br />
(20)<br />
i.e. two multi-function calls per dimension.<br />
Furthermore, there is also some redundancy within this<br />
recursion. It is not necessary to call 80 multi-function<br />
calls for tri-cubic splines as Fig. 5 hypothesizes. Some <strong>of</strong><br />
them are equal for different quad-cubic splines. This can<br />
easily be shown by the fact that each quad-cubic spline is<br />
defined by 256 parameters and each <strong>of</strong> them is also a<br />
parameter <strong>of</strong> the quint-cubic spline. However, the ten<br />
quad-cubic splines define only 992 quint-cubic<br />
parameters.<br />
TABLE I<br />
RECURSIVE MULTI-FUNCTION CALLS<br />
QuintQuadTriBi- Cubic<br />
cubiccubiccubiccubic 1 10 40 80 80<br />
Table 1 shows the actual number <strong>of</strong> multi-function
calls during this recursive approach. Table 2 shows an<br />
overview for different sampling rate setups and Table 3<br />
shows the corresponding calculation time and memory<br />
demand for an implementation in Fortran95. These<br />
calculations were carried out without parallelization on a<br />
computer with Intel Core2 Duo CPU (E8400) and 8GB<br />
RAM.<br />
TABLE II<br />
SETUP OVERVIEW FOR VARIOUS SAMPLING RATES<br />
Setup Nx1 Nx2 Nx3 Nx4 Nx5 NSample NSeg<br />
1 6 6 6 6 6 7776 3125<br />
2 8 8 8 8 8 32768 16807<br />
3 11 11 11 11 11 161051 100000<br />
4 12 12 12 12 12 248832 161051<br />
5 16 8 8 8 8 65536 36015<br />
6 16 9 9 9 9 104976 61440<br />
TABLE III<br />
SIMULATION TIME AND MEMORY DEMAND<br />
Setup Number <strong>of</strong> Calculation Memory<br />
Parameters time demand<br />
in Mio [s] [MByte]<br />
1 3.20 3,04 60,8<br />
2 17.21 15,61 131,3<br />
3 102.40 83,80 781,3<br />
4 1649.16 134,53 1258<br />
5 36.88 33,76 281,4<br />
6 62.91 54,39 480,0<br />
These setups illustrate the scaling <strong>of</strong> this approach. A<br />
comparison shows that the calculation time increases<br />
slightly slower than the number <strong>of</strong> segments does.<br />
The last setup corresponds to the test example <strong>of</strong><br />
section III-A. For this example, Nx1=16, although there<br />
are 15 different rotor positions. This can be explained by<br />
the periodicity <strong>of</strong> the rotor position, i.e. the first sampling<br />
point is identical with the last one.<br />
V. CONCLUSION<br />
The method <strong>of</strong> constant magneto-motive force<br />
presented in this paper reduces the number <strong>of</strong> nonlinear<br />
iterations and thus the over-all simulation time for the<br />
FEM simulations. This is achieved by just changing the<br />
order <strong>of</strong> parameter setups and thus makes this method<br />
simply usable in every commercial FEM simulation tool.<br />
The revised quint-cubic spline parameter calculation<br />
method introduced in this work allows a very fast and<br />
memory saving pre-calculation <strong>of</strong> the needed spline<br />
- 237 - 15th IGTE Symposium 2012<br />
parameters. This was achieved by a dimensional recursive<br />
approach that leads to a segmentation <strong>of</strong> the whole system<br />
<strong>of</strong> equations. The parameters can also be evaluated in<br />
parallel.<br />
These two improvements make the PPV-WRIM model<br />
approach introduced in [8] applicable to simulation tasks<br />
during the design stage <strong>of</strong> electrical drive chains.<br />
VI. ACKNOWLEDGEMENT<br />
This work has been supported by the Christian Doppler<br />
Research Association (CDG) and by the industrial partner<br />
AVL List GmbH.<br />
[1]<br />
REFERENCES<br />
Mohammed, O.A.; Liu, S.; Liu, Z.; , "Physical modeling <strong>of</strong><br />
electric machines for motor drive system simulation," Power<br />
Systems Conference and Exposition, 2004. IEEE PES , vol., no.,<br />
pp. 781-786 vol.2, 10-13 Oct. 2004<br />
[2] Liu, Z.; Mohammed, O.A.; Liu, S.; , "An improved physics-based<br />
phase variable model <strong>of</strong> PM synchronous machines obtained<br />
through field computation," Computation in Electromagnetics,<br />
2008. CEM 2008. 2008 IET 7th International Conference on ,<br />
vol., no., pp.166-167, 7-10 April 2008<br />
[3] Kallio, S.; Karttunen, J.; Andriollo, M.; Peltoniemi, P.;<br />
Silventoinen, P.; , "Finite element based phase-variable model in<br />
the analysis <strong>of</strong> double-star permanent magnet synchronous<br />
machines," Power Electronics, Electrical Drives, Automation and<br />
Motion (SPEEDAM), 2012 International Symposium on , vol.,<br />
no., pp.1462-1467, 20-22 June 2012 doi:<br />
[4]<br />
10.1109/SPEEDAM.2012.6264434<br />
Mohammed, O.A.; Liu, S.; Liu, Z.; Khan, A.A.; , "Improved<br />
physics-based permanent magnet synchronous machine model<br />
obtained from field computation," Electric Machines and Drives<br />
Conference, 2009. IEMDC '09. IEEE International , vol., no.,<br />
pp.1088-1093, 3-6 May 2009, doi:<br />
[5]<br />
10.1109/IEMDC.2009.5075339<br />
Mohr, M.; Bíró, O.; Stermecki, A.; Diwoky, F.; ,”An Improved<br />
Physical Phase Variable Model for Permanent Magnet Machines”,<br />
submitted and accepted at ICEM, Marseille, France, 2012<br />
[6] Sarikhani, A.; Mohammed, O.A.; , "Development <strong>of</strong> transient FEphysics-based<br />
model <strong>of</strong> induction for real time integrated drive<br />
simulations," Electric Machines & Drives Conference (IEMDC),<br />
2011 IEEE International , vol., no., pp.687-692, 15-18 May 2011<br />
[7] Sarikhani, A.; Mohammed, O. A.; , "Non-linear FE-based<br />
modeling <strong>of</strong> induction machine for integrataed drives,"<br />
[8]<br />
Computation in Electromagnetics (CEM 2011), IET 8th<br />
International Conference on , vol., no., pp.1-2, 11-14 April 2011<br />
Mohr, M.; Bíró, O.; Stermecki, A.; Diwoky, F.; ,” An Improved<br />
Physical Phase Variable Model for Wound Rotor Induction<br />
Machines”, submitted and accepted at CEFC, Oita, Japan, 2012<br />
[9] ANSYS® Academic Research, Release 12.1, Help System, “Low-<br />
Frequency Electromagnetic Analysis Guide”, ANSYS, Inc.<br />
[10] Lekien, F.; Marsden, J.; , “Tricubic interpolation in three<br />
dimensions,” Internat. J. Numer. Methods Engrg., 63 3 (2005),<br />
pp.455-471<br />
[11] Boor, C.D.; , “A practical guide to splines,” New York: Springer-<br />
Verlag, 1978, p39f., ISBN: 9780387903569
- 238 - 15th IGTE Symposium 2012<br />
Post Insulator Optimization Based on Dynamic<br />
Population Size<br />
Peter Kitak, Arnel Glotic, Igor Ticar<br />
<strong>University</strong> <strong>of</strong> Maribor, Faculty <strong>of</strong> Electrical Engineering and Computer Science, Smetanova 17, SI-2000 Maribor,<br />
Slovenia<br />
E-mail: peter.kitak@uni-mb.si<br />
Abstract—This paper suggests the use <strong>of</strong> dynamic population size throughout the optimization process which is applied on the<br />
numerical model <strong>of</strong> a medium voltage post insulator. The main objective <strong>of</strong> the dynamic population is reducing population<br />
size, to achieve faster convergence. Change <strong>of</strong> population size can be done in any iteration by proposed method. The multiobjective<br />
optimization process is based on the PSO algorithm, which is suitably modified in order to operate with the principle<br />
<strong>of</strong> the optimal Pareto front.<br />
Index Terms—Dynamic population size, Insulation elements, Multi-objective optimization, particle swarm optimization.<br />
reductions in the population size are presented in fifth<br />
section. The sixth section is a conclusion with<br />
fundamental summarizing thoughts.<br />
I. INTRODUCTION<br />
Principal function <strong>of</strong> medium voltage insulator is<br />
electrical insulation <strong>of</strong> the conductive parts from the<br />
earthed parts <strong>of</strong> the device, and the mechanical fixing <strong>of</strong><br />
equipment and conductors which have different electrical<br />
potential. This element <strong>of</strong>ten contains built-in capacitive<br />
voltage divider and thus is capable <strong>of</strong> performing voltage<br />
indication function.<br />
Particle swarm optimization method [1] is a very<br />
efficient algorithm and it is applied onto many<br />
engineering problems. In comparison to the original<br />
version, many modifications have been made to the<br />
algorithm, which has gained many improvements [2], [3].<br />
Also, the PSO algorithm is extended into a multiobjective<br />
particle swarm MOPSO [4], [5], on the basis <strong>of</strong> a nondominant<br />
solution sorting (Pareto concept) [6]. Although,<br />
the PSO algorithm does not contain genetic operators in<br />
its fundamentals, it has been proven that the introduction<br />
<strong>of</strong> the mutation is very useful [7]. The presented method<br />
enables the enhancement <strong>of</strong> the solution space. The more<br />
recent modifications <strong>of</strong> the algorithm introduce the<br />
dynamic population size throughout the optimization<br />
process [8]. A variable [9], [10] or fixed [11] change <strong>of</strong><br />
the population size, during the evolution, is possible. The<br />
method that includes variable changes <strong>of</strong> population size<br />
enables the change in any iteration. The method with<br />
fixed change <strong>of</strong> population is determined with a<br />
predefined step <strong>of</strong> iteration.<br />
Reduction <strong>of</strong> the population is desired, when the lasting<br />
time <strong>of</strong> the optimization needs to be shortened. However,<br />
the efficiency and robustness <strong>of</strong> the modified algorithm<br />
must not change.<br />
The remainder <strong>of</strong> this paper is organized as follows.<br />
Section two describes numerical model <strong>of</strong> medium<br />
voltage post insulator and also a multi-objective<br />
optimization model. Section three presents a classical<br />
Particle Swarm Optimization (PSO) algorithm and also an<br />
updated version <strong>of</strong> such algorithm with the Cauchy<br />
mutation operator. Section four describes a dynamic<br />
population PSO algorithm, where two procedures for<br />
population reduction are presented. Results <strong>of</strong> the<br />
classical optimization PSO algorithm and for all dynamic<br />
II. NUMERICAL MODEL AND OPTIMUM<br />
SIGNIFICATION<br />
Post insulator is used as a voltage indicator in medium<br />
voltage switchgear. Post insulators are installed in a<br />
switchgear device or in any other input element, where<br />
voltage is present. Fig. 1 shows parametrically written 3D<br />
numerical model <strong>of</strong> a post insulator. The metal fitting <strong>of</strong><br />
the insulator (upper connection) for fastening to the<br />
conductive part <strong>of</strong> the upper side, is elongated with a<br />
special electrode <strong>of</strong> the divider, which has the same<br />
potential as the conductive part. The metal fitting for<br />
insulator fastening to the earthed part (lower connection)<br />
is situated at the bottom <strong>of</strong> the insulator. An electrically<br />
separated cylindrical metal mesh is mounted around this<br />
metal connection, which is the other divider’s electrode.<br />
Modeling <strong>of</strong> the mentioned switchgear element<br />
demands design regarding the exact value <strong>of</strong> capacitance,<br />
which is calculated from electric energy. Modeling also<br />
requires the lowest possible magnitude <strong>of</strong> electric field<br />
inside insulation material, because increased values <strong>of</strong><br />
electric field lessen the life-time <strong>of</strong> these switchgear<br />
elements. Optimization provides electric field strength<br />
reduction on electrodes’ edges and on other critical points<br />
on its crossing between insulation and air. Switchgear<br />
stable performance requires that switchgear elements<br />
must be constructed to endure the highest voltages, such<br />
as lightning strike (125 kV).<br />
Requirements described above, which have an opposite<br />
tendency, are presented with two objective functions (fC,<br />
fE). Every objective function presents an individual<br />
electric characteristic with different quantities, therefore<br />
objective functions must be written relatively. Both<br />
objective functions are written with bell shaped fuzzy sets<br />
with the maximal value <strong>of</strong> 1. Function fE is presented in<br />
the Fig. 2a, whereas function fC is in Fig. 2b. The<br />
numerical calculation is based on FEM analysis, with<br />
solver <strong>of</strong> the EleFAnT program package [12].
upper electrode<br />
epoxy insulation<br />
cylindrical metal mesh<br />
lower electrode<br />
Figure 1: Post insulator illustrative model with<br />
optimization parameters.<br />
The entire model is parametrically written and<br />
described with eight parameters (Fig. 1). It is necessary to<br />
perform FEM calculation for each evaluation <strong>of</strong> the<br />
objective functions.<br />
f E<br />
a)<br />
f c<br />
1,2<br />
1<br />
0,8<br />
0,6<br />
0,4<br />
0,2<br />
0<br />
1 2 3 4 5<br />
1,2<br />
1<br />
0,8<br />
0,6<br />
0,4<br />
0,2<br />
p7<br />
p5<br />
p4<br />
p2<br />
E (MV/m)<br />
p8<br />
p6<br />
p3<br />
p1<br />
0<br />
54 59 64 69 74 79 84 89 94<br />
b)<br />
C (μF)<br />
Figure 2: Determination <strong>of</strong> objective function: a) fE, b) fC.<br />
PSO algorithm [1] has been used as a multi-objective<br />
optimization algorithm. Among other methods, the<br />
weighted sum method is used to solve the multiobjective<br />
optimization problem. Equation (1) describes how the<br />
functions fE and fC are merged into a unified objective<br />
function f<br />
f wEfE wCfC (1)<br />
where wE and wC are the weights <strong>of</strong> the individual<br />
quantities.<br />
The weighted sum method requires special attention<br />
when selecting the objective functions weights, which<br />
enable transformation to optimization with a composed<br />
single objective function. This method also requires a<br />
detailed knowledge <strong>of</strong> the applicative problem. Knowing<br />
the presented problem, according to (1), the authors have<br />
selected the following weights: wE = 0.6 and wC = 0.4.<br />
- 239 - 15th IGTE Symposium 2012<br />
III. PSO ALGORITHM AND CAUCHY MUTATION<br />
The PSO algorithm is placed in a population-based<br />
stochastic search technique, that imitates social behavior<br />
<strong>of</strong> the birds while they fly and does not contain genetic<br />
operators. Instead <strong>of</strong> the genetic operators, the population<br />
members are exposed to the cooperation between each<br />
other, at the same time, they compete with each other<br />
throughout generations.<br />
Each and every particle adjusts its flying ability to the<br />
leading particle - the best individual. Each particle <strong>of</strong> the<br />
population, which represents a possible solution to the<br />
problem, is treated as a point in the D-dimensional space.<br />
The i th particle is presented as xi ( xi1, xi2,..., xiD)<br />
. The<br />
best former position (position, which gives the best result<br />
in the previous iteration) <strong>of</strong> the each and every particle is<br />
stored and presented as pi ( pi1, pi2,..., piD)<br />
. The velocity<br />
<strong>of</strong> the i th particle is presented as vi ( vi1, vi2,...., viD)<br />
.<br />
The velocity changes vi and the new position xi <strong>of</strong> the<br />
i th particle changes in accordance with the (2) and (3):<br />
vi( t1) wvi() t c1rand() ( pi() t xi()) t <br />
(2)<br />
c2Rand() ( pg( t) xi(<br />
t))<br />
x( t1) x( t) v( t<br />
1)<br />
(3)<br />
i i i<br />
where t indicates the iteration, c1 and c2 are positive<br />
constants, rand() and Rand() are random functions <strong>of</strong> the<br />
dimension [0,1]. Index g represents the position <strong>of</strong> the<br />
best particle among other particles from the optimization<br />
process. Equation (2) is used for calculation <strong>of</strong> the new<br />
particle velocity on the basis <strong>of</strong> the previous particle<br />
velocity and the distance between its instantaneous<br />
distance and distance <strong>of</strong> the leading particle. Equation (3)<br />
represents a flight <strong>of</strong> the particle towards a new position.<br />
When the new population is entirely formed, the<br />
algorithm is being carried out until the interruption<br />
criterion is reached. Two approaches are used in this<br />
paper: a classical approach with the static population<br />
(number <strong>of</strong> the population members is always the same)<br />
and a dynamic approach, where the population changes<br />
the number <strong>of</strong> members throughout the optimization<br />
process. The quality <strong>of</strong> each particle is evaluated on the<br />
basis <strong>of</strong> the defined objective function.<br />
With the intention to prevent too fast convergence and<br />
consequentially to trap into a local minimum, the classical<br />
PSO algorithm has been upgraded with the Cauchy<br />
mutation [13]. Cauchy mutation operator that is used in<br />
the PSO is determined with a weighted vector.<br />
1 NP<br />
W v , (4)<br />
i ji<br />
NP j1<br />
where vji is velocity <strong>of</strong> vector j th particle in the<br />
population. The best particle is mutated from (2)<br />
according to the following equation:
p () i p () i W N( X , X ) , (5)<br />
'<br />
g g i<br />
min max<br />
where N is Cauchy distribution function over the<br />
interval (Xmin, Xmax).<br />
IV. DYNAMIC POPULATION SIZE IN THE PSO<br />
ALGORITHM<br />
In this paper the optimization procedure considers two<br />
approaches for the dynamic population size, employed in<br />
the multiobjective optimization problem. The first<br />
approach is based on the gradual reduction <strong>of</strong> the<br />
population size by half (subsection 4A). In the second<br />
approach, a dynamic reduction <strong>of</strong> population size is<br />
proposed, which is described in subsection 4B.<br />
A. Gradual reduction <strong>of</strong> the population size by half<br />
The original idea is presented by Brest [11] and<br />
proposes gradual reduction <strong>of</strong> the population size by half<br />
in each block <strong>of</strong> a predefined iteration number. This<br />
means that the reduction is not applied throughout all<br />
iterations. Fig. 3 shows the example where the population<br />
reduction has been carried out four times and the<br />
coefficient that defines the reduction is pmax = 4. In each<br />
reduction step, the population is reduced by a half in<br />
comparison to its former size.<br />
p=1<br />
p=2<br />
p=3 NP/4<br />
p=4 NP/8<br />
NP/2<br />
NP<br />
Figure 3: Schematic presentation <strong>of</strong> the population<br />
reduction.<br />
The stopping criterion for the optimization process is a<br />
predefined number <strong>of</strong> function evaluations maxnfeval. In<br />
relation to the population size reduction, there are two<br />
possibilities to determine the size <strong>of</strong> iteration blocks iterp.<br />
First possibility is an equal number <strong>of</strong> function<br />
evaluations throughout each single iteration block.<br />
Therefore the number <strong>of</strong> iterations is defined as:<br />
maxnfeval<br />
iterp<br />
<br />
pmax NP<br />
p<br />
The second possibility <strong>of</strong>fers a constant number <strong>of</strong><br />
iterations iterp, therefore the number <strong>of</strong> function<br />
evaluations for each reduction block is:<br />
(6)<br />
nfeval NPp iterp<br />
(7)<br />
For easier explanation, the Table I shows values for<br />
number <strong>of</strong> objective function evaluations (NP times iterp).<br />
These values are valid for individual population size<br />
blocks with a constant number <strong>of</strong> iterations iterp = 10 and<br />
the number <strong>of</strong> population reduction pmax = 4.<br />
- 240 - 15th IGTE Symposium 2012<br />
TABLE I<br />
RUN DATA, MAXNFEVAL=1200, PMAX=4<br />
p 1 2 3 4<br />
NP 56 28 14 7<br />
iterp 10 10 10 10<br />
NP x iterp 560 280 140 70<br />
Selection procedure is based on the idea from the<br />
selection mechanism in DE optimization algorithm [11].<br />
Individual from the first half <strong>of</strong> the population xi(t) and<br />
the suitable individual from the second half xNP/2+i(t) are<br />
compared based on the corresponding objective function<br />
values. Afterwards, the individual with a better objective<br />
function value takes the position i and therefore becomes<br />
the member <strong>of</strong> the new and reduced population. After the<br />
last step <strong>of</strong> selection is performed, the new and reduced<br />
population is obtained<br />
xNP/2<br />
i( t) if f( xNP/2i( t)) f( xi( t))<br />
xi( t1)<br />
<br />
xi(<br />
t)<br />
other<br />
B. Dynamic reduction <strong>of</strong> population size in individual<br />
iteration<br />
Alteration <strong>of</strong> population size through an optimization<br />
process is realized based on objective function<br />
evaluations, respectively according to (9):<br />
avr i best<br />
NP() t NPmax<br />
f( xmax<br />
)<br />
(8)<br />
f ( x ) f( x )<br />
, (9)<br />
where the favr(xi) is average objective function value <strong>of</strong><br />
the observed population. f(xbest) and f(xmax) are the<br />
objective function values <strong>of</strong> the best and worst particle<br />
ever found up to the observed iteration. NPmax is initial<br />
(max) population size.<br />
As the optimization algorithm approaches to optimal<br />
solution, the value <strong>of</strong> the objective function alters.<br />
Generally, the population’s average value <strong>of</strong> the objective<br />
function value is getting smaller along with iteration<br />
number. However, this does not hold true for all<br />
iterations, because the particles move also through the<br />
non-promising areas <strong>of</strong> the search space. Because <strong>of</strong> this,<br />
the population size is, according to (9), generally<br />
reducing its size; however there are also iterations, where<br />
the population size has been extended. Each population<br />
extension in individual iterations appear usually when the<br />
average objective function value is increased. The<br />
missing particles are obtained from the set <strong>of</strong> particles<br />
that have been discarded on account <strong>of</strong> population<br />
reduction in previous iterations. This improves the<br />
algorithms reliability.<br />
V. RESULTS<br />
Optimization processes with the PSO algorithm are<br />
performed under the following settings: c1=0.5, c2=1.2,<br />
w=0.8. Maximal number <strong>of</strong> iterations for all calculations<br />
is maxiter = 40.
The optimization results are shown in Table II, where<br />
first two examples are showing results obtained by<br />
standard PSO algorithm and different size <strong>of</strong> population,<br />
third one showing results <strong>of</strong> standard PSO algorithm<br />
upgraded with Cauchy mutation and last two are showing<br />
results obtained by using dynamical population size in<br />
standard PSO algorithm. This paper presented two<br />
different concepts <strong>of</strong> dynamical population size, gradual<br />
reduction and dynamic reduction proposed in section IV.<br />
TABLE II<br />
OPTIMIZATION RESULTS OBTAINED WITH PSO ALGORITHM<br />
description min f NP maxnfeval<br />
standard PSO 0.345 30 1200<br />
standard PSO 0.328 56 2240<br />
standard PSO + mutation 0. 326 56 2240<br />
dyn. populated PSO<br />
0. 326 56/28/14/7 1050<br />
(gradual reduction)<br />
dyn. populated PSO<br />
(proposed reduction)<br />
0. 326<br />
max 56<br />
min 10<br />
829<br />
Optimization process convergences for all mentioned<br />
examples in Table II are shown in Fig. 4.<br />
Objective function value<br />
0,6<br />
0,55<br />
0,5<br />
0,45<br />
0,4<br />
0,35<br />
standard PSO (NP=56)<br />
standard PSO (NP=30)<br />
standard PSO + mutation<br />
dyn. populated PSO (gradual reduction)<br />
dyn. populated PSO (proposed reduction)<br />
0,3<br />
0 10 20<br />
Iteration<br />
30 40<br />
Figure 4: Objective function values <strong>of</strong> PSO algorithm<br />
during the optimization process<br />
Algorithm with using small population size has not<br />
reached global solution, because <strong>of</strong> stuck in local<br />
optimum. Global solution can be reached by increasing<br />
population size which leads increased number <strong>of</strong> function<br />
evaluations and longer computation time. By using<br />
proposed dynamical population size algorithm achieved<br />
global minimum with decreased number <strong>of</strong> function<br />
evaluation and computation time.<br />
Size <strong>of</strong> population<br />
60<br />
50<br />
40<br />
30<br />
20<br />
10<br />
Standard procedure with a static population size<br />
Dynamically populated PSO<br />
(proposed reduction)<br />
Dynamically populated PSO<br />
(gradual reduction)<br />
0<br />
0 10 20<br />
Iteration<br />
30 40<br />
Figure 5: Changing <strong>of</strong> population size during the<br />
optimization process.<br />
Changing population size along the optimization process<br />
is shown on Fig. 5 – for the gradual reduction, proposed<br />
reduction and fixed population size (static size).<br />
- 241 - 15th IGTE Symposium 2012<br />
VI. CONCLUSION<br />
Results show comparison <strong>of</strong> the optimization process for<br />
different population size reduction methods. Reduction <strong>of</strong><br />
the population is desirable, when computation time<br />
should be decreased and although efficiency and<br />
robustness <strong>of</strong> algorithm should not be changed.<br />
The important impact <strong>of</strong> proposed PSO algorithm with<br />
using dynamical population size can be seen in decreased<br />
number <strong>of</strong> function evaluation.<br />
In each iteration is tendency to decrease the population<br />
size. However, the population number can be also<br />
increased by adding the new members, which refreshes<br />
the population. Therefore, the algorithm’s ability to<br />
search the minima is increased. It is important, already at<br />
the beginning, to select the appropriate, respectively<br />
enough large population. Therefore, the global search <strong>of</strong><br />
environment is enabled. Smaller population size is<br />
sufficient just for local search solutions.<br />
[1]<br />
REFERENCES<br />
J. Kennedy and R. C. Eberhart, “Particle swarm optimization,”<br />
Proc. IEEE International Conference on Neural Networks, Vol.<br />
IV, Piscataway, NJ, pp. 1942-1948, 1995.<br />
[2] S.L. Ho, Y. Shiyou, Ni Guangzheng and H.C. Wong, “A particle<br />
swarm optimization method with enhanced global search ability<br />
for design optimizations <strong>of</strong> electromagnetic devices,” IEEE Trans.<br />
on Magn., vol. 42, no.4, pp. 1107-1110, 2006.<br />
[3] G. Toscano Pulido and C.A. Coello Coello, “Using clustering<br />
techniques to improve the performance <strong>of</strong> a particle swarm<br />
optimizer,” <strong>Proceedings</strong> <strong>of</strong> Genetic and Evolutionary<br />
[4]<br />
Computation Conference, Seattle, WA, pp. 225-237, 2004.<br />
L. dos Santos Coelho, H.V.H. Ayala and P. Alotto, “A<br />
Multiobjective Gaussian Particle Swarm Approach Applied to<br />
Electromagnetic Optimization,” IEEE Trans. on Magn., vol. 46,<br />
no.8, pp. 3289-3292, 2010.<br />
[5] L. dos Santos Coelho, L.Z. Barbosa and L. Lebensztajn,<br />
“Multiobjective Particle Swarm Approach for the Design <strong>of</strong> a<br />
Brushless DC Wheel Motor,” IEEE Trans. on Magn., vol. 46,<br />
no.8, pp. 2994-2997, 2010.<br />
[6] U. Baumgartner, C. Magele and W. Renhart, “Pareto optimality<br />
and particle swarm optimization,” IEEE Trans. on Magn., vol. 40,<br />
no.2, pp. 1172-1175, 2004.<br />
[7] L. Jize, S. Ping and L. Kejie, “A Modified Particle Swarm<br />
Optimization with Adaptive Selection Operator and Mutation<br />
Operator,” International Conference on Computer Science and<br />
S<strong>of</strong>tware Engineering CSSE 2008, Vol. 1, Wuhan, China, pp.<br />
1199-1202, 2008.<br />
[8] W. F. Leong and G. G. Yen "PSO-based multiobjective<br />
optimization with dynamic population size and adaptive local<br />
archives", IEEE Trans. Syst., Man, Cybern. B, Cybern., vol. 38,<br />
no. 5, pp.1270 - 1293 , 2008.<br />
[9] M. Greeff and AP. Engelbrecht, “Dynamic multi-objective<br />
optimisation using PSO,” Studies in Computational Intelligence,<br />
(Series Ed: Kacprzyk, Janusz), pp. 105-123, 2010.<br />
[10] G. G. Yen and W. F. Leong "Dynamic multiple swarms in<br />
multiobjective particle swarm optimization", IEEE Trans. Syst.,<br />
Man, Cybern. A, Syst. Humans, vol. 39, p.890 - 911, 2009.<br />
[11] J. Brest and M. Sepesy Maucec, “Population Size Reduction for<br />
the Differential Evolution Algorithm,” Applied Intelligence, 29(3)<br />
pp. 228–247, 2008.<br />
[12] Program tools ELEFANT. <strong>Graz</strong>, Austria: Inst. Fundam. Theory<br />
Elect. Eng., Univ. Technol. <strong>Graz</strong>, 2000.<br />
[13] H. Wang, Y. Liu, S. Zeng, H. Li, and C. Li, "Opposition-based<br />
Particle Swarm Algorithm with Cauchy Mutation", Proc. <strong>of</strong> the<br />
2007 IEEE Congress on Evolutionary Compulation, 2007, pp.<br />
4750-4756.
- 242 - 15th IGTE Symposium 2012<br />
Simulation <strong>of</strong> the Absorbing Clamp Method for<br />
Optimizing the Shielding <strong>of</strong> Power Cables<br />
Szabolcs Gyimóthy∗ ,József Pávó∗ ,Péter Kis † , Tomoaki Toratani ‡ , Ryuichi Katsumi ‡ and Gábor Varga∗ ∗Budapest <strong>University</strong> <strong>of</strong> <strong>Technology</strong> and Economics, Egry J. u. 18, H-1111 Budapest, Hungary<br />
† Furukawa Electric Institute <strong>of</strong> <strong>Technology</strong>, Vasgolyó u. 2-4, H-1158 Budapest, Hungary<br />
‡ Furukawa Electric R&D Center for Automotive Systems & Devices, Hiratsuka, Japan<br />
E-mail: gyimothy@evt.bme.hu<br />
Abstract—An efficient numerical simulation tool based on FEM is proposed, by which the EMC shielding effect<br />
characteristics <strong>of</strong> power cables can be predicted in the 30–1000 MHz frequency range, as if it would be measured by the<br />
Absorbing Clamp Method. The proposed simulation method is based on decomposition: a 2D axisymmetric RF FE model<br />
is used for describing the whole measurement setup, while a 3D quasistatic FE model is used for the symmetry cell <strong>of</strong> the<br />
shielding layer in order to capture the effect <strong>of</strong> its fine geometric details. The two models are coupled via the concept <strong>of</strong><br />
the equivalent shielding layer obtained by homogenization. Comparison with real measurements show that the shielding<br />
characteristics can be reliably predicted this way, with some deviation in the low end <strong>of</strong> the frequency range though. This<br />
simulation tool can be applied in the design and optimization <strong>of</strong> braided cable shields to be used in automotive industry.<br />
Index Terms—EMC testing, cable shielding, homogenization, automotive industry<br />
I. INTRODUCTION<br />
The effective reduction <strong>of</strong> emitted radio frequency (RF)<br />
disturbances in electric vehicles –generated mainly by<br />
power semiconductors having high slew rates– becomes<br />
more and more important nowadays [1]. Partly for<br />
this reason, the network configuration <strong>of</strong> cars is being<br />
changed from unshielded single core multi wire harnesses<br />
into coaxial conductor layouts. Consequently, optimization<br />
<strong>of</strong> the shielding <strong>of</strong> such cables is a current topic.<br />
The absorbing clamp method (ACM) is a well known<br />
technique for measuring electromagnetic interference<br />
(EMI) generated by electric cables in the range 30–<br />
1000 MHz [2]. Nowadays it is commonly used in the<br />
automotive industry for testing electromagnetic compatibility<br />
(EMC) <strong>of</strong> braided shields and connectors <strong>of</strong><br />
wire harnesses [3]. The measurement set-up is shown in<br />
Fig. 1. The central element <strong>of</strong> the device is the “clamp”<br />
consisting <strong>of</strong> a set <strong>of</strong> split lossy ferrite rings (see Fig. 2)<br />
and a sensing loop [4]. The shielding effect (SE) to be<br />
measured is –in essence– the ratio <strong>of</strong> the output signal<br />
<strong>of</strong> the unshielded cable to that <strong>of</strong> the shielded.<br />
This method became de facto an industry standard in<br />
many areas. For instance –in addition to RF compliance<br />
testing <strong>of</strong> cables– it is already used for the quantitative<br />
Fig. 1. ACM device for measuring the efficiency <strong>of</strong> cable shielding.<br />
evaluation and comparison <strong>of</strong> the performance <strong>of</strong> shields<br />
through their measured SE characteristics. Therefore,<br />
although there are other practical parameters by which<br />
one may characterize and optimize a shielding, the design<br />
cycle would be much more efficient if the SE curve <strong>of</strong><br />
a shield prototype (as taken by ACM) could be directly<br />
predicted by numerical simulation.<br />
ACM is simple and relatively cheap, but it was not<br />
developed –in the late 60’s– with numerical modeling<br />
in mind. Although there are some theoretical studies on<br />
its operation [5], they are far from being applicable for<br />
quantitative prediction. Actually, the simulation <strong>of</strong> this<br />
measurement is quite challenging from the numerical<br />
point <strong>of</strong> view: a) the set-up consists <strong>of</strong> several components,<br />
among others ferrite; b) a wide frequency range is<br />
studied; c) the arrangement is “large” but has some very<br />
fine details.<br />
Several types <strong>of</strong> numerical models have been worked<br />
out to simulate this measurement, based on e.g. coupled<br />
transmission line system (TLS), finite element method<br />
(FEM) and method <strong>of</strong> moments (MoM), with little success<br />
though [6][7]. Surprisingly, none <strong>of</strong> them was able<br />
to catch even the main tendency <strong>of</strong> the characteristics.<br />
Anyone might conclude from the physical picture that<br />
the shielding effect gets better at higher frequencies,<br />
and actually the above mentioned computing models<br />
just confirmed this behavior. However, the real ACM<br />
Fig. 2. Absorbing clamp <strong>of</strong> type R&S R○ MDS-21 [4].
measurement <strong>of</strong> a typical braided cable shield usually<br />
results in a decaying SE characteristics as frequency<br />
increases (c.f. Fig. 13). Whether this behavior is intrinsic<br />
to the shielding, or rather just a “side-effect” <strong>of</strong> the<br />
measurement method, was a question.<br />
II. SUMMARY OF THE MODELING APPROACH<br />
It was realized that detailed 3D modeling <strong>of</strong> the measurements<br />
is not doable because <strong>of</strong> the enormous computational<br />
resources needed for correct analysis <strong>of</strong> the<br />
complicated arrangement.<br />
Our idea is to use decomposition and homogenization.<br />
Different models are used for the braided shield details<br />
and the overall set-up, respectively. The ACM measurement<br />
at higher frequencies tend to show little dependence<br />
on the larger environment (e.g. support, ground, walls,<br />
etc.). This suggests the use <strong>of</strong> a simplified axisymmetric<br />
2D finite element (FE) model <strong>of</strong> the arrangement, which<br />
can be analyzed efficiently.<br />
On the other hand, realistic cable shields do not<br />
exhibit such symmetry. To overcome this difficulty, we<br />
introduced the concept <strong>of</strong> equivalent homogeneous (bulk)<br />
shielding layer that may have frequency dependent complex<br />
conductivity, such that its shielding characteristics<br />
approximates that <strong>of</strong> the original braided shield. Also a<br />
method was developed by which the equivalent conductivity<br />
parameter can be identified. Although this latter<br />
requires a 3D FE model <strong>of</strong> the shield, the domain <strong>of</strong><br />
computation extends to only a single symmetry cell <strong>of</strong> the<br />
geometry. As a result, this 3D analysis is manageable in a<br />
relatively moderate computing environment, too. Putting<br />
the equivalent shield with its properly selected parameters<br />
into the 2D model <strong>of</strong> the experimental set-up provides the<br />
simulated output <strong>of</strong> the measurement.<br />
III. THE 2D AXISYMMETRIC RF FE MODEL<br />
By omitting the support and the surroundings <strong>of</strong> the<br />
measurement device and taking the two reflector plates as<br />
disc-shaped, we get an axially symmetric arrangement, <strong>of</strong><br />
which a 2D longitudinal section is enough to be considered.<br />
We used the RF Module <strong>of</strong> Comsol Multiphysics R○<br />
in the application mode “Electromagnetic Waves / TM<br />
Waves / Harmonic propagation” and with model space<br />
dimension “Axial symmetry (2D)” [8].<br />
The geometry <strong>of</strong> the 2D model can be seen in Fig. 3.<br />
Cable conductor and reflector plates are both modeled<br />
as perfect electric conductor (PEC) boundary conditions.<br />
The terminal impedance is modeled by means <strong>of</strong> a<br />
50 Ω coaxial cable section with non-reflecting boundary<br />
condition at its end. The arrangement is considered<br />
open in the radial direction, which is modeled by a<br />
cylindrical perfectly matching layer (PML). Finally, the<br />
excitation is realized as a 50 Ω coaxial cable, fed by an<br />
incident coaxial-mode TEM wave, which is prescribed as<br />
“port” boundary condition and characterized by the input<br />
voltage Uin.<br />
- 243 - 15th IGTE Symposium 2012<br />
Fig. 3. The 2D axisymmetric FE model built in Comsol RF.<br />
The output signal is the induced voltage Uout <strong>of</strong> an<br />
assumed loop encircling the first ferrite ring (the one<br />
which is closest to the feed). The transfer function is<br />
defined as the ratio <strong>of</strong> output voltage to input, and<br />
normally its gain k versus the frequency f is used,<br />
k(f) = 20 log10 |Uout/Uin| [dB]. (1)<br />
The shielding effect (SE) <strong>of</strong> a given shield is defined here<br />
as the ratio <strong>of</strong> the output voltage measured for the bare<br />
cable core (i.e. for the cable with its shielding removed)<br />
to that one measured for the shielded cable. Provided the<br />
input voltage is kept fixed, the SE characteristics (given<br />
in dB, as function <strong>of</strong> the frequency) can be expressed as<br />
follows,<br />
SE(f) =kun(f) − ksh(f), (2)<br />
where “un” and “sh” stand for the unshielded and<br />
shielded configurations, respectively.<br />
Figure 4 is a snapshot about the circumferential component<br />
<strong>of</strong> the magnetic field, Hϕ, taken in the unshielded<br />
case. It can be observed how two waves –one along the<br />
line with a longer wavelength and one along the series<br />
<strong>of</strong> ferrite rings with shorter– are coupled with each other.<br />
The damping effect <strong>of</strong> the ferrite is also observable on<br />
the plot. Notably, this FE model performs well even in<br />
the reconstruction <strong>of</strong> the stray field <strong>of</strong> bulk aluminum<br />
shielding layers, for which SE can be as high as 130 dB.<br />
A. Parameter Dependency <strong>of</strong> the 2D Model<br />
We have carefully investigated the sensitivity <strong>of</strong> the<br />
computed results on several model parameters –including<br />
some <strong>of</strong> the numerical implementation– in order to filter<br />
out all possible sources <strong>of</strong> inconsistency between the<br />
model and the real measurement. First comes a list <strong>of</strong><br />
those parameters which have little or no effect: value<br />
<strong>of</strong> the terminal impedance; the implemenation <strong>of</strong> open<br />
boundary (e.g. PML, absorbing boundary condition, etc.);<br />
fluctuation <strong>of</strong> the input voltage.
Fig. 4. Snapsot <strong>of</strong> the circumferential magnetic field.<br />
The latter needs some explanation. Since the device is<br />
not matched with the signal generator source, the voltage<br />
along the feeding cable becomes location dependent due<br />
to reflections, moreover this dependence is function <strong>of</strong> the<br />
frequency. In the FE model we consider a chunk <strong>of</strong> the<br />
feeding cable <strong>of</strong> fixed length, which implicitly defines the<br />
quantity Uin for the model. However, this may not be the<br />
same as its real (measured) counterpart. Fortunately, the<br />
computed SE curves do not show significant dependence<br />
on the “definition” <strong>of</strong> Uin, as simulations testified.<br />
Two factors that really affect the results are the permeability<br />
characteristics <strong>of</strong> the ferrite rings, as well as the<br />
assumption <strong>of</strong> axial symmetry. These are detailed in the<br />
next two subsections.<br />
B. Permeability <strong>of</strong> the Ferrite Rings<br />
The permeability <strong>of</strong> the ferrite material plays a dominant<br />
role in forming the SE characteristics predicted by<br />
the model, hence its accurate description is critical. Since<br />
permeability data for the given absorbing clamp were<br />
not available, we made measurements. The obtained frequency<br />
dependence <strong>of</strong> the complex relative permeability<br />
(real and imaginary parts) can be seen on the graph <strong>of</strong><br />
Fig. 5. Note that the data above 100 MHz are extrapolated<br />
values.<br />
C. Limitations <strong>of</strong> Assuming Axial Symmetry<br />
Considering the field plot in Fig. 4 one can conceive<br />
that the ferrite clamp acts a waveguide and –at higher<br />
- 244 - 15th IGTE Symposium 2012<br />
Relative permeability<br />
800<br />
700<br />
600<br />
500<br />
400<br />
300<br />
200<br />
100<br />
0<br />
10 7<br />
10 8<br />
Frequency (Hz)<br />
real part<br />
negative imaginary part<br />
Fig. 5. Complex relative permeability <strong>of</strong> the ferrite material.<br />
frequencies– can give rise to higher order modes. Although<br />
with the use <strong>of</strong> coaxial feeding the axisymmetric<br />
mode(s) are deemed predominant, non-axisymmetric<br />
modes can appear as well wherever symmetry is violated<br />
in the cross section <strong>of</strong> the device.<br />
For studying this behavior we carried out the mode<br />
analysis for the cross section <strong>of</strong> the device using FEM.<br />
In order to break axial symmetry, a large grounded<br />
conducting plate –being parallel with the axis <strong>of</strong> the cable<br />
under test– was added to the arrangement. In addition<br />
to this, the full 3D FE model <strong>of</strong> the arrangement was<br />
analyzed. Note that an unshielded cable was examined<br />
in both cases for the sake <strong>of</strong> simplicity.<br />
Figure 6 compares the transfer functions obtained from<br />
the full 3D and the 2D axisymmetric model, respectively.<br />
Although several possible propagating modes exist towards<br />
1GHz, there is only a little deviation between the<br />
curves, mostly limited to the lower frequencies. From<br />
this we concluded that the effect <strong>of</strong> higher order nonaxisymmetric<br />
modes is negligible, and that the axisymmetric<br />
model is acceptable within this frequency range.<br />
Fig. 6. Comparison <strong>of</strong> the transfer functions computed by the 2D and<br />
3D FE models, respectively, for the case <strong>of</strong> an unshielded cable.<br />
10 9
IV. HOMOGENIZATION AND EQUIVALENT SHIELDING<br />
The goal is to replace the shield having complex geometry<br />
with a homogeneous cylindrical shielding layer,<br />
which is called hereinafter the “equivalent shield”. The<br />
task is to determine the parameters <strong>of</strong> the latter so that<br />
its shielding effect be the same as if it was measured<br />
on the real cable shielding. Of course, this equivalence<br />
is allowed to satisfy approximately, and only on (and<br />
outside <strong>of</strong>) an observation surface, i.e. at and beyond a<br />
certain distance from the shield (Fig. 7).<br />
3D Observation surface 1D<br />
Real texture<br />
Wire<br />
Shield<br />
Isolation<br />
Fig. 7. Illustration <strong>of</strong> the concept <strong>of</strong> homogenization.<br />
Equivalent<br />
homogeneous material<br />
Figure 8 demonstrates how the roughness <strong>of</strong> the stray<br />
field distribution is smoothing as we increase the distance<br />
taken from the shield. (the radial component <strong>of</strong> the<br />
electric field, Er, is plotted along a line parallel with<br />
the cable, at 1GHz). This smoothing behavior justifies<br />
the use <strong>of</strong> the homogeneous shield as replacement in the<br />
2D model. Note that the shield tested here is not braided<br />
but leaky (c.f. Fig. 10), hence for braided shields one can<br />
expect stronger homogenization effect.<br />
There are several parameters that can be varied in order<br />
to find an equivalence, like for instance the inner and<br />
outer radii <strong>of</strong> the layer, or the specific electric conductivity<br />
and magnetic permeability <strong>of</strong> the material. In order to<br />
simplify and regularize the inverse problem we decided<br />
to keep the geometry fixed, and choose non-magnetic<br />
material. Hence only the complex valued conductivity,<br />
σ, <strong>of</strong> the layer remained to be determined. The proposed<br />
values for the inner and outer radius <strong>of</strong> the equivalent<br />
shield are r1 =10mmand r2 =11mm, respectively.<br />
The observation surface is at the radius robs =23.5mm<br />
which coincides with the inner radius <strong>of</strong> the ferrite rings.<br />
A. Evaluation <strong>of</strong> the Scalar Conductivity Parameter<br />
For those shielding configurations which show certain<br />
symmetry in the ϕ direction, like the one on the left<br />
hand side in Fig. 9, the azimuthal components <strong>of</strong> the<br />
shielding currents are symmetric too. As a consequence,<br />
the axial component <strong>of</strong> the magnetic fields caused by<br />
those currents are compensated, and thereby vanish.<br />
This allows us to investigate only the “axial electric –<br />
azimuthal magnetic” field mode, and hence to describe<br />
the conductivity σ by a complex scalar value.<br />
If the current in the wire is given and the parameter σ is<br />
known, the electric and magnetic fields can be determined<br />
from Maxwell’s equations. Since the problem with homogeneous<br />
shielding is essentially one dimensional (see<br />
- 245 - 15th IGTE Symposium 2012<br />
Fig. 8. Smoothness <strong>of</strong> the field at various distances from the shield.<br />
Fig. 7, right), the solution can be given analytically. We<br />
can describe the electric and magnetic fields, each, by<br />
one single component,<br />
E(r, ϕ, z, t) =Re Ez(r)e −jωt ez (3)<br />
H(r, ϕ, z, t) =Re {Hϕ(r)e −jωt eϕ (4)<br />
where time harmonic fields <strong>of</strong> angular frequency ω are<br />
assumed, and j denotes the imaginary unit. Using the<br />
quasi static approximation in the conducting region,<br />
one can easily derive the following Bessel’s differential<br />
equation from Maxwell’s equations:<br />
d2Ez 1 dEz<br />
+<br />
dr2 r dr − jωμ0σEz =0, r1
• Hϕ is specified at r0 (this is equivalent to prescribing<br />
the total current <strong>of</strong> the wire),<br />
• both Ez and Hϕ are continuous at r1 and r2,<br />
• the asymptotic behavior <strong>of</strong> the fields for r →∞is<br />
known.<br />
We omit further details <strong>of</strong> the solution as they can<br />
be found in several textbooks on electromagnetism [9].<br />
The closed formula expressing the magnetic field was<br />
implemented as a Matlab R○ function, where the input is<br />
the complex conductivity σ, and the output is the complex<br />
amplitude (phasor) <strong>of</strong> Hϕ at r = robs.<br />
We use the following procedure for the evaluation <strong>of</strong><br />
the equivalent conductivity <strong>of</strong> the homogeneous shielding<br />
layer (let us denote it with σeq in the following):<br />
1) We create the 3D FE model <strong>of</strong> the real shielding<br />
together with an exciting wire centered in its axis.<br />
We compute the magnetic field at some selected<br />
frequencies between 30 − 1000 MHz.<br />
2) We take samples <strong>of</strong> the azimuthal magnetic field on<br />
the observation surface, and calculate its average.<br />
This is what we call the “observed magnetic field”<br />
and denote with Hϕ,obs.<br />
3) In an optimization loop we attempt to find σeq<br />
for which Hϕ (a function <strong>of</strong> σ) and Hϕ,obs match<br />
the best (this is carried out for each frequency<br />
separately):<br />
σeq =argmin|Hϕ(σ)<br />
− Hϕ,obs| (8)<br />
σ<br />
Some remarks on the procedure. Notably, in step 1) it<br />
is enough to take one symmetry cell <strong>of</strong> the arrangement<br />
as Fig. 10 demonstrates. We used the AC/DC module<br />
<strong>of</strong> Comsol Multiphysics R○ for computing the fields with<br />
quasi static approximation [10]. In step 2) we also verify<br />
whether the z component magnetic field, Hz itself, as<br />
well as the fluctuation <strong>of</strong> Hϕ on the observation surface<br />
are really negligible. Finally, in step 3) we used the<br />
Matlab R○ function fminsearch for the purpose.<br />
B. Evaluation <strong>of</strong> the Conductivity Tensor<br />
For shielding configurations like the spiral-structure in<br />
Fig. 9 on the right, the above described method is not<br />
suitable because the magnetic field is no longer pure azimuthal.<br />
In this case we can still use the anisotropic conductivity<br />
tensor in the equivalent homogeneous shielding<br />
layer to imitate a similar phenomenon. The conductivity<br />
tensor is assumed to have the following form:<br />
⎡<br />
σ = ⎣ σrr<br />
⎤<br />
0 0<br />
⎦ (9)<br />
0 σϕϕ σϕz<br />
0 σzϕ σzz<br />
That is, the r − ϕ and r − z cross-effects are supposed to<br />
have negligible contribution to the observable magnetic<br />
field. Moreover, σϕz = σzϕ is expected.<br />
To demonstrate the treatment <strong>of</strong> anisotropy we derive<br />
the equations for the field components in the conductive<br />
region. First, the governing Maxwell’s equations are<br />
- 246 - 15th IGTE Symposium 2012<br />
Fig. 10. Symmetry cell <strong>of</strong> the shielded cable used for determining<br />
the σ parameter <strong>of</strong> the equivalent homogeneous shielding layer. This<br />
structure was inspired by leaky coaxial (LCX) cables.<br />
written in the quasi static approximation, in the time<br />
harmonic regime:<br />
∇×H = σE, ∇×E = −jωμ0H (10)<br />
Taking into account the obvious symmetries <strong>of</strong> the configuration<br />
with homogeneous shielding layer, i.e. ∂/∂z =<br />
0 and ∂/∂ϕ =0, the resolution <strong>of</strong> (10) written for the<br />
three cylindrical components is the following:<br />
r : 0 = σrrEr, 0=−jωμ0Hr (11)<br />
⎧<br />
⎪⎨ −<br />
ϕ :<br />
⎪⎩<br />
∂Hz<br />
∂r = σϕϕEϕ + σϕzEz<br />
− ∂Ez<br />
(12)<br />
= −jωμ0Hϕ<br />
⎧<br />
∂r<br />
⎪⎨<br />
1 ∂<br />
z :<br />
r ∂r<br />
⎪⎩<br />
(rHϕ) =σzϕEϕ + σzzEz<br />
1 ∂<br />
r ∂r (rEϕ)<br />
(13)<br />
=−jωμ0Hz<br />
It can be seen that the radial components vanish, and<br />
also that σrr does not play a role. However, there are no<br />
longer separable Hϕ − Ez and Hz − Eϕ modes, as in<br />
the isotropic case. The solution <strong>of</strong> the equations (11-13)<br />
is not easy, but as the domain is one dimensional, fast<br />
solution method may be established. The algorithm given<br />
in section IV-A should slightly be modified here:<br />
1) The same as in the isotropic case, but we have<br />
to carry out the FE analysis for two different<br />
excitations: one in which Hϕ is prescribed at the<br />
wire surface (r = r0), and one in which Hz is<br />
prescribed there (whatever would be the physical<br />
meaning <strong>of</strong> the latter). These two solutions are<br />
marked with ( ′ ) and ( ′′ ) in the following.<br />
2) We take samples <strong>of</strong> the axial and azimuthal<br />
magnetic field on the observation surface from<br />
both FEM solutions, and calculate their average.<br />
This way we obtain the observed magnetic fields,<br />
, H′′<br />
H ′ ϕ,obs<br />
ϕ,obs , H′ z,obs<br />
and H′′<br />
z,obs respectively.
3) We attempt to optimize the components <strong>of</strong> σ as<br />
above. The generalization <strong>of</strong> the scalar case is<br />
straightforward:<br />
σeq =argmin<br />
σ<br />
<br />
H ′<br />
ϕ − H ′ <br />
<br />
ϕ,obs + H ′′<br />
ϕ − H ′′<br />
+ H ′ z − H ′ <br />
<br />
z,obs + H ′′<br />
z − H ′′<br />
V. TEST RESULTS<br />
ϕ,obs<br />
z,obs<br />
<br />
+<br />
<br />
(14)<br />
For testing the method we chose a simple shield structure<br />
(Fig. 11) inspired by leaky coaxial (LCX) cables. The<br />
shield is is made <strong>of</strong> aluminum; the inner radius <strong>of</strong> the<br />
tube is 10 mm; the wall thickness is 1mm. The shield<br />
has circular holes <strong>of</strong> 5mmradius in a regular distribution;<br />
there are 4 holes along the circumference.<br />
Fig. 11. Leaky aluminum shield used for testing the method.<br />
Since the geometry is symmetric with respect to the<br />
azimuthal (ϕ) direction, we are allowed to use the equivalent<br />
scalar conductivity (c.f. section IV-A). By solving<br />
(8) we obtained the σeq curves presented in Fig. 12.<br />
Note that these curves are not the only suitable ones,<br />
because the solution <strong>of</strong> (8) is not unique. However, we<br />
experienced that quite different σeq curves resulted in<br />
the same SE characteristics at the end, so they can be<br />
considered equally good in this respect.<br />
Figure 13 shows the SE curve computed by building<br />
the σeq characteristics <strong>of</strong> Fig. 12 into the 2D axisymmetric<br />
FE model. For comparison, the figure also shows the<br />
curve obtained by real measurement. Obviously, the SE<br />
Fig. 12. The computed equivalent complex conductivity, σeq.<br />
- 247 - 15th IGTE Symposium 2012<br />
Shielding Effect (dB)<br />
60<br />
55<br />
50<br />
45<br />
40<br />
35<br />
30<br />
25<br />
20<br />
15<br />
10 7<br />
10 8<br />
Frequency (Hz)<br />
measured<br />
simulated<br />
Fig. 13. Comparison <strong>of</strong> measured and simulated shielding effects.<br />
characteristics has been reliably predicted by the model,<br />
with some deviation in the low end <strong>of</strong> the frequency<br />
range.<br />
VI. CONCLUSIONS<br />
A numerical simulation method has been elaborated that<br />
can be used to predict the shielding effect <strong>of</strong> shields with<br />
various patterns. Using this tool the designer can predict<br />
the usability <strong>of</strong> a given shield construction. This simulation<br />
method has been thoroughly verified by theoretical<br />
considerations, numerical experiments, and also by some<br />
experimental data. In the authors’ opinion, the error <strong>of</strong><br />
prediction at low frequency might be due to either the<br />
inexact knowledge <strong>of</strong> the ferrite permeability characteristics<br />
or the insufficiency <strong>of</strong> the 2D axisymmetric model.<br />
REFERENCES<br />
[1] M. Reuter, S. Tenbohlen, W. Köhler, and A. Ludwig, “Impedance<br />
analysis <strong>of</strong> automotive high voltage networks for EMC measurements,”<br />
in 10th Int. Symposium on Electromagnetic Compatibility<br />
(EMC Europe), York (UK), 26-30 Sept 2011.<br />
[2] A. Tsaliovich, Electromagnetic Shielding Handbook for Wired and<br />
Wireless EMC Applications, ser. Kluwer international series in<br />
engineering and computer science. Kluwer Academic, 1999. [Online].<br />
Available: http://books.google.hu/books?id=4vl0S6fZo-IC<br />
[3] S. Miyazaki, S. Kihira, and T. Nozaki, “New shielding construction<br />
<strong>of</strong> high-voltage wiring harnesses for Toyota Prius – winning<br />
<strong>of</strong> Toyota Superior Award for cost reduction,” Sumitomo Electric<br />
Industries Technical Review, no. 61, pp. 21–23, Jan 2006.<br />
[4] Rohde & Schwarz R○ MDS-21 Absorbing Clamp – Data<br />
sheet. [Online]. Available: http://www2.rohde-schwarz.com/file/<br />
MDS-21 EZ-24 dat en.pdf<br />
[5] D. Williams and S. Jones, “Time domain characterization and<br />
modelling <strong>of</strong> the absorbing clamp. a device for measuring radiated<br />
radio frequency power,” in Eighth International Conference on<br />
Electromagnetic Compatibility, 21-24 Sept 1992, pp. 149–159.<br />
[6] L. Fejérvári, “Simulation <strong>of</strong> wire harness radiation,” Furukawa<br />
Electric Institute <strong>of</strong> <strong>Technology</strong>, Budapest, Tech. Rep., March<br />
2009.<br />
[7] P. Kis, “Simulation <strong>of</strong> wire harness radiation,” Furukawa Electric<br />
Institute <strong>of</strong> <strong>Technology</strong>, Budapest, Tech. Rep., March 2010.<br />
[8] Comsol Multiphysics RF Module User’s Guide, COMSOL AB,<br />
November 2008.<br />
[9] K. Simonyi, Foundations <strong>of</strong> Electrical Engineering: Fields, Networks,<br />
Waves. London: Pergamon, 1963.<br />
[10] Comsol Multiphysics AC/DC Module User’s Guide, COMSOL<br />
AB, November 2008.<br />
10 9
- 248 - 15th IGTE Symposium 2012<br />
A Neural Network Approach to Sizing an Electrical<br />
Machine<br />
Steven Bielby, David A. Lowther<br />
Electrical and Computer Engineering Department, McGill <strong>University</strong>, 3480 <strong>University</strong> Street, Montreal, Quebec,<br />
Canada. H3A 2A7<br />
E-mail: david.lowther@mcgill.ca<br />
Abstract—The first stage in the design <strong>of</strong> an electrical machine, or an electromagnetic device, is usually referred to as<br />
“sizing”. It produces an approximate description (or design) <strong>of</strong> the desired device in terms <strong>of</strong> its major physical dimensions.<br />
This is traditionally performed using simple magnetic circuits or electric equivalent circuits. This paper proposes an<br />
approach based on a data base <strong>of</strong> device performance data and a neural network to estimate the values <strong>of</strong> the parameters<br />
necessary to provide a complete initial description <strong>of</strong> the device.<br />
Index Terms—Design, electrical machines, sizing, neural networks.<br />
Not only is the speed <strong>of</strong> the process an issue in that the<br />
I. INTRODUCTION<br />
design can take too much time, many <strong>of</strong> the parameters<br />
The design <strong>of</strong> any artifact is a process <strong>of</strong> searching an that are needed for a complete field solution <strong>of</strong> the device<br />
appropriate space at increasing levels <strong>of</strong> complexity. This have little or no effect on the main performance<br />
space is usually referred to as the design space. It is, parameters. For example, the magnitude <strong>of</strong> the torque in<br />
fundamentally, a high dimensional space relating the an electrical machine is dependent on the variation <strong>of</strong> the<br />
performance <strong>of</strong> a device to the values <strong>of</strong> a set <strong>of</strong> stored energy in the airgap <strong>of</strong> the device as the rotor<br />
parameters that describe the structure and physical changes position. The energy, in turn, depends on the<br />
properties <strong>of</strong> the device. The performance, <strong>of</strong> course, fields in the airgap and, to a first approximation, these<br />
might not be a single variable but several. For example, may be related to the current densities and the size <strong>of</strong> the<br />
the cost, size and weight as well as output or functional airgap. This information can be expressed through a basic<br />
capabilities might all be performance variables. The magnetic circuit.<br />
physical parameters could be the actual dimensions Consequently, the process usually employed for<br />
describing the physical structure; the material properties; relatively conventional structures relies on the knowledge<br />
or the external environment including the excitation <strong>of</strong> the designer and simple equivalent circuits for the<br />
sources, the mechanical loads, etc. Thus the design space initial steps. The design starts at an extremely coarse high<br />
can be extremely complex and the design process for a level and works downwards. The equivalent circuit<br />
device needs to be able to balance output objectives and model <strong>of</strong> an electrical machine provides a way <strong>of</strong> relating<br />
input specifications. The difficulty in design is that the electrical, mechanical and, possibly, thermal performance<br />
challenge posed by the specification is, in general, an <strong>of</strong> a device to the values <strong>of</strong> a relatively small number <strong>of</strong><br />
inverse problem, i.e. the designer is asked to produce a components and thus is easy to work with as well as<br />
physical structure which will meet objectives which are providing fast initial iterations. However, it is not always<br />
the input to the design. For example, for an electrical easy to relate the equivalent circuit values to the physical<br />
machine, the design input might well be the torque structures themselves and the knowledge base <strong>of</strong> the<br />
required from the device over a particular speed range. In designer helps to bridge this gap at the early stages <strong>of</strong> the<br />
addition, there might be electrical constraints imposed by design process. Fig. 1 illustrates the hierarchical process<br />
the form <strong>of</strong> the power supply.<br />
involved where the initial search for a design solution<br />
While it might be possible to achieve the design <strong>of</strong> an takes place at a high level with a limited set <strong>of</strong> parameters<br />
electrical machine from first principles, i.e. starting from but a large space to explore. As the design space is<br />
the equations <strong>of</strong> physics and knowledge <strong>of</strong> the properties narrowed down, the number <strong>of</strong> parameters is expanded to<br />
<strong>of</strong> various materials, this involves working at the provide a more detailed examination <strong>of</strong> the local space.<br />
maximum detail <strong>of</strong> the device. To use this approach, the At each level <strong>of</strong> the process, the analysis performed<br />
problem requires the solution <strong>of</strong> the field equations and becomes more complex and, consequently, more<br />
these require that all the physical parameters are expensive. If design is considered to be an optimization<br />
identified and given values. This generates a huge design process, then the phases <strong>of</strong> exploration and exploitation<br />
space to work in. The approach has been demonstrated re-occur at each level. At some point, the “virtual”<br />
on simple models but is computationally extremely (computer based) design is terminated because an<br />
expensive [1]. Searching such a space without an initial increase in the level <strong>of</strong> detail will have no effect on the<br />
idea <strong>of</strong> where a solution might be is extremely slow on<br />
existing computational platforms. In addition, such an<br />
required accuracy <strong>of</strong> the performance parameters .<br />
approach has difficulty including economic,<br />
manufacturing, and other constraints on the design.
Figure 1. Hierarchical Design Process<br />
The starting point at the highest level can, clearly,<br />
affect the convergence and time taken to complete the<br />
process. In an industrial organization, determining the<br />
starting point is performed through one or both <strong>of</strong> two<br />
processes. The first involves a database <strong>of</strong> previous<br />
designs [2], while the second uses simple magnetic<br />
circuit models (mentioned earlier) which vary in their<br />
effectiveness. However, if little or no design experience<br />
exists and a magnetic circuit is ineffective, for example in<br />
the case <strong>of</strong> systems with non-linearity and eddy currents,<br />
these two approaches may fail. The approach suggested<br />
in this paper is to use a neural network to map the desired<br />
machine performance onto a set <strong>of</strong> parameter values for<br />
the device.<br />
II. THE CONCEPT OF SIZING<br />
Following from the above discussion, the first stage in<br />
the design process <strong>of</strong> an electrical machine is to estimate<br />
the approximate size <strong>of</strong> the device needed to meet a set <strong>of</strong><br />
specifications. Traditionally, this is based on a few,<br />
relatively basic, rules. For example, the torque that can be<br />
delivered by an electrical machine is given in Equation 1:<br />
2<br />
T 0.<br />
5D<br />
L.(<br />
B.<br />
J )<br />
(1)<br />
Where D is the outer diameter <strong>of</strong> the rotor, J is the<br />
effective stator current sheet in amps per meter <strong>of</strong><br />
circumference, L is the length <strong>of</strong> the rotor, and B is the<br />
flux density crossing the air gap.<br />
How these quantities are created is the job <strong>of</strong> the rotor<br />
and stator structures. In a simple machine, the flux<br />
density in the airgap can be estimated from a basic<br />
magnetic circuit where the only component that provides<br />
a magnetic “resistance” to the magnetic flux is the airgap.<br />
The benefit <strong>of</strong> this approach is that it can provide<br />
approximate values for many <strong>of</strong> the key parameters in the<br />
machine design and it, in effect, locates the most likely<br />
area in the design search space for a solution. The levels<br />
<strong>of</strong> accuracy needed here are relatively low – the goal is to<br />
find a “ball park” estimate for the parameters so a<br />
solution within 10% or 20% <strong>of</strong> the real answer is good<br />
enough to begin the design process. The issue is speed –<br />
these results can be obtained very quickly and the<br />
exploration <strong>of</strong> a possible design space can be completed<br />
at a much reduced cost.<br />
However, given a need for only an approximate<br />
- 249 - 15th IGTE Symposium 2012<br />
solution but to deliver it extremely quickly, there are<br />
several alternate paradigms which could achieve this on a<br />
relatively unsophisticated computing device. In modern<br />
terms, there are two approaches which can be used. The<br />
first is to generate a surrogate model [3]. This is, in<br />
effect, the approach taken by the equivalent circuit<br />
approach, i.e. the real model is replaced by a simplified<br />
structure which performs in much the same way. The<br />
effectiveness and accuracy <strong>of</strong> the surrogate can be<br />
controlled relatively easily. The relationship between the<br />
output performance and the input parameters is <strong>of</strong>ten<br />
referred to as the “Response Surface” [4] and the<br />
accuracy or fidelity <strong>of</strong> the response surface depends on<br />
the surrogate model chosen. An alternate approach is to<br />
generate a series <strong>of</strong> points on the response surface and<br />
then to develop a curve fitting or interpolation system to<br />
estimate other points on the surface. In this case, the<br />
response surface can be modeled in a way that estimating<br />
a new point and determining the values <strong>of</strong> the input<br />
parameters for this point can be achieved very quickly.<br />
This is a variant <strong>of</strong> the approach suggested in [5].<br />
However, the development <strong>of</strong> the surrogate can be<br />
computationally expensive since full field solutions may<br />
be necessary. The gain, <strong>of</strong> course, over the simple<br />
magnetic circuit approach is that the synthesized initial<br />
design is likely to be much more detailed and to take into<br />
account more <strong>of</strong> the real behavior <strong>of</strong> the device than the<br />
simple assumption <strong>of</strong> perfect magnetic materials and only<br />
an air gap.<br />
The approach being proposed in this paper is a<br />
combination <strong>of</strong> the two “conventional” systems. The<br />
surrogate model is based on a neural network and an<br />
existing database <strong>of</strong> solutions is used to train the<br />
network, thus providing an approximation to the response<br />
surface. Such a system can deliver an initial estimate <strong>of</strong> a<br />
design with minimal computational effort and has the<br />
added advantage <strong>of</strong> improving its capabilities after each<br />
design as the new design can be added to the training<br />
database.<br />
III. NEURAL NETWORKS<br />
A neural network is an interconnected system <strong>of</strong> basic<br />
processing elements [6]. Each element performs a simple<br />
computation based on its inputs and produces an output.<br />
Each neuron “sees” a different weighted set <strong>of</strong> inputs and<br />
the outputs are combined to generate the output <strong>of</strong> the<br />
network. Fig. 2 illustrates the process.
Figure 2. Basic Neural Network Architecture<br />
The architecture shown in Fig.2. is <strong>of</strong>ten referred to as a<br />
“feed-forward network” in that the input data is fed in<br />
one direction through the network and the network<br />
operates synchronously.<br />
The operation <strong>of</strong> the network is controlled through the<br />
values <strong>of</strong> the weights on the neuron inputs and the<br />
combination <strong>of</strong> the neuron outputs. If a vector <strong>of</strong> input<br />
values, representing a point in a multi-dimensional space,<br />
is presented at the inputs, the network will respond with<br />
an appropriate output. In a sense, the network operates<br />
like an active read-only memory. The output from the<br />
memory system being determined not by addressing a<br />
particular location in the memory but based on the<br />
content <strong>of</strong> a memory location. Thus, neural networks are<br />
<strong>of</strong>ten referred to as “auto-associative” or “contentaddressable”<br />
memories. The feed-forward network is<br />
only one <strong>of</strong> several possible neural interconnections and<br />
is interesting in this application because it is able to<br />
interpolate between examples it has been shown.<br />
The process <strong>of</strong> defining the operation <strong>of</strong> a particular<br />
network is known as “training”. In this phase, the<br />
network is provided with a set <strong>of</strong> input vectors and the<br />
output corresponding to each vector. The weights seen by<br />
each neuron and their final combination into the output<br />
are adjusted to minimize the error between the required<br />
output and the one that is actually generated. This can<br />
<strong>of</strong>ten be a difficult process depending on the form <strong>of</strong> the<br />
neurons themselves.<br />
The neurons can be based around several different<br />
functions. Traditionally, neurons have used a paradigm<br />
based on a summing amplifier. Each neuron provides the<br />
weighted sum <strong>of</strong> its inputs which is then processed by a<br />
thresholding function. The outputs <strong>of</strong> all the neurons are<br />
then summed at the output. Each neuron operates over<br />
the entire input space. In this case, a neuron is said to<br />
“fire”, i.e. produce an output, when the weighted sum <strong>of</strong><br />
the inputs exceeds some threshold value. Thus the<br />
evaluation <strong>of</strong> the weights requires the satisfaction <strong>of</strong> a set<br />
<strong>of</strong> inequalities and the solution is non-unique. In<br />
addition, in order to model sophisticated functions,<br />
several layers <strong>of</strong> neurons may be needed and this can lead<br />
to difficulties in the training operation.<br />
- 250 - 15th IGTE Symposium 2012<br />
For the work described in this paper, neurons based<br />
on radial basis functions are used. In this case, the<br />
output function <strong>of</strong> the network is described by:<br />
Where ci represents the center <strong>of</strong> the area <strong>of</strong> interest <strong>of</strong> a<br />
single neuron and x is the position <strong>of</strong> the current input<br />
point in the parameter space being considered. Wi<br />
represents the trained weight <strong>of</strong> neuron i.<br />
The function, , is given by:<br />
2<br />
y<br />
<br />
2<br />
<br />
(3)<br />
(<br />
y) e<br />
Where controls the domain <strong>of</strong> influence <strong>of</strong> the neuron.<br />
The training process can thus determine the values <strong>of</strong><br />
W and for each neuron. Each neuron thus has a local<br />
effect. The determination <strong>of</strong> the weights for a network<br />
based on these functions can be expressed as an<br />
optimization problem and the approach results in a<br />
network that is easier to train.<br />
Once trained, the network can reproduce the examples<br />
that it was shown. However, there is also an emergent<br />
property in that it is able to “generalize”, i.e. it can<br />
generate outputs for input vectors it has not “seen”<br />
before. The process <strong>of</strong> building a neural network can be<br />
considered similar to fitting a surface in a multidimensional<br />
space to a set <strong>of</strong> data points. In fact, there is<br />
some commonality here with methods used in meshless<br />
systems to evaluate field solutions [7]. The network can<br />
function extremely quickly since the individual neuron<br />
operations are computationally simple and it acts as a<br />
look-up table for the unknown surface.<br />
How well the network can match the input data and<br />
corresponding outputs and how good the generalization<br />
capabilities are depends on the network design. The<br />
number <strong>of</strong> neurons can be considered to be similar to the<br />
number <strong>of</strong> basis functions used to represent the surface.<br />
If too few are used, the network will have a problem<br />
training to the presented data with sufficient accuracy; if<br />
too many are used, the network will have difficulty<br />
generalizing and may generated large errors between the<br />
known data points (a sort <strong>of</strong> high frequency oscillation<br />
between the points). For this reason, the training set is<br />
generally split into two pieces: the first is used to train<br />
the network; the second, which has not been seen by the<br />
network during training, is used to test the generalization<br />
capabilities. This can then lead to a higher level process<br />
where the network architecture, i.e. the number <strong>of</strong><br />
neurons used, is modified during the training process to<br />
try to improve the generalization performance.<br />
IV. THE PROPOSED SIZING PROCESS<br />
From the above, the process <strong>of</strong> sizing an electromagnetic<br />
device, in particular, an electrical machine, could be<br />
implemented using a neural network. This is based on the<br />
fact that the process <strong>of</strong> sizing is usually fairly limited, e.g.<br />
(2)
for a specific torque requirement and architecture <strong>of</strong><br />
machine, determine the key diameter values and the air<br />
gap size. If several designs <strong>of</strong> a specific class <strong>of</strong> machine<br />
already exist, then a neural network can be trained on this<br />
data and the generalization capability will allow it to<br />
estimate the “size” <strong>of</strong> the new device. As stated above,<br />
the goal <strong>of</strong> sizing is not to produce a perfect solution to<br />
the design problem, rather it is to get within a reasonable<br />
range in the design space <strong>of</strong> a possible solution. Thus the<br />
system does not have to be highly accurate; an error <strong>of</strong> 10<br />
or 20 percent in the performance <strong>of</strong> the proposed design<br />
is probably acceptable since a conventional optimization<br />
system can take the design from that point to completion.<br />
The process <strong>of</strong> developing a neural network based<br />
sizing system is shown in Fig. 3.<br />
Figure 3. Sizing Network Development Process.<br />
Since the goal <strong>of</strong> the sizing process is to develop an<br />
approximate synthesized prototype, the database shown<br />
in Fig.3 is used primarily to identify the major features<br />
and parameter values. Hence, in fact, a database <strong>of</strong><br />
existing designs is (a) probably too limited and will not<br />
cover the design space particularly well and (b) is<br />
unlikely to be structured to provide the information<br />
needed for sizing. Instead, a more controlled database can<br />
be constructed by using existing analysis programs.<br />
Using this approach, the database can be developed to<br />
provide effective coverage <strong>of</strong> the design space. In<br />
addition, certain parameters <strong>of</strong> the device, e.g. the<br />
- 251 - 15th IGTE Symposium 2012<br />
number <strong>of</strong> poles, the maximum frequency, etc., can be<br />
fixed and thus the network can be trained on a subset <strong>of</strong><br />
the machine design space. This lowers the dimensionality<br />
<strong>of</strong> the space and hence, simplifies the network and<br />
reduces the training time. It also simulates most existing<br />
sizing processes where certain key parameters are set in<br />
the specifications. In the event that these are not set, a<br />
higher level network can be developed to first make the<br />
choice <strong>of</strong> these key parameters before moving into the<br />
sizing process.<br />
V. A SIMPLE SIZING TEST<br />
Given the issues facing machines designers due to the<br />
costs <strong>of</strong> permanent magnets, a possible design scenario<br />
for demonstrating the effectiveness <strong>of</strong> the neural network<br />
approach is the replacement <strong>of</strong> a permanent magnet rotor<br />
with that for an induction machine while keeping the<br />
stator design constant. Thus the goal is to design a rotor<br />
structure that can produce a specific torque-speed<br />
performance. Note that, since the native torque-speed<br />
curve for an induction machine is very different to that<br />
for a permanent magnet design, the substitution is only<br />
possible with the additional use <strong>of</strong> power electronics and<br />
an external control system.<br />
Conventional sizing approaches, which work well for<br />
permanent magnet machines, are not very effective in<br />
dealing with induction machine sizing and somewhat<br />
more sophisticated models based around equivalent<br />
circuits are needed. Thus the induction machine is an<br />
ideal candidate for the process being described in this<br />
paper.<br />
The proposed system was tested on two different rotor<br />
architectures. The first was a drag-cup servo rotor where<br />
the rotor architecture is a conducting (copper) cylinder<br />
around a permeable (iron) core. The design parameters<br />
here are simple: just the thickness and radius <strong>of</strong> the<br />
conducting cylinder. The second design involved a<br />
squirrel-cage rotor which increased both the<br />
electromagnetic complexity <strong>of</strong> the problem and the<br />
number <strong>of</strong> design parameters.<br />
A. The Drag-Cup Rotor<br />
Fig.4 shows the basic design <strong>of</strong> the drag-cup rotor<br />
being considered.<br />
Figure 4. A Drag-Cup Rotor for an Induction Machine.
The typical torque-speed curve for this device is<br />
shown in Fig.5.<br />
Figure 5. Typical Torque-Speed Curve for a Drag-Cup<br />
Rotor.<br />
TABLE II Drag-Cup Rotor from Neural Net (Torque in<br />
Nm)<br />
Test#<br />
TABLE I Drag-Cup Rotor Simulations<br />
Test# Inner Outer Starting Maximum<br />
<br />
Radius<br />
(mm)<br />
Desired<br />
Start<br />
Torque<br />
Radius<br />
(mm)<br />
Desired<br />
Max<br />
Torque<br />
Start<br />
Torque<br />
Torque<br />
(Nm)<br />
Max<br />
Torque<br />
Torque<br />
(Nm)<br />
1 25 27 6.12 17.93<br />
2 25 28 6.74 25.56<br />
3 25 29 6.82 32.84<br />
4 25 30 6.65 39.25<br />
<br />
Averag<br />
eError<br />
1 6.12 17.93 6.52 17.87 3.40%<br />
2 6.74 25.56 7.02 24.85 3.44%<br />
3 6.82 32.84 6.76 31.4 2.57%<br />
4 6.65 39.25 6.58 36.31 4.23%<br />
Table I shows a typical set <strong>of</strong> parameters for the drag-cup<br />
rotor. A large range <strong>of</strong> values over each parameter was<br />
used to generate the training and testing sets for the<br />
neural network and the torque results computed using a<br />
finite element code (MagNet [7]). The network was<br />
constructed and trained using the MatLab Neural<br />
Network toolbox. Once trained, the network predictions<br />
were tested on a set <strong>of</strong> 50 samples. Each sample was also<br />
evaluated using the finite element analysis and the results<br />
were compared. The average error over the whole set was<br />
4%. Table II shows some typical results.<br />
Following on from these results, the complexity was<br />
increased by considering a squirrel-cage rotor, i.e. a<br />
structure consisting <strong>of</strong> a set <strong>of</strong> conducting bars in slots on<br />
the rotor.<br />
- 252 - 15th IGTE Symposium 2012<br />
B. The Squirrel-Cage Rotor.<br />
A range <strong>of</strong> squirrel-cage rotor designs were<br />
constructed to work with a 4 pole, 3 phase stator, shown<br />
in Fig. 6. The variables in the rotor were the number <strong>of</strong><br />
bars, the size <strong>of</strong> the bars and the diameter <strong>of</strong> the rotor.<br />
Fig. 7 shows the architecture <strong>of</strong> the squirrel-cage. A<br />
number <strong>of</strong> combinations <strong>of</strong> these parameters were<br />
produced and the torque-speed curves generated, again<br />
using the MagNet s<strong>of</strong>tware. Results were generated for a<br />
range <strong>of</strong> values <strong>of</strong> each parameter resulting in 144<br />
models in the database Table III shows the parameters<br />
and the ranges used. The number <strong>of</strong> conduction bars was<br />
set to an integer corresponding to the most commonly<br />
used values for a 4 pole system. Each rotor geometry was<br />
simulated for a range <strong>of</strong> frequencies from 0 to 60 Hz and<br />
the starting and peak torques recorded, as well as the<br />
torque-speed curve.<br />
Table III Ranges <strong>of</strong> Parameters for Squirrel-<br />
Cage Rotor<br />
Parameter Minimum Maximum<br />
Radius<strong>of</strong><br />
Conduction<br />
Bars(mm) 0.5 2<br />
Radius<strong>of</strong><br />
Rotor(mm) 28 36<br />
Thenumber<strong>of</strong>conductionbarswassettoone<br />
<strong>of</strong>15,20,30,35<br />
Figure 6. The 4 Pole, 3 Phase Stator Design used with<br />
the Sizing System.<br />
The network was developed following the process<br />
described in Fig. 3 and, once trained, was used to size a<br />
rotor for a particular specification. The neural network<br />
sizing estimates were then compared with an analysis <strong>of</strong><br />
the designed rotor in MagNet and an average error <strong>of</strong><br />
around 9% was generated over all the samples for a<br />
network with 20 neurons. Thus it is reasonable to state
that the proposed system provided a “sizing” estimate for<br />
the rotor design which was within the tolerance expected<br />
at this point in the design process.<br />
As a last test, the network architecture was varied, i.e.<br />
to determine the effect <strong>of</strong> the number <strong>of</strong> neurons on the<br />
error in prediction. The resulting errors are shown in Fig.<br />
8 as a function <strong>of</strong> the number <strong>of</strong> neurons. The data in Fig<br />
8 show the lack <strong>of</strong> approximation capability <strong>of</strong> the<br />
network for low numbers <strong>of</strong> neurons and the inability to<br />
generalize for high numbers. The ideal number for this<br />
problem appeared to be around 20 neurons in the<br />
network. It is not clear what caused the slight increase in<br />
error for a 15 neuron network and this bears further<br />
investigation.<br />
Figure 7. Basic Conductor Layout for a Squirrel Cage<br />
Rotor.<br />
Figure 8. Error between the Finite Element and Neural<br />
Network Solutions against the Number <strong>of</strong> Neurons in the<br />
Network for Starting and Maximum Torques<br />
VI. CONCLUSIONS<br />
The paper has described an approach to developing an<br />
initial prototype <strong>of</strong> an electromagnetic device based on a<br />
limited number <strong>of</strong> specifications. This is conventionally<br />
known as “sizing”. The use <strong>of</strong> a neural network together<br />
with a pre-computed database <strong>of</strong> examples, developed<br />
from a finite element analysis <strong>of</strong> a range <strong>of</strong> devices<br />
covering the design space, has been shown to be effective<br />
in developing an initial solution. The process <strong>of</strong> training<br />
the network is similar to developing the response surface<br />
- 253 - 15th IGTE Symposium 2012<br />
for the particular machine examples. The neural network<br />
acts as a form <strong>of</strong> surrogate but it is capable <strong>of</strong> providing a<br />
solution to the inverse problem unlike the more<br />
conventional usage <strong>of</strong> these techniques where the goal is<br />
to develop an effective forward model. The accuracy <strong>of</strong><br />
the neural network is within the range <strong>of</strong> existing sizing<br />
approaches and can probably be improved with a better<br />
training database.<br />
REFERENCES<br />
[1] Dyck, D.N., Lowther, D.A., “Automated Design <strong>of</strong> Magnetic<br />
Devices by Optimizing Material Distribution,” IEEE Transactions<br />
on Magnetics, Vol.32, 3, 1996, pp. 1188-1193.<br />
[2] Ouyang, J., Lowther, D.A., “A Hybrid Design Model for<br />
Electromagnetic Devices,” IEEE Transactions on Magnetics, Vol.,<br />
45, 3, 2009, pp. 1442-1445.<br />
[3] Hawe, G,I,. Sykulski, J.K., “The Consideration <strong>of</strong> Surrogate<br />
Model Accuracy in Single-Objective Electromagnetic Design<br />
Optimization,” <strong>Proceedings</strong> <strong>of</strong> the 6 th International Conference on<br />
Computational Electromagnetics, 2006, pp.1-2.<br />
[4] Wang, L., Lowther, D.A., ”Reducing the Design Space <strong>of</strong><br />
Standard Electromagnetic Devices using Bayesian Response<br />
Surfaces,” IEEE Transactions on Magnetics, Vol. 46, 2010, pp.<br />
2884-2887.<br />
[5] Hawe, G., Sykulski, J., “Considerations <strong>of</strong> Accuracy and<br />
Uncertainty with Kriging Surrogate Models in Single-Objective<br />
Electromagnetic Optimisation,” IET <strong>Proceedings</strong> on Science,<br />
Education and <strong>Technology</strong>, Vol. 1, 2007, pp.37-47.<br />
[6] Aleksander, I., Morton, H., “An Introduction to Neural<br />
Computing,” London, UK, International Thomson Computer<br />
Press, 1991.<br />
[7] Benbouza, N, Louai, F.Z., Nait-Said, N. “Application <strong>of</strong> Mexhless<br />
Petrov Galerkin (MLPG) Method in Electromagnetics using<br />
Radial Basis Functions,” <strong>Proceedings</strong> <strong>of</strong> the 4 th IET Conference on<br />
Power Electronics, Machines and Drives, 2008, pp. 650-655.<br />
[8] MagNet Users Manual, Infolytica Corporation, 2012.
- 254 - 15th IGTE Symposium 2012<br />
Exploring and Exploiting Parallelism in the Finite<br />
Element Method on Multi-core Processors: an<br />
Overview<br />
Hussein Moghnieh and David A. Lowther<br />
Department <strong>of</strong> Electrical and Computer Engineering, McGill <strong>University</strong> Montreal, Quebec, H3A 2A7, Canada<br />
E-mail: hussein.moghnieh@mail.mcgill.ca<br />
Abstract—Exploring parallelism requires identifying parts <strong>of</strong> a method or a kernel that can run concurrently. Exploiting<br />
parallelism involves utilizing techniques aimed at devising an efficient parallel implementation on a given processor.<br />
Different stages <strong>of</strong> the Finite Element Method have been found to require different approaches to explore and exploit their<br />
parallelism. While data locality is essential to gain performance, many approaches to parallelism have been found to not<br />
exhibit data locality by nature.<br />
Index Terms—Finite Element Method, incomplete Cholesky preconditioner, matrix assembly, mesh generation, multi-core<br />
processor, sparse matrix-vector multiplication.<br />
structure (i.e. maximum number <strong>of</strong> non-zeros per row and<br />
I. INTRODUCTION<br />
the average number <strong>of</strong> non-zeros per row) has been<br />
examined. The resulting matrices are shown in TABLE I.<br />
The matrix naming convention used is an indicator <strong>of</strong> the<br />
problem, element mesh size and the type <strong>of</strong> finite element<br />
formulation applied. For instance, BDC-1-0.07, indicates<br />
that the matrix is generated from the BDC problem, and a<br />
first order (i.e. 1) nodal formulation has been applied on a<br />
mesh where the maximum size <strong>of</strong> any triangular element<br />
is 0.07mm, while BDC-0-1 denotes a matrix that was<br />
assembled by applying an edge element formulation on a<br />
mesh where the maximum size <strong>of</strong> any triangular element<br />
is 1mm.<br />
Further, an initial 3D mesh <strong>of</strong> a transformer (ET)<br />
model has been refined multiple times and a first-order<br />
nodal finite element formulation has been applied on each<br />
<strong>of</strong> the refined meshes. The resulting matrices are denoted<br />
by ET and are shown in TABLE I.<br />
The introduction <strong>of</strong> the multi-core processor by IBM<br />
(i.e. the POWER4) in 2001, and later by Intel and AMD,<br />
has rekindled the interest in using parallel computing to<br />
accelerate computations in an electromagnetic (EM) field<br />
simulation s<strong>of</strong>tware running on a desktop computer.<br />
Since then, a considerable amount <strong>of</strong> research effort has<br />
been invested in investigating the methods and kernels<br />
executed in field simulation s<strong>of</strong>tware; these include mesh<br />
generation, matrix assembly, sparse matrix-vector<br />
multiplication (SMVM) and iterative solver<br />
preconditioning techniques such as incomplete LU<br />
factorization (ILU). Despite having achieved a degree <strong>of</strong><br />
performance gain, several shortcomings have reduced the<br />
effectiveness <strong>of</strong> those techniques in achieving the<br />
ultimate performance goal <strong>of</strong> a field analysis s<strong>of</strong>tware,<br />
which is the reduction <strong>of</strong> the overall time to design a<br />
device. These impediments include the problem size and<br />
structure as well as the architecture <strong>of</strong> the multi-core<br />
processor.<br />
This paper intends to illustrate the degree <strong>of</strong><br />
parallelism which might be expected in each <strong>of</strong> the design<br />
and analysis stages <strong>of</strong> a process based around the finite<br />
element method (FEM), in addition to discussing several<br />
issues and bottlenecks that arise while exploiting<br />
parallelism on a multi-core processor. In particular, it is<br />
intended to examine the gains due to parallelism on<br />
realistic electromagnetic design examples, i.e. a 2D<br />
brushless DC motor model and a 3D transformer model.<br />
II. METHODOLOGY<br />
An initial 2D mesh <strong>of</strong> a brushless DC (BDC) motor<br />
model has been refined multiple times, by setting an<br />
upper limit on the area <strong>of</strong> the elements in each refinement<br />
step, in order to create a range <strong>of</strong> typical meshes and<br />
mesh sizes. Subsequently, first order and second order<br />
nodal formulations, in addition to an edge formulation<br />
have been applied on each mesh and a matrix has been<br />
assembled in each case. The effect <strong>of</strong> applying different<br />
formulations and mesh sizes on the matrix size (i.e.<br />
degrees <strong>of</strong> freedom and number <strong>of</strong> non-zeros) and matrix<br />
TABLE I<br />
MATRICES PROPERTIES<br />
Matrix DOF NNZ Ave.<br />
(Max)<br />
nnz/row<br />
CSR size<br />
(MB)<br />
BDC-1-0.5 38,084 259,188 7 (12) 3.2<br />
BDC-1-0.07 1,194,044 8,334,798 7 (22) 100<br />
BDC-1-0.04 3,152,216 22,000,128 7 (33) 264<br />
BDC-2-3 48,031 407,733 9 (37) 5<br />
BDC-2-0.07 4,787,651 40,664,669 9 (43) 484<br />
BDC-2-0.04 12,660,592 107,560,044 9 (60) 1,280<br />
BDC-0-1 55,772 278,168 5 (5) 3.4<br />
BDC-0-0.07 3,492,389 17,931,763 5 (5) 219<br />
BDC-0-0.04 9,492,389 47,437,511 5 (5) 579<br />
ET-1-0.08 38,234 549,047 15 (36) 6.4<br />
ET-1-0.04 409,531 5,999,230 15 (31) 70<br />
ET-1-0.01 1,975,427 28,927,159 15 (39) 339<br />
Subsequently, the parallel performance and bottlenecks<br />
encountered in an efficient implementation <strong>of</strong> important<br />
FEM kernels, particularly matrix assembly, sparse<br />
matrix-vector multiplication, and preconditioning<br />
techniques based on incomplete LU factorizations, are<br />
investigated.
III. PARALLEL MATRIX ASSEMBLY<br />
The process <strong>of</strong> matrix assembly is not considered to be<br />
time consuming. It is an process, since it consists <strong>of</strong><br />
iterating once over all mesh elements. For each element,<br />
two operations are performed. The first is to approximate<br />
the solution <strong>of</strong> the field within each element which would<br />
result in a dense matrix structure for each mesh<br />
element e where u depends upon the formulation and the<br />
number <strong>of</strong> unknowns in an element. The second operation<br />
is to map each entry <strong>of</strong> the dense matrix to a global<br />
matrix A. The latter step constitutes a significant portion<br />
<strong>of</strong> the total assembly cost mainly because the global<br />
matrix A is sparse. Inserting and updating entries in a<br />
sparse matrix, even when its structure is a priori known,<br />
is not trivial, such is the case when using compressed<br />
sparse row (CSR).<br />
In the case <strong>of</strong> matrix assembly in FEM, the maximum<br />
number <strong>of</strong> non-zeros in any row can be roughly estimated<br />
since it depends on the FEM formulation used. In such a<br />
case, a more suitable choice <strong>of</strong> a sparse storage than the<br />
CSR is to use the ELLPACK sparse storage scheme [1].<br />
The ELLPACK sparse format stores a sparse matrix into two dense data structures<br />
(ELL_values and ELL_column_ind) as shown in Figure 1.<br />
ELL_values stores the values <strong>of</strong> non-zeros in each row in<br />
a condensed form and pads the remaining spaces with<br />
zeros. ELL_column_ind stores the column index <strong>of</strong> each<br />
corresponding non-zero in the ELL_values and “-1” for<br />
the padded non-zeros. The size W corresponds to the<br />
maximum number <strong>of</strong> non-zeros per row. When the<br />
number <strong>of</strong> non-zeros per row is less than W, zeros are<br />
padded to fill the remaining locations.<br />
Mutex<br />
objects<br />
Values per<br />
row counter<br />
00 01<br />
2 00 01 0 0 0 1 1 1<br />
11 14<br />
2 11 14 0 0 1 4 1 1<br />
20 22 25<br />
3 20 22 25<br />
0<br />
0 2 5 1<br />
32 33 35<br />
3 32 33 35<br />
0<br />
2 3 5 1<br />
44 45<br />
2 44 45 0 0 4 5 1 1<br />
50 51 52 55<br />
4 50 51 52 55 0 1 2 5<br />
nxn sparse matrix nxw values nxw<br />
column indices<br />
Figure 1: Synchronized ELLPACK sparse storage.<br />
ELLPACK sparse format<br />
ELL_values ELL_colum_ind<br />
The performance <strong>of</strong> parallel matrix assembly using<br />
atomic operations on multi-core processors has been<br />
investigated. Mutual exclusion (mutex) objects from the<br />
POSIX threads (Pthread) library were used to<br />
synchronize access <strong>of</strong> multiple threads to a shared<br />
resource, which in our case, is the matrix A. For this<br />
purpose, an array <strong>of</strong> mutex objects was created where<br />
each object corresponds to a row in the global matrix as<br />
shown in Figure 1. Typically, in order for a thread to add<br />
or modify entries on a row <strong>of</strong> the global matrix, it must<br />
acquire a lock on the mutex object corresponding to that<br />
row. After the thread finishes its modifications, it releases<br />
the lock to make it available for other threads. For<br />
example, a thread that is assembling an element <strong>of</strong> 3<br />
unknowns (1, 2, 3) must aggregate the total <strong>of</strong> <br />
entries in the global matrix. Each 3 <strong>of</strong> these entries is<br />
added onto the same row <strong>of</strong> the global matrix; hence, a<br />
- 255 - 15th IGTE Symposium 2012<br />
total <strong>of</strong> 3 locks are required on 3 different mutex objects.<br />
This is illustrated in Algorithm 1 (line 6).<br />
Algorithm 1: Parallel matrix assembly using atomic operations.<br />
Figure 2 shows the runtimes in seconds <strong>of</strong> the parallel<br />
assembly <strong>of</strong> 3 matrices using a first-order nodal finite<br />
element formulation on a quad-core Intel i7 processor.<br />
The sequential runtimes are small (a few seconds) despite<br />
the fact that these matrices are considered to be those <strong>of</strong><br />
realistic average size problems. The runtimes were<br />
reduced by more than 50% relative to 1-thread execution<br />
when the number <strong>of</strong> threads was 4. Notice the difference<br />
in runtimes between sequential execution (no<br />
synchronization) shown in horizontal lines and runtimes<br />
<strong>of</strong> 1 thread. This difference highlights the cost <strong>of</strong> calling<br />
the Pthread Application Programming Interface (API)<br />
times. The overhead <strong>of</strong> calling a Pthread API<br />
although it appears to be large in here, is not the main<br />
concern in multi-threaded applications. Instead it is the<br />
wait time that could incur when a thread is waiting for a<br />
mutex object to be released by another thread. In matrix<br />
assembly, this occurs when threads are simultaneously<br />
processing mesh elements that share vertices and edges.<br />
In the case <strong>of</strong> FEM, the possibility <strong>of</strong> threads waiting to<br />
acquire a lock is small since the number <strong>of</strong> shared<br />
vertices or edges is low; it is related to the average<br />
number <strong>of</strong> non-zeros per row.<br />
Figure 2: Parallel matrix assembly timings in seconds on an<br />
Intel quad-core i7-860 processor.
A. Parallel Matrix Assembly Synchronization and<br />
Cache Data Locality<br />
The time it has taken to complete the matrix assembly<br />
process in the previous experiments was very small (only<br />
a few seconds), hence, it was not possible to accurately<br />
measure the total time spent on synchronization (i.e.<br />
calling the Pthread API and waiting to acquire a mutex<br />
lock). Instead, Intel’s VTune Amplifier [2] was used to<br />
count the number <strong>of</strong> execution cycles spent on<br />
synchronization. In the case <strong>of</strong> matrix assembly using 1<br />
thread, this number constituted around 9% <strong>of</strong> the total<br />
cycles spent on matrix assembly (see Figure 3). This<br />
number reflects only the time to call the Pthread API,<br />
since there was no time or cycles wasted waiting to<br />
acquire a lock (no other threads were competing to<br />
acquire a lock). When using 4 threads, more cycles were<br />
halted during synchronization, and in this case the<br />
percentage <strong>of</strong> time wasted increased to 24% (see Figure<br />
4).<br />
91%<br />
Matrix assembly execution cycles<br />
Pthreads Lock / Unlock execution cycles<br />
Figure 3: Execution cycles <strong>of</strong> matrix assembly using 1 thread.<br />
76%<br />
Figure 4: Execution cycles <strong>of</strong> matrix assembly using 4 threads.<br />
IV. SPARSE MATRIX-VECTOR MULTIPLICATION<br />
It is well established that matrix-vector multiplication<br />
( ) exhibits a low floating-point operations<br />
(FLOP) count to memory access ratio, regardless <strong>of</strong><br />
whether A is dense or sparse [3, 4]. This low ratio <strong>of</strong><br />
FLOP/BYTE makes SMVM a memory bandwidth<br />
limited problem requiring the use <strong>of</strong> optimization<br />
techniques which efficiently use the memory hierarchy<br />
system (main memory, caches and registers).<br />
The experiments conducted and presented in this<br />
section aim at analyzing both the effectiveness and the<br />
limitation <strong>of</strong> the commonly used SMVM optimization<br />
techniques when applied on the matrix set described in<br />
TABLE I.<br />
Instead <strong>of</strong> using the ELLPACK storage described<br />
above which could incur a large number <strong>of</strong> padded zeros<br />
in matrices arising from a high order finite element<br />
formulation, a variation <strong>of</strong> this storage, known as the<br />
Hybrid (HYB) storage, is used instead. In this storage<br />
scheme, some <strong>of</strong> the non-zeros are stored in a coordinate<br />
list format (COO) so as to minimize the number <strong>of</strong><br />
padded zeros in the ELLPACK storage as illustrated in<br />
9%<br />
Matrix assembly execution cycles<br />
Pthreads Lock / Unlock execution cycles<br />
24%<br />
- 256 - 15th IGTE Symposium 2012<br />
Figure 5.<br />
00 01<br />
11 14<br />
20 22 25<br />
40<br />
41<br />
32 33 35<br />
44 45<br />
50 51 52 55<br />
ELLPACK<br />
sparse format<br />
ELL_values ELL_colum_ind<br />
00 01<br />
11 14<br />
0<br />
0<br />
0<br />
1<br />
1<br />
4<br />
1<br />
1<br />
Coordinate (COO) list<br />
sparse format<br />
20 22 25 0 2 5<br />
COO_values 45 55<br />
32 33 35 2 3 5 COO_row_ind 4 5<br />
40 41 44 0 1 4 COO_col_ind 5 5<br />
50 51 52 0 1 2<br />
Fillin<br />
Nonzeros stored in<br />
COO<br />
Figure 5: Hybrid (HYB) storage scheme. Some non-zeros are<br />
stored in a coordinate list format (COO) in order to reduce the<br />
total number <strong>of</strong> padded zeros.<br />
In order to evaluate the magnitude <strong>of</strong> the impact <strong>of</strong><br />
accessing X on SMVM performance, the multiplication<br />
by X[column] was replaced by X[i] (Algorithm 2, line 7).<br />
Although this multiplication yielded an incorrect result,<br />
the aim was to show an upper bound on performance gain<br />
in cache blocking (i.e. no cache misses on X).<br />
Algorithm 2: Modified SMVM to eliminate the effect <strong>of</strong> cache<br />
misses on .<br />
Figure 6: BDC-1: SMVM performance when using cache<br />
blocking on .<br />
Eliminating the cache misses <strong>of</strong> has increased the<br />
performance <strong>of</strong> SMVM significantly (as anticipated)<br />
when the matrix was unstructured (i.e. BDC-1) as shown<br />
in Figure 6 (Natural ordering). To further validate the<br />
results, the set <strong>of</strong> matrices in TABLE I (BDC-1) were<br />
ordered to reduced their bandwidth using the Reverse
Cuthill-McKee (RCM) technique [5]. When the matrices<br />
were ordered using RCM, the performance <strong>of</strong> SMVM<br />
using cache blocking was close to the performance <strong>of</strong><br />
SMVM without cache blocking (Figure 6), since cache<br />
misses were reduced due to the ordered access pattern on<br />
.<br />
A. Loop Setup Overhead<br />
One <strong>of</strong> the factors that has been argued to be contributing<br />
to reducing the performance <strong>of</strong> SMVM is the low number<br />
<strong>of</strong> non-zeros per row[6]. For each row <strong>of</strong> the matrix A,<br />
the inner loop <strong>of</strong> the SMVM code, whether using the<br />
CSR storage (as shown in line 5 <strong>of</strong> Algorithm 2) or using<br />
the HYB storage, iterates over the row's non-zeros and<br />
multiplies them by the corresponding entries in . When<br />
only a few non-zeros are present, the inner loop setup<br />
overhead time would dominate the calculation time and<br />
would not be able to be amortized over the short<br />
calculation time <strong>of</strong> a few non-zeros. Since the set <strong>of</strong> FEM<br />
matrices used in this work falls within this category (i.e.<br />
low per row) a test examining the degradation <strong>of</strong><br />
the SMVM performance due to the inner loop setup<br />
overhead has been carried out by replacing the inner loop<br />
<strong>of</strong> SMVM with a set <strong>of</strong> instructions which explicitly<br />
multiply each element <strong>of</strong> by its corresponding element<br />
in ; this technique is <strong>of</strong>ten referred to as “loop<br />
unrolling”. “Loop unrolling” has been made possible by<br />
the use <strong>of</strong> the ELLPACK (or Hybrid) sparse format since<br />
the number <strong>of</strong> non-zeros per row is fixed, hence the<br />
number <strong>of</strong> times an inner loop executes its inner<br />
instruction is fixed. In such a case, the inner loop can be<br />
eliminated and the instruction within the inner loop can<br />
be replaced by explicitly writing the set <strong>of</strong> instructions<br />
that would have been executed by the inner loop.<br />
Algorithm 3 illustrates a sparse matrix-vector<br />
multiplication using the ELLPACK storage. Assuming<br />
that the width <strong>of</strong> the ELLPACK storage is 7, the inner<br />
loop which multiplies the non-zeros <strong>of</strong> a row by the<br />
corresponding locations in is replaced by seven<br />
instructions. The effect <strong>of</strong> this technique on the<br />
performance <strong>of</strong> SMVM when applied on BDC-1 matrix<br />
test set is shown in Figure 7. It can be seen that while loop<br />
unrolling did increase SMVM performance, it was not as<br />
significant as the performance gain obtained from<br />
eliminating cache misses on .<br />
Algorithm 3: SMVM loop unrolling using NVIDIA's Hybrid<br />
sparse storage.<br />
- 257 - 15th IGTE Symposium 2012<br />
Figure 7: BDC-1: Loop unrolling and cache blocking (singleprecision<br />
floating-point operations).<br />
B. SMVM memory bandwidth<br />
Figure 8 shows the memory bandwidth when executing<br />
SMVM using different optimization techniques. The<br />
sustainable memory bandwidth obtained from executing<br />
the STREAM benchmark [7] on an Intel i7-860 processor<br />
is also shown on the same figure. The widely used<br />
STREAM benchmark serves as an indicator <strong>of</strong> the<br />
realistic performance <strong>of</strong> the memory subsystem <strong>of</strong> a<br />
particular processer. In this benchmark, a set <strong>of</strong> kernels is<br />
applied on a dense data structure chosen to be larger than<br />
the available cache <strong>of</strong> a particular processor.<br />
Figure 8: BDC-1: SMVM sustainable memory bandwidth<br />
(MB/s) on Intel i7-860 processor.<br />
In general, a naïve implementation <strong>of</strong> SMVM (i.e. no<br />
optimization) would work well below 50% <strong>of</strong> the<br />
STREAM benchmark sustained memory bandwidth,<br />
while an optimized SMVM (with cache blocking)<br />
attained 70% <strong>of</strong> the STREAM benchmarks. The<br />
implication <strong>of</strong> these results highlights the effect <strong>of</strong> using<br />
sparse storage, which introduces additional memory<br />
fetches due to indirect addressing which also prevents<br />
efficient memory pre-fetching by the processor. A similar<br />
observation has been found when running the same<br />
experiments on an older generation <strong>of</strong> quad-core
processor; AMD’s dual-socket, dual-core Opteron 2214<br />
processor.<br />
C. Parallel SMVM<br />
This section compares the sequential and parallel<br />
performance <strong>of</strong> SMVM kernels when applied to matrices<br />
obtained from the set described in TABLE I and a<br />
miscellaneous matrix test set obtained from “the<br />
<strong>University</strong> <strong>of</strong> Florida Sparse Matrix Collection” [8]<br />
shown in TABLE II. The latter set has been widely used in<br />
the past few years by researchers to evaluate the<br />
performance <strong>of</strong> SMVM algorithms. The results <strong>of</strong> our<br />
evaluation are shown in Figure 9.<br />
TABLE II<br />
MISCELLANEOUS MATRIX TEST SET<br />
Matrix DOF NNZ Ave. (Max)<br />
nnz/row<br />
CSR<br />
size<br />
(MB)<br />
Protein 36,417 4,344,765 120 (204) 50<br />
Sphere 83,334 6,010,480 73 (81) 69<br />
Cant. 62,451 4,007,383 65 (78) 46<br />
Tunnel 217,918 11,524,432 53 (180) 133<br />
CFD 46,835 2,374,001 50 (145) 27<br />
Ship. 140,874 7,813,404 26 (68) 42<br />
Econ. 206,500 1,273,389 7 (74) 16<br />
Epidem. 525,825 2,100,225 4 (4) 26<br />
Circuit 170,998 958,936 6 (353) 12<br />
The following observations were concluded from the<br />
results shown in Figure 9:<br />
The sequential performance <strong>of</strong> SMVM kernels when<br />
the size <strong>of</strong> a matrix fits in the available processor<br />
cache is significantly higher than when the matrix<br />
does not fit in the cache (e.g. BDC-1-0.5 and ET-0-<br />
0.5).<br />
Figure 9: Parallel SMVM using HYBRID storage (doubleprecision<br />
floating-point operations)<br />
- 258 - 15th IGTE Symposium 2012<br />
Matrices that have a high percentage number <strong>of</strong> nonzeros<br />
per row attained higher GFLOPS than matrices<br />
with short row lengths. This is not due to the overhead<br />
caused by the inner-loop <strong>of</strong> SMVM (as demonstrated<br />
in section III.A), but to the ratio <strong>of</strong> the DOF and<br />
NNZ. In general, matrices arising in FEM have high<br />
ratios <strong>of</strong> DOF over NNZ, which explains the low<br />
performance relative to other matrices. This explains<br />
also why matrices obtained from 3D first-order finite<br />
element analysis attained higher GFLOPS than<br />
matrices obtained from 2D analysis.<br />
The performance <strong>of</strong> parallel SMVM is affected by the<br />
distribution <strong>of</strong> non-zeros in a row. Matrices arising<br />
from FEM have a balanced distribution <strong>of</strong> the number<br />
<strong>of</strong> non-zeros per row, leading to better thread<br />
utilization and subsequently to higher GFLOPS.<br />
V. PRECONDITIONING TECHNIQUES:INCOMPLETE<br />
CHOLESKY AND INCOMPLETE CHOLESKY WITH FILL-INS.<br />
There are two techniques to solve a system <strong>of</strong> linear<br />
equations where is the coefficient matrix and <br />
is the right hand side vector. The first is to use direct<br />
solver methods and the second is to use iterative methods.<br />
The direct solver methods rely on decomposing the<br />
coefficient matrix, , into upper and lower triangular<br />
matrices and, where . This is a robust<br />
method. However, it is not useful for large systems, since<br />
the triangular matrices L and U lose their sparsity, as zero<br />
entries in the coefficient matrix turn into non-zero<br />
entries in and . Those new entries are referred to as<br />
fill-ins.<br />
A less robust technique is based on iterative<br />
approaches, such as the conjugate gradient method (CG).<br />
This method requires a large number <strong>of</strong> iterations over<br />
the system <strong>of</strong> linear equations to reach the solution. The<br />
number <strong>of</strong> iterations depends upon the condition number<br />
<strong>of</strong> the matrix in . This condition number can be<br />
reduced (i.e. leading to less CG iterations) if a<br />
preconditioner that is based on the incomplete<br />
factorization <strong>of</strong> is applied to the CG method[9].<br />
Incomplete factorization derives its name from the<br />
direct method discussed above. It uses the same<br />
elimination algorithm to decompose the matrix into an<br />
and , which are an approximation <strong>of</strong> and,<br />
obtained by dropping some fill-in entries. One <strong>of</strong> the<br />
dropping strategies during ILU factorization is to drop all<br />
fill-ins so that the sparsity <strong>of</strong> and matches that <strong>of</strong> the<br />
original matrix A. This dropping rule gives rise to an<br />
ILU(0) or IC(0) (incomplete Cholesky in the case <strong>of</strong><br />
symmetric matrices) preconditioner [10], where the zero<br />
denotes that no fill-ins are allowed. Incomplete Cholesky<br />
with no fill-ins has been the preconditioner <strong>of</strong> choice on a<br />
desktop computer mainly due to its ability to reduce the<br />
number <strong>of</strong> iterations <strong>of</strong> a PCG while being inexpensive to<br />
produce and to compute on a desktop computer. The<br />
structures <strong>of</strong> the factors and are a priori known,<br />
making it easy to pre-allocate the storage requirement,<br />
without the need for symbolic factorization. An efficient<br />
implementation would be to duplicate the lower part <strong>of</strong> A<br />
and then perform an in-place factorization by going in an
ordered manner over the entries <strong>of</strong> each row. A very<br />
efficient implementation is found in the SparseLib++<br />
library [11]. TABLE III shows the execution times <strong>of</strong><br />
creating an IC(0) preconditioner.<br />
Matrix Degrees <strong>of</strong><br />
freedom<br />
TABLE III<br />
INCOMPLETE CHOLESKY PERFORMANCE<br />
Upper<br />
triangle<br />
NNZ<br />
CSR size<br />
(MB)<br />
IC(0) time<br />
(sec.)<br />
BDC-1-0.5 38,084 147,636 1.9 0.0275<br />
BDC-1-0.1 632,883 2,521,428 31.3 0.4987<br />
BDC-1-0.04 3,152,216 12,576,171 155 2.594<br />
ET-0.08 38,324 293,643 3.5 0.1359<br />
ET-0.04 409,531 3,204,372 38.2 1.554<br />
ET-0.01R 2,666,039 21,100,983 252 10.3244<br />
In order to improve the convergence rate <strong>of</strong> PCG<br />
beyond that provided by using the IC(0) preconditioner,<br />
much research has focused on extending the idea <strong>of</strong> the<br />
incomplete Cholesky preconditioner by allowing fill-ins<br />
to occur. There are two heuristics used to control the<br />
amount <strong>of</strong> fill-in. The first is based on a drop tolerance<br />
criterion, known as the Incomplete LU Threshold (ILUT)<br />
through which entries are dropped if their values are<br />
below a preset threshold. The second is based on the level<br />
<strong>of</strong> fill-in known as ILU, where symbolic factorization,<br />
using graph theory, is carried out to identify the locations<br />
<strong>of</strong> the fill-ins and their level in the graph. The fill-in<br />
entries that exceed a given level are dropped. Matrix<br />
elements are assigned a level 0, hence IC(0) discards all<br />
fill-ins and the resulting factorized matrix has the same<br />
sparsity pattern as the original matrix. One way to<br />
calculate the level <strong>of</strong> a fill-in is to use the sum rule as<br />
shown in (1). This rule gives rise to a symbolic<br />
factorization algorithm described by Hysom [12] that is<br />
amenable to parallelization. The sparsity <strong>of</strong> each row in<br />
the final preconditioner can be evaluated independently<br />
from the other rows. Figure 10 demonstrates the<br />
scalability <strong>of</strong> this algorithm. Despite that, the runtime <strong>of</strong><br />
the symbolic factorization is considered to be a<br />
bottleneck in our case mainly due to the large number <strong>of</strong><br />
fill-ins that incurred in the final ILU preconditioners<br />
(where =1, 2 or 3) as shown in TABLE IV.<br />
level(i, j) min {level(i, k) level(k, j) 1} (1)<br />
1hmin{i, j}<br />
- 259 - 15th IGTE Symposium 2012<br />
Figure 10: Execution times <strong>of</strong> parallel symbolic factorization <strong>of</strong><br />
BDC-1-0.1 where . The results demonstrate that the multithreaded<br />
implementation <strong>of</strong> Hysom’s algorithm is highly<br />
scalable.<br />
Matrix IC(0)<br />
TABLE IV<br />
FILL-INS<br />
ILU(1) ILU(2) ILU(3)<br />
BDC-1-0.5 148,636 225,244 321,101 429,566<br />
(51%) (116%) (189%)<br />
BDC-1-0.1 2,521,428 3,848,266 5,544,448 7,380,993<br />
(52%) (120%) (193%)<br />
ET-0.08 293,643 700,914 1,368,930 2,555,737<br />
(139%) (366%) (770%)<br />
ET-0.04 3,204,372 7,927,979 15,963,746 30,758,154<br />
(147%) (398%) (860%)<br />
VI. PRECONDITIONER BACKWARD-FORWARD<br />
SUBSTITUTION<br />
The next step is to investigate the degree <strong>of</strong> parallelism<br />
(i.e. the number <strong>of</strong> operations that can be executed<br />
simultaneously) that can be attained when solving a<br />
preconditioner (by backward and forward substitution)<br />
within a PCG iteration obtained from the matrix BDC-1-<br />
0.5 (i.e. a 2D problem) and the matrix ET-0.08 (i.e. a 3D<br />
problem). A histogram will be used to depict the<br />
maximum degree <strong>of</strong> parallelism and the number <strong>of</strong> steps<br />
required to solve each <strong>of</strong> the preconditioners. The x-axis<br />
shows the number <strong>of</strong> steps required to solve a matrix, and<br />
the y-axis <strong>of</strong> the histogram shows the number <strong>of</strong> rows<br />
that can be solved simultaneously at a given step. Figure<br />
11 and Figure 12 show the rows dependency histograms <strong>of</strong><br />
ILU(1) and ILU(3) <strong>of</strong> the matrix BDC-1-0.5 respectively.<br />
The maximum degree <strong>of</strong> parallelism <strong>of</strong> ILU(1) was 1,151<br />
and the number <strong>of</strong> steps required to solve the<br />
preconditioner was 196. On the other hand, the ILU(3)<br />
preconditioner <strong>of</strong> the same problem had a maximum<br />
degree <strong>of</strong> parallelism equal to 453 and 399 steps were<br />
required to solve it. The more fill-ins that existed in a<br />
preconditioner, the less parallelism could be exploited.<br />
Solving a preconditioner obtained from a 3D problem<br />
is less amenable to parallelism than that obtained from a<br />
2D problem. For instance, an ILU(1) preconditioner<br />
obtained from ET-0.08 (3D electric transformer problem)<br />
can be solved in 12,559 steps where the maximum<br />
number <strong>of</strong> rows that could be solved simultaneously is<br />
only 16 (see TABLE V) and an ILU(3) preconditioner <strong>of</strong><br />
the same problem can be solved in 24,125 steps where the
maximum attainable degree <strong>of</strong> parallelism is only 16 (see<br />
TABLE VI). A 2D problem that has the same number <strong>of</strong><br />
degrees <strong>of</strong> freedom as ET-0.08 (i.e. BDC-1-0.5) was<br />
more amenable to parallelism.<br />
1200<br />
1000<br />
800<br />
600<br />
400<br />
200<br />
0<br />
0 20 40 60 80 100 120 140 160 180 200<br />
Figure 11: Degree <strong>of</strong> parallelism (y-axis) attained when solving<br />
ILU(1) <strong>of</strong> BDC-1-0.5. The x-axis represents the number <strong>of</strong><br />
sequential steps to finish the solve stage.<br />
500<br />
450<br />
400<br />
350<br />
300<br />
250<br />
200<br />
150<br />
100<br />
50<br />
0<br />
0 50 100 150 200 250 300 350 400<br />
Figure 12: Degree <strong>of</strong> parallelism (y-axis) when solving ILU(3)<br />
<strong>of</strong> BDC-1-0.5. The x-axis represents the number <strong>of</strong> sequential<br />
steps to finish the solve stage.<br />
TABLE V<br />
ILU(1) OF ET-0.08<br />
Preconditioner NNZ Max. Solving<br />
level and ordering<br />
degree <strong>of</strong><br />
parallelism<br />
steps<br />
ILU(1)-Natrual 700,914 69 12,559<br />
ILU(1)-AMD 673,511 207 569<br />
ILU(1)-RCM 585,646 36 3,491<br />
TABLE VI<br />
ILU(3) OF ET-0.08<br />
Preconditioner NNZ Max. Solving<br />
level and ordering<br />
degree <strong>of</strong><br />
parallelism<br />
steps<br />
ILU(3)-Natrual 2,555,737 16 24,125<br />
ILU(3)-AMD 1,937,345 119 1,690<br />
ILU(3)-RCM 1,984,093 12 10,548<br />
Since the preconditioner obtained from the 3D<br />
transformer problem exhibited a low degree <strong>of</strong><br />
parallelism, approximate minimum degree (AMD) [13]<br />
and Reverse Cuthill–McKee (RCM) orderings were first<br />
applied on the ET-0.08 matrix before generating ILU(1)<br />
and ILU(3) preconditioners. Although, it has been<br />
established in the literature that orderings to reduce fillins<br />
or increase parallelism (RCM and AMD) degrade the<br />
quality <strong>of</strong> the preconditioner and lead to more PCG<br />
- 260 - 15th IGTE Symposium 2012<br />
iterations than when using Natural ordering [14], [15], the<br />
aim <strong>of</strong> this experiment was to only focus on the<br />
implication <strong>of</strong> ordering in terms <strong>of</strong> reducing the overall<br />
solver time.<br />
The number <strong>of</strong> non-zeros in the upper triangle<br />
preconditioner, the maximum degree <strong>of</strong> parallelism and<br />
the solving steps <strong>of</strong> both preconditioners ILU(1) and<br />
ILU(3) using different orderings are summarized in<br />
TABLE V and TABLE VI respectively. AMD ordering<br />
resulted in a relatively higher parallelizable<br />
preconditioner solver than Natural and RCM orderings.<br />
On the other hand, RCM exhibited a similar degree <strong>of</strong><br />
parallelism to that <strong>of</strong> the Natural ordering but required<br />
less solve steps. The reason being that RCM reduces the<br />
bandwidth <strong>of</strong> the matrix and balances the distribution <strong>of</strong><br />
non-zeros between rows. This implies that there is a<br />
balance in the degree <strong>of</strong> parallelism among steps, which<br />
will translate into balanced threads utilization.<br />
VII. CONCLUSION<br />
A. Matrix Structure<br />
Given the dependency <strong>of</strong> the sparse storage upon the<br />
problem structure, it is important to devise matrix test<br />
sets that are relevant to the problem domain (i.e. low<br />
frequency electromagnetic analysis using the Finite<br />
Element Method). One <strong>of</strong> the advantages <strong>of</strong> matrices<br />
generated in FEM is the absence <strong>of</strong> a large discrepancy in<br />
the number <strong>of</strong> non-zeros between rows. This enables the<br />
use <strong>of</strong> a sparse storage technique such as ELLPACK that<br />
pre-allocate memory to store the coefficient matrix.<br />
B. 2D vs 3D Analysis<br />
Matrices generated from 2D finite element analysis are<br />
less dense that those generated from 3D problems.<br />
SMVM attained more GFLOPS in the case <strong>of</strong> a denser<br />
matrix (i.e. first-order finite element formulation <strong>of</strong> a 3D<br />
problem). However, ILU preconditioner generated in the<br />
case <strong>of</strong> a 3D problem was less amenable to parallelism<br />
than a preconditioner <strong>of</strong> a 2D problem.<br />
C. Matrix Ordering<br />
Overall, although parallelism can be explored and<br />
exploited in most <strong>of</strong> the examined FEM kernels, the main<br />
bottleneck remains the solver part <strong>of</strong> the FEM process.<br />
There are many sub-kernels that are executed within a<br />
large number <strong>of</strong> loops. This places a stringent<br />
requirement on sparse data structures as there is no gain if<br />
these structures change in between sub-kernels. Further,<br />
in each sub-kernel within the solver, there is a large<br />
number <strong>of</strong> simple operations to be executed. The number<br />
<strong>of</strong> operations is related to the degrees <strong>of</strong> freedom <strong>of</strong> the<br />
matrix and the operations’ complexity is related to the<br />
number <strong>of</strong> non-zeros per row. These simple operations<br />
are memory bandwidth limited requiring that each<br />
operation be optimized in terms <strong>of</strong> memory access.<br />
Hence, single thread optimization remains the most<br />
essential part <strong>of</strong> the solver’s optimization.<br />
Reverse Cuthill-McKee (RCM) ordering has been<br />
found to be beneficial assuming that it will not degrade<br />
the performance <strong>of</strong> PCG. It balances threads utilization
when solving a preconditioner and also in enhances cache<br />
performance in SMVM.<br />
REFERENCES<br />
[1] R. G. Grimes, D. M. Young, and D. R. Kincaid, "ITPACK 2.0:<br />
User's Guide," CNA-150, Center for Numerical<br />
Analysis,<strong>University</strong> <strong>of</strong> Texas,Austin,Texas<br />
,August 1979.<br />
[2] Intel, "Intel VTune Amplifier XE 2011," 2011 ed: Intel<br />
Corporation, 2011.<br />
[3] S. Toledo, "Improving the memory-system performance <strong>of</strong> sparsematrix<br />
vector multiplication," IBM Journal <strong>of</strong> Research and<br />
Development, vol. 41, pp. 711-725, 1997.<br />
[4] R. Vuduc, "Automatic performance tuning <strong>of</strong> sparse matrix<br />
kernels," PhD thesis, <strong>University</strong> <strong>of</strong> California,Berkley, 2003.<br />
[5] E. Cuthill and J. McKee, "Reducing the bandwidth <strong>of</strong> sparse<br />
symmetric matrices," presented at the <strong>Proceedings</strong> <strong>of</strong> the 1969 24th<br />
national conference, 1969.<br />
[6] S. Williams, L. Oliker, R. Vuduc, J. Shalf, K. Yelick, and J.<br />
Demmel, "Optimization <strong>of</strong> sparse matrix-vector multiplication on<br />
emerging multicore platforms," Parallel Computing, vol. 35, pp.<br />
178-194, 2009.<br />
[7] J. D. McCalpin, "STREAM: Sustainable memory bandwidth in<br />
high performance computers," STREAM: Sustainable Memory<br />
Bandwidth in High Performance Computers, 1991.<br />
[8] T. A. Davis and Y. Hu, "The <strong>University</strong> <strong>of</strong> Florida sparse matrix<br />
collection," ACM Transactions on Mathematical S<strong>of</strong>tware (TOMS),<br />
vol. 38, p. 1, 2011.<br />
[9] H. A. Van der Vorst, "Preconditioning by incomplete<br />
decompositions," Ph.D. Thesis, <strong>University</strong> <strong>of</strong> Utrecht, 1982.<br />
[10] J. Meijerink and V. DER VORST, "An iterative solution method<br />
for linear systems <strong>of</strong> which the coefficient matrix is a symmetric<br />
M-matrix," Mathematics <strong>of</strong> Computation, vol. 31, pp. 148-162,<br />
1977.<br />
[11] R. Pozo and K. Remington, "Sparselib++ v. 1. 5 sparse matrix class<br />
library. reference guide," NASA, 1996.<br />
[12] D. Hysom and A. Pothen, "Level-based incomplete LU<br />
factorization: Graph model and algorithms," Preprint UCRL-JC-<br />
150789, US Department <strong>of</strong> Energy, p. 17, 2002.<br />
[13] P. R. Amestoy, T. A. Davis, and I. S. Duff, "An approximate<br />
minimum degree ordering algorithm," SIAM Journal on Matrix<br />
Analysis and Applications, vol. 17, pp. 886-905, 1996.<br />
[14] I. S. Duff and G. A. Meurant, "The effect <strong>of</strong> ordering on<br />
preconditioned conjugate gradients," BIT Numerical Mathematics,<br />
vol. 29, pp. 635-657, 1989.<br />
[15] S. Doi and T. Washio, "Ordering strategies and related techniques<br />
to overcome the trade-<strong>of</strong>f between parallelism and convergence in<br />
incomplete factorizations," Parallel Computing, vol. 25, pp. 1995-<br />
2014, 1999.<br />
- 261 - 15th IGTE Symposium 2012
- 262 - 15th IGTE Symposium 2012<br />
Diagnosis <strong>of</strong> real cracks from the three spatial<br />
components <strong>of</strong> the eddy current testing signals<br />
M. Rebican∗ , L. Janousek † ,M.Smetana † , T. Strapacova † ,A.Duca∗and G. Preda∗ ∗ <strong>University</strong> Politehnica <strong>of</strong> Bucharest, Spl. Independentei 313, Bucharest 060042, Romania<br />
† Faculty <strong>of</strong> Electrical Engineering, <strong>University</strong> <strong>of</strong> Zilina, Univerzitna 1, 010 26 Zilina, Slovakia<br />
E-mail: mihai.rebican@upb.ro<br />
Abstract—This paper presents a novel approach for diagnosis <strong>of</strong> real cracks from two-dimensional eddy current testing<br />
signals by means <strong>of</strong> a stochastic method, such as tabu search. A new testing probe driving uniformly distributed eddy<br />
currents is employed for the inspection. Three spatial components <strong>of</strong> the perturbation field due to partially conductive<br />
cracks are sensed as the response signals in order to enhance information level <strong>of</strong> the inspection. The signals are simulated<br />
by a fast forward FEM-BEM solver using a database. Two crack models are proposed for the inversion: a crack with cuboid<br />
shape and a crack with more complex shape. In the both cases, the cracks have uniform conductivity. The length, depth,<br />
width and conductivity <strong>of</strong> the crack are unknown in the inversion process. Numerical results <strong>of</strong> the 3D reconstruction <strong>of</strong><br />
partially conductive cracks from simulated 2D signals with added noise are presented and discussed.<br />
Index Terms—partially conductive cracks, diagnosis, eddy current testing, tabu search.<br />
I. INTRODUCTION<br />
Eddy current testing (ECT) is one <strong>of</strong> the most common<br />
electromagnetic methods employed in non-destructive<br />
evaluation <strong>of</strong> conductive materials. The principle <strong>of</strong><br />
ECT underlies in the interaction <strong>of</strong> induced eddy currents<br />
with a structure <strong>of</strong> an examined conductive body<br />
based on the electromagnetic induction phenomena. The<br />
method is widely applied in various fields accounting for<br />
measurements <strong>of</strong> material thickness, proximity measurements,<br />
corrosion evaluation, sorting <strong>of</strong> materials based on<br />
the electromagnetic properties. However, the most wide<br />
spread area <strong>of</strong> its application in present is the detection<br />
and possible diagnosis <strong>of</strong> discontinuities.<br />
Real cracks, such as stress corrosion cracks (SCC),<br />
usually appear in steam generator (SG) tubes <strong>of</strong> pressurized<br />
water reactor (PWR) <strong>of</strong> nuclear power plants.<br />
Recently, quite satisfactory results are reported by several<br />
groups for automated evaluation <strong>of</strong> artificial slits [1] and<br />
even for several parallel notches [2] using eddy current<br />
testing (ECT). However, evaluation <strong>of</strong> real cracks from<br />
ECT signals remains still very difficult.<br />
In the case <strong>of</strong> artificial EDM notches, the width is<br />
usually considered fixed in the inversion process <strong>of</strong> ECT<br />
signals. However, for cracks with non-zero conductivity<br />
the width affects the signal and it has to be considered<br />
unknown during reconstruction [3]. It means that the<br />
additional variable should be taken into account for<br />
evaluation <strong>of</strong> a detected SCC what considerably increases<br />
ill-posedness <strong>of</strong> the inverse problem [4]. Thus, many unsatisfactory<br />
results are reported when the automated procedures<br />
originally developed for non-conductive cracks<br />
are employed in the evaluation <strong>of</strong> SCCs. It is stated<br />
that one <strong>of</strong> the possible reasons is lack <strong>of</strong> sufficient<br />
information [1].<br />
Several studies <strong>of</strong> the authors focused on enhancing<br />
information level <strong>of</strong> eddy current testing signals and on<br />
decreasing uncertainty in evaluation [5], [6]. Promising<br />
results create new challenges concerning development <strong>of</strong><br />
automatic procedures for diagnosis <strong>of</strong> real cracks.<br />
In a previous work [2], the authors developed an<br />
algorithm for reconstruction <strong>of</strong> multiple artificial slits<br />
from ECT signals by means <strong>of</strong> a stochastic optimization<br />
methods, such as tabu search. The reconstruction <strong>of</strong> multiple<br />
cracks was a 3D one. Therefore, the scheme is also<br />
appropriate for reconstruction <strong>of</strong> a partially conductive<br />
crack, when the width is not considered fixed.<br />
The paper proposes a novel approach for the threedimensional<br />
reconstruction <strong>of</strong> partially conductive cracks<br />
from simulated two-dimensional ECT signals, consisting<br />
<strong>of</strong> all the three spatial components <strong>of</strong> the perturbation<br />
field. Two crack models are proposed for the inversion.<br />
The first one has a cuboid shape and the other reflects<br />
a more complex geometry. Both the crack models consider<br />
uniform distribution <strong>of</strong> the partial conductivity. The<br />
length, depth, width and conductivity <strong>of</strong> the cracks are<br />
unknown in the inversion process <strong>of</strong> the signals. The<br />
validity <strong>of</strong> the approach is proved using perturbed ECT<br />
signals by added noise in the inversion process.<br />
II. EDDY CURRENT TESTING PROBLEM DEFINITION<br />
A plate specimen having the electromagnetic parameters<br />
<strong>of</strong> a stainless steel SUS316L is inspected in this<br />
study. The specimen has a thickness <strong>of</strong> t =10mm, a<br />
conductivity <strong>of</strong> σ =1.35 MS/m and a relative permeability<br />
<strong>of</strong> μr =1.<br />
Figure 1 shows the configuration <strong>of</strong> the plate (region<br />
Ω0) with a single surface breaking crack (shadow zone)<br />
located inside the region Ω1. The crack region Ω1 (22 ×<br />
2 × 10 mm3 ) is uniformly divided into a grid composed<br />
from nx × ny × nz (11×5×10) cells defining a possible<br />
crack geometry. The dimensions <strong>of</strong> each cell are 2.0 mm<br />
in length, 0.4 mm in width, and 1.0 mm in depth.<br />
A new eddy-current probe proposed by the authors<br />
is employed for the near-side inspection <strong>of</strong> the plate<br />
[7]. It consists <strong>of</strong> two circular exciting coils positioned<br />
apart from each other and oriented normally regarding
2<br />
1<br />
1<br />
nz<br />
Crack<br />
n y<br />
Ω 1<br />
Probe<br />
1 2 nx<br />
y<br />
x<br />
Scanning<br />
Fig. 1. Configuration <strong>of</strong> the plate specimen with a crack.<br />
the plate surface. The circular coils are connected in<br />
series but magnetically opposite to induce uniformly<br />
distributed eddy currents in the plate. The exciting coils<br />
are supplied from a harmonic source with a frequency<br />
<strong>of</strong> 5kHz and the current density 1A/mm 2 . A detection<br />
system <strong>of</strong> the probe is composed <strong>of</strong> three small circular<br />
coils oriented along three axes perpendicularly to each<br />
other [5]. The detection system is located in the center<br />
between the exciting coils to gain high sensitivity as<br />
the direct coupling between the exciting coils and the<br />
detectors is minimal at this position.<br />
Figure 2 shows the configuration <strong>of</strong> the new probe.<br />
Dimensions <strong>of</strong> the detecting coils are as follows: an inner<br />
diameter <strong>of</strong> 1.2mm, an outer diameter <strong>of</strong> 3.2mm and a<br />
winding height <strong>of</strong> 0.8mm.<br />
Two-dimensional scanning, so called C-scan, is performed<br />
over the cracked surface with a lift-<strong>of</strong>f <strong>of</strong> 1mm.<br />
The real and imaginary parts <strong>of</strong> the induced voltages in<br />
all three detecting coils corresponding to three spatial<br />
components <strong>of</strong> the perturbation electromagnetic field are<br />
sensed and recorded during the inspection.<br />
III. PARTIALLY CONDUCTIVE CRACK MODELS<br />
Partially conductive cracks with a uniform conductivity<br />
smaller than the conductivity <strong>of</strong> the base material are<br />
considered in this paper. Two crack models are proposed<br />
for the partially conductive cracks.<br />
In the first model shown in Figure 3, the crack has a<br />
cuboid shape. The crack parameter vector c consists <strong>of</strong><br />
z<br />
23<br />
x<br />
plate<br />
35<br />
14<br />
Fig. 2. ECT probe configuration.<br />
exciting<br />
coils<br />
10<br />
detectors<br />
Ω 0<br />
- 263 - 15th IGTE Symposium 2012<br />
1<br />
1<br />
2<br />
n<br />
z<br />
2<br />
ny<br />
1 2 n<br />
x<br />
Fig. 3. Crack region division - cuboid shape <strong>of</strong> the crack (model 1).<br />
6integers,c =[ix1,ix2,iy1,iy2,iz,s], whereix1 and<br />
ix2 are the indices <strong>of</strong> the first and last cells <strong>of</strong> the crack<br />
along the length <strong>of</strong> crack, iy1 and iy2 are the indices <strong>of</strong><br />
the first and last cells <strong>of</strong> the crack along the width <strong>of</strong><br />
crack, iz is the number <strong>of</strong> cells <strong>of</strong> the crack along the<br />
depth <strong>of</strong> crack, and σc = s%σ (σc - the conductivity <strong>of</strong><br />
crack, σ - the conductivity <strong>of</strong> base material).<br />
In Figure 3, for a uniform grid with 13×5×10 cells, the<br />
parameter vector is c =[6, 13, 1, 3, 4, 20]. Thus, 8×3×4<br />
cells form the crack, and the crack conductivity is σc =<br />
20%σ.<br />
The second crack model shown in Figure 4 adopts<br />
a more complex shape. The crack depth is considered<br />
as variable along the crack length. The crack parameter<br />
vector c consists <strong>of</strong> nx +3 integers, c =<br />
[iz1,iz2,...,iznx,iy1,iy2,s], whereizk, k = 1,nx is<br />
the number <strong>of</strong> cells <strong>of</strong> the crack along the depth <strong>of</strong> crack,<br />
iy1 and iy2 are the indices <strong>of</strong> the first and last cells <strong>of</strong><br />
the crack along the width <strong>of</strong> crack, and σc = s%σ (σc<br />
- the conductivity <strong>of</strong> crack, σ - the conductivity <strong>of</strong> base<br />
material).<br />
In Figure 4, for a uniform grid with 13 × 5 × 10<br />
cells, the parameter vector contains 16 integers, as c =<br />
[0, 0, 0, 0, 0, 8, 4, 1, 2, 5, 3, 6, 4, 1, 3, 30]. Thus, (8+4+1+<br />
2+5+3+6+4)× 3 cells form the crack, and the crack<br />
conductivity is σc = 30%σ.<br />
In the both models, the cracks have the same orienta-<br />
1<br />
1<br />
2<br />
n<br />
z<br />
2<br />
ny<br />
1 2 n<br />
x<br />
Fig. 4. Crack region division - complex shape <strong>of</strong> the crack (model 2).
tion. The width <strong>of</strong> crack can have the values: 0.4, 0.8,<br />
1.2, 1.6, 2 mm.<br />
IV. DIAGNOSIS OF PARTIALLY CONDUCTIVE CRACKS<br />
The fast-forward FEM-BEM analysis solver using<br />
database [8], [9] is adopted here for the ECT signals simulation.<br />
Actually, a version <strong>of</strong> the algorithm <strong>of</strong> database<br />
upgraded by the authors in previous works [2], [10],<br />
for the computation <strong>of</strong> the ECT signals due to multiple<br />
cracks is used in this paper. The database is designed for<br />
a three-dimensional defect region, and not as usually for a<br />
two-dimensional one where the crack width is considered<br />
fixed. Thus, the ECT response signals can be simulated<br />
also for partially conductive cracks with variable width<br />
using the same database generated in advance.<br />
The authors have already developed an algorithm for<br />
the reconstruction <strong>of</strong> multiple cracks from ECT signals<br />
by means <strong>of</strong> a stochastic optimization method, such as<br />
tabu search [2]. The reconstruction <strong>of</strong> multiple cracks<br />
validated by experimental data was a 3D one. Therefore,<br />
the scheme is also appropriate for the reconstruction<br />
<strong>of</strong> a partially conductive crack, when the width is not<br />
considered constant. It is well known that the width<br />
significantly affects the signal for cracks <strong>of</strong> non-zero<br />
conductivity [3].<br />
Tabu search is employed for the three-dimensional<br />
diagnosis <strong>of</strong> a partially conductive crack [2]. The error<br />
function ε to be minimized is defined as:<br />
ε(c) = <br />
j=X,Y,Z<br />
N<br />
i=1<br />
|ΔVij(c) − ΔV m<br />
ij |2<br />
, (1)<br />
N<br />
i=1<br />
|ΔV m<br />
ij |2<br />
where c is the crack parameter vector <strong>of</strong> the crack,<br />
ΔVij(c) and ΔV m<br />
ij are the simulated (reconstructed) and<br />
true (measured) induced pick-up voltages <strong>of</strong> the coils<br />
(ECT signal) for each spatial component (X, Y and Z<br />
according to the coordinate system shown in Figures 1<br />
and 2) at the i-th scanning point respectively, and N is<br />
the number <strong>of</strong> scanning points.<br />
Figures 5-7 show the simulated ECT signal for each<br />
spatial component (X, Y, Z) caused by a partially conductive<br />
crack with a cuboid shape, which has the parameters:<br />
lc =6mm, wc =0.8mm, dc =4mm, σc =5%<strong>of</strong> σ.<br />
In this paper, the simulated signals are affected by<br />
added noise before the inversion process in order to prove<br />
the validity and robustness <strong>of</strong> the proposed approach. The<br />
perturbed signal is computed as:<br />
(ΔV m<br />
i )ns =ΔV m<br />
i (1 ± ns%), (2)<br />
where ΔV m<br />
i and (ΔV m<br />
i )ns are the initial and perturbed<br />
true signals at the i-th scanning point respectively, ns is<br />
a random value <strong>of</strong> an imposed maximum level, NOISE.<br />
Figure 8 shows the perturbed ECT signal for Z component<br />
when noise <strong>of</strong> maximum level 40% is added to<br />
the simulated signal shown in Figure 7.<br />
- 264 - 15th IGTE Symposium 2012<br />
Absolute voltage [mV]<br />
0.8<br />
0.6<br />
0.4<br />
0.2<br />
0<br />
1<br />
1.8<br />
1.6<br />
1.4<br />
1.2<br />
15<br />
10<br />
5<br />
-20 -15 0<br />
-10 -5 0<br />
-5 y [mm]<br />
5<br />
x [mm] 10 -10<br />
15 20-15<br />
Fig. 5. X component <strong>of</strong> the simulated ECT signal.<br />
Absolute voltage [mV]<br />
0.8<br />
0.6<br />
0.4<br />
0.2<br />
0<br />
1<br />
1.8<br />
1.6<br />
1.4<br />
1.2<br />
15<br />
10<br />
5<br />
-20 -15 0<br />
-10 -5 0<br />
-5 y [mm]<br />
5<br />
x [mm] 10 -10<br />
15 20-15<br />
Fig. 6. Y component <strong>of</strong> the simulated ECT signal.<br />
V. NUMERICAL RESULTS AND DISCUSSION<br />
The numerical simulations <strong>of</strong> the cracks reconstruction<br />
are performed using an ordinary PC: Intel Core 2 Quad<br />
2.4GHz, 3GB RAM.<br />
In Table I the numerical results <strong>of</strong> the reconstruction<br />
are presented, when a partially conductive crack is<br />
modeled as a cuboid shape (crack model 1, Figure 3).<br />
The column denoted ”Real” gives the true dimensions<br />
(lc × wc × dc) and partial conductivity (σc in % <strong>of</strong> σ)<br />
<strong>of</strong> the crack. The results <strong>of</strong> the diagnosis are provided<br />
in the column labelled as ”Reconstructed” for various<br />
maximum levels <strong>of</strong> the noise added to simulated ECT<br />
signals (2). The error function ε(c) (1) <strong>of</strong> the diagnosis<br />
are reported, too.<br />
The time required for reconstruction <strong>of</strong> one crack is<br />
approximately 90-120 minutes. When there is no noise
Absolute voltage [mV]<br />
0.8<br />
0.6<br />
0.4<br />
0.2<br />
0<br />
1<br />
1.8<br />
1.6<br />
1.4<br />
1.2<br />
15<br />
10<br />
5<br />
-20 -15 0<br />
-10 -5 0<br />
-5 y [mm]<br />
5<br />
x [mm] 10 -10<br />
15 20-15<br />
Fig. 7. Z component <strong>of</strong> the simulated ECT signal.<br />
Absolute voltage [mV]<br />
0.8<br />
0.6<br />
0.4<br />
0.2<br />
0<br />
1<br />
1.8<br />
1.6<br />
1.4<br />
1.2<br />
15<br />
10<br />
5<br />
-20 -15 0<br />
-10 -5 0<br />
-5 y [mm]<br />
5<br />
x [mm] 10 -10<br />
15 20-15<br />
Fig. 8. Z component <strong>of</strong> the perturbed ECT signal by added noise <strong>of</strong><br />
maximum level 40%.<br />
added to signal (shown in Figures 5-7), the reconstructed<br />
crack is the same with the real crack (the error function<br />
ε =0). In the case <strong>of</strong> diagnosis from signals perturbed<br />
by noise (maximum level NOISE =10, 20, 30, 40%),<br />
TABLE I<br />
RESULTS OF THE PARTIALLY CONDUCTIVE CRACK<br />
RECONSTRUCTION FOR THE CRACK MODEL 1<br />
Crack Real Reconstructed<br />
NOISE (%) 0 10 20 30 40 40<br />
lc [mm] 6.0 6.0 6.0 6.0 6.0 6.0 6.0<br />
wc [mm] 0.8 0.8 0.8 0.8 0.8 0.8 1.2<br />
dc [mm] 4.0 4.0 4.0 4.0 4.0 4.0 4.0<br />
σc[%] 5 5 5 5 4 4 9<br />
ε · 10 −3 - 0 11 21 32 42 45<br />
- 265 - 15th IGTE Symposium 2012<br />
the crack is exactly reconstructed even if the maximum<br />
level <strong>of</strong> noise is high (30, 40%). Starting from another<br />
initial population <strong>of</strong> tabu search, and for a perturbed<br />
signal by added noise <strong>of</strong> highest level, NOISE =40%<br />
(the Z component is shown in Figure 8), the result is<br />
slightly different (the last column in Table I): the length<br />
and depth are equal to the true values; the width and<br />
partial conductivity are estimated not exactly but with<br />
good precision. However, the last two parameters are not<br />
very important from structural integrity point <strong>of</strong> view.<br />
The results clearly show that the crack parameters are<br />
estimated quite precisely from the noisy ECT signals<br />
using the proposed approach.<br />
Figures 9 and 10 show the results <strong>of</strong> three-dimensional<br />
diagnosis <strong>of</strong> the partially conductive crack described in<br />
Table I (column ”Real”) from 2D ECT signals without<br />
and with added noise <strong>of</strong> maximum level 20%, respectively,<br />
when the complex crack model (Figure 4) is<br />
employed for the inversion. The inversion procedure<br />
takes around 5-7 hours.<br />
In the case <strong>of</strong> the reconstruction from signal without<br />
noise, the crack is precisely localized and also its length<br />
width<br />
depth<br />
real reconstructed<br />
σ c=5%σ<br />
σ c=3%σ,<br />
ε=0.004<br />
Fig. 9. Reconstruction <strong>of</strong> a conductive crack from signal without noise.<br />
width<br />
depth<br />
real reconstructed<br />
σ c=5%σ<br />
σ c=4%σ,<br />
ε=0.027<br />
Fig. 10. Reconstruction <strong>of</strong> a conductive crack from perturbed signal<br />
by added noise <strong>of</strong> maximum level 20%.
width<br />
depth<br />
real reconstructed<br />
σ c=8%σ σ c=6%σ,<br />
ε=0.024<br />
Fig. 11. Reconstruction <strong>of</strong> an elliptical conductive crack.<br />
and width are exactly estimated. The depth pr<strong>of</strong>ile does<br />
not perfectly copy the true one. However, the maximum<br />
depth is accurately assessed. But, for the reconstruction<br />
from signal with added noise <strong>of</strong> maximum level <strong>of</strong> 20%,<br />
the width is smaller with a minimum value <strong>of</strong> 0.4 mm<br />
than real width.<br />
A crack with elliptical pr<strong>of</strong>ile is also reconstructed<br />
from signal without noise. The crack opening has a value<br />
<strong>of</strong> wc =0.4 mm, its surface length is lc =14mm, the<br />
maximum depth is dc = 4mm and the crack partial<br />
conductivity is adjusted to σc =8%<strong>of</strong> the base material<br />
conductivity σ. The reconstruction result is shown<br />
in Figure 11. The crack width and its surface length<br />
are accurately assessed. The estimated crack position is<br />
minimally shifted (0.4mm) in the crack width direction<br />
comparing the true position. The maximum depth is<br />
slightly overestimated <strong>of</strong> 1mm. When the signal caused<br />
by the elliptical conductive crack is perturbed by added<br />
noise <strong>of</strong> maximum level <strong>of</strong> 20%, and then is used in<br />
reconstruction, the maximum depth is overestimated <strong>of</strong><br />
2mm, but the crack width and its surface length are<br />
precisely estimated, too.<br />
The presented results proved effectiveness <strong>of</strong> the proposed<br />
novel approach <strong>of</strong> three-dimensional diagnosis <strong>of</strong><br />
partially conductive cracks, even if cracks with complex<br />
shape and signals with added noise are considered. ECT<br />
response signals gained during C-scan together with<br />
acquiring all three spatial components <strong>of</strong> the perturbation<br />
electromagnetic field significantly improve the preciseness<br />
<strong>of</strong> inversion process using tabu search stochastic<br />
method.<br />
VI. CONCLUSION<br />
A novel approach for three-dimensional diagnosis <strong>of</strong><br />
partially conductive cracks has been proposed in the<br />
paper. A special eddy current probe driving uniformly<br />
distributed eddy currents was used for the inspection <strong>of</strong><br />
a plate specimen. A detection system <strong>of</strong> the probe was<br />
designed in such a way that all three spatial components<br />
<strong>of</strong> the perturbation electromagnetic field were acquired.<br />
- 266 - 15th IGTE Symposium 2012<br />
The tabu search stochastic method was employed for the<br />
reconstruction <strong>of</strong> partially conductive cracks pr<strong>of</strong>ile from<br />
eddy current response signals gained during the C-scan<br />
<strong>of</strong> the probe. The signals were perturbed by added noise.<br />
Two crack models were proposed: a crack with cuboid<br />
shape and the other one with more complex shape.<br />
The length, depth, width and conductivity <strong>of</strong> the crack<br />
were considered unknown in the inversion process. The<br />
conductivity <strong>of</strong> the crack was uniform.<br />
The presented results proved that the proposed approach<br />
allows quite precisely reconstructing threedimensional<br />
pr<strong>of</strong>ile <strong>of</strong> a crack together with its partial<br />
conductivity from signals with added noise.<br />
Further work <strong>of</strong> the authors will concern more realistic<br />
shapes <strong>of</strong> cracks and validation with measured data from<br />
natural cracks (SCC).<br />
ACKNOWLEDGEMENTS<br />
This work has been co-funded by the Sectoral Operational<br />
Programme Human Resources Development<br />
2007-2013 <strong>of</strong> the Romanian Ministry <strong>of</strong> Labour, Family<br />
and Social Protection through the Financial Agreement<br />
POSDRU/89/1.5/S/62557.<br />
This work was supported by the Slovak Research and<br />
Development Agency under the contracts No. APVV-<br />
0349-10 and APVV-0194-07, and by grants <strong>of</strong> the Slovak<br />
Grant Agency VEGA, projects No. 1/0765/11, 1/0927/11.<br />
REFERENCES<br />
[1] N. Yusa, “Development <strong>of</strong> computational inversion techniques to<br />
size cracks from eddy current signals,” Nondestructive testing and<br />
evaluation, vol. 24, pp. 39–52, 2009.<br />
[2] M. Rebican, Z. Chen, N. Yusa, L. Janousek, and K. Miya, “Shape<br />
reconstruction <strong>of</strong> multiple cracks from ECT signals by means <strong>of</strong><br />
a stochastic method,” IEEE Transactions on Magnetics, vol. 42,<br />
pp. 1079–1082, 2006.<br />
[3] Z. Chen, M. Rebican, N. Yusa, and K. Miya, “Fast simulation <strong>of</strong><br />
ECT signal due to a conductive crack <strong>of</strong> arbitrary width,” IEEE<br />
Transactions on Magnetics, vol. 42, pp. 683–686, 2006.<br />
[4] N. Yusa, H. Huang, and K. Miya, “Numerical evaluation <strong>of</strong> the illposedness<br />
<strong>of</strong> eddy current problems to size real cracks,” NDT&E<br />
International, vol. 40, pp. 185–191, 2007.<br />
[5] L. Janousek, M. Smetana, and K. Capova, “Enhancing information<br />
level in eddy-current non-destructive inspection,” International<br />
Journal <strong>of</strong> Applied Electromagnetics and Mechanics, vol. 33, pp.<br />
1149–1155, 2010.<br />
[6] L. Janousek, M. Smetana, and M. Alman, “Decreasing uncertainty<br />
in size estimation <strong>of</strong> stress corrosion cracking from eddy-current<br />
signals,” Studies in Applied Electromagnetics and Mechanics,<br />
vol. 35, pp. 53–60, 2011.<br />
[7] L. Janousek, M. Smetana, and M. Alman, “Decline in ambiguity<br />
<strong>of</strong> partially conductive cracks depth evaluation from eddy current<br />
testing signals,” International Journal <strong>of</strong> Applied Electromagnetics<br />
and Mechanics, vol. 34, 2012 (in press).<br />
[8] Z. Chen, K. Miya, and M. Kurokawa, “Rapid prediction <strong>of</strong> eddy<br />
current testing signals using A−φ method and database,” NDT&E<br />
International, vol. 32, pp. 29–36, 1999.<br />
[9] Z. Chen, K. Aoto, and K. Miya, “Reconstruction <strong>of</strong> cracks with<br />
physical closure from signals <strong>of</strong> eddy current testing,” IEEE<br />
Transactions on Magnetics, vol. 36, pp. 1018–1022, 2000.<br />
[10] M. Rebican, N. Yusa, Z. Chen, K. Miya, T. Uchimoto, and<br />
T. Takagi, “Reconstruction <strong>of</strong> multiple cracks in an ECT roundrobin<br />
test,” International Journal <strong>of</strong> Applied Electromagnetics and<br />
Mechanics, vol. 19, no. 1-4, pp. 399–404, 2004.
- 267 - 15th IGTE Symposium 2012<br />
An Adaptive Galaxy-Based Search Approach for<br />
Electromagnetic Optimization Problems<br />
* Θ Leandro dos Santos Coelho, Θ Teodoro Cardoso Bora and † Piergiorgio Alotto<br />
* Industrial and Systems Eng. Graduate Program, Pontifical Catholic <strong>University</strong> <strong>of</strong> Parana, Curitiba, PR, Brazil<br />
Θ Department <strong>of</strong> Electrical Engineering (PPGEE), Federal <strong>University</strong> <strong>of</strong> Parana (UFPR), Curitiba, PR, Brazil<br />
† Dip. Ingegneria Industriale, Università di Padova, Italy, E-mail: piergiorgio.alotto@dii.unipd.it<br />
Abstract—Optimization metaheuristics have become very popular methods for electromagnetic device design. The Galaxybased<br />
search algorithm (GBSA) is a recently proposed algorithm, inspired by the movement <strong>of</strong> the arms <strong>of</strong> spiral galaxies in<br />
outer space. In this work, a standard and an adaptive version <strong>of</strong> GBSA (AGBSA) based on historic knowledge are applied to<br />
an analytical testcase and to Loney’s solenoid benchmark problem, showing the suitability <strong>of</strong> this technique for<br />
electromagnetic optimization. Furthermore, both algorithmic variants are compared with other well-known stochastic<br />
optimizers.<br />
Index Terms— Electromagnetic optimization, Galaxy-based search algorithm, Loney’s solenoid.<br />
I. INTRODUCTION<br />
Optimization algorithms which include stochastic<br />
components are nowadays commonly classified as<br />
metaheuristics and many <strong>of</strong> them, e.g. Particle Swarm<br />
Optimization (PSO), Genetic Algorithms (GA) and<br />
Evolution Strategies (ES), Differential Evolution (DE),<br />
just to name a few well-known ones, are known to be<br />
powerful techniques for the solution <strong>of</strong> optimization<br />
problems related to the design <strong>of</strong> electromagnetic<br />
devices. Such methods have been studied extensively in<br />
the last decades with growing interest in recent years (see<br />
e.g. [1]-[4]).<br />
A recently introduced metaheuristic which has not yet<br />
received much attention in the electromagnetic<br />
optimization community and which is starting to show<br />
interesting performances in other application areas is the<br />
the Galaxy-based search algorithm (GBSA) [6],[7].<br />
GBSA is a nature-inspired optimization method which<br />
mimics the movement <strong>of</strong> the arms <strong>of</strong> spiral galaxies in<br />
outer space.<br />
The objective <strong>of</strong> this paper is to review the basic<br />
algorithmic features <strong>of</strong> the relatively uncommon GBSA<br />
optimizer and to present a modified and improved<br />
adaptive GBSA (AGBSA) variant. Both algorithms are<br />
then tested on Loney’s solenoid benchmark problem [5],<br />
which features a rough objective function surface typical<br />
<strong>of</strong> many electromagnetic problems in which the direct<br />
problem is solved by numerical methods.<br />
The rest <strong>of</strong> this paper is organized as follows. Section<br />
II provides a detailed description <strong>of</strong> the GBSA algorithm,<br />
while section III is devoted to the application <strong>of</strong> GBSA to<br />
a multiminima analytical test problem. In Section IV, we<br />
describe Loney’s solenoid benchmark problem and<br />
presents the optimization results for the GBSA and<br />
AGBSA algorithmic variants and comparisons with other<br />
metaheuristics, finally the paper concludes with a brief<br />
discussion in Section V.<br />
II. FUNDAMENTALS OF THE GBSA ALGORITHM<br />
GBSA searches the input space using a spiral chaotic<br />
movement approximating the behavior <strong>of</strong> one arm <strong>of</strong> a<br />
spiral galaxy. This movement is driven by a chaotic<br />
process using a logistic map [7]. The main steps <strong>of</strong><br />
GBSA are given in Fig. 1, where S represents the current<br />
solution. The algorithm consists <strong>of</strong> two main<br />
S ← GenerateInitialSolution<br />
S ← LocalSearch (S)<br />
While (termination condition is not met) do<br />
Flag ← False<br />
SpiralChaoticMove (S, Flag)<br />
If Flag then<br />
S ← LocalSearch (S)<br />
Endif Endif<br />
End while while<br />
Fig. 1. Pseudo code <strong>of</strong> classical GBSA.<br />
componentes which are repeated in sequence:<br />
SpiralChaoticMove, shown in Fig. 2, and LocalSearch,<br />
shown in Fig. 3. The SpiralChaoticMove has the role <strong>of</strong><br />
searching around the current solution denoted by S. When<br />
the SpiralChaoticMove procedure finds an improved<br />
solution, it updates S with the improved solution, and the<br />
variable Flag is set to true. When Flag is true, the<br />
LocalSearch component <strong>of</strong> GBSA is activated in order to<br />
locally search around the current optimal solution.<br />
The SpiralChaoticMove component is iterated for<br />
MaxRep times. However, whenever it finds a solution<br />
better than the current optimal solution,<br />
SpiralChaoticMove is terminated and the control <strong>of</strong> the<br />
algorithm is transferred to the main procedure <strong>of</strong> GBSA.<br />
The SpiralChaoticMove component searches the space<br />
around the current best solution using a spiral movement<br />
enhanced by a chaotic variable generated by the logistic<br />
map:<br />
(1)<br />
Where, λ = 4 and x 0 = 0.19 (a sample output for the<br />
case <strong>of</strong> two degrees <strong>of</strong> freedom is given in Fig. 4). It<br />
should be mentioned that the first 5000 iterations <strong>of</strong> the<br />
logistic map are discarded in order not to include in the<br />
generating sequence the transient motion leading to the<br />
chaotic attractor.<br />
The LocalSearch component <strong>of</strong> GBSA may either<br />
find a locally optimal solution or it will exceed the<br />
maximum number <strong>of</strong> iterations kMax without<br />
improvement.<br />
The SpiralChaoticMove component <strong>of</strong> GBSA is the
input:<br />
S, the current best solution (Si is the ith component<br />
i=1:N)<br />
output:<br />
SNext is the first found solution better than S.<br />
Flag if set to true indicates that a better solution has<br />
been found.<br />
parameters:<br />
Each θi is initialised by (–1 + 2 NextChaos()).<br />
Δθ =0.01.<br />
r =0.001.<br />
Δr is set by NextChaos() at each procedure call.<br />
MaxRep is the maximum number <strong>of</strong> local iterations in<br />
SpiralChaoticMove. (e.g. 100)<br />
θ = –π<br />
While rep < MaxRep<br />
For i = 1 to N<br />
SNexti ← Si + NextChaos() r cos(θi)<br />
End<br />
If (f(SNext) ≥ f(S)) then<br />
Flag ← true<br />
Return<br />
Endif<br />
For i = 1 to N<br />
SNexti ← Si - NextChaos() r cos(θi)<br />
End<br />
If (f(SNext) ≥ f(S)) then<br />
Flag ← true<br />
Return<br />
Endif<br />
r ← r + Δr<br />
For i = 1 to N<br />
θi ←θi +Δθ<br />
End<br />
For i = 1 to N<br />
If(θi > π) then<br />
θ ← –π<br />
Endif<br />
End<br />
rep←rep +1<br />
Endwhile<br />
mechanism which is used for exploring the search space<br />
Fig. 2. Pseudo-code <strong>of</strong> the SpiralChaoticMove component <strong>of</strong><br />
GBSA<br />
in order to find the promising area which may include the<br />
optimal solution. In contrast, the LocalSearch is the<br />
GBSA component which is used to explore the promising<br />
area to find within this area the minimum <strong>of</strong> the objective<br />
function. In summary, exploration is conducted by<br />
SpiralChaoticMove while exploitation is carried out by<br />
LocalSearch.<br />
Both exploration and exploitation mechanisms are<br />
necessary for the success <strong>of</strong> any metaheuristic: without<br />
the exploitation mechanism, the metaheuristic may not be<br />
able to obtain accurate solutions whereas without the<br />
exploration mechanism, the metaheuristic may get easily<br />
trapped into a local optimum.<br />
The advantage <strong>of</strong> using the LocalSeach component<br />
with respect to some other exploitation mechanisms, such<br />
as mutation typical <strong>of</strong> Genetic Algorithms, is that the<br />
- 268 - 15th IGTE Symposium 2012<br />
input:<br />
S, the current best solution (Si is the ith component<br />
i=1:N)<br />
output:<br />
SNext is a solution better than S<br />
parameters:<br />
ΔS is the step size<br />
α is a dynamic parameter .<br />
KMax is the maximum number <strong>of</strong> local iterations in<br />
LocalSearch. (e.g. 100).<br />
For i = 1 to N<br />
a←1<br />
k←0<br />
while k < kMax<br />
SLi ←Si –α·ΔS·NextChaos()<br />
SUi ←Si +α·ΔS·NextChaos()<br />
If f(SL) < f(S) and f(SU) < f(S) then<br />
Goto Endrepeat<br />
Endif<br />
If f(SU) > f(S) then<br />
Si ← SUi<br />
SLi ← Sui<br />
α ← α + 0.01 × NextChaos()<br />
k←k+1<br />
ElseIf f(SL) > f(S) then<br />
Si ← SLi<br />
SUi ← SLi α ← α + 0.01 × NextChaos()<br />
k←k+1<br />
Else<br />
α ← α + 0.05 × NextChaos()<br />
k←k+1<br />
Endif<br />
Endwhile<br />
SLi ← Si<br />
SRi ← Si Endrepeat<br />
SNext ← S<br />
proposed local search never allows the algorithm lose the<br />
current best solution, thus increasing the greadyness <strong>of</strong><br />
the algorithm.<br />
1<br />
0.9<br />
0.8<br />
0.7<br />
0.6<br />
0.5<br />
0.4<br />
0.3<br />
Fig. 3. Pseudo-code <strong>of</strong> the LoccalSearch component <strong>of</strong> GBSA<br />
0.2<br />
0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1<br />
Fig. 4. Sample output <strong>of</strong> the logistic map for the two dimensional<br />
case
Moreover, also the SpiralChaoticMove never loses the<br />
best solution found so far. In addition, it uses chaotic<br />
movements in order to reduce the chance <strong>of</strong> getting<br />
trapped into local optima (although this may still happen<br />
as will be seen in the paragraph devoted to the analytical<br />
test case). Due to the chaotic process, GBSA does not<br />
return to the same solution and thus diversity <strong>of</strong> the found<br />
solutions is kept high. Keeping diversity high is<br />
obviously especially important in dealing with<br />
multimodal problems.<br />
The tuning <strong>of</strong> the step size ΔS in the LocalSearch<br />
procedure is a very delicate task in the classical GBSA<br />
and the convergence properties <strong>of</strong> the method strongly<br />
depend on its specific value, which is also problemdependent.<br />
The proposed AGBSA efficiently tunes the<br />
step size using history knowledge <strong>of</strong> mean distances for<br />
the best solution in the previous iteration. The use <strong>of</strong><br />
history knowledge is typical <strong>of</strong> cultural algorithms [9].<br />
III. ANALYTICAL BENCHMARK<br />
The analytical benchmark refers to minimization<br />
<strong>of</strong> the so-called six-hump camel back function<br />
/3+ <br />
. The function has features which are typical <strong>of</strong><br />
many real problems, i.e. a bowl-shaped large-scale<br />
behavior, shown in Fig. 5a, which incorporates a<br />
relatively flat plateau with a rather rough small-scale<br />
behavior with several local minima, shown in Fig. 5b.<br />
The function has two global minima, i.e. at [-<br />
0.089842, 0.712656] and [0.089842, -0.712656] with<br />
value f=-1.031628453 and an additional four local<br />
minima.<br />
f(x1,x2)<br />
f(x1,x2)<br />
−50<br />
2<br />
6<br />
5<br />
4<br />
3<br />
2<br />
1<br />
0<br />
−1<br />
2<br />
200<br />
150<br />
100<br />
50<br />
0<br />
1<br />
1<br />
0<br />
x2<br />
0<br />
x2<br />
−1<br />
−1<br />
−2<br />
−2<br />
−2<br />
−4<br />
Fig. 5: a) large-scale and b) small-scale behavior <strong>of</strong> the six-hump camel<br />
back function<br />
−1<br />
−2<br />
x1<br />
x1<br />
0<br />
0<br />
2<br />
1<br />
4<br />
2<br />
160<br />
140<br />
120<br />
100<br />
80<br />
60<br />
40<br />
20<br />
0<br />
5<br />
4<br />
3<br />
2<br />
1<br />
0<br />
- 269 - 15th IGTE Symposium 2012<br />
Fig 6. shows the position <strong>of</strong> the optimal<br />
solutions obtained over 30 runs <strong>of</strong> the algorithm with<br />
maximum number <strong>of</strong> objective function evaluation set to<br />
100 (Fig. 6a) and 500 (Fig. 6b).<br />
The picture clearly shows that even with a rather<br />
small number <strong>of</strong> function evaluations, the areas <strong>of</strong> the<br />
global minima are usually correctly identified by the<br />
algorithm, while a larger number <strong>of</strong> function evaluations<br />
allows a good precision. The ability <strong>of</strong> the algorithm to<br />
escape local minima is also clearly shown, since none <strong>of</strong><br />
the runs terminated in one <strong>of</strong> the four local minima at a<br />
higher number <strong>of</strong> function evaluations, while some<br />
trapping in local minima is to be seen with a lower<br />
number <strong>of</strong> function evaluations.<br />
1<br />
0.8<br />
0.6<br />
0.4<br />
0.2<br />
0<br />
−0.2<br />
−0.4<br />
−0.6<br />
−0.8<br />
−1<br />
1<br />
0.8<br />
0.6<br />
0.4<br />
0.2<br />
0<br />
−0.2<br />
−0.4<br />
−0.6<br />
−0.8<br />
−1<br />
−1.5 −1 −0.5 0 0.5 1 1.5<br />
−1.5 −1 −0.5 0 0.5 1 1.5<br />
Fig. 6: Optimal solution over 30 runs : a) max. 500 function evaluations b)<br />
max. 3000 function evaluations<br />
IV. LONEY’S SOLENOID DESIGN<br />
The objective in Loney’s solenoid benchmark problem<br />
[5] is to produce a uniform magnetic flux density B within<br />
a given interval on the axis <strong>of</strong> a main solenoid (Fig. 7). The<br />
problem is described by two degrees <strong>of</strong> freedom (the<br />
separation s and the length l <strong>of</strong> the correcting coils) both<br />
constrained by box bounds
Fig. 7. Axial cross-section <strong>of</strong> Loney’s solenoid (upper half-plane).<br />
Three different basins <strong>of</strong> attraction can be recognized in<br />
the domain <strong>of</strong> the objective function F with values <strong>of</strong> F ><br />
4·10 -8 (high level region: HL), 3·10 -8 < F < 4·10 -8 (low<br />
level region: LL), and F < 3·10 -8 (very low level region -<br />
global minimum region: VL). The very low level region is<br />
a small ellipsoidally shaped area within the thin low level<br />
valley. In both VL and LL areas, small changes in one <strong>of</strong><br />
the parameters result in changes in objective function<br />
values <strong>of</strong> several orders <strong>of</strong> magnitude, as shown in Fig. 8.<br />
In the numerical tests, a stopping criterion <strong>of</strong> 3000<br />
objective function evaluations in each run was used. Tables<br />
I and II show the results over 30 runs. Table II also shows a<br />
comparison with other metaheuristics [10], [11].<br />
It can be noticed that the proposed improvement allows<br />
GBSA to become almost as good as some other wellknown<br />
stochastic optimizers, especially as far as the best<br />
solution is concerned, while improvements in the standard<br />
deviation and mean value are still required.<br />
Fig. 8. Objective function landscape and detail <strong>of</strong> the VL area<br />
Optimization<br />
Method<br />
TABLE I<br />
SIMULATION RESULTS OF F IN 30 RUNS<br />
F(s, l)·10 -8<br />
Maximum Mean Minimum StandardD<br />
(Worst)<br />
(Best) eviation<br />
GBSA 1510.646 95.960 2.121 282.727<br />
AGBSA 1510.631 88.313 2.068 283.834<br />
SOMA [10] 3.8761 3.2671 2.0595 0.5078<br />
Tribes [11] 3.9526 3.4870 2.0574 0.5079<br />
- 270 - 15th IGTE Symposium 2012<br />
TABLE II<br />
BEST SOLUTIONS FOR LONEY’S SOLENOID IN 30 RUNS<br />
Optimization separation length F(s, l)·10<br />
Method s (cm) l (cm)<br />
-8<br />
GBSA 11.8212 1.6519 2.121<br />
AGBSA 1.6072 1.5145 2.068<br />
V. CONCLUSION<br />
This paper proposes an improved AGBSA based on<br />
historic knowledge. Results on Loney’s solenoid design<br />
problem, a benchmark featuring many <strong>of</strong> the<br />
characteristics <strong>of</strong> typical electromagnetic design problems<br />
show promising results. Since the main remaining<br />
weakness <strong>of</strong> the algorithm, compared with other<br />
metaheuristics, is a rather large standard deviation <strong>of</strong><br />
optimal solutions, future research will be targeted at<br />
introducing additional improvements in order to decrease<br />
the spread <strong>of</strong> solutions.<br />
ACKNOWLEDGMENTS<br />
This work was supported by the National Council <strong>of</strong><br />
Scientific and Technologic Development <strong>of</strong> Brazil —<br />
CNPq — under Grant 476235/2011-1/PQ.<br />
REFERENCES<br />
[1] N. Al-Aaawar, T. M. Hijazi, and A. A. Arkadan, “Particle swarm<br />
optimization <strong>of</strong> coupled electromechanical systems,” IEEE<br />
Transactions on Magnetics, vol. 47, no. 5, 2011, pp. 1314-1317.<br />
[2] G. Crevecoeur, P. Sergeant, L. Dupré, and R. Van de Walle, “A<br />
two-level genetic algorithm for electromagnetic optimization,”<br />
IEEE Transactions on Magnetics, vol. 46, no. 7, 2010, pp. 2585-<br />
2595.<br />
[3] K. Watanabe, F. Campelo, Y. Iijima, K. Kawano, T. Matsuo, T.<br />
Mifune, and H. Igarashi, “Optimization <strong>of</strong> inductors using<br />
evolutionary algorithms and its experimental validation,” IEEE<br />
Transactions on Magnetics, vol. 46, no. 8, 2010, pp. 3393-3396.<br />
[4] P. Alotto. A hybrid multiobjective differential evolution method<br />
for electromagnetic device optimization. COMPEL, Vol. 30, No.<br />
6, 2011, pp.1815 – 1828.<br />
[5] P. Di Barba and A. Savini, “Global optimization <strong>of</strong> Loney’s<br />
solenoid by means <strong>of</strong> a deterministic approach,” Int. J. <strong>of</strong> Applied<br />
Electromagnetics and Mechanics, vol. 6, no. 4, pp. 247-254, 1995.<br />
[6] H. Shah-Hosseini, “Principal components analysis by the galaxybased<br />
search algorithm: a novel metaheuristic for continuous<br />
optimization,” Int. J. <strong>of</strong> Comp. Sci. and Eng., vol. 6, no. 1-2, pp.<br />
132-140, 2011.<br />
[7] H. Shah-Hosseini, “Otsu’s criterion-based multilevel thresholding<br />
by a nature-inspired metaheuristic called galaxy-based search<br />
algorithm,” Proc. <strong>of</strong> 3rd World Congr. on Nature and Biologically<br />
Inspired Computing, Salamanca, Spain, pp. 383-388, 2011.<br />
[8] G. Ciuprina, D. Ioan and I. Munteanu, “Use <strong>of</strong> intelligent-particle<br />
swarm optimization in electromagnetics,” IEEE Transactions on<br />
Magnetics, vol. 38, no. 2, pp. 1037-1040, 2002.<br />
[9] R. L. Becerra and C. A. C. Coello, “Cultural differential evolution<br />
for constrained optimization,” Comp. Methods in Appl. Mechanics<br />
and Engineering, vol. 195, no. 33-36, pp. 4303-4322, 2006.<br />
[10] L. S. Coelho and P. Alotto, “Electromagnetic optimization using a<br />
cultural self-organizing migrating algorithm approach based on<br />
normative knowledge,” IEEE Trans. on Magnetics, vol. 45, no. 3,<br />
pp. 1446-1449, 2009.<br />
[11] L. S. Coelho and P. Alotto, “Tribes optimization algorithm applied<br />
to the Loney’s solenoid,” IEEE Trans. on Magnetics, vol. 45, no.<br />
5, pp. 1526- 1529, 2009.
Abstract—Finite element modeling <strong>of</strong> a magnetic circuit used in<br />
automotive technologies is presented. A 3D magnetic analysis<br />
was performed in order to calculate the field distribution on the<br />
surface <strong>of</strong> giant magnetoresistance (GMR) sensors. Model<br />
results were compared with experiments, which showed good<br />
agreement. The validated model was further used to optimize<br />
the magnetic circuit design and to improve the working<br />
performance sensors.<br />
Index Terms— Sensors, Finite element method, Magnetic<br />
circuits, Magnetic fields, Giant Magnetoresistance..<br />
I. INTRODUCTION<br />
Magnetic sensors play an important role in automotive<br />
applications. They are reliable, cost effective with high<br />
performance and provide contactless measurements. They are<br />
majorly employed for applications such as measuring pedal<br />
position, engine transmission control, rotational speed <strong>of</strong> the<br />
wheels, and for anti-lock braking system (ABS) [1].<br />
A new type <strong>of</strong> magnetic sensor based on the Giant<br />
Magneto-resistance phenomenon (GMR) was developed by<br />
Infineon [2]. They <strong>of</strong>fer key benefits such as high sensitivity,<br />
linear operation over the sensing range, good temperature<br />
stability over a wide range and low field detection<br />
capabilities. Therefore, they are capable <strong>of</strong> being more precise<br />
on measuring the position or operating at large distances<br />
from the gear wheel in applications. Another benefit <strong>of</strong> using<br />
GMR elements is the low resistance noise. Presently, GMR<br />
sensors can be used in small fields such as 10 nT at 1 Hz and<br />
up to 10 8 nT. They can operate under temperatures between -<br />
55°C up to 150°C. Unfortunately due to GMR’s high<br />
sensitivity and their low field detection capability GMR<br />
elements can easily drive on saturation, if the detected<br />
magnetic field reached a crucial strength value. Therefore it<br />
is very important to ensure that GMR sensors always stay in<br />
their linear range.<br />
This can be achieved using an experimental procedure to<br />
measure the field distribution <strong>of</strong> the magnetic circuit.<br />
Unfortunately the experimental method is time consuming<br />
and cost expensive. In order to overcome these problems, this<br />
paper presents a model development <strong>of</strong> GMR sensors based<br />
on finite element method which can predict the field strength<br />
on the surface <strong>of</strong> GMR elements with high accuracy.<br />
- 271 - 15th IGTE Symposium 2012<br />
Implementation <strong>of</strong> a 3D magnetic circuit model<br />
for automotive applications<br />
Ioannis Anastasiadis 1, 3 , Andreas Buchinger 1 , Tobias Werth 2 ,Lukas Bellwald 1 and Kurt Preis 3<br />
1 KAI Kompetenzzentrum Automobil- und Industrieelektronik GmbH, Europastrasse 8, Villach, 9524 Austria<br />
2 Infineon Technologies Austria AG, Siemensstrasse 2, 9500 Villach, Austria<br />
3 Institute for Fundamentals and Theory in Electrical Engineering, Kopernikusgasse 24/3, A-8010 <strong>Graz</strong>, Austria<br />
II. GMR SENSOR CONCEPT<br />
Magnetoresistance is the change in resistance <strong>of</strong> a<br />
ferromagnetic material caused by an external magnetic field.<br />
The measure <strong>of</strong> magnetoresistance is usually given by the<br />
ratio ΔR/R, where R is the resistance for zero magnetic field<br />
and ΔR is the change in resistance when magnetic field<br />
changes by an amount ΔH. Usually ΔR/R value is small and<br />
hence the change in DC voltage remains low. In applications<br />
by using a Wheatstone bridge configuration to place the<br />
magnetoresistance elements, it is possible to minimize the DC<br />
<strong>of</strong>fset.<br />
In some cases <strong>of</strong> ferromagnetic multilayer’s stack (Fe/Cr)n,<br />
it was reported that at low temperatures their resistance can<br />
change up to 50 % [3]. Due to this major change in<br />
resistance, this phenomenal behavior was named as Giant<br />
Magneto-Resistance (GMR). GMR elements consist <strong>of</strong> a<br />
sequence <strong>of</strong> ferromagnetic and antiferromagnetic layers,<br />
which drastically change their resistance under an external<br />
magnetic field. The simplest GMR technology structure is the<br />
spin valve consisting <strong>of</strong> three layers, <strong>of</strong> which two<br />
ferromagnetic layers are separated by an antiferromagnetic<br />
layer [4]. One layer has a fixed magnetization direction called<br />
pinned layer-hard layer and the other layer is free to rotate<br />
with external fields and magnetization direction, termed as<br />
free layer-s<strong>of</strong>t layer. For industrial use, this pinned layer can<br />
be created in two ways. The first, using the current flow<br />
which provides heating to the layer and the second method is<br />
to use laser pulses for heating the selected layer. During<br />
cooling, the pinned magnetization is formed. In sensor<br />
technology, the pinned layer has its magnetization direction<br />
perpendicular to the free axis <strong>of</strong> the free layer. This setup<br />
gives a linear response <strong>of</strong> the change in GMR resistance when<br />
an external magnetic field is applied.<br />
GMR is a quantomechanic phenomenon created due to the<br />
orientation <strong>of</strong> conducting electrons while they pass through<br />
the GMR stack. If the spin orientation <strong>of</strong> the electrons is<br />
parallel to the magnetic orientation <strong>of</strong> the layer, they move<br />
freely and the resistance remains low. If the spin orientation<br />
is antiparallel to the orientation <strong>of</strong> the layer, resistance<br />
increases due to collisions with the atoms <strong>of</strong> the layers. For<br />
application purposes, the trigger/external field should have a<br />
magnitude bigger than the saturation field <strong>of</strong> the free layer
and smaller than the stand<strong>of</strong>f field <strong>of</strong> the pinned layer. If this<br />
is not the case, then the magnetization direction <strong>of</strong> the layers<br />
will be affected, which will change the overall characteristics<br />
<strong>of</strong> the magnetic sensor. The equation that describes the<br />
change in the resistance R <strong>of</strong> the GMR element is related to<br />
the angle θ between the magnetization directions <strong>of</strong> the free<br />
and pinned layer. In the simplest form <strong>of</strong> the GMR elements<br />
the change in resistance is proportional to the cosine <strong>of</strong> angle<br />
θ between the magnetization layers [5]:<br />
R − R||<br />
ΔR<br />
ΔR<br />
− ( 1−<br />
m1<br />
⋅ m2<br />
) = [ 1−<br />
cosθ<br />
] (1)<br />
R||<br />
2 R||<br />
2R||<br />
Where R is the resistance <strong>of</strong> the stack, R|| is the resistance<br />
<strong>of</strong> the stack in the parallel state, ΔR is the difference between<br />
the resistance <strong>of</strong> the stack in parallel and antiparallel state,<br />
m1 and m2 are the unit magnetization vectors.<br />
The magnetic-sensor consists <strong>of</strong> 4 GMR elements situated<br />
at the two edges <strong>of</strong> the sensor chip. Typical size <strong>of</strong> the GMR<br />
stacks is approximately 1 μm in length, with 1 mm depth and<br />
a thickness <strong>of</strong> few nanometers. They are connected to each<br />
other with a Whitestone bridge configuration to measure the<br />
speed signal. In addition, an extra GMR element is placed at<br />
the center <strong>of</strong> the IC to calculate the directional movement.<br />
The GMR element configuration is shown in figure 1.<br />
Fig. 1: GMR element configuration<br />
By using the above bridge configuration, it is possible to<br />
compensate the DC-<strong>of</strong>fset signal coming from the magnetic<br />
sources. The output signal is given by the following equation:<br />
R4<br />
R2<br />
Vsign = Vleft<br />
−Vright<br />
= VDD<br />
−VDD<br />
R + R R + R<br />
3 4<br />
2 1<br />
≈ Bxleft − Bxright<br />
(2)<br />
Calculating the magnetic field will provide an indication <strong>of</strong><br />
the sensor output signal. The above equation is valid, since<br />
the change in resistance <strong>of</strong> the GMR elements has a linear<br />
response with the change in magnetic field. This assumption<br />
is correct for fields around zero, but for larger applied fields<br />
the overall characteristics <strong>of</strong> GMR elements will change as<br />
they will saturate.<br />
≈<br />
- 272 - 15th IGTE Symposium 2012<br />
III. MAGNETIC CIRCUITS<br />
Typical magnetic circuits used in automotive technologies<br />
consist <strong>of</strong> a gear wheel, sensor and a magnet. This magnet,<br />
termed as back-bias magnet is the source for the circuit. The<br />
sensor and the back-bias magnet are fixed, while the wheel is<br />
subjected to rotation. The ferromagnetic gear wheel acts as an<br />
accumulator <strong>of</strong> the magnetic field –passive target- and the<br />
fluxes bend according to the position <strong>of</strong> the gear wheel, either<br />
if the static part <strong>of</strong> the circuit faces a tooth or not. This<br />
difference <strong>of</strong> the field distribution is sensed by the GMR<br />
elements and is transformed to an electrical signal as the<br />
output <strong>of</strong> the magnetic sensor. A schematic <strong>of</strong> this typical<br />
circuit is shown in Figure 2.<br />
magnetic sensor<br />
Fig.2. Basic magnetic circuit application<br />
back-bias magnet<br />
Previously, investigations and optimization process for<br />
back-bias magnets and gear wheels geometries was carried<br />
out for GMR magnetic sensors applications. [6, 7]. This paper<br />
presents the investigation and model development <strong>of</strong> the<br />
circuit as shown in Figure 3. The gear wheel consists <strong>of</strong> 44<br />
teeth with a circular pitch <strong>of</strong> 8°.18. The gear wheel is 10 mm<br />
long in y-axis dimension. The back-bias magnet structure<br />
consists <strong>of</strong> two magnets formed together with magnetization<br />
directions on the xz plane tilted at an angle <strong>of</strong> 20° in the z<br />
axis as shown in figure 3. The dimensions <strong>of</strong> the magnet are<br />
10 x 10 x 4 mm. The magnet is a ferrite with a remanence <strong>of</strong><br />
287 mT.<br />
y<br />
z<br />
x<br />
3 mm<br />
20° 20°<br />
3mm<br />
airgap<br />
4 mm<br />
3mm<br />
Fig.3: Magnetic circuit under investigation<br />
The magnetic sensor is placed between the gear wheel and<br />
the magnet. The distance from the end <strong>of</strong> the sensor to the top<br />
<strong>of</strong> a tooth is the airgap distance <strong>of</strong> the circuit. When the gear<br />
wheel is rotated, change in magnetic field distribution on the
GMR element surface takes place. By the rotation <strong>of</strong> the<br />
wheel with respect to the MS location and for a distance <strong>of</strong><br />
one pitch, the field distribution along x-axis has a sinusoidal<br />
form. It is <strong>of</strong> interest to calculate this field distribution and to<br />
compare with experimental results.<br />
IV. MODEL CREATION<br />
The field distribution on the xy plane along the surface <strong>of</strong><br />
the GMR elements was investigated. Additionally, it is<br />
necessary to check also the field strength in the normal<br />
direction (z-direction) in order to determine the sensor circuit<br />
response due to change in the airgap distance. For the above,<br />
it is necessary to investigate the magnetic circuit’s field<br />
distribution in three dimensions (3D). Because <strong>of</strong> the<br />
complexity <strong>of</strong> the problem, it is not possible to derive an<br />
analytical 3D solution. Therefore, finite element method was<br />
used to for this purpose [8]. Within this method the geometry<br />
<strong>of</strong> the problem is discretized in smaller regions, where the<br />
field distribution is calculated by means <strong>of</strong> approximated<br />
polynomial shape functions. The approach <strong>of</strong> the problem is<br />
bottom-up. Model was first created in two dimensions and<br />
then extracted in the third direction. 3D scalar magnetic<br />
element was used for this model. Figure 4 shows the field<br />
distribution for an airgap <strong>of</strong> 2 mm.<br />
Fig.4: field distribution for an airgap <strong>of</strong> 2 mm<br />
For the model creation, back-bias magnet and the gear<br />
wheel were created surrounded by air. Since the GMR<br />
elements do not interfere in the field distribution but only<br />
used to measure the magnetic field they are ignored in the<br />
model. Precautions have to be made for the magnet<br />
surrounding free space. The dimensions should be taken<br />
around 5 times the respective dimensions <strong>of</strong> the magnet for<br />
convergence reasons. Larger the model size will increase the<br />
computational time and smaller may lead to distorted<br />
calculations <strong>of</strong> the field due to calculation errors. All<br />
simulations reported in the paper were carried out using a<br />
commercial FEM tool [9].<br />
A. MS attached to back-bias magnet<br />
Initially, investigations for the case where magnetic sensor<br />
is attached to the back-bias magnet were performed. The air<br />
gap was 2 mm. For a rotation <strong>of</strong> 1 pitch, the field distribution<br />
- 273 - 15th IGTE Symposium 2012<br />
was calculated on the surface <strong>of</strong> the GMR elements. The Bx<br />
field distribution on the surface <strong>of</strong> the left GMR and center<br />
GMR element measured at a point in the center <strong>of</strong> their<br />
surface and for a rotation <strong>of</strong> 1 pitch is shown in figure 5.<br />
Bx(mT)<br />
Bx(mT)<br />
0<br />
-2<br />
-4<br />
-6<br />
-8<br />
-10<br />
-12<br />
-14<br />
2<br />
1.5<br />
1<br />
0.5<br />
0<br />
-0.5<br />
-1<br />
-1.5<br />
-2<br />
Bx field on the left GMR<br />
0 0.5 1 1.5 2 2.5 3 3.5 4 4.5 5 5.5 6 6.5 7 7.5 8 8.18<br />
angle(°)<br />
Fig. 5a:Bx field distribution on the left GMR<br />
Bx field on the center GMR<br />
0 0.5 1 1.5 2 2.5 3 3.5 4 4.5 5 5.5 6 6.5 7 7.5 8 8.18<br />
angle(°)<br />
Fig. 5b:Bx field distribution on the center GMR<br />
As it can be seen from Figure 5a, the Bx field on the left<br />
GMR element is approximately -10 mT which is big enough<br />
to drive GMR elements on saturation. Therefore MS should<br />
not be attached to the back-bias magnet but kept a distance<br />
between MS and magnet to prevent driving GMR elements on<br />
saturation.<br />
B. MS placed in a distance from the magnet<br />
In this case, a distance <strong>of</strong> 2 mm is kept between the<br />
magnet and the magnetic sensor as shown in figure 6.<br />
20°<br />
20°<br />
Fig.6: The new circuit geometry under investigation<br />
By setting the airgap distance also to 2 mm and considering<br />
that the package <strong>of</strong> the sensor has in normal direction 1 mm<br />
length, the total distance between the magnet back face and<br />
gear wheel teeth was 5 mm.<br />
y<br />
z<br />
x
Hence magnetization <strong>of</strong> the free layer <strong>of</strong> GMR stack follows<br />
the magnetization direction <strong>of</strong> the external plane field<br />
distribution on the surface <strong>of</strong> the element. We calculated the<br />
plane field Bx and By distribution along the surface <strong>of</strong> GMR<br />
stripes. GMR stripes have a length <strong>of</strong> approximately 1 mm in<br />
y-direction. For investigations <strong>of</strong> the plane field at the length<br />
<strong>of</strong> GMR stripe along y-direction, the Bx and By distributions<br />
were calculated at points which were equidistant spaced. The<br />
bottom point at the surface <strong>of</strong> GMR stripe is denoted to be the<br />
0 point and the next point is spaced by 0.05 mm until the last<br />
point <strong>of</strong> calculations on the top point <strong>of</strong> the stripe. The Bx and<br />
By filed distributions for those points and for two GMR<br />
elements, one which is located on the left half-bridge and<br />
another on the center <strong>of</strong> the sensor (GMR5) can be seen on<br />
figures 7 and 8.<br />
Bx(mT)<br />
By(mT)<br />
Bx(mT)<br />
1<br />
0.5<br />
0<br />
-0.5<br />
-1<br />
-1.5<br />
-2<br />
-2.5<br />
-3<br />
-3.5<br />
2<br />
1.5<br />
1<br />
0.5<br />
0<br />
-0.5<br />
-1<br />
-1.5<br />
-2<br />
0<br />
1<br />
1.5<br />
2<br />
2.5<br />
3<br />
3.5<br />
angle(°)<br />
4<br />
4.5<br />
5<br />
5.5<br />
6<br />
6.5<br />
7<br />
7.5<br />
8<br />
8.1818<br />
Bx @ 0mm<br />
Bx @ 0.05mm<br />
Bx@ 0.1mm<br />
Bx@ 0.15mm<br />
Bx@ 0.2mm<br />
Bx@ 0.25mm<br />
Bx@ 0.3mm<br />
Bx@0.35mm<br />
Bx@0.4mm<br />
Bx@0.45mm<br />
Bx@0.5mm<br />
Bx@ 0.55mm<br />
Bx@ 0.6mm<br />
Bx@ 0.65mm<br />
Bx@ 0.7mm<br />
Bx@ 0.75mm<br />
Bx@ 0.8mm<br />
Bx@ 0.85mm<br />
Bx@ 0.9mm<br />
Bx@ 0.95mm<br />
Bx@ 1mm<br />
Fig. 7a: Bx field distribution on a GMR on the left half-bridge<br />
0 1 1.5 2 2.5 3 3.5 4 4.5 5 5.5 6 6.5 7 7.5 8 8.182<br />
angle(°)<br />
By@ 0mm<br />
By@ 0.05mm<br />
By@ 0.1mm<br />
By@ 0.15mm<br />
By@ 0.2mm<br />
By@ 0.25mm<br />
By@ 0.3mm<br />
By@ 0.35mm<br />
By@ 0.4mm<br />
By@ 0.45mm<br />
By@ 0.5mm<br />
By@ 0.55mm<br />
By@ 0.6mm<br />
By@ 0.65mm<br />
By@ 0.7mm<br />
By@ 0.75mm<br />
By@ 0.8mm<br />
By@ 0.85mm<br />
By@ 0.9mm<br />
By@ 0.95mm<br />
By@ 1mm<br />
Fig. 7b: By field distribution on a GMR on the left half-bridge<br />
2<br />
1.5<br />
1<br />
0.5<br />
0<br />
-0.5<br />
-1<br />
-1.5<br />
-2<br />
0<br />
1<br />
1.5<br />
2<br />
2.5<br />
3<br />
3.5<br />
4<br />
4.5<br />
5<br />
5.5<br />
6<br />
6.5<br />
7<br />
7.5<br />
8<br />
8.18<br />
angle(°)<br />
Bx@ 0mm<br />
Bx@ 0.05mm<br />
Bx@ 0.1mm<br />
Bx@ 0.15mm<br />
Bx@ 0.2mm<br />
Bx@ 0.25mm<br />
Bx@ 0.3mm<br />
Bx@ 0.35mm<br />
Bx@ 0.4mm<br />
Bx@ 0.45mm<br />
Bx@ 0.5mm<br />
Bx@ 0.55mm<br />
Bx@ 0.6mm<br />
Bx@ 0.65mm<br />
Bx@ 0.7mm<br />
Bx@ 0.75mm<br />
Bx@ 0.8mm<br />
Bx@ 0.85mm<br />
Bx@ 0.9mm<br />
Bx@ 0.95mm<br />
Bx@ 1mm<br />
Fig. 8a: Bx field distribution on a center GMR element<br />
- 274 - 15th IGTE Symposium 2012<br />
By(mT)<br />
2<br />
1.5<br />
1<br />
0.5<br />
0<br />
-0.5<br />
-1<br />
-1.5<br />
-2<br />
0<br />
1<br />
1.5<br />
2<br />
2.5<br />
3<br />
3.5<br />
4<br />
4.5<br />
5<br />
5.5<br />
6<br />
6.5<br />
7<br />
7.5<br />
angle(°)<br />
8<br />
8.1818<br />
By@ 0mm<br />
By@ 0.05mm<br />
By@ 0.1mm<br />
By@ 0.15mm<br />
By@ 0.2mm<br />
By@ 0.25mm<br />
By@ 0.3mm<br />
By@ 0.35mm<br />
By@ 0.4mm<br />
By@ 0.45mm<br />
By@ 0.5mm<br />
By@ 0.55mm<br />
By@ 0.6mm<br />
By@ 0.65mm<br />
By@ 0.7mm<br />
By@ 0.75mm<br />
By@ 0.8mm<br />
By@ 0.85mm<br />
By@ 0.9mm<br />
By@ 0.95mm<br />
By@ 1mm<br />
Fig. 8b: By field distribution on a center GMR element<br />
The Bx fields along the GMR stripes have always the same<br />
response for a rotation <strong>of</strong> 1 pitch. On the other hand the By<br />
field is shifted each time we move from the bottom side <strong>of</strong> the<br />
stripe towards the upper side. The By fields are<br />
homogeneously distributed along the stripes and have the<br />
same values on the surfaces <strong>of</strong> all GMR stripes. Due to<br />
symmetry reasons, a GMR element situated on the right halfbridge<br />
should also have along their surface similar By and Bx<br />
distribution.<br />
C. Experimental results<br />
Experiments were performed for the configuration shown in<br />
figure 6. To compare with simulations, the gear wheel speed<br />
was set to 1.5 rpm. Such a small speed was chosen, because<br />
the finite element model was built for every corresponding<br />
magnet positions over the rotated angle. As the gear wheel<br />
rotates, the magnetic sensor provides speed and the<br />
directional signal. These measured signals are compared with<br />
simulation results. Drawback <strong>of</strong> the experimental procedure is<br />
that there is no possibility to directly derive the Bx and By<br />
distribution along the GMR stripe, but only the plane field<br />
distribution on the surface <strong>of</strong> GMR elements while gear wheel<br />
is rotated towards magnetic sensor. This experimental field<br />
distribution along the GMR surface is compared with the field<br />
distributions derived from the model.<br />
Hence Bx and By distributions are changed along the GMR<br />
stripes we have to calculated the average field that the<br />
elements are sensed and that field we have to compare it with<br />
the experimental results. Substituting the signal from the left<br />
and right Whetstone bridge, we measure the speed signal<br />
while the center GMR element shows the directional signal.<br />
The results are shown in figure 9. The signal is calculated on<br />
field distribution along the surface <strong>of</strong> the stripes.<br />
Fig. 9a: comparison <strong>of</strong> directional signal
Fig. 9b: comparison <strong>of</strong> speed signal<br />
Again, results are shown for a rotation <strong>of</strong> 1 pitch.<br />
Comparisons for the speed signal between experimental<br />
and simulation results reveal a small deviation <strong>of</strong><br />
approximately 3%. On the other hand, the comparison for the<br />
directional signal which comes from the measurements <strong>of</strong> the<br />
middle GMR element shows a bigger deviation with a mean<br />
value <strong>of</strong> 9%. Differences between simulation and<br />
experimental results are due to inaccuracies in the simulation<br />
model, such as how dense the model is. Another important<br />
issue is that in reality the geometry <strong>of</strong> gear wheel has<br />
deviations from the theoretical geometry due to construction<br />
reasons, for example each pitch distance has not exactly the<br />
same dimensions <strong>of</strong> 6mm or there may be a small deviation<br />
on the height <strong>of</strong> all the teeth <strong>of</strong> the gear wheel. Those<br />
problems can bring an error on the calculated signal.<br />
V. CONCLUSION<br />
A 3D model describing the rotation <strong>of</strong> a GMR sensor<br />
around a gear wheel was developed and verified with<br />
experiments. The model is based on finite element analysis<br />
and is used to calculate the variations <strong>of</strong> the field distribution,<br />
when the gear wheel is rotated around the stator <strong>of</strong> the<br />
magnetic circuit, MS and back-bias magnet. Calculation <strong>of</strong><br />
the field distribution was performed along the GMR element<br />
surface. In parallel, experiments were performed for the same<br />
configuration to support model development.<br />
Comparison shows a small deviation between the compared<br />
values given an indication <strong>of</strong> the valid <strong>of</strong> the model. Such a<br />
model can give us a fast and accurate estimation on the<br />
magnetic circuit’s functionality showing also the maximum<br />
airgap performance <strong>of</strong> this circuit.<br />
ACKNOWLEDGEMENTS<br />
This work was jointly funded by the Federal Ministry <strong>of</strong><br />
Economics and Labour <strong>of</strong> the Republic <strong>of</strong> Austria (contract<br />
98.362/0112-C1/10/2005) and the Carinthian Economic<br />
Promotion Fund (KWF) (contract 98.362/0112-C1/10/2005).<br />
REFERENCES<br />
[1] C.P.O. Treutler, “Magnetic sensors for automotive applications,” Elsevier,<br />
Sensors and Actuators A, vol. 91,2001, pp. 2-6<br />
[2] Dirk Hammersdchmidt, Ernst Katzmaier, et. all, “Giant magneto resistorssensor<br />
technology & automotive applications”, SAE 2005 World Congress<br />
& Exhibition, SAE, Detroit, 01-01-2005, pp. 1-16.<br />
- 275 - 15th IGTE Symposium 2012<br />
[3] M.N.Baibich, M.Broto, A. Fert, F. Nguyen Van Dau, F. Petr<strong>of</strong>f, P.<br />
Etienne, G. Creuzei, A. Frederick and J. Chazelas, “Giant<br />
Magnetoresistance <strong>of</strong> (001) Fe/(001) Cr Magnetic Superlattices,” Phys.<br />
Rev. Lett., vol. 61, num. 21, Nov. 1988, pp. 2472-2475.<br />
[4] Robert L. White, “Giant Magnetoresistance: A Primer” IEEE Trans. on<br />
Magn., vol. 28, Sept. 1992, pp. 2482-2487.<br />
[5] S.E. Russek, R.D: McMichael, M.J: Donahue and S. Kaka, “High Speed<br />
Switching and Rotational Dynamics in Small Magnetic Thin Film<br />
Devices,” Springer, Spin Dynamics in Confined Magnetic Structures 2,<br />
vol.87, 2003, pp. 93-156<br />
[6] I. Anastasiadis, T. Werth, K. Preis, “Evaluation and optimization <strong>of</strong> backbias<br />
magnets for automotive applications using Finite Element Methods”,<br />
IEEE Transactions on Magnetics, vol. 45 (March 2009) no.3, pp. 1332-<br />
1335.<br />
[7] I. Anastasiadis, T. Werth, K. Preis, “Investigation and optimization <strong>of</strong><br />
magnetic sensor gear wheels for automotive applications”, 14 th IGTE<br />
Symposium on Numerical Field Calculation in Electrical Engineering,<br />
19-22 Sept. 2010, submitted.<br />
[8] A. Bonderson,T. Rylander, P. Ingelström, Computational<br />
Electromagnetics (Book style), Springer Inc. 2005.<br />
[9] Tutorial, Electromagnetic Field Analysis Guide, ANSYS Release 11.0,<br />
ANSYS Inc Book International Inc., Canonsburg, PA, 2006
- 276 - 15th IGTE Symposium 2012
- 277 - 15th IGTE Symposium 2012
- 278 - 15th IGTE Symposium 2012
- 279 - 15th IGTE Symposium 2012
- 280 - 15th IGTE Symposium 2012
- 281 - 15th IGTE Symposium 2012
- 282 - 15th IGTE Symposium 2012<br />
Mixed Order Edge-based Finite Element Method<br />
by Means <strong>of</strong> Nonconforming Mesh Connection<br />
Yoshifumi Okamoto and Shuji Sato<br />
*Department <strong>of</strong> Electrical and Electronic Systems Engineering, Utsunomiya <strong>University</strong><br />
7-1-2 Yoto, Utsunomiya, Tochigi 321-8585, Japan<br />
E-mail: okamotoy@cc.utsunomiya-u.ac.jp<br />
Abstract—The first order element is widely used as discretized order <strong>of</strong> edge-based finite element method. The second order<br />
discretization has a tendency <strong>of</strong> escape from the practical magnetic field analysis for the reason <strong>of</strong> many nonzero entries in<br />
global matrix and convergence deterioration <strong>of</strong> a linear solver. Therefore, the performance <strong>of</strong> higher order elements should<br />
be successfully utilized by the restriction <strong>of</strong> analyzed region. This paper presents the mixed order finite element method with<br />
reasonable computational costs by using nonconforming mesh connection. The higher order discretization is applied to the<br />
main region with higher accuracy, and is connected with the outer space discretized by first order element using<br />
nonconforming connection. This paper shows the detailed characteristics <strong>of</strong> nonconforming mixed order edge-based finite<br />
element analysis.<br />
Index Terms— Higher order element, higher order interpolation, mixed order edge-based finite element analysis,<br />
nonconforming mesh connection.<br />
linear combination [5] – [8]. We applied the linear<br />
combination to nonconforming connection to retain the<br />
symmetry <strong>of</strong> the global matrix. Furthermore, we newly<br />
propose the finite element analysis using the<br />
nonconforming connection between 2nd order elements,<br />
and comparisons have been made with conventional<br />
conforming analysis and mixed-order analysis.<br />
I. INTRODUCTION<br />
The edge-based finite element method is widely<br />
recognized as a powerful and practical numerical method<br />
in the design environment for the evaluation <strong>of</strong> temporal<br />
changes <strong>of</strong> electromagnetic field and various<br />
characteristics <strong>of</strong> electrical machine. Furthermore, the<br />
mesh generator has been advanced in order to product the<br />
finite element mesh for the complicated target. From<br />
these technical background, 1st order discretization is<br />
widely adopted as an edge-based finite element method.<br />
On the other hand, the 2nd order discretization has the<br />
tendency <strong>of</strong> escape from the practical magnetic field<br />
analysis owing to the many nonzero entries in global<br />
matrix and convergence deterioration <strong>of</strong> a linear solver<br />
for the algebraic equation. However, 2nd order element<br />
has better convergence characteristics to the true value <strong>of</strong><br />
physical quantity such as the magnetic energy than the<br />
1st order element when the element size is shortened.<br />
Then, it is assumed that 2nd order element should be<br />
adopted at the region where the accuracy is required and<br />
1st order element is adopted at other region. This<br />
combinatorial technique might be capable <strong>of</strong> improving<br />
the convergence characteristics <strong>of</strong> linear solvers and<br />
shortening the elapsed time with lower computational<br />
cost than conventional 2nd order discretization. The<br />
reference [1] describes the mixed order analysis based on<br />
the hierarchical elements. Furthermore, the mixed order<br />
analysis in the discontinuous Galerkin method is applied<br />
to the eddy current problem [2]. These references show<br />
the effectiveness <strong>of</strong> mixed order analysis. However, the<br />
combinatorial technique between higher order elements is<br />
not reported and the degree <strong>of</strong> nonconforming mesh<br />
connection between 2nd and 1st order is not verified.<br />
This paper shows the detailed effectiveness <strong>of</strong> mixed<br />
order edge-based finite element analysis which is realized<br />
using nonconforming mesh connection. The<br />
nonconforming mesh connection is mainly classified into<br />
two methods, in which one is the method based on<br />
Lagrange multiplier [3], [4], and the other is based on the<br />
II. MIXED ORDER FINITE ELEMENT METHOD BY MEANS<br />
OF NONCONFORMING MESH CONNECTION<br />
A. Weak form for edge-based finite element method<br />
The weak form Gi for Maxwell equation in<br />
magnetostatic field is given as follows:<br />
G<br />
i<br />
<br />
<br />
V<br />
( <br />
N ) (<br />
<br />
A)<br />
dV N J 0dV<br />
i<br />
<br />
<br />
Vm<br />
<br />
Vc<br />
( N ) <br />
B dV 0,<br />
where Ni is the edge-based shape function, is the<br />
reluctivity, A is magnetic vector potential, J0 is the<br />
current density vector, Br is the remanence, respectively.<br />
The domain for volume integral V, Vc, and Vm denote the<br />
whole region, the region for magnetizing winding, and<br />
the permanent magnet, respectively. When the magnetic<br />
nonlinearity is taken into account, Newton-Raphson (NR)<br />
method supported by the line-search based on functional<br />
(0, 1.0) [9] is adopted as the nonlinear analysis method.<br />
The ICCG method with shifted parameter [10] is used as<br />
a linear solver for the algebraic equation derived from<br />
edge-based finite element method.<br />
B. Nonconforming mesh connection<br />
This subsection describes the nonconforming mesh<br />
connection using the linear combination, which is<br />
formulated as follows:<br />
<br />
i<br />
i<br />
r<br />
(1)<br />
A N A dl<br />
,<br />
(2)<br />
b<br />
ab<br />
a<br />
k<br />
k<br />
k<br />
where Aab is the vector potential <strong>of</strong> nonconforming edge<br />
ab as shown in Figure 1 (a). In following formulation,<br />
suppose that the global coordinate at the node k is (xk, yk),
the global coordinate <strong>of</strong> node a and b is (xa, ya), (xb, yb)<br />
and the local coordinate <strong>of</strong> node a and b is (a, a), (b,<br />
b). The element shape on master side is assumed as a<br />
square or rectangle. Hence, the global coordinate (x, y)<br />
on master side element is as follows:<br />
x1<br />
x2<br />
x1<br />
x2<br />
x ,<br />
(3)<br />
2 2<br />
y1<br />
y4<br />
y1<br />
y4<br />
y ,<br />
(4)<br />
2 2<br />
where and are local coordinate on the master side<br />
element. , is given as follows:<br />
b<br />
<br />
a<br />
<br />
i ,<br />
(5)<br />
x x<br />
b<br />
a<br />
b<br />
<br />
a<br />
<br />
j ,<br />
(6)<br />
yb ya<br />
where i and j are the unit vectors in x- and y-direction.<br />
The linear combination using 1st and 2nd order mesh on<br />
the master side is mentioned below.<br />
Coefficients for the 1st order nonconforming mesh<br />
connection<br />
Adopting the 1st order element as the discretization <strong>of</strong><br />
master side, (2) becomes the expression:<br />
4<br />
A ab I ke Ake<br />
,<br />
(7)<br />
k 1<br />
where Ike is a coefficient which is evaluated by a line<br />
integral <strong>of</strong> edge-based shape function Nke along the edge<br />
ab as follows:<br />
b<br />
1<br />
dl<br />
I ke N d ( ) d<br />
,<br />
a<br />
ke l N<br />
1<br />
ke tab<br />
d<br />
(8)<br />
where tab is the unit vector in the direction ab, dl/d is the<br />
Jacobian,and is an additional parameter to perform the<br />
analytical line integrals, respectively. Using , the local<br />
coordinates and are transformed into<br />
b<br />
<br />
a a b<br />
,<br />
2 2<br />
(9)<br />
b<br />
<br />
a a<br />
<br />
b<br />
.<br />
2 2<br />
(10)<br />
Subsequently, tab and dl/d for the analytical evaluation<br />
<strong>of</strong> (8) are as follows:<br />
1<br />
tab { ( xb<br />
xa<br />
) i ( yb<br />
ya<br />
) j}<br />
,<br />
l<br />
(11)<br />
2e<br />
N 4e<br />
ab<br />
a<br />
l ab<br />
N 1e<br />
b<br />
1e<br />
N 3e<br />
3e N2e 4e<br />
2e<br />
N 7e<br />
8e<br />
N 8e<br />
a<br />
N 2e<br />
N 9e<br />
l ab<br />
5e<br />
N 10e<br />
N 1e<br />
b<br />
1e<br />
N 5e<br />
7e<br />
N 6e<br />
3e N4e 6e N3e 4e<br />
(a) (b)<br />
Figure 1. Interpolation to the slave edge ab from master<br />
side element. (a) 1st order master side element and (b)<br />
Serendipity 2nd order master side element.<br />
- 283 - 15th IGTE Symposium 2012<br />
2<br />
dl<br />
<br />
d<br />
x<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
x<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
lab<br />
,<br />
2<br />
(12)<br />
where lab is the length <strong>of</strong> edge ab. Then, the 1st order<br />
edge-based shape functions on master side surface are as<br />
follows:<br />
1<br />
( 1<br />
ke<br />
) <br />
N 4<br />
ke <br />
1<br />
( 1<br />
ke<br />
) <br />
4<br />
( 1 k 2)<br />
,<br />
( 3 k 4)<br />
(13)<br />
where ke and ke are the local coordinates on the edge k<br />
according to TABLE I. Substituting (11), (12), and (13),<br />
are substituted for (8), all components <strong>of</strong> Ike are<br />
analytically evaluated as follows:<br />
I<br />
ke<br />
1<br />
( b<br />
<br />
a ){ 2 <br />
ke ( a<br />
<br />
b )}<br />
8<br />
<br />
1<br />
( b<br />
<br />
a ){ 2 ke ( a b<br />
)}<br />
8<br />
2<br />
( 1<br />
( 3<br />
TABLE I<br />
LOCAL COORDINATES FOR 1ST ORDER<br />
EDGE-BASED SHAPE FUNCTION ON MASTER SIDE<br />
k 1 2 3 4<br />
ke 0 0 1.0 -1.0<br />
ke 1.0 -1.0 0 0<br />
k 2)<br />
. (14)<br />
k 4)<br />
Coefficients for the 2nd order nonconforming mesh<br />
connection<br />
Even in the case <strong>of</strong> the linear combination adopting the<br />
2nd order elements as the master side mesh, the<br />
derivation <strong>of</strong> the coefficients for linear combination is in<br />
the same way as 1st order nonconforming connection.<br />
Firstly, 2nd order edge-based shape functions <strong>of</strong><br />
Serendipity type [10] on the master side surface shown in<br />
Figure 1 (b) as follows:<br />
1<br />
( 1<br />
ke<br />
) ( 4<br />
ke<br />
<br />
ke<br />
) <br />
( 1 k 4)<br />
4<br />
1<br />
( 1<br />
ke<br />
) ( 4<br />
ke<br />
ke<br />
) <br />
( 5 k 8)<br />
4<br />
N ke <br />
, (15)<br />
1 2<br />
( 1<br />
) <br />
( k 9)<br />
2<br />
1 2<br />
( 1<br />
) <br />
( k 10)<br />
2<br />
where ke and ke are the local coordinates on the edge k<br />
according to TABLE II. Then, adopting the 2nd order<br />
element as master side discretization, (2) becomes next<br />
expression:<br />
10<br />
ab ke<br />
k 1<br />
A I A .<br />
(16)<br />
ke<br />
When the shape <strong>of</strong> master side element is a square or<br />
rectangle, the order <strong>of</strong> coordinates x and y can be defined<br />
TABLE II<br />
LOCAL COORDINATES FOR 2ND ORDER<br />
EDGE-BASED SHAPE FUNCTION ON MASTER SIDE<br />
k 1 2 3 4 5 6 7 8 9 10<br />
ke 0.5 -0.5 0.5 -0.5 1.0 1.0 -1.0 -1.0 0 0<br />
ke 1.0 1.0 -1.0 -1.0 0.5 -0.5 0.5 -0.5 0 0
as 1st order. Therefore, , , tab, dl/d are all same<br />
as (5), (6), (11), and (12). Substituting these equations<br />
and (15) for (8), all components <strong>of</strong> Ike are analytically<br />
obtained as follows:<br />
<br />
b <br />
a<br />
[ ke<br />
( b<br />
<br />
a ){ 4<br />
ke ( b<br />
<br />
a )<br />
48<br />
<br />
<br />
ke ( b<br />
<br />
a )} 3{<br />
2 <br />
ke ( a<br />
<br />
b )}<br />
{ 4<br />
ke ( a<br />
b<br />
) <br />
ke ( a<br />
<br />
b )}]<br />
<br />
<br />
( 1 k 4)<br />
b<br />
<br />
a<br />
[ ke ( b<br />
<br />
a ){ 4<br />
ke ( b<br />
<br />
a )<br />
48<br />
<br />
ke ( b<br />
<br />
a )} 3{<br />
2 ke ( a b<br />
)}<br />
I ke <br />
. (17)<br />
<br />
{ 4<br />
ke ( a<br />
b<br />
) ke ( a b<br />
)}]<br />
<br />
( 5 k 8)<br />
<br />
<br />
b <br />
a<br />
2<br />
{ 12 3(<br />
a<br />
<br />
b ) ( b<br />
<br />
a )}<br />
24<br />
<br />
<br />
( k 9)<br />
b<br />
<br />
a<br />
2<br />
{ 12 3(<br />
a b<br />
) ( b<br />
<br />
a )}<br />
24<br />
<br />
( k 10)<br />
In the case <strong>of</strong> nonconforming connection between 1st<br />
order elements, 1st order connection (14) is adopted.<br />
Similarly, in the case between 2nd order elements, 2nd<br />
order connection (17) is adopted. When the<br />
nonconforming connection between same order elements<br />
is performed, the coarser side is adopted as the master<br />
side <strong>of</strong> the linear combination and the finer side is<br />
adopted as slave side.<br />
On the other hand, 1st order connection (14) is adopted<br />
as the connection order for mixed order analysis, in<br />
which 1st and 2nd order elements are connected. The<br />
reasons are follows: If the 2nd order connection (17) is<br />
applied to the interface for mixed order connection, the<br />
coefficients I9e and I10e to the edges on 1st order edge 1e-<br />
4e become zero. Therefore, the diagonal component<br />
related to the connection interface may be zero, and the<br />
difficulty for solving the algebraic equation causes.<br />
III. ANALYSIS MODEL<br />
Figure 2 shows a square coil model in order to verify<br />
the performance <strong>of</strong> various nonconforming connections.<br />
The current density is determined by the electric scalar<br />
potential to be I = 1000 AT in the current input surface.<br />
The meshes for the region including a square coil and the<br />
outer space are nonconforrmally connected. The<br />
nonconforming connection is performed at the three<br />
surfaces composed <strong>of</strong> 1st surface (x = 120, y: [0, 120], z:<br />
[0, 120]), 2nd surface (x: [0, 120], y = 120, z: [0, 120]),<br />
and 3rd surface (x: [0, 120], y: [0, 120], z = 120). The<br />
range <strong>of</strong> whole region is set to x: [0, 300], y: [0, 300], and<br />
z: [0, 300], and the all element shapes are the cube to<br />
remove the error caused by the element distortion.<br />
Figure 3 shows an open type MRI model [12], in which<br />
the main object is to compute the uniform magnetic flux<br />
distribution in the imaging region with high accuracy.<br />
The nonconforming connection is performed at the three<br />
- 284 - 15th IGTE Symposium 2012<br />
surfaces composed <strong>of</strong> 1st surface (x = 400, y: [0, 400], z:<br />
[0, 400]), 2nd surface (x: [0, 400], y = 400, z: [0, 400]),<br />
and 3rd surface (x: [0, 400], y: [0, 400], z = 400). The<br />
magnetic nonlinearity <strong>of</strong> SS400 is considered in the yoke<br />
and pole piece. The remanence <strong>of</strong> two facing magnets is<br />
set to 1.2 T, and the nonlinear magnetostatic analysis is<br />
performed by NR method.<br />
IV. NUMERICAL RESULTS<br />
A. Verification using square coil<br />
Figure 4 shows the two examples <strong>of</strong> finite element<br />
meshes. The element coefficient matrix is computed by<br />
Gaussian quadrature 3×3×3 points. The linear equation is<br />
stopped when the condition ||rk||2/||b||2 < cg is satisfied,<br />
where ||rk||2 and ||b||2 are 2-norm <strong>of</strong> the residual at the k-th<br />
iteration and right side vector in the algebraic equations<br />
and cg is set to 10 -6 . is the typical element size <strong>of</strong><br />
standard model, and h is the element size <strong>of</strong> target model.<br />
Therefore, (h/) 2 <strong>of</strong> standard model becomes 1.0 as<br />
shown in Figure 4 (a). On the other hand, h in the<br />
nonconforming case (hexa-1st + hexa-1st, hexa-2nd +<br />
hexa-2nd, and hexa-2nd + hexa-1st) is defined as the<br />
element size <strong>of</strong> the inner mesh including magnetizing<br />
winding.<br />
Figure 5 shows the convergence characteristics <strong>of</strong><br />
magnetic energy. All characteristics have an asymptotic<br />
behavior as h shortens. The W values in nonconforming<br />
case (hexa-1st + hexa-1st and hexa-2nd + hexa-2nd) at<br />
(h/) 2 = 1.0 is equivalent to those in conforming case.<br />
These nonconforming characteristics are slightly<br />
detached from conforming characteristics in the range<br />
(h/) 2 < 1.0.<br />
coil(I = 1000 AT)<br />
currentoutput<br />
y<br />
z<br />
Figure 2: Square coil model.<br />
y<br />
400 130 350 400<br />
imagingregion<br />
polepiece(SS400)<br />
z<br />
unit:[mm]<br />
direction<strong>of</strong>current<br />
currentinput<br />
40<br />
x<br />
Figure 3: Open type MRI model.<br />
unit:[mm]<br />
magnet<br />
(Br = 1.2 T)<br />
yoke(SS400)<br />
x
Comparing the nonconforming characteristic (hexa-2nd +<br />
hexa-2nd) with (hexa-2nd + hexa-1st), the (hexa-2nd +<br />
hexa-2nd) characteristic is superior to the result <strong>of</strong> (hexa-<br />
2nd + hexa-1st) from the viewpoint <strong>of</strong> asymptote to the<br />
behavior <strong>of</strong> conforming hexa-2nd.<br />
TABLE III shows the effect <strong>of</strong> the size ratio on the<br />
computational accuracy. shows the ratio <strong>of</strong> the element<br />
size in outer space mesh for the element size in inner<br />
region, for example, becomes 1.5 in Figure 4 (b). In<br />
nonconforming case, the mesh for outer space is<br />
subdivided on the condition that the mesh for inner<br />
region is fixed. The number <strong>of</strong> elements for 2nd order is<br />
set to one eighth <strong>of</strong> 1st order element. When gets<br />
larger, the relative error <strong>of</strong> W tends to be worse owing to<br />
being coarse size <strong>of</strong> outer space element. The results <strong>of</strong><br />
nonconforming connection have the tendency, in which<br />
the accuracy <strong>of</strong> the nonconforming connection using 2nd<br />
order element is superior to 1st order on the whole. The<br />
outerregion<br />
y<br />
currentoutput<br />
innerregion<br />
nonconf.<br />
boundary<br />
y<br />
outerregion<br />
currentoutput<br />
innerregion<br />
z<br />
(a)<br />
z<br />
x<br />
currentinput<br />
x<br />
currentinput<br />
(b)<br />
Figure 4: Finite element meshes <strong>of</strong> a square coil model.<br />
(a) conforming (h/) 2 = 1.0 and (b) nonconforming (h/) 2<br />
= 0.444.<br />
- 285 - 15th IGTE Symposium 2012<br />
elapsed time using (nonconf. 2nd + 1st, = 2.0) is the<br />
shortest among the nonconforming results, in which the<br />
relative error <strong>of</strong> W is less than 0.1 %. Whereas the<br />
accuracy <strong>of</strong> W using (nonconf. 2nd + 2nd, = 2.0) is the<br />
best among the above mentioned nonconforming types,<br />
the elapsed time approximately quintuples against the<br />
case <strong>of</strong> (nonconf. 2nd + 1st, = 2.0). Hence, it is shown<br />
that the enough accuracy is provided by the mesh type<br />
(nonconf. 2nd + 1st, = 2.0) from the viewpoint <strong>of</strong><br />
practical analysis.<br />
Figure 6 shows the z-component <strong>of</strong> magnetic flux<br />
density on z-axis. The all characteristics coincide with the<br />
standard characteristic, and the relative error between<br />
(nonconf. hexa-2nd + hexa-1st) and (conf. hexa-2nd) is<br />
less than 0.05 %. The accuracy <strong>of</strong> magnetic flux in local<br />
area as well as that <strong>of</strong> magnetic energy is retained even in<br />
W [J]<br />
nonconf.hexa2nd+hexa2nd<br />
nonconf.hexa2nd+hexa1st<br />
0.0512<br />
0.0508<br />
0.0504<br />
stand.<br />
conf.hexa2nd<br />
conf.hexa1st<br />
0.0500<br />
0.0 0 0.2 0.4 0.6 0.8 1.0<br />
(h /) 2<br />
nonconf.<br />
hexa1st+hexa1st<br />
Figure 5: Convergence characteristics <strong>of</strong> magnetic<br />
energy.<br />
nonconf.boundary<br />
0.012 12.0<br />
inner outer<br />
B z [mT]<br />
0.010 10.0<br />
0.008 8.0<br />
0.006 6.0<br />
0.004 4.0<br />
0.002 2.0<br />
0.00 0 0.04 40 0.08 80 0.12 120<br />
z [mm]<br />
TABLE III<br />
ANALYZED RESULTS OF SQUARE COIL MODEL<br />
nonconf.hexa2nd+hexa2nd<br />
nonconf.hexa1st+hexa1st<br />
conf.hexa2nd conf.hexa1st<br />
7.72<br />
B z [mT]<br />
7.70<br />
stand.<br />
nonconf.<br />
hexa2nd+hexa1st<br />
7.68<br />
59.8 60.0 60.2<br />
z [mm]<br />
Figure 6: Distributions <strong>of</strong> Bz on z-axis <strong>of</strong> square coil<br />
model.<br />
mesh type<br />
inner<br />
NoE<br />
outer total<br />
size ratio DoF nonzero<br />
time for<br />
global matrix [s]<br />
ICCG<br />
ite.<br />
time for<br />
ICCG [s]<br />
W [mJ]<br />
relative error (%) <strong>of</strong> W<br />
vs. conf. 2nd (stand.)<br />
conf. 2nd (stand.) 37,044 1,120,581 1,157,625 1.0 13,759,410 576,830,675 224.2 *<br />
466 1042.3 *<br />
<br />
51.0553 0<br />
conf. 1st<br />
1,672,704 1,728,000 1.0 5,126,520 86,040,332 67.5 147 67.1 50.9912 0.125<br />
nonconf. 1st + 1st<br />
55,296<br />
209,088<br />
26,136<br />
264,384<br />
81,432<br />
2.0<br />
4.0<br />
775,368<br />
236,424<br />
12,873,180<br />
3,901,596<br />
10.7<br />
3.5<br />
96<br />
59<br />
6.8<br />
1.4<br />
50.9840<br />
50.9548<br />
0.140<br />
0.197<br />
3,267 58,788 8.0 170,289 2,819,055 2.6 49 0.9 50.8334 0.435<br />
conf. 2nd<br />
209,088 216,000 1.0 2,548,920 105,755,060 51.4 263 142.4 51.0505 0.009<br />
nonconf. 2nd + 2nd<br />
26,136<br />
3,267<br />
33,048<br />
10,179<br />
2.0<br />
4.0<br />
383,256<br />
115,971<br />
15,585,392<br />
4,574,483<br />
8.2<br />
2.7<br />
141<br />
98<br />
12.5<br />
3.6<br />
51.0505<br />
51.0179<br />
0.009<br />
0.073<br />
6,912 1,672,704 1,679,615 0.5 5,037,828 86,450,712 67.7 214 96.7 51.0477 0.015<br />
nonconf. 2nd + 1st<br />
209,088<br />
26,136<br />
216,000<br />
33,048<br />
1.0<br />
2.0<br />
693,576<br />
154,632<br />
13,378,746<br />
4,398,882<br />
10.2<br />
2.9<br />
114<br />
84<br />
8.4<br />
2.3<br />
51.0408<br />
51.0117<br />
0.028<br />
0.085<br />
3,267 10,179 4.0 88,497 3,312,651 2.0 84 2.4 50.8903 0.323<br />
CPU: Intel Core i7-2620M 2.7 GHz & 16 GB<br />
CPU * : Intel Core i7-3930K 4.2 GHz with over-clocked & 32 GB
outer region<br />
imaging region<br />
(inner region)<br />
y<br />
outer region<br />
imaging region<br />
(inner region)<br />
y<br />
outer region<br />
hexa-1st<br />
imaging region<br />
(inner region)<br />
hexa-1st<br />
y<br />
outer region<br />
hexa-1st<br />
imaging region<br />
(inner region)<br />
hexa-2nd<br />
y<br />
z<br />
r<br />
(a)<br />
z<br />
r<br />
(b)<br />
z<br />
r<br />
(c)<br />
z<br />
r<br />
(d)<br />
Figure 7: Finite element meshes <strong>of</strong> open type MRI<br />
model. (a) conforming (hexa-1st), (b) conforming (hexa-<br />
2nd isoparametric), (c) nonconforming (hexa-1st + hexa-<br />
1st), and (d) nonconforming (hexa-2nd + hexa-1st).<br />
B z [T]<br />
-0.20<br />
nonconf.boundary<br />
inner outer<br />
-0.22<br />
-0.24<br />
-0.26<br />
-0.28<br />
-0.30<br />
0.0 0.2 0.4 0.6 0.8<br />
r [m]<br />
x<br />
x<br />
x<br />
x<br />
- 286 - 15th IGTE Symposium 2012<br />
the nonconforming connection.<br />
B. Application <strong>of</strong> mixed order finite element analysis to<br />
open type MRI model<br />
This subsection shows the effectiveness <strong>of</strong> mixed order<br />
finite element analysis with nonconforming connection in<br />
the open type MRI model shown in Figure 3. The<br />
convergence criterion cg <strong>of</strong> ICCG method is set to 10 -3 ,<br />
and when the maximum correction <strong>of</strong> magnetic flux<br />
density is to be 10 -3 T, NR iteration is stopped.<br />
Figure 7 shows the finite element meshes for open type<br />
MRI model. Figure 7 (a), (b), (c), and (d) show the mesh<br />
for conforming hexa-1st, conforming hexa-2nd<br />
isoparametric, nonconforming (hexa-1st + hexa-1st), and<br />
nonconforming (hexa-2nd + hexa-2nd), respectively. The<br />
nonconforming mesh connection is performed at the<br />
interface between imaging region and other region in<br />
order to reduce the number <strong>of</strong> elements with keeping the<br />
accuracy <strong>of</strong> magnetic flux density in imaging region.<br />
Number <strong>of</strong> elements in conforming hexa-2nd (b) is one<br />
eighth size in conforming hexa-1st (a). The mesh for<br />
imaging region <strong>of</strong> (c) is exactly the same as that <strong>of</strong> (a),<br />
and the mesh for outer region <strong>of</strong> (c) and (d) is completely<br />
the same as that <strong>of</strong> (b).<br />
Figure 8 shows the distributions <strong>of</strong> z-direction<br />
magnetic flux density Bz on 45° direction r-axis which is<br />
located on the surface (x, y) = (0, 0) shown in Figure 7.<br />
There are some noise spikes in the characteristics <strong>of</strong><br />
(conf. hexa-1st), (nonconf. hexa-1st + hexa-1st), and<br />
(nonconf. hexa-2nd + hexa-1st). The generation <strong>of</strong> noise<br />
is likely to be caused by the element distortion <strong>of</strong> 1st<br />
order hexahedral elements. The distributions <strong>of</strong> 1st order<br />
discretization have the concave and convex owing to the<br />
interpolation <strong>of</strong> inner flux using edge-shape function.<br />
The mixed order characteristic <strong>of</strong> (nonconf. hexa-2nd +<br />
hexa-1st) seems to be combined two properties, in which<br />
the property <strong>of</strong> 2nd order is confirmed in the inner region<br />
(imaged region) and 1st order property appeared in the<br />
outer region.<br />
TABLE IV shows the analysis results for open type<br />
MRI model. Even in MRI model, the DoF <strong>of</strong> (conf. 2nd)<br />
is a half <strong>of</strong> (conf. 1st); nevertheless, the elapsed time <strong>of</strong><br />
(conf. 2nd) is longer than that <strong>of</strong> (conf. 1st). There is a<br />
possibility that the condition number <strong>of</strong> the global matrix<br />
B z [T]<br />
-0.26<br />
-0.27<br />
nonconf.hexa1st+hexa1st<br />
nonconf.hexa2nd+hexa1st<br />
conf.hexa2nd<br />
stand.<br />
conf.hexa1st<br />
-0.28<br />
0.50 0.62<br />
r [m]<br />
Figure 8: Distributions <strong>of</strong> Bz on r-axis in open type MRI model.
derived from 2nd order get worse than that <strong>of</strong> 1st order.<br />
On the other hand, the elapsed time <strong>of</strong> (nonconf. 2nd +<br />
1st) is the almost same as that <strong>of</strong> (nonconf. 1st + 1st).<br />
Furthermore, the iteration number for NR method in<br />
mixed order analysis is quite same as other mesh type.<br />
V. CONCLUSION<br />
We proposed a mixed order finite element method<br />
using nonconforming mesh connection technique. The<br />
obtained results are summarized as follows:<br />
1. We propose the nonconforming mesh connection<br />
between 2nd order elements supported by the linear<br />
combination, in which the coefficient for 2nd order<br />
connection can be derived from the line integral.<br />
2. The performances <strong>of</strong> the nonconforming connection<br />
for 2nd order elements and mixed order elements are<br />
verified using a square coil model. The accuracy <strong>of</strong> the<br />
nonconforming connection including the 2nd order<br />
discretization is superior to that <strong>of</strong> the meshes<br />
discretized by only 1st order element.<br />
3. The accuracy <strong>of</strong> mixed order analysis has the good<br />
agreement with that <strong>of</strong> conforming 2nd order<br />
discretized mesh and the mesh with 2nd order<br />
nonconforming connection.<br />
4. Mixed order analysis has superiority than the<br />
conventional conformal mesh from a point <strong>of</strong> view <strong>of</strong><br />
the elapsed time.<br />
Finally, we will investigate the effectiveness <strong>of</strong> mixed<br />
order edge-based finite element analysis including the<br />
distorted finite elements as a future works.<br />
ACKNOWLEDGMENT<br />
The authors would like to thank Mr. Y. Tominaga for<br />
helpful support. This work was supported by Japan<br />
Society for Promotion <strong>of</strong> Science (JSPS) Grant-in-Aid<br />
for Young Scientists (B) (Grant Number: 23760252).<br />
REFERENCES<br />
[1] M. Hano, T. Miyamura, and M. Hotta, “Fast and high-accuracy<br />
finite-element electromagnetic analysis by mixed-order vector<br />
elements,” The Papers <strong>of</strong> Joint Technical Meeting on Static<br />
Apparatus, SA-02-14, RM-02-14, pp. 13-18, Jan. 2002. (in<br />
Japanese)<br />
[2] P. Houston, I. Perugia, and D. Schötzau, “Nonconforming mixed<br />
finite-element approximations to time-harmonic eddy current<br />
problems,” IEEE Trans. Magn., Vol. 40, No. 2, pp. 1268-1273, Feb.<br />
2004.<br />
[3] D. Rodger, H. C. Lai, and P. J. Leonard, “Coupled elements for<br />
problems involving movement,” IEEE Trans. Magn., Vol. 26, No.<br />
2, pp. 548-550, Feb. 1990.<br />
- 287 - 15th IGTE Symposium 2012<br />
TABLE IV<br />
ANALYSIS RESULTS OF OPEN TYPE MRI MODEL<br />
mesh type<br />
inner region<br />
NoE<br />
outer space total<br />
DoF nonzero NR ite.<br />
time for<br />
global matrix [s]<br />
total<br />
ICCG ite.<br />
time for<br />
ICCG [s]<br />
conf. 2nd (stand.)<br />
conf. 1st 8,000<br />
556,528 564,528<br />
6,680,172<br />
1,662,234<br />
278,902,794<br />
27,727,823<br />
<br />
7<br />
1035.0<br />
171.7<br />
2,254<br />
1,714<br />
2268.6<br />
262.6<br />
nonconf. 1st + 1st<br />
223,699 3,663,713 7 23.3 617 14.2<br />
conf. 2nd<br />
nonconf. 2nd + 1st<br />
1,000<br />
69,566 77,566 823,308<br />
212,099<br />
33,753,786<br />
3,709,345<br />
7<br />
7<br />
127.5<br />
23.3<br />
315<br />
624<br />
526.3<br />
14.9<br />
CPU: Intel Core i7-2620M 2.7 GHz & 16 GB<br />
[4] E. Lange, F. Henrotte, and K. Hameyer, “A variational<br />
formulation for nonconforming sliding interfaces in finite element<br />
analysis <strong>of</strong> electric machines,” IEEE Trans. Magn., Vol. 46, No. 8,<br />
pp. 2755-2758, Aug. 2010.<br />
[5] C. Golovanov, J.-L. Coulomb, Y. Marechal, and G. Meunier, “3D<br />
mesh connection techniques applied to movement simulation,”<br />
IEEE Trans. Magn., Vol. 28, No. 2, pp. 3359-3362, Feb. 1992.<br />
[6] H. Kometani, S. Sakabe, and A. Kameari, “3-D analysis <strong>of</strong><br />
induction motor with skewed slots using regular coupling mesh,”<br />
IEEE Trans. Magn., Vol. 36, No. 4, pp. 1769-1773, Apr. 2000.<br />
[7] K. Muramatsu, Y. Yokoyama, N. Takahashi, A. Nafalski, and Ö.<br />
Göl, “Effect <strong>of</strong> continuity <strong>of</strong> potential on accuracy in magnetic field<br />
analysis using nonconforming mesh,” IEEE Trans. Magn., Vol. 36,<br />
No. 4, pp. 1578-1582, Apr. 2000.<br />
[8] Y. Okamoto, R. Himeno, K. Ushida, A. Ahagon, and K. Fujiwara,<br />
“Dielectric heating analysis method with accurate rotational motion<br />
<strong>of</strong> stirrer fan using nonconforming mesh connection,” IEEE Trans.<br />
Magn., Vol. 44, No. 6, pp. 806-809, Jun. 2008.<br />
[9] Y. Okamoto, K. Fujiwara, and R. Himeno, “Exact minimization <strong>of</strong><br />
energy functional for NR method with line-search technique,” IEEE<br />
Trans. Magn., Vol. 45, No. 3, pp. 1288-1291, Mar. 2009.<br />
[10] K. Fujiwara, T. Nakata, and H. Fusayasu, “Acceleration <strong>of</strong><br />
convergence characteristic <strong>of</strong> the ICCG method,” IEEE Trans.<br />
Magn., Vol. 29, No. 2, pp. 1958-1961, Mar. 1993.<br />
[11] A. Kameari, “Calculation <strong>of</strong> transient 3D eddy current using edgeelements,”<br />
IEEE Trans. Magn., Vol. 26, No. 2, pp. 466-469, Mar.<br />
1990.<br />
[12] C. Lee and K. Miyata, “Large-scale magnetic field analysis on<br />
MRI with hysteresis,” The papers <strong>of</strong> Joint Technical Meeting on<br />
Static Apparatus and Rotating Machinery, SA-06-20, RM-06-20,<br />
pp. 25-30, Jan. 2005. (in Japanese)
- 288 - 15th IGTE Symposium 2012<br />
Topology Optimization Using Parallel Search<br />
Strategy for Magnetic Devices<br />
1 Takumi Nagano, 1 Shogo Yasukawa, 1 Shinji Wakao, and 2 Yoshihumi Okamoto<br />
1 Waseda <strong>University</strong>, 3-4-1, Okubo, Shinjuku, Tokyo 169-8555, Japan<br />
2 Utsunomiya <strong>University</strong>, 7-1-2, Yoto, Utsunomiya, Tochigi 321-8585, Japan<br />
E-mail: wakao@waseda.jp<br />
Abstract— In this paper, we propose a topology optimization method using parallel search strategy for magnetic devices.<br />
Here, we use the gradient method to minimize an object function from the viewpoint <strong>of</strong> convergence speed, i.e., steepest<br />
descent method. By applying parallel computing, we will carry out calculations simultaneously for some patterns <strong>of</strong> initial<br />
variables. With these calculation results, new patterns <strong>of</strong> initial variables are efficiently created for restarting new search<br />
processes, which results in the better topologies than previous ones. Compared with the conventional method, the proposed<br />
search method enables us to decrease the whole CPU time with keeping the optimization quality.<br />
Index terms— density method, parallel computing, topology optimization.<br />
I. INTRODUCTION<br />
Structural optimization is categorized into three types<br />
<strong>of</strong> problems, i.e., size optimization, shape optimization,<br />
and topology optimization. Topology optimization is a<br />
useful method especially in terms <strong>of</strong> weight saving. And<br />
there is possibility <strong>of</strong> discovering a new topology<br />
unthinkable by conventional methods. However, the<br />
computational load <strong>of</strong> topology optimization will be<br />
heavier than that <strong>of</strong> other optimization, because <strong>of</strong> the<br />
large search space. In this paper, we applied “density<br />
method with gradient method” to make the optimization<br />
process more efficient. In density method, topology <strong>of</strong> the<br />
target is expressed with the density value <strong>of</strong> elements<br />
which consist <strong>of</strong> the design domain [1]. And gradient<br />
method has good convergence speed as the search<br />
method. However, the result <strong>of</strong> minimization with<br />
gradient method will be mostly local minimum<br />
depending on initial variables. Therefore this paper<br />
proposes an efficient topology optimization method based<br />
on parallel computing for magnetic devices. In the<br />
proposed method, we can efficiently escape local<br />
minimums by simultaneous searches from various initial<br />
variables and creating new initial variables with the<br />
results <strong>of</strong> parallel optimization.<br />
II. PROPOSED METHOD<br />
A. Density method based on sensitivity analysis<br />
In density method, topology <strong>of</strong> the target is expressed<br />
with the density value <strong>of</strong> elements. In this paper, the<br />
target is magnetic circuit. The magnetic permeability i<br />
<strong>of</strong> ith element can be formulated as<br />
2<br />
{ 1<br />
1<br />
(1)<br />
},<br />
i 0 r i<br />
where i is the density value <strong>of</strong> ith element, and r is<br />
relative permeability <strong>of</strong> the material. In this paper, r is<br />
1000, and the property <strong>of</strong> magnetic system is regarded as<br />
linear one. In density method, the number <strong>of</strong> elements in<br />
the design domain should be large to express the detail <strong>of</strong><br />
topologies. When density method is combined with<br />
gradient method, the first derivative <strong>of</strong> object function<br />
with respect to density value (sensitivity) <strong>of</strong> each element<br />
in the design domain needs to be calculated. As the<br />
efficient sensitivity analysis, “adjoint variable method” is<br />
applied [2]. The algebraic equation <strong>of</strong> FEM can be<br />
formulated as<br />
HA G,<br />
(2)<br />
where H is whole coefficient matrix, A is unknown<br />
magnetic vector potential, and G is right side vector.<br />
(3) is obtained by differential calculus <strong>of</strong> (2) with respect<br />
to density vector .<br />
<br />
A<br />
G<br />
H<br />
H = A.<br />
(3)<br />
ρ<br />
ρ<br />
ρ<br />
(4) is obtained by multiplying H -1 to (3).<br />
<br />
A 1<br />
G<br />
H<br />
~ <br />
= H<br />
<br />
A<br />
<br />
,<br />
(4)<br />
i<br />
i<br />
i<br />
<br />
where A ~ is the solution <strong>of</strong> (2).<br />
Sensitivity <strong>of</strong> the object function W is obtained as<br />
T<br />
dW W W<br />
A<br />
= . (5)<br />
d<br />
<br />
A<br />
<br />
i<br />
i<br />
In this paper, W is defined with the value <strong>of</strong> magnetic<br />
flux density vector. Therefore, the 1 st term <strong>of</strong> (5) is<br />
invisible. By substituting (4) into (5), (6) is obtained.<br />
T<br />
W W<br />
1<br />
G<br />
H<br />
~ T G<br />
H<br />
~ <br />
H <br />
A<br />
λ <br />
A<br />
.<br />
(6)<br />
i<br />
A<br />
i<br />
i<br />
i<br />
i<br />
<br />
As the property <strong>of</strong> FEM matrix, H is symmetric. Taking<br />
account <strong>of</strong> this point, (7) is obtained by transforming (6).<br />
Hλ.<br />
A <br />
W<br />
(7) <br />
The is called “adjoint variable”. is obtained by<br />
solving (7). Finally, the whole sensitivity vector is<br />
obtained by substituting into (6).<br />
B. Basic concept <strong>of</strong> proposed method<br />
Here, we carried out parallel computing based on Open<br />
MP for topology optimization by using a computer with 8<br />
cores. The basic concept <strong>of</strong> proposed method is shown in<br />
figure 1.<br />
i
Figure 1: Basic concept <strong>of</strong> proposed method.<br />
The proposed method is based on gradient method, where<br />
the solution depends on initial values. Therefore, we<br />
regard the data set <strong>of</strong> initial topology, optimized<br />
topology, and object function value as the properties <strong>of</strong><br />
one “individual”.<br />
First, we prepare 8 individuals i.e., the number <strong>of</strong><br />
cores, by creating random initial topologies and<br />
computing their optimized topologies. Next, a new initial<br />
topology is created with property information <strong>of</strong> 2<br />
selected individuals. The process <strong>of</strong> creating a new<br />
topology with 2 individuals’ information is named<br />
“intercross”. We prepare new 8 initial topologies for the<br />
next generation, and obtain the data sets by optimizing<br />
new initial topologies. The better solution will be<br />
obtained by repeating the above cycle.<br />
In the next section, we explain how to create initial<br />
topologies in the next generation with 2 individuals.<br />
C. How to select 2 individuals for intercross<br />
Three manners <strong>of</strong> selection <strong>of</strong> 2 individuals for<br />
intercross are proposed.<br />
a) We combined 2 individuals with superior object<br />
function value to give properties <strong>of</strong> superior individuals<br />
to the next generation. In this paper, the combinations <strong>of</strong><br />
individuals in the top 4 from the viewpoint <strong>of</strong> object<br />
function superiority are selected for intercross. We create<br />
3 initial topologies in the next generations based on this<br />
selection.<br />
b) We combined 2 individuals with different optimized<br />
topologies. The subject <strong>of</strong> the selection is to make<br />
diversity for intercross. The difference between 2<br />
topologies is evaluated in the following rule.<br />
As shown in figure 2, design domain is separated into<br />
some areas. Next, we define a reference value in the i th<br />
area with the density values as<br />
area<br />
i<br />
<br />
Ni<br />
<br />
2<br />
j<br />
j (8)<br />
.<br />
N<br />
Ni is the number <strong>of</strong> elements in the i th area. We define a<br />
vector C as in (9).<br />
i<br />
C area , area ,..., area ). (9)<br />
( 1 2<br />
8<br />
- 289 - 15th IGTE Symposium 2012<br />
Figure 2: Separation <strong>of</strong> design domain.<br />
The vector C expresses the character <strong>of</strong> the optimized<br />
topology. The characteristic difference <strong>of</strong> 2 topologies A<br />
and B, is evaluated as (10).<br />
2<br />
B<br />
Diff C C . (10)<br />
AB<br />
The combination <strong>of</strong> 2 individuals with larger value <strong>of</strong><br />
(10) is selected for intercross. We create 3 initial<br />
topologies in the next generations based on this selection.<br />
c) We combined 2 individuals chosen randomly. The<br />
subject <strong>of</strong> the selection is also to make diversity <strong>of</strong><br />
intercross. We create 2 initial topologies in the next<br />
generations based on this selection.<br />
8 initial topologies in the next generation are created<br />
based on the selections a)-c).<br />
D. How to make new initial topologies<br />
Here, 2 individuals selected for intercross are named as<br />
IA and IB, and new individual is named as IC. The initial<br />
topology <strong>of</strong> IC is created in the following three kinds <strong>of</strong><br />
manners, which are adopted randomly.<br />
a) The density values <strong>of</strong> IC’s initial topology are created<br />
as the weighted mean <strong>of</strong> those <strong>of</strong> initial topologies <strong>of</strong> IA<br />
and IB.<br />
b) The density values <strong>of</strong> IC’s initial topology are created<br />
as the weighted mean <strong>of</strong> those <strong>of</strong> optimized topologies <strong>of</strong><br />
IA and IB.<br />
To inherit the properties <strong>of</strong> superior individuals to the<br />
next generation, the weight coefficients <strong>of</strong> IA and IB are<br />
formulated as<br />
,<br />
Ci<br />
Ai<br />
W<br />
<br />
W<br />
3<br />
A<br />
3<br />
3<br />
A WB<br />
,<br />
Bi<br />
A<br />
W<br />
<br />
W<br />
3<br />
B<br />
3<br />
3<br />
A WB<br />
(11)<br />
,<br />
where, WA and WB are object function values <strong>of</strong> IA and IB,<br />
and i stands for the element number.<br />
c) The density values <strong>of</strong> IC’s initial topology are created<br />
as the multiplication <strong>of</strong> initial topology <strong>of</strong> IA and<br />
optimized topology <strong>of</strong> IB.<br />
. (12)<br />
Ci<br />
Ai<br />
Bi
The distribution <strong>of</strong> density value i <strong>of</strong> 0 in optimized<br />
topology will be strongly inherited to the next generation.<br />
The probabilities <strong>of</strong> occurrence <strong>of</strong> a),b), and c) are 3/7,<br />
3/7, 1/7 respectively.<br />
III. NUMERICAL EXAMPLE<br />
A. C-shaped iron core model<br />
To demonstrate the validity <strong>of</strong> the proposed method, we<br />
carried out the optimization <strong>of</strong> the model shown in figure<br />
3.<br />
Figure 3: Magnetic force model. (unit : mm)<br />
The main subject <strong>of</strong> this optimization problem is to<br />
maximize the electromagnetic force generated in the<br />
magnetic bar lying in the right side <strong>of</strong> design domain. The<br />
density values <strong>of</strong> elements in the design domain are<br />
design variables <strong>of</strong> this optimization [3]. To evaluate the<br />
electromagnetic force, we adopt the following equation as<br />
the object function <strong>of</strong> this optimization,<br />
1<br />
2<br />
By<br />
W , (13)<br />
where, By is the y component <strong>of</strong> magnetic flux density<br />
vector generated in the target element. This model is<br />
discretized by triangular elements. The numbers <strong>of</strong><br />
elements, in the design domain and whole domain, are<br />
1,575 and 5,576 respectively.<br />
Next, the detail <strong>of</strong> the proposed method is explained.<br />
Initial topologies <strong>of</strong> the 1 st generation are created as<br />
random values i which range from 0 to 1. The initial<br />
magnetic permeability distribution is determined as to<br />
(1). The best object function value before the n th<br />
generation is Wbest, and the best object function value in<br />
the n+1 th generation is Wn+1. If Wn+1 Wbest , we update<br />
the value <strong>of</strong> Wbest as Wn+1. If the update doesn’t occur over<br />
10 generations, the calculation is terminated.<br />
- 290 - 15th IGTE Symposium 2012<br />
Figure 4: Flow chart <strong>of</strong> proposed method.<br />
Now, for the comparison with the proposed method, the<br />
optimization without intercross is also applied to this<br />
model. In the method, the initial topologies in whole<br />
generation are created as randomly as that <strong>of</strong> the 1 st<br />
generation. This method is named “random method”.<br />
2 pattern <strong>of</strong> initial topologies are prepared as those <strong>of</strong> in<br />
the 1 st generation, which are named as ITA and ITB. An<br />
example <strong>of</strong> initial topology in 1 st generation is shown in<br />
figure 5.<br />
Figure 5: Density distribution example <strong>of</strong> initial topology.
Wbest<br />
best object funciton value<br />
68<br />
67.5<br />
67<br />
66.5<br />
66<br />
65.5<br />
65<br />
64.5<br />
64<br />
63.5<br />
Initial<br />
pattern<br />
IT A<br />
IT B<br />
random method (ITA) proposed method<br />
random method (ITB) proposed method (ITB) 0 10 20<br />
generations<br />
30 40<br />
Figure 6: Convergence characteristic <strong>of</strong> each method.<br />
TABLE I<br />
THE OPTIMIZATION RESULTS OF BOTH METHODS.<br />
method<br />
total<br />
generations<br />
Wbest<br />
(IT A)<br />
(a) The obtained topology with random method<br />
from initial pattern ITA.<br />
CPU<br />
time(sec.)<br />
Random 20 65.0926 764.56<br />
Proposed 34 64.0327 347.91<br />
Random 25 64.2958 954.66<br />
Proposed 17 63.9976 229.98<br />
- 291 - 15th IGTE Symposium 2012<br />
Figure 7: Comparison <strong>of</strong> CPU time.<br />
We carry out the optimizations with these topologies by<br />
the proposed method and the random method. The<br />
convergence characteristics <strong>of</strong> both methods are shown in<br />
figure 6. In random method, the improvement <strong>of</strong> object<br />
function value is obviously inefficient, and the<br />
computational result is in the local minimum. By<br />
contrast, the proposed method enables us to improve the<br />
object function value more efficiently, and to achieve<br />
better solution than that <strong>of</strong> the random method. The<br />
object function values Wbest <strong>of</strong> both methods are shown in<br />
table 1.<br />
(b) The obtained topology with proposed method<br />
from initial pattern ITA.<br />
(c) The obtained topology with random method<br />
(d) The obtained topology with proposed method<br />
from initial pattern ITB. Figure 8: Optimization result <strong>of</strong> each method.<br />
from initial pattern ITB.<br />
frequency<br />
0.7<br />
0.6<br />
0.5<br />
0.4<br />
0.3<br />
0.2<br />
0.1<br />
0<br />
proposed method<br />
random method<br />
2 6 10 14 18 22 26 30 34 38 42 46<br />
calculation time[CPU sec.]
The frequency distribution <strong>of</strong> calculation time for each<br />
individual is shown in Figure 7. Figure 7 indicates that<br />
the calculation time for each individual is sufficiently<br />
reduced in the proposed method because the starting<br />
points for optimization, i.e., the initial topologies, are<br />
effectively created near stationary points. We can<br />
estimate the total CPU time by multiplying the<br />
calculation time for individuals in one generation by the<br />
required number <strong>of</strong> generations. As the results, the CPU<br />
time <strong>of</strong> the proposed method is much less than that <strong>of</strong> the<br />
random method in spite <strong>of</strong> their total generations required<br />
for convergence as shown in table I.<br />
The obtained topologies by both methods are shown in<br />
Figure 8. Black elements correspond to the magnetic<br />
material with i = 1, and white elements to air elements<br />
with i = 0. Gray elements have an intermediate property<br />
between air and magnetic material with 0 < i < 1 which<br />
are named as “gray scale”. The topologies obtained by<br />
the random method strongly depend on their initial<br />
variables and contain many gray scales. On the contrary,<br />
we can obtain the similar topologies without gray scales<br />
by using the proposed method.<br />
B. Magnetic shielding model<br />
Proposed method is applied to the optimization <strong>of</strong> the<br />
shield model shown in Figure 9.<br />
y<br />
x<br />
Figure 9: shield model. (unit : mm)<br />
The main subject <strong>of</strong> the optimization problem is to<br />
minimize the magnetic flux entering into target domain<br />
generated by 2 surrounding coils. The density values <strong>of</strong><br />
elements in the design domain are design variables <strong>of</strong> the<br />
optimization. The object function is defined as (14).<br />
W <br />
2<br />
Bx<br />
<br />
t arg et<br />
domain<br />
2<br />
By<br />
.<br />
(14)<br />
- 292 - 15th IGTE Symposium 2012<br />
This model is discretized by triangular elements. The<br />
number <strong>of</strong> elements, in the design domain and the whole<br />
domain, are 3,800 and 12,106 respectively.<br />
The optimization results <strong>of</strong> the random and the proposed<br />
methods are shown in Figures 10-13.<br />
Wbest<br />
best object funtion value<br />
ferquency<br />
0.0012<br />
0.001<br />
0.0008<br />
0.0006<br />
0.0004<br />
0.0002<br />
0.45<br />
0.4<br />
0.35<br />
0.3<br />
0.25<br />
0.2<br />
0.15<br />
0.1<br />
0.05<br />
0<br />
random method<br />
proposed method<br />
0 20 40 60 80<br />
generations<br />
Figure 10: Convergence characteristic <strong>of</strong> object function.<br />
0<br />
Wbest = 4.0610 -4<br />
0<br />
24<br />
48<br />
72<br />
96<br />
120<br />
144<br />
168<br />
192<br />
216<br />
240<br />
264<br />
288<br />
312<br />
336<br />
360<br />
384<br />
408<br />
432<br />
calculation time (sec.)<br />
Wbest = 6.9010 -5<br />
random method<br />
proposed method<br />
Figure 11: Comparison <strong>of</strong> CPU time.<br />
Fig 12: Result <strong>of</strong> optimization. (random method)
Figure 13: Result <strong>of</strong> optimization. (proposed method)<br />
The computational results demonstrate the effectiveness<br />
<strong>of</strong> the proposed method compared with the random<br />
method as is the case with the magnetic force model.<br />
We can successfully obtain the multilayer structure as<br />
an effective topology <strong>of</strong> the shield as shown in Figure 13.<br />
IV. CONCLUSION<br />
In this paper, a topology optimization using parallel<br />
search strategy for magnetic devices is proposed to<br />
efficiently obtain more global solutions. In the proposed<br />
method, we effectively introduce the novel concept <strong>of</strong><br />
intercross into the density method with sensitivity<br />
analysis, which results in the CPU time reduction with<br />
keeping the optimization quality.<br />
We will investigate various algorithms <strong>of</strong> intercross,<br />
and apply the proposed method to more practical<br />
nonlinear problems as future works.<br />
REFERENCES<br />
[1] H. P. Mlejnek and R. Schirrmacher, "An engineer's approach to<br />
optimal material distribution and shape finding," Comput.<br />
Methods Appl. Mech. Eng., Vol. 106, pp. 1-26 (1993).<br />
[2] S.Gitosusastro, J.L.Coulomb and J.C. Sabonnadiere, "Performance<br />
derivative calculations and optimization process," IEEE Trans.<br />
Magn, Vol.25, No.4, pp. 2834-2839(1989)<br />
[3] Yoshihumi Okamoto, and Norio Takahashi, "Investigation <strong>of</strong><br />
Topology Optimization <strong>of</strong> Magnetic Circuit by Using Density<br />
Method", IEEJ Trans. IA, Vol.124, No.12, pp. 1228-1235(2004).<br />
[4] Jin-kyu Byun, Il-han Park, and Song-yop Hahn, "Topology<br />
optimization <strong>of</strong> electrostatic actuator using design sensitivity,"<br />
IEEE Trans. Magn. Vol.38, No. 2, pp. 1053-1056 (2002).<br />
[5] Jin-kyu Byun and Song-yop Hahn, "Application <strong>of</strong> topology<br />
optimization to electromagnetic system," International journal <strong>of</strong><br />
applied electromagnetics and mechanics, Vol. 13, No. 1-4, pp. 25-<br />
33 (2002).<br />
- 293 - 15th IGTE Symposium 2012
- 294 - 15th IGTE Symposium 2012
- 295 - 15th IGTE Symposium 2012
- 296 - 15th IGTE Symposium 2012
- 297 - 15th IGTE Symposium 2012
- 298 - 15th IGTE Symposium 2012
- 299 - 15th IGTE Symposium 2012
- 300 - 15th IGTE Symposium 2012<br />
Interaction Magnetic Force Calculation <strong>of</strong> Axial<br />
Passive Magnetic Bearing Using Magnetization<br />
Charges and Discretization Technique<br />
*Saša S. Ilić, *Ana N.Vučković and *Slavoljub Aleksić<br />
*<strong>University</strong> <strong>of</strong> Niš, Faculty <strong>of</strong> Electronic Engineering <strong>of</strong> Niš, Aleksandra Medvedeva 14, 18000 Niš, Serbia<br />
E-mail: ana.vuckovic@elfak.ni.ac.rs<br />
Abstract— The paper presents calculation <strong>of</strong> the force between two ring permanent magnets whose magnetization is axial.<br />
Such configuration corresponds to a passive magnetic bearing. The simple and fast analytical approach is used for this<br />
calculation based on magnetization charges and discretization technique. The results for interaction magnetic force obtained<br />
using proposed approach are compared with finite element method using FEMM 4.2 s<strong>of</strong>tware.<br />
Index Terms— Permanent magnet, interaction magnetic force, magnetization charges, discretization technique, Finite<br />
Element Method (FEM).<br />
charges for uniform magnetization do not exist. nˆ is the<br />
unit vector normal to surface.<br />
I. INTRODUCTION<br />
Permanent magnets are used nowadays in many<br />
applications, and the general need for dimensioning and<br />
optimizing leads to the development <strong>of</strong> calculation<br />
methods. Permanent magnets are commonly used in<br />
many electrical devices and their own quality depends on<br />
the magnet material, magnetization and dimensions. Two<br />
major kinds <strong>of</strong> applications can be identified: the ones<br />
which use block magnets and the ones which use<br />
cylindrical magnets. Block permanent magnets are easy<br />
to manufacture and to magnetize, and it’s easier to<br />
calculate magnetic field they create [1-3]. Indeed, most<br />
engineering applications need several ring permanent<br />
magnets and the determination <strong>of</strong> the magnetic force<br />
between them is thus required.<br />
Magnetic bearings are contactless suspension devices<br />
with various rotating and translational applications [4].<br />
Depending on the ring permanent magnet magnetization<br />
direction, the devices work as axial or radial bearings and<br />
thus control the position along an axis or the centering <strong>of</strong><br />
an axis. Knowledge <strong>of</strong> the interaction magnetic force is<br />
required to control devices reliably.<br />
There are numerous techniques for analyzing permanent<br />
magnet devices and different approaches for determining<br />
interaction forces between them [5-8]. Many authors<br />
are proposing simplified and robust formulations <strong>of</strong> the<br />
interaction forces created by permanent magnets. The<br />
authors generally use the Ampere's current model [9],[10]<br />
or the Columbian approach [11],[12]. Several application<br />
examples were previously presented in [13],[14] where<br />
levitation forces for magnetic bearings were calculated.<br />
II. THEORETICAL BACKGROUND<br />
Axial passive magnetic bearing [5] that is considered in<br />
the paper is presented in the Fig.1.<br />
Since the boundary condition for surface magnetization<br />
charges density has to be satisfied for both magnets,<br />
m ˆ<br />
1 n M1<br />
and m ˆ<br />
2 n M2<br />
, (1)<br />
it is obvious that fictitious surface magnetization charges<br />
[2] exist only on the bottom and the top bases <strong>of</strong> each<br />
permanent magnet, because volume magnetization<br />
Figure 1: Axial passive magnetic bearing.<br />
The simplest procedure for interaction magnetic force<br />
determination is to discretize each base <strong>of</strong> both<br />
permanent magnets into system <strong>of</strong> circular loops. The<br />
interaction force between two magnetized circular loops<br />
will be calculated first. That will be performed by<br />
calculating the magnetic field and magnetic flux density<br />
generated by the arbitrary magnetized circular loop <strong>of</strong> the<br />
upper magnet first and then the force that acts on the<br />
arbitrary loop <strong>of</strong> the lower magnet (Figure 2). Magnetic<br />
field <strong>of</strong> the upper loop will be determined by calculating<br />
the magnetic scalar potential. Using results for interaction<br />
magnetic force between two circular loops, magnetic<br />
force <strong>of</strong> the axial magnetic bearing can be obtained by<br />
summing the contribution <strong>of</strong> both magnet bases <strong>of</strong> lower<br />
and upper permanent magnets by using uniform<br />
discretization technique.<br />
The goal <strong>of</strong> this approach is to determine the interaction<br />
magnetic force between two circular loops uniformly<br />
loaded with magnetization charges Qm1 and Q m2<br />
.<br />
Dimensions and positions <strong>of</strong> the loops are presented in<br />
the Figure 2. For determining the interaction force<br />
between two circular loops, magnetic scalar potential,<br />
magnetic field and magnetic flux density generated by<br />
the upper loop will be calculated. Elementary magnetic<br />
scalar potential generated by the elementary point<br />
magnetization charge, dQ m1,is
Figure2: Two circular loops.<br />
dQm1<br />
1<br />
d m . (2)<br />
4<br />
R<br />
Qm1<br />
Qm1<br />
Since dQ<br />
m1<br />
Qm<br />
1 dl<br />
a d '<br />
d '<br />
,<br />
2a<br />
2<br />
elementary magnetic scalar potential has the following<br />
form<br />
Qm1<br />
1<br />
d m<br />
d '<br />
, (3)<br />
2<br />
8<br />
R<br />
and the resulting magnetic scalar potential generated by<br />
the upper circular loop at an arbitrary point P( r,<br />
,<br />
z)<br />
is<br />
2<br />
Qm1<br />
1<br />
m <br />
d '<br />
,<br />
2<br />
8<br />
2 2<br />
2<br />
0 r r0<br />
zz0 2r0r<br />
cos'<br />
(4)<br />
Considering the existing symmetry, in 0 plane,<br />
magnetic scalar potential has the following form<br />
<br />
Qm1<br />
1<br />
m ( r,<br />
z)<br />
<br />
d '.<br />
2<br />
4<br />
2 2<br />
2<br />
0 r r0<br />
zz0 2rr0<br />
cos '<br />
(5)<br />
Substituting θ' 2<br />
in Eq. (5), magnetic scalar<br />
potential is obtained as:<br />
2<br />
Qm1<br />
m ( r,<br />
z)<br />
<br />
2<br />
2<br />
<br />
0<br />
1<br />
2<br />
2<br />
2<br />
( r r0<br />
) 4rr0<br />
sin zz0 d <br />
. (6)<br />
After some simple operations the magnetic scalar<br />
potential can be given in the form:<br />
where<br />
m<br />
( r,<br />
z)<br />
<br />
Qm1<br />
2<br />
2<br />
2<br />
<br />
<br />
K<br />
, k <br />
2 <br />
2<br />
( r r0<br />
) <br />
2 z z<br />
<br />
1<br />
K , k K <br />
,<br />
2 <br />
2 2<br />
1<br />
k sin<br />
0<br />
0<br />
d<br />
<br />
- 301 - 15th IGTE Symposium 2012<br />
(7)<br />
is complete elliptic integral <strong>of</strong> the first kind with modulus<br />
2 4rr0<br />
k .<br />
2<br />
2<br />
( r r0<br />
) ( z z0<br />
)<br />
External magnetic field (magnetic field generated by the<br />
upper loop) at an arbitrary point can be determined as<br />
ext<br />
H ( r, z)<br />
grad<br />
m<br />
( r,<br />
z)<br />
Hr<br />
( r,<br />
z)<br />
rˆ<br />
H z ( r,<br />
z)<br />
zˆ<br />
,<br />
(8)<br />
External magnetic flux density is<br />
ext<br />
ext<br />
ext<br />
ext<br />
B ( r, z)<br />
H ( r,<br />
z)<br />
, (9)<br />
ext<br />
r<br />
ext<br />
r<br />
ext<br />
z<br />
with components<br />
ext<br />
Br <br />
<br />
<br />
<br />
2<br />
<br />
r<br />
<br />
<br />
2r<br />
and<br />
0<br />
B ( , z)<br />
B ( r,<br />
z)<br />
rˆ<br />
B ( r,<br />
z)<br />
zˆ<br />
. (10)<br />
( r,<br />
z)<br />
<br />
ext<br />
ext H<br />
( r,<br />
z)<br />
Br ( r,<br />
z)<br />
0<br />
, (11)<br />
r<br />
2 2<br />
2 <br />
rr0zz0 E,<br />
k <br />
2 <br />
2<br />
2<br />
2<br />
( r r ) zz ( r r ) zz 0<br />
( r r<br />
<br />
K<br />
, k <br />
2 <br />
0<br />
)<br />
B z<br />
0<br />
2<br />
Q<br />
m1<br />
2<br />
2<br />
<br />
<br />
0<br />
zz 0<br />
2<br />
<br />
<br />
<br />
<br />
,<br />
<br />
<br />
<br />
0<br />
ext<br />
0<br />
2<br />
<br />
(12)<br />
ext H<br />
( r,<br />
z)<br />
( r,<br />
z)<br />
0<br />
, (13)<br />
z<br />
ext Qm1<br />
Bz ( r,<br />
z)<br />
0<br />
<br />
2<br />
2<br />
<br />
zz0E, k <br />
2 <br />
2 2<br />
2<br />
2<br />
( r r ) z z ( r r ) z z<br />
0<br />
0<br />
2<br />
<br />
0<br />
0<br />
(14)<br />
<br />
2 2<br />
where E , k E 1<br />
k sin d ,<br />
2 <br />
0<br />
is complete elliptic integral <strong>of</strong> the second kind with<br />
modulus<br />
2 4rr0<br />
k <br />
2<br />
2<br />
( r r0<br />
) ( z z0<br />
)<br />
.<br />
The interaction magnetic force on elementary<br />
magnetization charge <strong>of</strong> lower circular loop<br />
Q<br />
m2<br />
Qm2<br />
dQm<br />
2 Qm2<br />
dl<br />
bd<br />
d<br />
is<br />
2b<br />
2<br />
ext<br />
d m2<br />
m m<br />
F dQ<br />
B ( r , z ) . (15)<br />
Finally, interaction magnetic force components can be<br />
expressed as:
Qm1Qm<br />
2<br />
Fr ( r,<br />
z)<br />
0<br />
<br />
2<br />
2<br />
<br />
<br />
<br />
<br />
2<br />
<br />
rm<br />
<br />
m 0 m 0 m 0<br />
<br />
<br />
K<br />
, k0<br />
<br />
2 <br />
2rm<br />
2<br />
( rm<br />
r0<br />
) m<br />
2 2<br />
2 <br />
rmr0zmz0 E,<br />
k0<br />
<br />
2 <br />
2<br />
2<br />
2<br />
( r r ) zz ( r r ) zz Qm1Qm<br />
2<br />
Fz ( r,<br />
z)<br />
0<br />
<br />
2<br />
2<br />
<br />
zmz0E, k0<br />
<br />
2 <br />
m<br />
zz 0<br />
2<br />
<br />
<br />
<br />
0<br />
2 <br />
0 <br />
<br />
<br />
2 2<br />
2<br />
2<br />
( rm<br />
r0<br />
) zm<br />
z0<br />
( rm<br />
r0<br />
) zm<br />
z0<br />
(16)<br />
- 302 - 15th IGTE Symposium 2012<br />
(17)<br />
Q<br />
( , ) m1Q<br />
F m 2<br />
z r z 0<br />
F ( r0,<br />
rm,<br />
z0,<br />
zm)<br />
2 z<br />
, (18)<br />
p<br />
2<br />
with elliptic integrals modulus<br />
2 4r0rm<br />
k0<br />
2<br />
2<br />
( rm<br />
r0<br />
) ( zi<br />
zm<br />
) <br />
.<br />
The axial component <strong>of</strong> the force (17) presents<br />
interaction force between two magnetized circular loops.<br />
The simplest procedure for levitation magnetic force<br />
determination is to discretize each bases <strong>of</strong> permanent<br />
magnets into system <strong>of</strong> circular loops, where N 1 is the<br />
number <strong>of</strong> discretized segments <strong>of</strong> each bases <strong>of</strong> upper<br />
permanent magnet and N 2 is the number <strong>of</strong> discretized<br />
segments <strong>of</strong> each bases <strong>of</strong> lower permanent magnet.<br />
Figure 3: Discretizing model.<br />
By taking into account the ring geometry <strong>of</strong> permanent<br />
magnets (Figure 3), the radius <strong>of</strong> each discretized<br />
segment <strong>of</strong> both bases <strong>of</strong> upper magnet is<br />
r n<br />
2n<br />
1<br />
a ( b a),<br />
n 1,<br />
2,<br />
,<br />
N1<br />
2N1<br />
, (19)<br />
and magnetization loop charges <strong>of</strong> upper permanent<br />
magnet bases are<br />
b a<br />
Qm n M12rn<br />
, n 1,<br />
2,...,<br />
N1<br />
N1<br />
. (20)<br />
For lower magnet bases the radius <strong>of</strong> each discretized<br />
segments is<br />
r i<br />
2i<br />
1<br />
C ( d c),<br />
i 1,<br />
2,<br />
,<br />
N2<br />
2N<br />
2<br />
. (21)<br />
Magnetization loop charges <strong>of</strong> lower permanent magnet<br />
bases are<br />
d c<br />
Qm i M 2 2ri , i 1,<br />
2,...,<br />
N2<br />
N2<br />
. (22)<br />
Using results for interaction magnetic force between<br />
two circular loops, Eqs. (17), the levitation magnetic<br />
force between two ring permanent magnets can be<br />
obtained. It can be achieved by summing the contribution<br />
<strong>of</strong> both magnet bases <strong>of</strong> lower and upper permanent<br />
magnets by using uniform discretization technique,<br />
<br />
<br />
1 2<br />
20<br />
M1M<br />
2<br />
Fz<br />
( b a)(<br />
d c)<br />
rnri<br />
<br />
N N <br />
<br />
F<br />
F<br />
zp<br />
zp<br />
1<br />
2<br />
( r , r , h,<br />
0)<br />
F<br />
n<br />
( r , r , h L , 0)<br />
F<br />
n<br />
i<br />
i<br />
1<br />
Lh zp<br />
n1<br />
i1<br />
( r , r , h,<br />
L ) <br />
n<br />
zp<br />
<br />
E<br />
, k3<br />
<br />
2 <br />
i<br />
N<br />
2<br />
N<br />
( rn<br />
, ri<br />
, h L1,<br />
L2<br />
)<br />
; (23)<br />
2<br />
0M<br />
1M<br />
2<br />
Fz<br />
<br />
( b a)(<br />
d c)<br />
<br />
N1N<br />
2<br />
<br />
<br />
N1<br />
N2<br />
hE<br />
, k1<br />
<br />
<br />
2 <br />
rn<br />
ri<br />
<br />
<br />
2 2<br />
2 2<br />
n1<br />
i1 <br />
<br />
( ri<br />
rn<br />
) h ( ri<br />
rn<br />
) h<br />
<br />
<br />
L2hE, k2<br />
<br />
2 <br />
<br />
2<br />
2<br />
2<br />
2<br />
( ri<br />
rn<br />
) L2h ( ri<br />
rn<br />
) L2h 2<br />
2<br />
2<br />
( r r ) Lh ( r r ) Lh i<br />
1<br />
LLh <br />
E<br />
, k4<br />
<br />
2 <br />
2<br />
2<br />
2<br />
( r r ) LLh ( r r ) LLh i<br />
n<br />
n<br />
where<br />
2 4rnri<br />
k1<br />
,<br />
2 2<br />
( ri<br />
rn<br />
) h<br />
2 2 4rnri<br />
k2<br />
,<br />
2<br />
ri<br />
rn<br />
L2<br />
h<br />
2 2 4rnri<br />
k3<br />
,<br />
2<br />
ri<br />
rn<br />
L1<br />
h<br />
2 2<br />
4rnri<br />
k4<br />
.<br />
2<br />
ri<br />
rn<br />
L2<br />
L1<br />
h<br />
2<br />
1<br />
2<br />
1<br />
1<br />
i<br />
n<br />
i<br />
n<br />
1<br />
2<br />
2<br />
1<br />
<br />
<br />
<br />
<br />
2 <br />
<br />
<br />
<br />
(24)<br />
III. NUMERICAL RESULTS<br />
We are working under presumption that the both ring<br />
permanent magnets are made <strong>of</strong> the same material and<br />
magnetized uniformly along their axis <strong>of</strong> symmetry, but<br />
in opposite direction, M1 M 2 M .
Distribution <strong>of</strong> magnetic flux density obtained using<br />
FEMM 4.2 s<strong>of</strong>tware [15] is presented in Fig. 6. The<br />
values <strong>of</strong> the geometrical parameters used in the<br />
numerical computation are: 2 1,<br />
L a , 2 2 L b<br />
c L2<br />
3,<br />
d / L2<br />
4,<br />
L1<br />
/ L2<br />
0.<br />
5,<br />
2 1.<br />
5,<br />
L h<br />
2 1mm<br />
L and kA/m 900 M .<br />
Convergence <strong>of</strong> the normalized interaction force,<br />
nor Fz<br />
Fz<br />
obtained using presented approach is<br />
2 2<br />
0M<br />
L2<br />
,<br />
given in Table I for magnetic bearing dimensions:<br />
a L2<br />
1, b L2<br />
2,<br />
2 3,<br />
L c , 4 2 L d , 5 . 0 / L1<br />
L2<br />
<br />
h / L2<br />
0.<br />
1 .<br />
Figure 4: Distribution <strong>of</strong> magnetic flux density for magnetic bearing<br />
obtained using FEMM 4.2 s<strong>of</strong>tware.<br />
TABLE I<br />
CONVERGENCE OF LEVITATION MAGNETIC FORCE FORCE VERSUS<br />
NUMBER OF SEGMENTS.<br />
N tot<br />
nor<br />
F z<br />
nor<br />
F z (FEM)<br />
10 -0.0732327<br />
20 -0.0741579<br />
30 -0.0743319<br />
50<br />
100<br />
-0.0744213<br />
-0.0744591<br />
-0.07491167<br />
200 -0.0744685<br />
300 -0.0744703<br />
500 -0.0744712<br />
In order to save the calculation time, the number <strong>of</strong><br />
segments is limited on N tot N1<br />
N2<br />
200 because it<br />
is not necessary to take a greater number <strong>of</strong> segments to<br />
obtain a desired accuracy.<br />
Compared results for normalized interaction magnetic<br />
force <strong>of</strong> two identical ring permanent magnets, obtained<br />
using presented analytical approach and finite element<br />
method (FEM) versus 2 L h , for parameters: , 1 2 L a<br />
2 2,<br />
L b c L2<br />
3, d L2<br />
4 and 5 . 0 / L 1 L2<br />
are<br />
given in the Table II.<br />
Comparative results for normalized interaction<br />
magnetic force <strong>of</strong> axial passive magnetic bearing versus<br />
ratios 2 L a and 2 L b , obtained using presented approach<br />
and finite element method (FEM), for parameters:<br />
c L2<br />
3, d L2<br />
4,<br />
L1<br />
/ L2<br />
0.<br />
5 and h / L2<br />
1.<br />
5 are<br />
shown in the Table III.<br />
- 303 - 15th IGTE Symposium 2012<br />
TABLE II<br />
COMPARED RESULTS FOR INTERACTION MAGNETIC VERSUS h L2<br />
h / L2<br />
nor<br />
F z<br />
nor<br />
F z (FEM)<br />
0 -0.120071 -0.120515<br />
0.1 -0.074469 -0.074911<br />
0.2 -0.025238 -0.025659<br />
0.3 0.025238 0.024873<br />
0.4 0.074469 0.074099<br />
0.5 0.120071 0.119765<br />
0.6 0.159974 0.159691<br />
0.7 0.192597 0.192327<br />
0.8 0.216978 0.216771<br />
0.9 0.232821 0.232627<br />
1.0 0.240465 0.240293<br />
1.1 0.240766 0.240654<br />
1.2 0.234930 0.234829<br />
1.3 0.224326 0.224240<br />
1.4 0.210322 0.210279<br />
1.5 0.194163 0.194138<br />
TABLE III<br />
COMPARED RESULTS FOR INTERACTION MAGNETIC FORCE VERSUS<br />
a L AND<br />
2 b L2<br />
a / L2<br />
b/<br />
L2<br />
nor<br />
F z<br />
nor<br />
F z (FEM)<br />
1.0 2.0 0.194163 0.194138<br />
1.5 2.5 0.320992 0.321208<br />
2.0 3.0 0.209864 0.210493<br />
2.5 3.5 -0.614301 -0.613222<br />
3.0 4.0 -1.341280 -1.339868<br />
3.5 4.5 -0.714116 -0.712579<br />
4.0 5.0 0.272002 0.273619<br />
4.5 5.5 0.491236 0.492676<br />
5.0 6.0 0.352195 0.353756<br />
IV. CONCLUSION<br />
Determination <strong>of</strong> the interaction forces <strong>of</strong> axial passive<br />
magnetic bearing is presented. It is preformed using<br />
magnetization charges and discretization technique.<br />
Presumption was that both magnets are made <strong>of</strong> the<br />
same material and magnetized uniformly along the<br />
magnet axis <strong>of</strong> symmetry, with the same intensity, but in<br />
opposite directions. The derived algorithm is easily<br />
implemented in any standard computer environment and<br />
it enables rapid parametric studies <strong>of</strong> the interaction<br />
force. The results <strong>of</strong> the presented approach are<br />
successfully confirmed using FEMM 4.2 s<strong>of</strong>tware. Table<br />
I shows that it is not necessary to take a great number <strong>of</strong><br />
segments (not more then 200) to obtain a desired<br />
accuracy so the computational time can be saved.<br />
Interaction forces calculation using presented approach<br />
for mentioned parameters and N tot 200 is performed<br />
with Intel Core 2 Duo CPU at 2.4GHz and 4GB RAM<br />
memory and it took less than two seconds <strong>of</strong> run time.<br />
Interaction forces are also determined on the same<br />
computer using FEMM 4.2 s<strong>of</strong>tware and the computation<br />
time was 14 minutes for about 1.8million finite elements.<br />
Therefore, the advantage <strong>of</strong> presented analytical approach<br />
is its accuracy, simplicity and time efficiency.
V. ACKNOWLEDGEMENT<br />
The work presented here was partly supported by the<br />
Serbian Ministry <strong>of</strong> Education and Science in the frame<br />
<strong>of</strong> the project TR 33008.<br />
REFERENCES<br />
[1] J. S Agashe and D. P Arnold, “A study <strong>of</strong> scaling and geometry<br />
effects on the forces between cuboidal and cylindrical magnets<br />
using analytical force solutions”, J. Phys. D: Appl. Phys. 41<br />
105001, pp.1-9, 2008.<br />
[2] A. N. Vučković, S. R. Aleksić, S. S. Ilić.: “Calculation <strong>of</strong> the<br />
Attraction and Levitation Forces Using Magnetization Charges”,<br />
The 10th International Conference on Applied Electromagnetics –<br />
PES 2011, <strong>Proceedings</strong> <strong>of</strong> full papers (CDROM), pp. 33-55, 25-<br />
29 September, Niš, Serbia, 2011.<br />
[3] G. Akoun, J. P. Yonnet.: “3d Analytical Calculation <strong>of</strong> the Forces<br />
Exerted between two Cuboidal Magnets”, IEEE Transactions on<br />
Magnetics, Vol. 20, No. 5, pp. 1962-1964, September 1984.<br />
[4] S. I. Babic, C. Akyel.: “Magnetic Force Calculation between Thin<br />
Coaxial Circular Coils in Air”, IEEE Transactions on Magnetics,<br />
Vol. 44, No. 4, pp. 445-452, April 2008.<br />
[5] V. Lemarquand, G. Lemarquand.: “Passive Permanent Magnet<br />
Bearings for Rotating Shaft: Analytical Calculation”, Magnetic<br />
Bearings, Theory and Applications, Sciyo Published book, pp. 85-<br />
116, October 2010.<br />
[6] R. Ravaud, G. Lemarquand, V. Lemarquand.: “Force and<br />
Stiffness <strong>of</strong> Passive Magnetic Bearings Using Permanent Magnets.<br />
Part 1: Axial Magnetization”, IEEE Transactions on Magnetics,<br />
Vol. 45, No. 7, pp. 2996-3002, July 2009.<br />
[7] R. Ravaud, G. Lemarquand, S. Babic, V. Lemarquand, C. Akeyel.:<br />
“Cylindrical Magnets and Coils: Fields, Forces and Inductances”,<br />
IEEE Transactions on Magnetics, Vol. 46, No. 9, pp. 3585-3590,<br />
September 2010.<br />
- 304 - 15th IGTE Symposium 2012<br />
[8] M. Greconici, Z. Ž. Cvetković, A. N. Mladenović, S. R. Aleksić,<br />
D. Vesa.: “Analytical-numerical Approach for Levitation Force<br />
Calculation <strong>of</strong> a Cylindrical Bearing with Permanent Magnets<br />
Used in an Electric Meter” <strong>Proceedings</strong> <strong>of</strong> full papers OPTIM<br />
2010, pp. 197-201, 20-21 May, Brasov, Romania, 2010.<br />
[9] Furlani, E. P., S. Reznik, & A. Kroll. 1995. A three-dimensional<br />
field solution for radially polarized cylinders. IEEE Trans. Magn.,<br />
vol. 31, no.1, pp. 844–851.<br />
[10] M. Braneshi, O. Zavalani and A. Pijetri.: “The Use <strong>of</strong> Calculating<br />
Function for the Evaluation <strong>of</strong> Axial Force between Two Coaxial<br />
Disk Coils”, 3 rd International PhD Seminar Computational<br />
Electromagnetics and Technical Application, pp. 21-30, 28<br />
August - 1 September, Banja Luka, Bosnia and Hertzegovina,<br />
2006.<br />
[11] Rakotoarison, H. L., J.-P. Yonnet, & B. Delinchant.2007. Using<br />
Coulombian Approach for Modeling Scalar Potential and<br />
Magnetic Field <strong>of</strong> a Permanent Magnet With Radial Polarization.<br />
IEEE Transactions on Magnetics, Vol. 43, No. 4, pp. 1261-1264.<br />
[12] R. Ravaud, G. Lemarquand, V. Lemarquand.: “Force and<br />
Stiffness <strong>of</strong> Passive Magnetic Bearings Using Permanent Magnets.<br />
Part 2: Radial Magnetization”, IEEE Transactions on Magnetics,<br />
Vol. 45, No. 9, pp. 3334-3342, September 2009.<br />
[13] Ana N. Vučković, Saša S. Ilić & Slavoljub R. Aleksić: Interaction<br />
Magnetic Force Calculation <strong>of</strong> Ring Permanent Magnets Using<br />
Ampere's Microscopic Surface Currents and Discretization<br />
Technique, Electromagnetics, 32:2, pp. 117-134, 2012.<br />
[14] A. N. Mladenović, S. R. Aleksić, S. S. Ilić.: “Levitation Force<br />
Calculation for Permanent Magnet Bearings Using Ampere’s<br />
Currents”, The 14 th International IGTE Symposium on Numerical<br />
Field Calculation in Electrical Engineering, <strong>Proceedings</strong> <strong>of</strong> full<br />
papers (CDROM), pp. 149-153, 19-22 September, <strong>Graz</strong>, Austria,<br />
2010<br />
[15] Meeker, D. n.d. S<strong>of</strong>tware package FEMM 4.2. Available on-line<br />
at http://www.femm.info/wiki/ Download (accessed 2 March<br />
2007).
- 305 - 15th IGTE Symposium 2012<br />
Magnet deviation measurements and<br />
their consideration in<br />
electromagnetic field simulation<br />
Peter Offermann ∗ , Isabel Coenen ∗ , David Franck ∗ and Kay Hameyer ∗<br />
∗ Institute <strong>of</strong> Electrical Machines<br />
RWTH Aachen <strong>University</strong><br />
Schinkelstrasse 4<br />
D-52062 Aachen, Germany<br />
E-mail: Peter.Offermann@IEM.rwth-aachen.de<br />
Abstract—Due to their manufacturing process arc segment magnets for the use in permanent-magnet synchronous machines<br />
(PMSM) may show deviations from their intended ideal magnetization. Using magnets with unfavourable error constellations<br />
in one rotor <strong>of</strong> a PMSM will result in a spatial unsymmetric air gap field, causing undesired parasitic effects as e.g. torque<br />
pulsations. Most manufacturer information only contain the mean values <strong>of</strong> the magnetization as well as certain guaranteed<br />
error bounds, not stating if (and how) the magnetization will vary spatial over a set <strong>of</strong> magnets. In order to allow an<br />
accurate consideration <strong>of</strong> these deviations in the machine simulation, the emitted radial field <strong>of</strong> a set <strong>of</strong> magnets has been<br />
measured and compared to their assumed magnetisation using finite element method (FEM). As a result, the measured<br />
deviations can be quantified and the influence <strong>of</strong> magnet deviations can be estimated using e.g. stochastic collocation<br />
methods in combination with the FEM.<br />
Index Terms—finite element method, magnetization errors, measurements, stochastics variations<br />
I. INTRODUCTION<br />
The simulation <strong>of</strong> an electrical machine employing<br />
the finite element method (FEM) requires the exact<br />
knowledge <strong>of</strong> the machine’s geometry, its excitations and<br />
its material properties. For machines which are manufactured<br />
in mass production, the material or geometry <strong>of</strong><br />
one specific instance <strong>of</strong> the designed machine may vary<br />
from its specified targets [1], leading in the worst case<br />
to a non-fulfilment <strong>of</strong> the rated machine’s data.<br />
For geometry variations a typical cause is the abrasion<br />
<strong>of</strong> the punching tools. Varying material properties<br />
may be caused e.g. by a stochastic jitter in the orientation<br />
<strong>of</strong> the punched stator lamination sheets, which<br />
can be tainted with anisotropy. Causes for variations<br />
in excitations can either arise from the converter or –<br />
in case <strong>of</strong> a permanent-magnet synchronous machines<br />
(PMSM) – from magnet deviations [2] with respect to<br />
their intended ideal magnetization [3]. Using magnets<br />
with unfavourable error constellations in one rotor <strong>of</strong> a<br />
PMSM will result in a spatial unsymmetric air gap field,<br />
causing undesired parasitic effects as torque pulsation [4],<br />
[5].<br />
Most manufacturer information only contain the mean<br />
values <strong>of</strong> the magnetization as well as certain guaranteed<br />
error bounds, not stating if (and how) the magnetization<br />
will vary spatial over a set <strong>of</strong> magnets. The goal <strong>of</strong><br />
this publication hence is to improve the simulation <strong>of</strong><br />
electrical machines by reducing the described epistemic<br />
uncertainty <strong>of</strong> magnet variations. Therefore, a magnet<br />
test-bench has been created, in order to measure the<br />
emitted radial field <strong>of</strong> a set <strong>of</strong> magnets. From this, the<br />
modality and probability distribution <strong>of</strong> the occurring<br />
variations have been deduced.<br />
The comparison <strong>of</strong> the magnets’ FEM-simulations<br />
with their measurements may allow the calculation <strong>of</strong><br />
improved simulation parameters for complete machine<br />
simulations. For the measured magnets, which were<br />
diametrally magnetized, three error-types have been identified:<br />
A general variation <strong>of</strong> the flux-density’s strength<br />
<strong>of</strong> up to 11.6%, a maximal local, angle deviation at the<br />
magnet’s outer borders <strong>of</strong> 8 ◦ and local errors <strong>of</strong> up to<br />
9.1%.<br />
II. MAGNETIZATION MEASUREMENT TEST-BENCH<br />
In order to obtain reliable data about possible magnetisation<br />
errors, a test bench for the evaluation <strong>of</strong> surface<br />
magnets has been built. In the following the sensor<br />
selection (sec. II-A) and the test-bench construction (sec.<br />
II-B) are described.<br />
A. Sensor selection<br />
Typical methods to measure the magnetic flux-density<br />
are Hall-sensors and Helmholtz-coils. In this paper, a<br />
Hall-sensor as depicted in fig. 1 has been selected, due<br />
to the following reasoning:<br />
For best results, both methods require that the measured<br />
magnetic field is oriented perpendicular to the<br />
measuring coil respectively Hall-sensor. This can be<br />
easier accomplished for larger sensors than for very small
devices. Hall-sensors can be miniaturized due to the fact<br />
that an interaction with a given current is measured.<br />
Therefore the concomitant reduction <strong>of</strong> the Hall-constant<br />
CH, being a consequence <strong>of</strong> a reduction in material<br />
volume, can be compensated to certain extents with an<br />
increase in the measurement current (fig. 1). This allows<br />
to measure field components nearly pointwise.<br />
d<br />
ϕ1<br />
I<br />
B<br />
ϕ2<br />
Fig. 1. Hall-sensor and its distinctive input sizes.<br />
Helmholtz-coil configurations – in contrast to Hallsensors<br />
– always measure the the overall magnetic fluxdensity.<br />
Due to this integration over the magnet’s surface<br />
flux-density, however, a pointwise selective resolution <strong>of</strong><br />
the magnetic field is no longer possible. Global angle<br />
<strong>of</strong>fsets in the magnetization can be detected with both<br />
measurement methods by either using multiple sensors<br />
respectively coils or by turning the magnet under test.<br />
For this purpose, coils are preferable, because their<br />
orientation is better adjustable and an integration over<br />
all local values for a single angle value is implemented<br />
intrinsic in the coil. Local angle errors however cannot<br />
be detected using such a setup. Lastly, coil measurements<br />
are less noise sensitive because the integration already<br />
smoothes some measurement noise.<br />
The decisive factor for Hall-sensors was the interest<br />
in local magnet variations, since most publications until<br />
now focus only on global magnet variations [6], [7] in<br />
electrical machines. Furthermore, this selection allows<br />
the analysis <strong>of</strong> possible locational misalignments <strong>of</strong> the<br />
magnets and will enable a later use <strong>of</strong> the measured<br />
variations in conformal mapping Ansatz functions [8],<br />
[9].<br />
B. Test-bench construction<br />
For the construction <strong>of</strong> the magnet test bench, Hallsensors<br />
<strong>of</strong> the type HE-244 [10] were selected. Table II-B<br />
summarizes the main features <strong>of</strong> the selected sensor:<br />
TABLE I<br />
PROPERTIES OF THE USED HALL SENSOR.<br />
value unit<br />
supply current up to 10 mA<br />
sensitivity 90 to 190 V / (A · T)<br />
linearity<br />
hall voltage typical ≤ 0.2 %<br />
Three sensors for the measurement <strong>of</strong> the magnetic<br />
field components Bx, By and Bz are located on an index<br />
- 306 - 15th IGTE Symposium 2012<br />
arm with predefined 90 degree edges, in order to achieve<br />
a good positioning. The sensors are positioned directly<br />
on adjacent edges to measure the field at approximately<br />
one point as depicted in fig. 2.<br />
y x<br />
z<br />
Fig. 2. Positions and labelling <strong>of</strong> the used Hall-sensors on the<br />
measurement anchor.<br />
The index arm itself is mounted on a gibbet, which<br />
is constructed in such a way, that it allows a position<br />
adjustment in all three dimensions. Below the index arm<br />
the magnets under test can be mounted upon a cylindric<br />
shaft which rotates around its symmetry-axis (fig. 3, 4).<br />
z<br />
encoder<br />
y<br />
x<br />
step motor<br />
magnet mounting<br />
rotation axis<br />
hall sensor<br />
magnet under test<br />
Fig. 3. Schematic scetch <strong>of</strong> the created test bench for magnet<br />
measurements.<br />
This allows the use <strong>of</strong> a connected stepper-motor to<br />
measure the field along a circular line over the magnet’s<br />
surface. To avoid field distortion by flux guidance all<br />
relevant test bench components have been constructed<br />
from aluminium. Data acquisition and the stepper-motor<br />
control are implemented using a dSpace-system in combination<br />
with a PC.<br />
III. RESULTS<br />
In this study 52 magnets with diametral magnetization<br />
and a field strength <strong>of</strong> Br = 1.04T were analysed,<br />
consisting <strong>of</strong> two equally sized groups with either northor<br />
south-pole on the outer magnet circumference. For<br />
each magnet, the Hall-voltage <strong>of</strong> the radial outwards<br />
pointing flux-density was measured 1.5mm above the<br />
magnet’s surface. The magnet’s dimensions are given in<br />
fig. 5.<br />
A. Simulations<br />
In the simulations, the magnet (as depicted in fig. 5)<br />
is surrounded by an air layer which measures ten times
Fig. 4. Photograph <strong>of</strong> the constructed magnet test bench.<br />
Br =1.04T<br />
3mm<br />
Fig. 5. Dimensions <strong>of</strong> the measured magnets.<br />
15mm<br />
the magnet’s height in every direction [11]. The applied<br />
solver implements the magnetic vector-potential formulation.<br />
All boundaries were set as Neumann conditions. The<br />
radial flux-density was sampled along a circumference <strong>of</strong><br />
1.5mm above the magnet.<br />
B. Measurements<br />
1) Repetition measurements:<br />
Repetitive measurements were executed to determine the<br />
test-bench’s measurement reproducibility. The average<br />
error between two arbitrary measurements <strong>of</strong> the same<br />
magnet is below 0.5% and mainly caused by very small<br />
positioning errors <strong>of</strong> the magnet in the tangential direction<br />
<strong>of</strong> the measurement shaft. Fig. 6 depicts five<br />
repetitive measurements <strong>of</strong> magnet #7.<br />
2) Post-processing <strong>of</strong> measurements:<br />
For data acquisition, every magnet is inserted, measured,<br />
and removed from the test-bench five times (fig. 6).<br />
Afterwards, the repetitive data <strong>of</strong> each magnet data are<br />
scanned for obvious misplacement errors. If they exist,<br />
the worst deviating measurement is removed. Thereafter,<br />
- 307 - 15th IGTE Symposium 2012<br />
V(Brad)[V ]<br />
2<br />
0<br />
−2<br />
−4<br />
−6<br />
200 220 240 260 280 300 320 340<br />
angle [ ◦ −8<br />
]<br />
Fig. 6. Five repetitive measurements <strong>of</strong> magnet #7, showing the testbench’s<br />
reproduction quality.<br />
the repetitive measurements are aligned to have their<br />
outer minima centred at around fixed value. Ultimately,<br />
the remaining, centred flux-density values <strong>of</strong> the magnet<br />
are averaged. Fig. 7 shows – for the purpose <strong>of</strong> demonstration<br />
exaggerated – examples <strong>of</strong> the described process.<br />
raw measurements<br />
delete errors<br />
x-align measurements<br />
average<br />
Fig. 7. Post-processing <strong>of</strong> measured flux-density curves.<br />
3) Variation measurements:<br />
Figure 8 presents the results <strong>of</strong> the variation measure-
ments for all magnets which have their north pole located<br />
on the outer side. Two obvious variations can be directly<br />
identified:<br />
• Strength variations in the overall remanence fluxdensity<br />
per magnet,<br />
• Strong deformations from the expected curve shape<br />
in terms <strong>of</strong> local variations.<br />
V(Brad)[V ]<br />
8<br />
6<br />
4<br />
2<br />
0<br />
−2<br />
200 220 240 260 280 300 320 340<br />
angle [ ◦ ]<br />
Fig. 8. Measured radial flux-density 1.5mm above each magnet’s<br />
centre in the magnet group ’north-up’.<br />
Fig. 9 shows accordingly the likelihood <strong>of</strong> occurrence<br />
for the radial outwards pointing flux-density over the<br />
magnet angle for the opposite magnet group. Due to the<br />
envelope shape <strong>of</strong> the resulting curve, the strong influence<br />
<strong>of</strong> the variations is even more obvious.<br />
Fig. 9. Probability <strong>of</strong> measured magnetisation strength, probabilities<br />
ranging from low (dark) to high (light).<br />
C. Comparison <strong>of</strong> measurements and simulations<br />
In order to quantify the strength <strong>of</strong> the occurring<br />
deviations in terms <strong>of</strong> changes in excitation (in contrast<br />
to changes in the resulting flux-density), the excitation<br />
<strong>of</strong> each magnet had to be reconstructed from the given<br />
- 308 - 15th IGTE Symposium 2012<br />
measurements. To solve this inverse problem [12], a<br />
straightforward approach was to compare the measured<br />
radial flux-density component <strong>of</strong> each magnet to a set<br />
<strong>of</strong> simulations. In these simulations, the magnet’s remanence<br />
flux-density Br was varied as parameter ξ1,<br />
applying the simulation conditions presented in section<br />
III-A. However, the resulting shapes did not agree to<br />
the measured curves. The employed magnetisation model<br />
was therefore extended to include a second deviation<br />
parameter ξ2, allowing an angle spread in magnetisation<br />
as given in fig. 10 and yealding the excitation given in<br />
eq. 1:<br />
⎛<br />
B(Δα, ξ1,ξ2) =Br(ξ1) · ⎝ cos(αmid<br />
⎞<br />
+Δα(ξ2))<br />
sin(αmid +Δα(ξ2)) ⎠ (1)<br />
0<br />
Δα<br />
Fig. 10. Determined second deviation parameter ξ2 (grey) from the<br />
ideal, unidirectional magnetisation.<br />
Applying both variation types, the magnet excitation<br />
parameters could be reconstructed sufficiently in most<br />
cases using the least-square minimization from eq. 2 for<br />
parameter determination:<br />
<br />
<br />
<br />
min <br />
<br />
ξ1,ξ2<br />
310 ◦<br />
<br />
α=230 ◦<br />
[Brad,sim(α, ξ1,ξ2) − Brad,mes(α)] 2<br />
<br />
<br />
<br />
<br />
(2)<br />
Fig. 11 shows the comparison <strong>of</strong> the measured radial<br />
flux-density (dashed) in comparison to the best fitting<br />
simulated curve (solid). The divergence <strong>of</strong> both curves at<br />
V(Brad)[V ]<br />
8<br />
6<br />
4<br />
2<br />
0<br />
−2<br />
200 220 240 260 280 300 320<br />
angle [ ◦ −4<br />
]<br />
Fig. 11. Measured (dashed) radial outwards pointing flux-density in<br />
comparison to its best fitting siumlation for magnet #1.
the outer side <strong>of</strong> both graphs can safely be neglected here,<br />
because they are caused by effects <strong>of</strong> the 2D-simulation<br />
and are considered as not relevant, as this area is not<br />
above, but beside the magnet.<br />
Figure 12 finally shows the comparison <strong>of</strong> measured<br />
and simulated radial outwards pointing flux-density for a<br />
magnet having a local magnetisation error. As the graph<br />
clearly shows, this behaviour cannot be reproduced by the<br />
applied model yet. The three identified error-types finally<br />
have been identified to: flux-density’s strength variations<br />
<strong>of</strong> up to 11.6%, a maximal local, angle deviation at the<br />
magnet’s outer borders <strong>of</strong> 8 ◦ and local errors <strong>of</strong> up to<br />
9.1%<br />
V(Brad)[V ]<br />
8<br />
6<br />
4<br />
2<br />
0<br />
−2<br />
200 220 240 260 280 300 320<br />
angle [ ◦ −4<br />
]<br />
Fig. 12. Measured (dashed) radial outwards pointing flux-density in<br />
comparison to its best fitting siumlation for magnet #13. Local errors<br />
cannot be reproduced yet.<br />
IV. CONCLUSIONS<br />
The presented methodology allows an accurate determination<br />
<strong>of</strong> remanence flux-density variations above the<br />
surface <strong>of</strong> a set <strong>of</strong> magnets or rotors. A comparison <strong>of</strong><br />
the measured curves with the magnet’s simulated and<br />
intended remanence flux-density reveals, in which way<br />
the used FE-magnet-models have to be adopted to be<br />
used in stochastic considerations <strong>of</strong> parameter variations<br />
in electrical machines. Necessary implementations are a<br />
scalable magnetization strength and an over the magnet<br />
changing deviation angle. Optional, local errors can be<br />
considered as well. The resulting magnet parameters<br />
finally can be used for uncertainty propagation applying<br />
appropriate tools as stochastic collocation [13] or polynomial<br />
chaos approaches [14] to propagate the magnet<br />
deviations onto output sizes <strong>of</strong> interest.<br />
V. ACKNOWLEDGEMENT<br />
The results presented in this paper have been developed<br />
in the research project Propagation <strong>of</strong> uncertainties<br />
across electromagnetic models granted by the Deutsche<br />
Forschungsgemeinschaft (DFG).<br />
- 309 - 15th IGTE Symposium 2012<br />
REFERENCES<br />
[1] M. Ci<strong>of</strong>fi, A. Formisano, and R. Martone, “Stochastic handling<br />
<strong>of</strong> tolerances in robust magnets design,” IEEE Transactions on<br />
Magnetics, vol. 40, no. 2, pp. 1252 – 1255, march 2004.<br />
[2] M.-F. Hsieh, C.-K. Lin, D. Dorrell, and P. Wung, “Modeling<br />
and effects <strong>of</strong> in-situ magnetization <strong>of</strong> isotropic ferrite magnet<br />
motors,” in Energy Conversion Congress and Exposition (ECCE),<br />
2011 IEEE, sept. 2011, pp. 3278 –3284.<br />
[3] K.-C. Kim, S.-B. Lim, D.-H. Koo, and J. Lee, “The shape<br />
design <strong>of</strong> permanent magnet for permanent magnet synchronous<br />
motor considering partial demagnetization,” IEEE Transactions<br />
on Magnetics, vol. 42, no. 10, pp. 3485 –3487, oct. 2006.<br />
[4] D. Torregrossa, A. Khoobroo, and B. Fahimi, “Prediction <strong>of</strong><br />
acoustic noise and torque pulsation in pm synchronous machines<br />
with static eccentricity and partial demagnetization using field<br />
reconstruction method,” IEEE Transactions on Industrial Electronics,<br />
vol. 59, no. 2, pp. 934 –944, feb. 2012.<br />
[5] G. Heins, T. Brown, and M. Thiele, “Statistical analysis <strong>of</strong> the<br />
effect <strong>of</strong> magnet placement on cogging torque in fractional pitch<br />
permanent magnet motors,” IEEE Transactions on Magnetics,<br />
vol. 47, no. 8, pp. 2142 –2148, aug. 2011.<br />
[6] F. Jurisch, “Production process based deviations in the orientation<br />
<strong>of</strong> anisotropic permanent magnets and their effects onto the operation<br />
performance <strong>of</strong> electrical machines and magnetic sensors –<br />
german –,” International ETG-Kontress Tagungsband, (ETG-FB<br />
107), no. 1, pp. 255–261, 2007.<br />
[7] I. Coenen, M. Herranz Gracia, and K. Hameyer, “Influence and<br />
evaluation <strong>of</strong> non-ideal manufacturing process on the cogging<br />
torque <strong>of</strong> a permanent magnet excited synchronous machine,”<br />
COMPEL, vol. 30, no. 3, pp. 876–884, 2011.<br />
[8] M. Hafner, D. Franck, and K. Hameyer, “Accounting for saturation<br />
in conformal mapping modeling <strong>of</strong> a permanent magnet<br />
synchronous machine,” COMPEL, vol. 30, no. 3, pp. 916–928,<br />
May 2011.<br />
[9] D. Zarko, D. Ban, and T. Lipo, “Analytical calculation <strong>of</strong> magnetic<br />
field distribution in the slotted air gap <strong>of</strong> a surface permanentmagnet<br />
motor using complex relative air-gap permeance,” Magnetics,<br />
IEEE Transactions on, vol. 42, no. 7, pp. 1828 – 1837,<br />
july 2006.<br />
[10] H. Electronics, “He244 series analog hall sensor - datasheet,”<br />
Download from www.hoeben.com, downloaded at 15.08.2012,<br />
November 2011.<br />
[11] P. Offermann and K. Hameyer, “Non-Linear stochastic variations<br />
in a magnet evaluated with Monte-Carlo simulation and a polynomial<br />
Chaos META-Model,” in XXII Symposium on Electromagnetic<br />
Phenomena in Nonlinear Circuits,EPNC 2012. Pula,<br />
Croatia: PTETIS Publishers, June 2012, pp. 21–22.<br />
[12] A. Mohamed Abouelyazied Abdallh, “An inverse problem based<br />
methodology with uncertainty analysis for the identification <strong>of</strong><br />
magnetic material characteristics <strong>of</strong> electromagnetic devices,”<br />
Ph.D. dissertation, Ghent <strong>University</strong>, 2012.<br />
[13] E. Rosseel, H. De Gersem, and S. Vandewalle, “Nonlinear<br />
stochastic Galerkin and collocation methods: application to a<br />
ferromagnetic cylinder rotating at high speed,” Communications<br />
in Computational Physics, vol. 8, no. 5, pp. 947–975, 2010.<br />
[14] B. Sudret, “Uncertainty propagation and sensitivity analysis<br />
in mechanical modesl – contributions to structural reliability<br />
and stochastic spectral methods,” Ph.D. dissertation, Universite<br />
BLAISE PASCAL - Clermont II, Ecole Doctorale Sciences pour<br />
l’Ingenieur, 2007.
- 310 - 15th IGTE Symposium 2012<br />
Potential <strong>of</strong> Spheroids in a Homogeneous<br />
Magnetic Field in Cartesian Coordinates<br />
Markus Kraiger∗ and Bernhard Schnizer †<br />
∗Institute for Radiopharmacy - PET Center, Helmholtz-Zentrum Dresden - Rossendorf e.V., Bautzner Landstr. 400,<br />
D-01328 Dresden - Schönfeld/Schullwitz, Germany. Email: m.kraiger@hzdr.de<br />
† Institute for Theoretical Physics - Computational Physics, Technische Universität <strong>Graz</strong>, Petersg. 16, A-8010 <strong>Graz</strong>,<br />
Austria<br />
E-mail: schnizer@itp.tu-graz.ac.at<br />
Abstract—The potential and the field <strong>of</strong> a prolate or an oblate magnetic spheroid in a static homogeneous field are computed<br />
and expressed in Cartesian coordinates. The directions <strong>of</strong> both the primary magnetic field and <strong>of</strong> the symmetry axis are<br />
completely arbitrary. These expressions are used to investigate trabecular structures built from spheroids having different<br />
symmetry axes and positions for Magnetic Resonance (MR-) Osteodensitometry.<br />
Index Terms—Prolate or oblate spheroid in homogeneous field, building flexible models for magnetic resonance imaging or<br />
spectroscopy.<br />
I. INTRODUCTION<br />
In gerneral, the potential <strong>of</strong> a magnetic spheroid in a given<br />
external magnetic field is derived in spheroidal coordinates,<br />
whose symmetry axis is the z-axis. Models <strong>of</strong> biological tissues,<br />
as e.g. trabecular bones, are arrays <strong>of</strong> such spheroids with symmetry<br />
axes having various directions. Having such applications<br />
in mind, we derived potential and field expressions for prolate<br />
and oblate spheroids in a homogeneous field. These expressions<br />
depend on Cartesian coordinates for arbitrary directions <strong>of</strong> both<br />
the field and the symmetry axes.<br />
II. METHOD OF SOLUTION<br />
A spheroid (permeability μi = μ0(1 + χi); semi-axes<br />
a, a, c) is in a medium (permeability μe = μ0(1 + χe))<br />
and a static homogeneous field H0 = (H0x,H0y,H0z) =<br />
H0(sin β cos α, sin β sin α, cos β) <strong>of</strong> arbitrary direction. At<br />
first the problem <strong>of</strong> a prolate spheroid is solved in prolate<br />
spheroidal coordinates ([1], Fig.1.06)<br />
x + iy = ep sinh η sin θe iψ<br />
(1)<br />
z = ep cosh η cos θ<br />
or in the corresponding oblate spheroidal coordinates ([1],<br />
Fig.1.07)<br />
x + iy = eo cosh η sin θe iψ<br />
(2)<br />
z = eo sinh η cos θ.<br />
for an oblate spheroid as shown e.g. in [2] to [4]. The particular<br />
solutions <strong>of</strong> the potential equation are obtained by separation<br />
giving Legendre functions and polynomials <strong>of</strong> cosh η, i sinh η<br />
respectively multiplied by Legendre polynomials <strong>of</strong> cos θ and<br />
by trigonometric functions <strong>of</strong> ψ. A solution <strong>of</strong> this problem<br />
is found by the usual method, namely by expanding the<br />
potential in the interior and in the exterior <strong>of</strong> the spheroid<br />
w.r.t. the particular solutions fulfilling the appropriate boundary<br />
conditions: i) the total potential must be finite at η =0; ii)<br />
the total potential must agree with that <strong>of</strong> the primary field<br />
(5) at η = ∞. The expansion coefficients are determined<br />
by the continuity conditions that the total potential must be<br />
continuous Φ0 +Φ σ e =Φ0 +Φ σ i and the corresponding normal<br />
component <strong>of</strong> the magnetic induction must be continuous at the<br />
interface <strong>of</strong> the two media ((7) with n = ez). The solutions<br />
contain only Legendre funtions and polynomials <strong>of</strong> order 1<br />
since the inhomogeneity (5) is <strong>of</strong> that order. Thereafter the<br />
Legendre functions and polynomials may be replaced with<br />
elementary functions <strong>of</strong> η and θ. These may be in turn expressed<br />
by functions <strong>of</strong> Cartesian coordinates by use <strong>of</strong> (1),<br />
(2) respectively and by cosh η = up(r, ez)/ √ 2, sinh η =<br />
uo(r, ez)/ √ 2, eq.(24) respectively. The expansion coefficients<br />
L σ 0 ,L σ 1 ,M σ 0 ,M σ 1 obtained from matching the two pieces <strong>of</strong><br />
the potential at the interface are first expressed in Legendre<br />
functions and polynomials <strong>of</strong> argument ηp,ηo respectively:<br />
ηp = Arcoth(cp/ap) (3)<br />
ηo = Artanh(co/ao). (4)<br />
The coefficients are also reexpressed in elementary functions<br />
<strong>of</strong> these geometrical parameters and by the magnetic susceptibilities<br />
χe,χi to give eqs.(8) to (11), (13) to (16) respectively.<br />
In the last step the potential in both domains is transformed to<br />
an arbitrary direction n <strong>of</strong> the spheroidal symmetry axis. All<br />
vectors in the potential are decomposed into vectors parallel to<br />
or perpendicular to the z-axis. Finally all vectors ez occuring<br />
in these expressions are replaced by n.<br />
This description is rather concise; full details may be found<br />
in the papers [3] and [4] and in the notebooks at the website<br />
quoted. But the next paragraph gives a complete listing <strong>of</strong> all<br />
formulas needed for the applications.<br />
III. RESULTS<br />
The primary field is homogeneous with the potential<br />
Φ0(x, y, z) = − (H0x x + H0y y + H0z z). (5)<br />
A. The potentials <strong>of</strong> the reaction fields<br />
The presence <strong>of</strong> a spheroid induces a reaction field with<br />
potential (r =(xβ)) :<br />
Φ σ k(x, y, z) =<br />
3X<br />
α,β=1<br />
H0αt σ,k<br />
αβ xβ = H0 · T σ,k · r (6)<br />
with σ = p (= prolate) or = o (= oblate) and k = e (= external)<br />
or i (= internal) to the ellipsoid<br />
Eσ := r2 − (n · r) 2<br />
a 2 σ<br />
+ (n · r)2<br />
c 2 σ<br />
=1. (7)
For p a prolate spheroid, ap < cp, the excentricity is ep =<br />
c2 p − a2 p; for an oblate one, co
the additional contribution, originating from the local field<br />
inhomogeneities, to the effective transversal relaxation rate R ∗ 2.<br />
Further, R ′ 2 ≈ γΔB with ΔB representing the field variation<br />
and γ the gyromagnetic ratio.<br />
B. Theory: Computersimulation<br />
The aim <strong>of</strong> the current simulation is to investigate effects on<br />
the induced line broadening <strong>of</strong> the resonance spectra evoked<br />
through micro cracks as examples <strong>of</strong> trabecular rarefaction.<br />
Thus, the evaluation <strong>of</strong> the magnetic field distribution was<br />
performed utilizing a two-compartment model, consisting <strong>of</strong><br />
marrow and bone. In oder to mimic the known trabecular micro<br />
structure within a vertebra [13] prolate ellipsoids were arranged<br />
appropriately within a three-dimensional unit cell.<br />
The precession frequency <strong>of</strong> spins in a homogeneous magnetic<br />
field is determined through the magnetic induction B.<br />
Hence, in a first step the reaction fields induced by the susceptibility<br />
difference between the ellipsoids (trabeculae) and the<br />
background (bone marrow) were computed [14].<br />
Introducing a sample with a different susceptibility, in the<br />
current experiment trabecular bone (χ2) is surrounded by bone<br />
marrow (χ1), the resulting magnetic induction Bz can be<br />
generally written as:<br />
Bz = μ (H0z + Mz (r)) = μ0(1 + χ)(H0z + Mz (r)) , (31)<br />
with Mz characterising the induced reaction field. Herin the<br />
units are given in the MKS-system, and susceptibility units are<br />
per unit volume.<br />
Since the transversal magnetization decay <strong>of</strong> mineralized<br />
bone is several magnitudes faster comparing to bone marrow,<br />
the received resonance signal in MR-Osteodensitometry is governed<br />
by the magnetization arising within the marrow. Thus Mz<br />
corresponds to the computed reaction fields ΔHr1,z caused by<br />
the difference in magnetic property between bone and marrow.<br />
The resulting magnetic field distribution within the unit cell<br />
was determined as the sum <strong>of</strong> the individual contributions Hzi<br />
originating from all ellipsoids n:<br />
nX<br />
ΔHr1,z (r) = Hzi (r) . (32)<br />
i=1<br />
Interactions between the trabeculae have been neglected. This<br />
assumption is valid, since interactions between such structures<br />
include susceptibility effects <strong>of</strong> the second order, which will<br />
give rise to field contributions <strong>of</strong> the order <strong>of</strong> H0 (Δχ) 2 ,or<br />
≈ H0 · 10 −12 .<br />
In a simple MR experiment, excitation followed by an<br />
acquisition period, the signal <strong>of</strong> the free induction decay (FID)<br />
can be written as:<br />
S(t) =const<br />
Z<br />
VOI<br />
with ω(r) =γBz(r) it follows:<br />
Z<br />
S(t) =const<br />
VOI<br />
d 3 r e −iω(r)t e −T2/t ; (33)<br />
d 3 r e −iγBz(r)t e −T2/t . (34)<br />
Using again expression (31) the following expression in<br />
ΔHr1,z can be found:<br />
Z<br />
S(t) =const<br />
VOI<br />
d 3 r e −iγtμ0(1+χ)(H0z+ΔHr1,z(r)) e −T2/t .<br />
(35)<br />
This integral must be extended over the entire unit cell enclosing<br />
the ellipsoids.<br />
In order to compare the simulation results with MR images<br />
the magnitude <strong>of</strong> S(t) must be found. Except for the dissipative<br />
relaxation phenomenon e −T2/t<br />
the expressions in (35) are<br />
- 312 - 15th IGTE Symposium 2012<br />
purely oscillatory in H0z. Hence, for the analysis <strong>of</strong> the signal<br />
course the essential decay can be expressed as:<br />
Z<br />
|S(t)| = const d 3 r e −iγtμ0(1+χ)ΔHr1,z(r)<br />
. (36)<br />
VOI<br />
ΔHr1,z(r) can be computed according to (32) as the sum<br />
over all the reactions fields <strong>of</strong> the individual ellipsoids, where<br />
μ0(1+χ) describes the magnetic permeability at the location r.<br />
1) Algorithm: Utilizing the expression developed for the<br />
reaction field (28) the simulation was implemented in Mathematica<br />
(Wolfram Research, Inc.). The program computed the<br />
field distribution <strong>of</strong> ΔHr1,z(r) in the sense <strong>of</strong> a histogram and<br />
generated the MR signal curve according to (36).<br />
As input parameters the spacing <strong>of</strong> the trabeculae in x-,<br />
y- and z-direction, the dimensions <strong>of</strong> the ellipsoids and the<br />
position <strong>of</strong> the symmetry axis with respect to the z-axis<br />
<strong>of</strong> the coordinate system had to be defined. Further, the<br />
susceptibilities <strong>of</strong> the bones and the background as well as the<br />
orientation <strong>of</strong> the applied homogenous main magnetic field had<br />
to be set. The results <strong>of</strong> the simulations were the histograms<br />
<strong>of</strong> the magnetic field distribution and the signal curve, which<br />
was further utilized within a fitting-procedure yielding the<br />
relaxation constant R ′ 2.<br />
2) Data fitting: Utilizing the simulated signal curves a<br />
exponential signal model was applied in order to approximate<br />
the relaxation time T ′ 2 [15]. The computed signal intensities (36)<br />
at the echo times ranging from 0 to 50 ms, 5 ms increment, were<br />
used to generate a single T ′ 2 value by means <strong>of</strong> a non linear<br />
least-squares-approximation to a two parameter fit function:<br />
S(t) =Ae −t/T ′ 2 . (37)<br />
C. Model <strong>of</strong> vertebra<br />
The three-dimensional unit cell was composed out <strong>of</strong> thirty<br />
prolate ellipsoids, fifteen aligned along the x- and z-direction<br />
each, mimicing the initial intact trabeculae. The interruptions<br />
were simulated in the way, that each trabecula was replaced by<br />
two ellipsoids, which were displaced along the x/z-axis by 50<br />
μm forming a crack. The configuration <strong>of</strong> the three-dimensional<br />
vertebra model and the applied parameter setting are given in<br />
Fig.1.<br />
Fig. 1. Depiction <strong>of</strong> the 3.75 × 3.75 × 3.75 mm 3 unit cell; the<br />
x/z aligned sets are built up <strong>of</strong> three planes displaced by 750 μm.<br />
The trabeculae in each plane were modelled with a trabecular spacing<br />
and width <strong>of</strong> 500 μm and 120 μm respectively. The trabecular micro<br />
fractures were simulated by replacing each <strong>of</strong> the intact trabeculae with<br />
two opposed shifted versions.
D. Results<br />
The resulting reaction fields Hr1 pre- and post bone rarefaction<br />
are depicted in Fig.2. Note, that the field distribution is<br />
directly affected by the shape <strong>of</strong> the micro cracks, whereby the<br />
resulting field inhomogeneities in the vicinity <strong>of</strong> the spiky edges<br />
lead to the observed major field broadening. Prior rarefaction,<br />
the inital field distribution ranged approximately around ±1<br />
A/m, afterwards field values from almost ±2 A/m were found<br />
within the three-dimensional vertebra model. The effect <strong>of</strong> the<br />
interrupted bone mesh on the MR signal decay and the resulting<br />
estimated relaxation time T ′ 2 is presented in Fig.3. The modelled<br />
cracks gave rise to a change <strong>of</strong> the initial T ′ 2 <strong>of</strong> 26.1 ms to<br />
approximately 14.4 ms.<br />
Fig. 2. Resulting field distribution <strong>of</strong> the reaction field Hr1,z within<br />
the applied three-dimensional vertebra model. The trabecular cracks<br />
causing a broadening <strong>of</strong> the distribution, resulting in a more Lorentzian<br />
like line shape. A main magnetic field H0 =2.38732 · 10 6 A/m with<br />
α =30 ◦ and β parallel z-axes, and values <strong>of</strong> χ1 = −0.62·4·π ·10 −6<br />
and χ2 = −0.9 · 4 · π · 10 −6 were applied.<br />
V. CONCLUSION<br />
The advantage <strong>of</strong> this new approach is that it is very easy<br />
to build and investigate structures built from spheroids with<br />
different axes and positions. There is no need <strong>of</strong> complicated<br />
coordinate transformations.<br />
The analytical solutions <strong>of</strong> the Laplacian potential problem<br />
<strong>of</strong> spheroids in Cartesian coordinates were successfully applied.<br />
Fig. 3. Resulting resonance signal decays affected by the reaction field<br />
Hr1,z <strong>of</strong> the vertebra model in the two situations. As a consequence<br />
<strong>of</strong> the increasing inhomogeneous reaction field a rapid signal decay in<br />
case <strong>of</strong> micro cracks is visible (green curve). The signals are normalized<br />
to the values at the first echo time TE, markers are indicating the<br />
computed signal values at TE.<br />
- 313 - 15th IGTE Symposium 2012<br />
A three-dimensional magnetostatic problem in the area <strong>of</strong> MR-<br />
Osteodensitometry, susceptibility effects in the vicinity <strong>of</strong> micro<br />
cracks, was analysed. Within vertebrae affected by pathologies<br />
such as osteoporosis horizontally arranged structures get typically<br />
interrupted at first. The novel expressions make it possible<br />
to study the bone rarefaction along such pathologies, whereby<br />
either cracks <strong>of</strong> the horizontal, the vertical or arbitrary structures<br />
are accessible for modelling.<br />
In the present work just one application <strong>of</strong> the analytical<br />
expressions, the modelling <strong>of</strong> bone disorders in the area <strong>of</strong><br />
MR-Osteodensitometry, was given. For example in the field<br />
<strong>of</strong> functional MRI the devoloped toolbox eases the analysis <strong>of</strong><br />
the BOLD (blood oxygenation level-dependent) contrast, where<br />
induced reaction fields in the surrounding <strong>of</strong> vascular networks<br />
are <strong>of</strong> great interest [16]. A fast and precise computation <strong>of</strong> the<br />
magnetic distortion is essential for improving the precision <strong>of</strong><br />
the temperature determination in techniques using the proton<br />
resonance frequency (PRF) shift method [17], [18]. Temperature<br />
mapping in the vicinity <strong>of</strong> the needle electrode is a crucial<br />
determinant <strong>of</strong> MRI guided interventional radi<strong>of</strong>requency ablations<br />
[19]. Further, in the field <strong>of</strong> metabolism studies using<br />
NMR spectroscopy (MRS) the expressions can be used in<br />
order to model specific cells introduced in solutes differing in<br />
magnetic susceptibility. [20].<br />
In summary, the authors believe that the novel formulation<br />
<strong>of</strong> solutions depending solely on the Cartesian coordinates will<br />
facilitate the modelling <strong>of</strong> countless magnetostatic problems.<br />
REFERENCES<br />
[1] P. Moon and D.E. Spencer: Field Theory Handbook. Including<br />
coordinate systems, differential equations and their solutions.<br />
Springer 1988.<br />
[2] P. W. Kuchel, and B. T. Birman, ”Perturbation <strong>of</strong> Homogeneous<br />
Magnetic Fields by Isolated Single and Confocal Spheroids. Implications<br />
for NMR Spectroscopy <strong>of</strong> Cells,” NMR in Biomedicine,<br />
vol.2 (4) pp. 151-160, 1989.<br />
[3] M. Kraiger, and B. Schnizer, ”Potential and Field <strong>of</strong> a Homogeneous<br />
Magnetic Spheroid <strong>of</strong> Arbitrary Direction in a Homogeneous<br />
Magnetic Field in Cartesian Coordinates,” to appear in COMPEL,<br />
2012.<br />
[4] M. Kraiger, and B. Schnizer, ”Reaction Fields <strong>of</strong> a Homogeneous<br />
Magnetic Spheroids <strong>of</strong> Arbitrary Direction in a Homogeneous<br />
Magnetic Field. A Toolbox for MRI and MRS <strong>of</strong> Heterogeneous<br />
Tissue.” Report ITPR-2011-021. Institute for Theoretical<br />
and Computational Physics. Technische Universität <strong>Graz</strong>, Austria.<br />
http://itp.tugraz.at/∼schnizer/MedicalPhysics/<br />
[5] F. W. Wehrli, H. K. Song, P. K. Saha, and A. C. Wright, ”Quantitative<br />
MRI <strong>of</strong> the assessment <strong>of</strong> bone structure and function,” NMR<br />
in Biomedicine, vol. 19 pp. 731-764, 2006.<br />
[6] C. A. Davis, H. K. Genant, and J. S. Dunham, ”The effects <strong>of</strong><br />
bone on proton NMR relaxation times <strong>of</strong> surrounding liquids,”<br />
Investigative Radiology, vol. 21 pp. 472-477, 1986.<br />
[7] S. Grampp, S. Majumdar, M. Jergas, P. Lang, A. Gies, and HK.<br />
Genant, ”MRI <strong>of</strong> bone marrow in the distal radius: in vivo precision<br />
<strong>of</strong> effective transverse relaxation times,” European Radiology, vol.<br />
5 pp. 43-48, 1995.<br />
[8] T. M. Link, J. C. Lin, D. Newitt, N. Meier, S. Waldt, and S.<br />
Majumdar, ”Computergestützte Strukturanalyse des trabekulären<br />
Knochens in der Osteoporosediagnostik,” Der Radiologe, vol. 38<br />
pp. 853-859 , 1998.<br />
[9] M. H. Arokoski, J. P. Arokoski, P. Vainio, L. H. Niemitukia,<br />
H. Kroeger, and J. S. Jurvelin, ”Comparison <strong>of</strong> DXA and MRI<br />
methods for interpreting femoral neck bone mineral density,”<br />
Journal <strong>of</strong> Clinical Densitometry, vol. 5 pp. 289-296. 2002.<br />
[10] H. Chung, F. W. Wehrli, J. L. Williams, and S. D. Kugelmass,<br />
”Relationship between NMR transverse relaxation, trabecular bone<br />
architecture and strength,” <strong>Proceedings</strong> <strong>of</strong> the National Academy<br />
<strong>of</strong> Sciences, vol. 90 pp. 10250-10254, 1993.<br />
[11] T. B. Brismar, T. Hindmarsh, and H. Ringertz, ”Experimental<br />
correlation between T2* and ultimate compressive strength in<br />
lumbar porcine vertebrae,” Academic Radiology, vol. 4 pp. 426-<br />
430, 1997.
[12] O. Beuf, D. C. Newitt, L. Mosekilde, and S. Majumdar, ”Trabecular<br />
Structure Assessment in Lumbar Vertebrae Specimens Using<br />
Quantitative Magnetic Resonance Imaging and Relationship with<br />
Mechanical Competence,” Journal <strong>of</strong> Bone and Mineral Research,<br />
vol. 16 pp. 1511-1519, 2001.<br />
[13] T. Hildebrand, A. Laib, R. Müller, J. Dequeker, and P. Regsegger,<br />
”Direct Three-Dimensional Morphometric Analysis <strong>of</strong> Human<br />
Cancellous Bone: Microstructural Data from Spine, Femur, Iliac<br />
Crest, and Calcaneus,” Journal <strong>of</strong> Bone and Mineral Research, vol.<br />
14 pp. 1167-1174, 1999.<br />
[14] C. J. C. Bakker, R. Bhagwandien, M. A. Moerland, and M.<br />
Fuderer, ”Susceptibility artifacts in 2D FT spin-echo and gradientecho<br />
imaging: the cylinder model revisted,” Magnetic Resonance<br />
Imaging, vol. 11 pp. 539-548, 1992.<br />
[15] A. Fransson, S. Grampp, and H. Imh<strong>of</strong>, ”Effects <strong>of</strong> trabecular<br />
bone on marrow relaxation in the tibia,” Magnetic Resonance<br />
Imaging, vol. 17 pp. 69-82, 1998.<br />
[16] S. Ogawa, T. M. Lee, A. R. Kay, and D. W. Tank, ”Brain magnetic<br />
resonance imaging with contrast dependent on blood oxygenation,”<br />
<strong>Proceedings</strong> <strong>of</strong> the National Academy <strong>of</strong> Sciences <strong>of</strong> the United<br />
States <strong>of</strong> America, vol. 87 pp. 9868-9872, 1990.<br />
[17] J. C. Hindman, ”Proton resonance shift <strong>of</strong> water in gas and liquid<br />
states,” Journal <strong>of</strong> Chemical Physics, vol. 44 pp. 4582-4592, 1966.<br />
[18] V. Rieke, and K. B. Pauly, ”MR Thermometry,” Journal <strong>of</strong><br />
Magnetic Resonance, vol. 27 pp. 376-390, 2008.<br />
[19] A. Boss, H. Graf, B. Müller-Bierl, S. Clasen, D. Schmidt, P.<br />
L. Pereira, and F. Schick, ”Magnetic susceptibility effects on the<br />
accuracy <strong>of</strong> MR temperature monitoring by the proton resonance<br />
frequency method,” Journal <strong>of</strong> Magnetic Resonance Imaging, vol.<br />
22 pp. 813-820, 2005.<br />
[20] P. W. Kuchel, ”Red cell metabolism: studies using NMR spectroscopy,”<br />
<strong>Proceedings</strong> <strong>of</strong> Australian Biochemistry Society, vol. 15<br />
pp. P5-P6, 1983.<br />
- 314 - 15th IGTE Symposium 2012
- 315 - 15th IGTE Symposium 2012<br />
Application <strong>of</strong> Signal Processing Tools for Fault<br />
Diagnosis in Induction Motors-A Review<br />
*Jawad Faiz, *Amir Masoud Takbash, *Bashir Mahdi Ebrahimi and †Subhasis Nandi<br />
*Center <strong>of</strong> Excellence on Applied Electromagnetic Systems, School <strong>of</strong> Electrical and Computer Engineering,<br />
College <strong>of</strong> Engineering, <strong>University</strong> <strong>of</strong> Tehran, Tehran, Iran<br />
†Department <strong>of</strong> Electrical and Computer Engineering, <strong>University</strong> <strong>of</strong> Victoria, Victoria, BCV8W 3P6, Canada<br />
E-mail: jfaiz@ut.ac.ir<br />
Abstract—Use <strong>of</strong> efficient signal processing tools (SPTs) to extract proper indices for fault detection in induction motors (IMs)<br />
is the essential part <strong>of</strong> any fault recognition procedure. In this paper, all utilized SPTs employed in fault identification <strong>of</strong> IMs<br />
are analyzed in details. Then, their competency and their drawbacks for extracting indices in transient and steady-state modes<br />
are criticized from different aspects. The considerable experimental results are used to certificate demonstrated discussion.<br />
Different kinds <strong>of</strong> faults including eccentricity, broken bars and bearing faults as major internal faults in IMs, are<br />
investigated.<br />
Index Terms—Fault detection, Fast Fourier Transform, Hilbert, Wavelet Transform.<br />
I. INTRODUCTION<br />
Ever increasing application <strong>of</strong> induction motors (IMs)<br />
and importance <strong>of</strong> its uninterrupted operation in<br />
production lines make it necessary to diagnose internal<br />
faults in IMs quickly and precisely. The internal faults in<br />
IMs consist <strong>of</strong> electrical and mechanical faults. Electrical<br />
faults occur in the stator and rotor. Electrical faults <strong>of</strong><br />
squirrel-cage rotor <strong>of</strong> IM consist <strong>of</strong> bars and end-rings<br />
breakage which are about 10% <strong>of</strong> the internal faults <strong>of</strong><br />
squirrel-cage IM [1]. The reasons for these faults are as<br />
follows:<br />
1. Thermal stresses due to over-load and asymmetrical<br />
dissipation <strong>of</strong> heat which may change the hot spot.<br />
2. Magnetic stresses arising from electromagnetic<br />
forces.<br />
3. Mechanical stresses due to mechanical fatigue <strong>of</strong><br />
different parts, bearing damage, etc.<br />
4. Stresses due to assembling process and centrifugal<br />
forces arising from shaft torque.<br />
Some impacts <strong>of</strong> broken bars on IM are: Increasing<br />
core losses and total losses in faulty machine [2]-[4] and<br />
asymmetrical vector diagram <strong>of</strong> rotor current [2]. Broken<br />
bars and end rings faults have been studied more than<br />
other internal faults <strong>of</strong> IM. The faulty motor is studied<br />
using experimental or modeling and simulation methods.<br />
Following analysis <strong>of</strong> faulty motor by test or modeling, a<br />
proper signal must be selected. The signals used in the<br />
fault diagnosis process, consist <strong>of</strong> mechanical and<br />
electrical signals. There are three following reasons that<br />
make the stator current an appropriate signal for fault<br />
diagnosis:1) unique effect <strong>of</strong> motor internal fault on this<br />
signal. 2) there is no need to have sensor for monitoring<br />
the signal. 3) this method is economical. After testing or<br />
modeling motor and selecting a proper signal, it must be<br />
processed and effect <strong>of</strong> the proposed fault upon the signal<br />
is determined. The signal processing methods are based<br />
on the mathematical transformations. The well-known<br />
transformations are Fourier, Wavelet, Hilbert and<br />
multiple signal classification (Music). These processors<br />
are widely used in the fault diagnosis; however, recently<br />
intelligent methods such as Genetic, Fuzzy and Neural<br />
Network algorithms have been applied to make fault<br />
diagnosis methods more efficient [5]-[7]. Thus<br />
considering fault type, load conditions and the proposed<br />
processor characteristics, a particular processor will be<br />
suitable for each case. To choose an appropriate<br />
processor, different faults and operating conditions such<br />
as load are considered.<br />
II. FAST FOURIER TRANSFORM<br />
Fourier transform expresses signal as a sum <strong>of</strong><br />
sinusoidal functions. This transform expresses a timedomain<br />
signal to frequency-domain signal. This transform<br />
determines the frequency components arising from the<br />
fault. In application <strong>of</strong> Fourier transform to a signal, the<br />
signal must have two basic features: stability and<br />
alternating. A fast Fourier transform (FFT) is a faster<br />
version <strong>of</strong> the discrete Fourier transform<br />
(DFT). Application <strong>of</strong> these processors includes sampling<br />
and applying Fourier transform. Sampling has a series <strong>of</strong><br />
rules and laws as described in [7].<br />
A. Rotor Bars and End Ring Breakage Fault Diagnosis<br />
For broken bars and end ring fault diagnosis in IM,<br />
FFT base processor is <strong>of</strong>ten used and frequency spectrum<br />
<strong>of</strong> torque, speed, instantaneous power, body vibration and<br />
stator current signals are obtained. Torque signal has been<br />
employed as reference for fault diagnosis in [3].<br />
Harmonics 2sfs are produced in the torque frequency<br />
spectrum which used for fault detection [8]. Some<br />
references use the speed signal for fault diagnosis and<br />
here harmonics 2sfs <strong>of</strong> frequency spectrum are again<br />
proposed. Figure 1 exhibits the frequency spectrum <strong>of</strong><br />
speed signal and its variations due to the broken rotor<br />
bars [3]. Torque and speed signals depend on the external<br />
factors such as load and this makes hard to diagnose the<br />
fault. Also the procedure for acquiring these signals is<br />
important, because using sensors and other devices affect<br />
the accuracy <strong>of</strong> the operation. Another signal that is<br />
considered for fault diagnosis is the case vibration signal<br />
The reason for this vibration is air gap radial
Figure 1: Frequency spectrum <strong>of</strong> motor speed for<br />
different numbers <strong>of</strong> rotor broken bars [4].<br />
electromagnetic forces. Broken bars lead to the odd<br />
harmonics in the frequency spectrum <strong>of</strong> vibration signal.<br />
Although signal with twice supply frequency has been<br />
used for fault diagnosis, this signal is not suitable because<br />
it also appears in the healthy motor vibration frequency<br />
spectrum. The above-mentioned signal depends on the<br />
load and its detection requires a sensor [9]. On the other<br />
hand, Fourier transforms application to the transient<br />
signals such as speed and vibration does not yield<br />
accurate results. Pendulum oscillations and increment <strong>of</strong><br />
<br />
[10], [11 <br />
proposed in [11] with some simplifying assumptions. As<br />
seen, broken bars generate 2sfs <br />
[10]. Instantaneous power signal can be also utilized for<br />
broken bars and end rings fault diagnosis. Advantages <strong>of</strong><br />
using instantaneous power spectrum are listed in [12].<br />
Output voltage harmonics <strong>of</strong> motor following the power<br />
supply interruption can be used to diagnose the fault. The<br />
main idea <strong>of</strong> this method is eliminating the harmonics<br />
generated by the voltage supply [13]. However, this<br />
signal is a transient signal and FFT application on this<br />
signal leads to inaccurate fault detection. Intelligent<br />
algorithms can be used to diagnose the fault through<br />
current signal envelop [14]. The drawback <strong>of</strong> this method<br />
is that the harmonics <strong>of</strong> the envelop signal depends on the<br />
severity <strong>of</strong> the rotor bar fault as well as their locations<br />
[13]. Current signal has been considered as the most<br />
appropriate signal for internal fault diagnosis. Some<br />
references [15]-[17] use time signal <strong>of</strong> the line current for<br />
fault diagnosis but this fault is <strong>of</strong>ten detected through line<br />
current harmonics [1]-[3]. The most important harmonics<br />
used for fault diagnosis are (1±2s)fs. The amplitudes <strong>of</strong><br />
these harmonics are larger than that <strong>of</strong> other harmonics<br />
and their diagnosis is easier. Amplitude <strong>of</strong> harmonic (1-<br />
2s)fs depends on the rotor broken bars fault and its<br />
intensity and harmonic (1+2s)fs is mostly depends on the<br />
speed variations [18]. It is noted that harmonic (1-2s)fs<br />
may be disappeared when broken bars has 90 degrees<br />
increased by the broken bars fault. The reason for such<br />
amplitude rising is asymmetry <strong>of</strong> the rotor due to the fault<br />
and consequently generating a negative rotating field<br />
[22]. Table I shows these harmonics before and after the<br />
- 316 - 15th IGTE Symposium 2012<br />
TABLE I<br />
AMPLITUDES OF CURRENT SIDEBANDS FOR MOTOR WITH DIFFERENT ROTOR<br />
BROKEN BARS [4]<br />
NBB fs+2fr fs-2fr<br />
0 -58 -57<br />
1 -54 -55<br />
2 -53 -48<br />
3 -48 -42<br />
4 -46 -40<br />
Figure 2: Frequency spectrum <strong>of</strong> stator current for broken<br />
end-ring [23].<br />
fault versus number <strong>of</strong> broken bars (NBB) [3]. However,<br />
raising the fault degree produces lower changes in the<br />
amplitudes <strong>of</strong> the sidebands. The reason is increasing the<br />
number <strong>of</strong> parallel paths <strong>of</strong> currents and saturation due to<br />
asymmetry <strong>of</strong> the currents passing the bars. Influence <strong>of</strong><br />
bars inner current in the broken bars fault has been<br />
proposed in [23] and its effects consisting <strong>of</strong> harmonics<br />
amplitude reduction has been mathematically proved. In<br />
[22], broken end-ring has been considered. Figure 2<br />
presents stator current frequency spectrum for such a<br />
case. Starting current signal may be used for fault<br />
diagnosis [3] where broken bars generate harmonics<br />
(3±4k)fr in the current spectrum (Figure 3). Frequency<br />
spectrum <strong>of</strong> starting transient current signal is determined<br />
STFT in which the problem <strong>of</strong> processor with the<br />
transient signal is solved. However, dimensions <strong>of</strong> the<br />
window are fixed and therefore it has not good frequency<br />
and time resolution at the same time [24]. Sometimes<br />
current is indirectly used, for instance Park transform <strong>of</strong><br />
stator current has been used for fault diagnosis [25]. Of<br />
course, this method has some drawbacks such as no-clear<br />
fault effect and susceptible to noise, so it seems that<br />
application <strong>of</strong> this method beside other techniques such as<br />
intelligent methods is useful. However, in this case a set<br />
<strong>of</strong> full data is necessary. Park transform <strong>of</strong> stator current<br />
leads to iD+jiQ Modulus and harmonics arising from the<br />
broken bars fault in line current are as 2sfs and 4sfs. The<br />
advantage <strong>of</strong> these harmonics is that these are far from the<br />
fundamental harmonic so its detection is simple.<br />
B. Impacts <strong>of</strong> Load Variation<br />
Side-band components vary with the load torque<br />
fluctuations [26], [27]. Figure 4 shows the impact <strong>of</strong> the<br />
load upon the high and low side-bands <strong>of</strong> the stator<br />
current spatial vector [21]. Load fluctuation decreases the<br />
amplitude <strong>of</strong> low-band and increases the amplitude <strong>of</strong><br />
high-band.<br />
C. Impact <strong>of</strong> Drive<br />
In the presence <strong>of</strong> drive and closed-loop circuits the<br />
situation differs. In PWM-driven motor odd harmonics as
Figure 3: Frequency spectrum <strong>of</strong> starting current: (a)<br />
healthy motor, (b) motor with 4 rotor broken bars [3].<br />
Figure 4: Impact <strong>of</strong> load upon high and low side-bands <strong>of</strong><br />
stator current spatial vector [21].<br />
well as third harmonic are injected to the motor. These<br />
harmonics are fb=(m±2nks)fs (m=supply odd harmonicorders,<br />
n=odd harmonics due to rotor induced currents<br />
and k=integer number) and odd-order harmonics currents<br />
are induced in the rotor, that subsequently produces oddorder<br />
rotor flux in the air gap. Therefore, a new frequency<br />
pattern is introduced in faulty motors under PWM supply<br />
[28]. In the closed-loop drive, mutual effects <strong>of</strong> electrical<br />
and mechanical oscillations amplify each other and<br />
amplitude <strong>of</strong> the above-mentioned frequency spectrum<br />
increases. Figure 5 shows rotor asymmetry signature in<br />
inverter-fed motor line current spectrum for healthy motor<br />
and motor with broken bars [29].<br />
D. Impact <strong>of</strong> Broken Rotor Bars Location<br />
Rotor bar location and its impact upon the fault<br />
diagnosis have been investigated and reported in [30] and<br />
effect <strong>of</strong> the broken bars location on the waveform and<br />
frequency spectrum <strong>of</strong> stator current and side-bands<br />
components (Figure 6) have been given. Influence <strong>of</strong> the<br />
broken bars location on the amplitude <strong>of</strong> the torque<br />
harmonics has been pointed out in [28]. Amplitude <strong>of</strong><br />
torque harmonic is increased by more concentration <strong>of</strong> the<br />
broken bars [31].<br />
III. WAVELET TRANSFORM<br />
Wavelet transform is a method that transforms the<br />
signal to time and frequency spectrum. This transform is<br />
based on transforming a signal to different kinds <strong>of</strong> scaled<br />
and shifted <strong>of</strong> mother wavelet function [13]. Wavelet<br />
transform enables to show some characteristics <strong>of</strong> the<br />
- 317 - 15th IGTE Symposium 2012<br />
Figure 5: Rotor asymmetry signature in inverter-fed<br />
motor line current spectrum (a) around fundamental, (b)<br />
around fifth and seventh harmonics [29].<br />
signal such as non-continuity <strong>of</strong> high-order derivatives <strong>of</strong><br />
the function and sharp point <strong>of</strong> maximum <strong>of</strong> the function<br />
that cannot be shown by other transforms, because they<br />
eliminate these characteristics during transform [13].<br />
Considering the above-mentioned points, wavelet<br />
transform gives a detailed and fully localized view <strong>of</strong> the<br />
function. Having frequency components caused by the<br />
internal fault <strong>of</strong> the motor, this transform can concentrate<br />
on particular regions and this can enhance the precision,<br />
while Fourier series provides a general view over a period<br />
<strong>of</strong> signal [12].<br />
A. Rotor Bars and End Ring Breakage Fault Diagnosis<br />
Various wavelet transforms have been so far used for<br />
fault diagnosis. Most <strong>of</strong> these methods are based on the<br />
sidebands components <strong>of</strong> frequency spectrum <strong>of</strong> the<br />
current signal. In [28], energy <strong>of</strong> a bandwidth is used to<br />
diagnose the fault in which the load impact is also taken<br />
into account. Since discrete wavelet transform (DWT) has<br />
a better clarity over the low frequencies, the use <strong>of</strong> the<br />
current spatial vector which has harmonics with lower<br />
frequencies will yield more precise results [32]. In [33], a<br />
method based on CWT has been used to diagnose the<br />
fault in different drives. However, there is no physical<br />
interpretation for fault diagnosis using the Figurers. In<br />
[34], power spectral density (PSD) values <strong>of</strong> details signal<br />
in any level <strong>of</strong> transform is fault diagnosis criterion.<br />
Figure 7 shows the pattern <strong>of</strong> current signal wavelet<br />
transform <strong>of</strong> healthy and rotor broken bars motor [34]. In<br />
[35], the reason for application <strong>of</strong> DWT in the papers has<br />
been noted. There are not suitable physical description for<br />
results, complicated trend and algorithm <strong>of</strong> other wavelet<br />
methods and ambiguous results. In [35], fault has been<br />
diagnosed using envelope <strong>of</strong> the starting current signal<br />
and procedure has been introduced for extracting the<br />
envelope signal considering the impact <strong>of</strong> the broken bars<br />
on the settling time and amplitude <strong>of</strong> the envelope <strong>of</strong> the<br />
starting current. Also determination <strong>of</strong> wavelet main<br />
function is important in fault diagnosis. Harmonics due to<br />
torque ripples and unbalanced voltage generate harmonics<br />
similar to that <strong>of</strong> the broken bar and this reduces the
- 318 - 15th IGTE Symposium 2012<br />
Figure 6: Impact <strong>of</strong> bars location on amplitude <strong>of</strong> sidebands; (a) three broken bars in one pole and one broken bar in<br />
adjacent pole, (b) Two broken bars in one pole and two broken bars in adjacent pole, (c) One bar under each pole [30].<br />
a b<br />
Figure 7: Pattern <strong>of</strong> current signal wavelet transform, (a)<br />
healthy, (b) rotor broken bar motor [34].<br />
accuracy <strong>of</strong> the fault diagnosis process. However, this can<br />
be solved by application <strong>of</strong> DWT transform [36]. The<br />
analytical wavelet transform (AWT) is one <strong>of</strong> the wavelet<br />
transforms which has been used to diagnose the rotor<br />
broken bars fault. Advantage <strong>of</strong> this wavelet transform is<br />
keeping the characteristics <strong>of</strong> time domain, amplitude and<br />
phase as well as frequency. Amplitude is related to the<br />
proposed signal envelope and the phase is related to the<br />
time characteristics <strong>of</strong> the signal. In [37], AWT has been<br />
used to diagnose the rotor broken bars fault by the help <strong>of</strong><br />
starting signal <strong>of</strong> the IM under low level loads.<br />
B. Impacts <strong>of</strong> Load Variation<br />
Impact <strong>of</strong> load fluctuations on wavelet coefficients <strong>of</strong><br />
the stator current spectrum <strong>of</strong> a motor under broken bars<br />
fault has been studied in [38]. Table II summarizes the<br />
variations <strong>of</strong> D4 coefficient and values <strong>of</strong> a function (that)<br />
defined in the reference. The un-decimated discrete<br />
wavelet transform (UDWT) is a type <strong>of</strong> DWT in which<br />
shift- invariant has been included. This leads to a good<br />
time precision over high frequency harmonics, and good<br />
time and frequency precision over low frequency<br />
harmonics. In addition to DWT and CWT, there is<br />
another wavelet called wavelet packet decomposition<br />
(WPD), which yields more precise results but it is time<br />
consuming method [37], [39]. Sidebands move to higherorder<br />
nodes WPD transform due to load fluctuations [39].<br />
One important point in the application <strong>of</strong> this transform is<br />
the use <strong>of</strong> a proper node for fault diagnosis. For high<br />
loads the low-order nodes and for low loads high-order<br />
nodes are investigated [39]. In [40], the impact <strong>of</strong> the<br />
drive in broken bar diagnosis using wavelet transform has<br />
been proposed. Although fault diagnosis procedure and<br />
load impact have been considered in the above-mentioned<br />
reference, the location <strong>of</strong> the broken bars has not been<br />
taken into account.<br />
TABLE II<br />
VARIATIONS OF D4 COEFFICIENT OF WAVELET TRANSFORM OF CURRENT<br />
SIGNAL OF MOTOR UNDER BROKEN GAR FAULT AGAINST LOAD [38].<br />
% <strong>of</strong> rated<br />
load<br />
Mean Current<br />
(A)<br />
Mean distortion in D4 Index2<br />
0 9.54 0.0923 0.97%<br />
33 8.92 0.3220 3.61%<br />
71 8.81 0.4044 4.59%<br />
100 8.74 0.5469 6.25%<br />
133 8.57 0.5674 6.62%<br />
IV. HILBERT PROCESSOR<br />
Due to the drawbacks <strong>of</strong> the above-mentioned<br />
processors some new processors such as Hilbert processor<br />
have been introduced. Hilbert transform similar with<br />
Fourier transform is orthogonal in respect <strong>of</strong> its main<br />
transform. In addition one function and its Hilbert<br />
transform has identical energy. One type <strong>of</strong> Hilbert<br />
transform is the Hilbert- Huang transform (HHT). In<br />
HHT the energy distribution in time-frequency domain is<br />
obtained by estimation <strong>of</strong> the local energy <strong>of</strong> signal in<br />
different times and frequencies. On contrary to other<br />
time- frequency transforms which depend on the size <strong>of</strong><br />
window and sampling frequency in Fourier transform and<br />
mother wavelet in wavelet transform, HHT is independent<br />
on aforementioned parameters, that is an advantage <strong>of</strong> this<br />
transform. No-load and light load cases in rotor broken<br />
bars and end-rings have been emphasized in [41].<br />
A. Rotor Bars and End Ring Breakage Fault Diagnosis<br />
In no-load IM there is no harmonic arising from the<br />
load, but harmonics are very close to the fundamental<br />
frequency. Here a Hilbert vector is defined for signal and<br />
using this vector instead <strong>of</strong> the proposed signal has some<br />
advantages:<br />
1. Requirement <strong>of</strong> phase current.<br />
2. Generation <strong>of</strong> harmonic components due to fault and<br />
deletion <strong>of</strong> non-applicable harmonics.<br />
Figure 8: Hilbert modulus: (a) healthy motor, (b) motor<br />
with two broken bars [42].<br />
3. Elimination <strong>of</strong> frequency scattering.<br />
4. Nonexistence <strong>of</strong> fundamental frequency that lets to
use linear scale on the vertical axes instead <strong>of</strong><br />
logarithm scale and this clarifies the graphs.<br />
5. No need to sample with twice <strong>of</strong> Nayquest<br />
frequency.<br />
Considering the proposed low frequencies the sampling<br />
speed is reduced to 0.1 <strong>of</strong> the normal case values and this<br />
is useful in practice. In [42], a fault diagnosis method<br />
based on fundamental harmonic deletion and<br />
determination <strong>of</strong> Hilbert Modulus has been introduced.<br />
Figure 8 shows Hilbert modulus for a healthy motor<br />
and a motor with broken bars. By increasing the fault<br />
degree, this modulus becomes larger due to the<br />
harmonics. In the following part a dimensionless<br />
numerical criterion with a low dependency on the load is<br />
introduced.<br />
V. MUSIC PROCESSOR<br />
Recently some methods with high frequency precision<br />
such as Music and Root-Music have been proposed [43].<br />
These methods are used where keeping a particular<br />
frequency is necessary. So in these methods, precise<br />
information on frequency components are required [19].<br />
This mathematical transform is similar with other<br />
transforms and it converts a signal to sum <strong>of</strong> several<br />
signals with identical feature. Music transform consists <strong>of</strong><br />
transform <strong>of</strong> a signal and expressing it based on K<br />
complex sinusoidal pair and a signal e(n).<br />
C. Rotor Bars and End Ring Breakage Fault Diagnosis<br />
Combination <strong>of</strong> FFT and Music methods and a<br />
method <strong>of</strong> fundamental harmonic deletion have been used<br />
for fault diagnosis in [44]; because lonely application <strong>of</strong><br />
Music leads to error. This method provides clearer results<br />
compared to FFT method. In Figure 9 the results <strong>of</strong> two<br />
methods have been compared [44]. In [45], frequency<br />
spectrum <strong>of</strong> output voltage after disconnecting the input<br />
supply obtained by FFT and Music methods and they<br />
compared. It has been shown that a series <strong>of</strong> particular<br />
harmonics in the frequency spectrum is excited due to the<br />
broken bars and variations <strong>of</strong> these variations are seen.<br />
Reliability and low impact <strong>of</strong> noise are the advantages <strong>of</strong><br />
this method compared to FFT method [45]. Music- based<br />
methods similar with Hilbert transform are new methods<br />
which have precise results and computation is quicker<br />
than Wavelet method. However, it needs improved<br />
algorithms for deletion <strong>of</strong> the fundamental harmonic<br />
which complicates these methods. Since these methods<br />
are new, many fault diagnosis indexes have not been yet<br />
modeled by these methods.<br />
VI. CONCLUSIONS<br />
Different methods and processors used for diagnosis<br />
<strong>of</strong> rotor bars and end ring breakage fault in IMs were<br />
investigated briefly. At this end, four types <strong>of</strong> processors<br />
and their advantages and drawbacks were studied. It is<br />
clear that a single method and a common processor<br />
cannot be specified for fault detection in all conditions.<br />
Fourier processor as a most applied processor for broken<br />
- 319 - 15th IGTE Symposium 2012<br />
Figure 9: Comparison <strong>of</strong> FFT and Music-based methods<br />
for motor with 1 broken bar: (a) FFT, (b) Music [44].<br />
bars fault has weak and strong points. Its most important<br />
weakness is in processing <strong>of</strong> transient signals. To<br />
overcome this problem, application <strong>of</strong> wavelet processor<br />
was suggested which provide more detailed time and<br />
frequency view <strong>of</strong> the signal. Following wavelet packet<br />
with simultaneous high precision <strong>of</strong> time and frequency is<br />
commonly used. These processors <strong>of</strong>ten are used for<br />
broken bars fault but there are no appropriate researches<br />
about number <strong>of</strong> broken bars and their location. Other<br />
drawbacks <strong>of</strong> this method are that it is time consuming<br />
and complicated. In recent years, Hilbert-based methods<br />
with high frequency precision methods such as Music<br />
have been proposed. A common point that must be taken<br />
into account in an appropriate fault diagnosis method in<br />
industry beside on-line case; is the method must quick<br />
and at the same time have good accuracy.<br />
REFERENCES<br />
[1] J. Faiz, B. M. Ebrahimi, and M. B. B. Sharifian, “Different faults<br />
and their diagnosis techniques in three-phase squirrel-cage<br />
induction motors-a review, “ Electromagnetics, vol. 26, no. 7, pp.<br />
543–569, October 2006.<br />
[2] X. Ying, “Characteristic performance analysis <strong>of</strong> squirrel cage<br />
induction motor with broken bars,” IEEE Trans. on Magnetics,<br />
vol. 45, no. 2, pp. 759-766, Feb. 2009.<br />
[3] J. Faiz, B. M. Ebrahimi, "A new pattern for detecting broken rotor<br />
bars in induction motors during start-up," IEEE Trans. on<br />
Magnetics, vol. 44, no. 12, pp. 4673-4683 ,December 2008.<br />
[4] J. Faiz, and B. M. Ebrahimi, "Determination <strong>of</strong> number <strong>of</strong> rotor<br />
broken bars and static eccentricity degree in induction motor<br />
under mixed fault," Electromagnetics, vol. 28, no. 6, pp. 433-449,<br />
August 2008.<br />
[5] M. J. Devaney and L. Eren, "Monitoring an induction motors<br />
current and detecting bearing failure," Trans. on Instrumentation<br />
and Measurement, vol.7, no.4, pp. 30- 50 December 2004.<br />
[6] B. Li, M. Y. Chow, Y. Tipusuwan, and J. C. Hung , "Neuralnetwork-based<br />
motor rolling bearing fault diagnosis,", IEEE<br />
Trans. on Industrial Electronics, pp. 1060-1069, vol. 47, no.5,<br />
Oct. 2000.<br />
[7] J. Faiz, and I. Tabatabaei, " Extension <strong>of</strong> winding function theory<br />
for non uniform Air gap in eccentric machinery," IEEE<br />
Transaction on Magnetics, vol. 38, no. 6, pp. 3654-3657,<br />
November 2002.<br />
[8] J. F. Bangura, R. J. Povinelli, N. A. Demerdash, and R. H. Brown,<br />
"Diagnostics <strong>of</strong> eccentricities and bar/end ring connector<br />
breakages in poly phase induction motors through a combination<br />
<strong>of</strong> time-series data mining and time-stepping coupled FE–statespace<br />
techniques," IEEE Trans. on Industry Applications, vol. 39,<br />
no. 4, pp. 1005-1013, Jul./Aug. 2003.<br />
[9] P. J. Rodriguez, A. Belahcen, and A. Arkkio, "Signature <strong>of</strong><br />
electrical faults in the force distribution and vibration pattern <strong>of</strong><br />
induction motors," IEE Proc.-Electr. Power Appl., vol. 153, no. 4,<br />
pp. 523-529, July 2006.<br />
[10] C. Yeh, G. Y. Sizov, A. S. Ahmed, N. A. O. Demerdash, R. J.<br />
Povinelli, E. E. Yaz, and D. M. Ionel, "A reconfigurable motor for<br />
experimental emulation <strong>of</strong> stator winding inter turn and broken<br />
bar faults in poly-phase induction machines," IEEE Trans. on
Energy Conversion, vol. 23, no. 4, pp. 1005-1014, December<br />
2008.<br />
[11] B. Mirafzal, and N. A. O. Demerdash, "Effects <strong>of</strong> load magnitude<br />
on diagnosing broken bar faults in induction motors using the<br />
pendulous oscillation <strong>of</strong> the rotor magnetic field orientation,"<br />
IEEE Trans. on Industry Applications, vol. 41, no. 3, pp. 771-<br />
783, May/June 2005.<br />
[12] G. Didier, E. Ternisien, O. Caspary, and H. Razik, "Fault<br />
detection <strong>of</strong> broken rotor bars in induction motor using a global<br />
fault index," IEEE Trans. on Ind. Appl., vol. 42, no. 1, pp. 79-88,<br />
Jan./Feb. 2006.<br />
[13] J. Milimonfared, H. Meshginkelk, S. Nandi, A. D. Minassians,<br />
and H. A. Toliyat, "A novel approach for broken-rotor-bar<br />
detection in cage induction motors," IEEE Trans. on Ind. Appl. ,<br />
vol.35, no. 5, pp. 1000-1006, Sep./Oct. 1999.<br />
[14] A. M. D. Silva, R. J. Povinelli, and N. A. O. Demerdash,<br />
"Induction machine broken bar and end stator short-circuit fault<br />
diagnostics based on three-phase stator current envelopes," IEEE<br />
Trans. on Industrial Electronics, vol. 55, no. 3, pp. 1310-1318,<br />
March 2008.<br />
[15] C. E. Kim, Y. B. Jung, S. B. Yoon, and D. Im, "The fault<br />
diagnosis <strong>of</strong> rotor bars in squirrel cage induction motors by timestepping<br />
finite element method," IEEE Trans. on Magnetics, vol.<br />
33, no. 2, pp. 2131-2134, March 1997.<br />
[16] N. M. Elkasabgy, A. R. Eastham, and G. E. Dawson, "Detection<br />
<strong>of</strong> broken bars in the cage rotor on an induction machine," IEEE<br />
Trans. on Industry Applications, vol. 22, no. 6, pp. 165-171, Jan./<br />
Feb. 1992.<br />
[17] V. F. Pires, J. F. Martins, and A. J. Pires, "Eigen vector / Eigen<br />
value analysis <strong>of</strong> a 3D current referential fault detection and<br />
diagnosis <strong>of</strong> an induction motor," Energy Conversion and<br />
Management, vol. 51, issue. 5, pp. 901-907, May 2010.<br />
[18] S. Nandi, H. A. Toliyat, and X. Li, "Condition monitoring and<br />
fault diagnosis <strong>of</strong> electrical motors- a review," IEEE Trans. on<br />
Energy Conversion, vol. 20, no. 4, pp. 719-729, December 2005.<br />
[19] M. E. H. Benbouzid, and G. B. Kilman, "What stator current<br />
processing-based technique to use for induction motor rotor faults<br />
diagnosis?," IEEE Trans. on Energy Conversion, vol. 18, no. 2,<br />
pp. 238-244, June 2003.<br />
[20] M. Riera-Guasp, M. F. Cabanas, J. A. Antonino-Daviu, M.<br />
Pineda-Sanchez, and C. H. R Garcia, "Influence <strong>of</strong><br />
Nonconsecutive Bar Breakages in Motor Current Signature<br />
Analysis for the Diagnosis <strong>of</strong> Rotor Faults in Induction Motors,"<br />
IEEE Trans. on Energy Conversion, vol. 25, no.1, pp. 80-89,<br />
Jan./ Feb. 2010.<br />
[21] A. Bellini, F. Filippetti, G. Franceschini, C. Tassoni, and G. B.<br />
Kilman, "Quantitative evaluation <strong>of</strong> induction motor broken bars<br />
by means <strong>of</strong> electrical signature analysis," IEEE Trans. on<br />
Industry Applications, vol. 37, no. 5, pp. 1248-1255, Sep./Oct.<br />
2001.<br />
[22] J. F. Bangura, and N. A. Demerdash, "Diagnosis and<br />
characterization <strong>of</strong> effects <strong>of</strong> broken bars and connectors in<br />
squirrel-cage induction motors by a time-stepping coupled finite<br />
Element-state space modeling approach," IEEE Trans. on Energy<br />
Conversion, vol. 14, no. 4, pp. 1167-1176, December 1999.<br />
[23] R. F. Walliser, and C. F. Landy, "Determination <strong>of</strong> inter bar<br />
current effects in the detection <strong>of</strong> broken rotor bars in squirrel<br />
cage induction motors," IEEE Trans. on Energy Conversion, vol.<br />
9, no. 1, pp. 152-158, March 1994.<br />
[24] P. V. Goode, and M. Chow, "Using a neural/fuzzy system to<br />
extract heuristic knowledge <strong>of</strong> incident faults in induction motors:<br />
part II-application," IEEE Trans. on Industrial Electronics, vol.<br />
42, no. 2, pp. 139-146 , April 1995.<br />
[25] M. Haji, and H. A. Toliyat, "Pattern recognition- a technique for<br />
induction machine rotor broken bar detection," IEEE Trans. on<br />
Energy Conversion , vol. 16, no. 4, pp. 312-317, December 2001.<br />
[26] J. Faiz, B. M. Ebrahimi, and M. B. B. Sharifian, "Time Stepping<br />
Finite Element Analysis Of Broken Bars Fault In A Three-Phase<br />
Squirrel-Cage Induction Motor," Progress in Electromagnetics<br />
Research, PIER, vol. 68, pp. 53-70, 2007.<br />
[27] J. Faiz, B. M. Ebrahimi, H.A. Toliyat, and W.S. Abu-Elhaija,<br />
"Mixed – fault diagnosis in induction motors considering varying<br />
load and broken bars location," Energy Conversion and<br />
Management, vol. 51, issue. 7, pp. 1432-1441, July 2010.<br />
[28] D. F. Pires, V. F. Pires, J.F. Martins, and A.J. Pires, "Rotor cage<br />
fault diagnosis in three-phase induction motors based on a current<br />
- 320 - 15th IGTE Symposium 2012<br />
and virtual flux approach," Energy Conversion and Management,<br />
vol. 50, issue. 4, pp. 1026-1032, April 2009.<br />
[29] B. Akin, U. Orguner, H. A. Toliyat, and M. Rayner, "Low order<br />
PWM inverter harmonics contributions to the inverter-fed<br />
induction machine fault diagnosis," IEEE Trans. on Industrial<br />
electronics, vol. 55, no. 2, pp. 610-619, February 2008.<br />
[30] J. Faiz, and B. M. Ebrahimi, "Locating rotor broken bars in<br />
induction motors using finite element method," Energy<br />
Conversion and Management, vol. 50, pp. 125-131, August 2008.<br />
[31] S. H. Kia, H. Henano, and G. A. Capolino, "Diagnosis <strong>of</strong> Brokenbar<br />
Fault in Induction Machines Using Discrete Wavelet<br />
Transform Without Slip Estimation," IEEE Trans. on Industry<br />
Applications, vol. 45, no. 4, pp. 1395-1404, July/August 2009.<br />
[32] Z. K. Peng, M. R. Jackson , J. A. Rongong , F. L. Chu , and R.<br />
M. Parkin, " On the energy leakage <strong>of</strong> discrete wavelet transform,"<br />
Mechanical Systems and Signal Processing, vol. 29, pp. 330-343,<br />
2009.<br />
[33] I. P. Georgakopoulos, E. D. Mitronikas, and A. N. Safacas, "<br />
Condition Monitoring <strong>of</strong> an Inverter-Driven Induction Motor<br />
Using Wavelets," Advanced Electromechanical Motion Systems<br />
& Electric Drives Joint Symposium, 2009. ELECTROMOTION<br />
2009. 8th International Symposium on, pp.1-5, 1-3 July 2009.<br />
[34] J. Cusido, L. romeral, J. A. Ortega, A. Rosero and A. G.<br />
Espinosa, "Fault Detection in Induction Machines Using Power<br />
Spectral Density in Wavelet decomposition," IEEE Trans. on<br />
Industrial Electronics, vol. 55, no. 2, pp. 633-643, February 2008.<br />
[35] Randy Supangat, NesimiErtugrul, Wen l.Soog, Douglas A.Gray<br />
and Colin Hasen, "Broken Rotor Bar fault Detection in induction<br />
Motors Using Starting Current Analysis," EPE 2005, pp. 1-1,<br />
2005.<br />
[36] J. A. Daviu, M. R. Guasp, J. R. Floch, and P. M. Palomares, "<br />
Validation <strong>of</strong> a New Method for the Diagnosis <strong>of</strong> Rotor Bar<br />
Failures via Wavelet Transform in Industrial Induction<br />
Machines," IEEE Trans. on Industry Applications, vol. 42, no. 4,<br />
pp. 990-996, July/August 2006.<br />
[37] Z. Ye, B. Wu, and A. Sadeghian, "Current Signature Analysis <strong>of</strong><br />
Induction Motor Mechanical Faults by Wavelet Packet<br />
Decomposition," IEEE Trans. on Industrial Electronics, vol. 50,<br />
no. 4, pp. 1217- 1228, December 2003.<br />
[38] J. Faiz, B.M. Ebrahimi, B. Asaie, R. Rajabioun, and H. A.<br />
Toliyat, "A Criterion Function for Broken Bar Fault Diagnosis in<br />
Induction Motor under Load Variation using Wavelet Transform,"<br />
Electrical Machines and Systems, 2007. ICEMS. International<br />
Conference on, pp.1249-1254, 8-11 Oct. 2007.<br />
[39] S. A. Saleh, T.S. Radwan, and M.A. Rahman, "Real- Time <strong>of</strong><br />
WPT-Based Protection <strong>of</strong> Three-Phase Vs PWM Inverter-Fed<br />
Motors," IEEE Trans. on Power Delivery, vol. 22, no. 4, pp.<br />
2108-2115, October 2007.<br />
[40] M. A. S. K. Khan and M. A. Azizur Rahman, "A New Wavelet<br />
Based Diagnosis and Protection <strong>of</strong> Faults in Induction Motor<br />
Drives," Power Electronics Specialists Conference, 2008. PESC<br />
2008. IEEE, pp.1536-1541, 15-19 June 2008.<br />
[41] I. Aydin, M. Karakose, E. Akin, "A new method for early fault<br />
detection and diagnosis <strong>of</strong> broken rotor bars," Energy Conversion<br />
and Management, vol. 52, issue 4, pp. 1790-1799, April 2011.<br />
[42] L. Sun, "Detection <strong>of</strong> Rotor Bar Breaking Fault in Induction<br />
Motors Based on Hilbert Modulus Gyration Radius <strong>of</strong> Filtered<br />
Stator current Signal," Electrical Machines and Systems, 2008.<br />
ICEMS 2008. International Conference on, pp. 877-881, 17-20<br />
Oct. 2008.<br />
[43] L. A. Periera, D. Fernandes, D. S. Gazzana, F. B. Libano, and S.<br />
Haffner, "Application <strong>of</strong> Welch, burg and music methods to the<br />
detection <strong>of</strong> rotor cage faults <strong>of</strong> induction motors," Transmission<br />
& Distribution Conference and Exposition: Latin America, 2006.<br />
TDC '06. IEEE/PES, pp. 1-6, 15-18 Aug. 2006.<br />
[44] B. Xu, S. liu and L. Sun, "A novel method for the early detection<br />
<strong>of</strong> broken rotor bars in squirrel cage induction motors," Electrical<br />
Machines and Systems, 2008. ICEMS 2008. International<br />
Conference on, pp. 751-754, 17-20 Oct. 2008.<br />
[45] S. Hamdani, A. Bouzida, O. Touhami, and R. Ibtiouen,<br />
"Diagnosis <strong>of</strong> Rotor Fault in Induction Motors Using Music<br />
Analyses <strong>of</strong> Terminal Voltage after Switch <strong>of</strong>f,", 18th<br />
International Conference on Electrical Machines, pp. 1-5, 6-9<br />
Sept. 2008.0
†<br />
- 321 - 15th IGTE Symposium 2012<br />
Experimental Calibration <strong>of</strong> Numerical Model<br />
<strong>of</strong> Thermoelastic Actuator<br />
*L. Voracek, *V. Kotlan and *B. Ulrych<br />
*<strong>University</strong> <strong>of</strong> West Bohemia, Faculty <strong>of</strong> Electrical Engineering, Univerzitní 26, 30614, Pilsen, Czech Republic<br />
Abstract—A numerical model <strong>of</strong> the thermoelastic actuator for accurate settings <strong>of</strong> position is compared with the data<br />
obtained by measurements on an experimental prototype. Some disagreements between the results became the reason for a<br />
calibration <strong>of</strong> the model carried out using an appropriate iterative process.<br />
Index Terms—Actuator, FEM, Measurement, Thermoelasticity.<br />
I. INTRODUCTION<br />
Various industrial technologies work with extremely<br />
small and accurate shifts on the order <strong>of</strong> 10 –3 to 10 –6 m.<br />
One way <strong>of</strong> reaching that small shifts and exact positions<br />
is using a thermoelastic actuator. Its theoretical<br />
backgrounds are described in previous papers written by<br />
our group (i.e., [1], [2]). Recently, we also built a<br />
prototype <strong>of</strong> the device and started experimental verifying<br />
the calculated results. Certain discrepancies between the<br />
computations and measurements lead to necessity <strong>of</strong><br />
calibrating the material parameters and their temperature<br />
dependences.<br />
II. FORMULATION OF THE PROBLEM<br />
The arrangement <strong>of</strong> the device is depicted in Fig. 1.<br />
The dilatation element 2 made <strong>of</strong> a suitable electrical<br />
conductive metal is inserted into a coil 3 fixed in frame 4.<br />
The coil is supplied by harmonic current. The whole<br />
system is placed in an insulating shell 1. The device is<br />
clamped by its bottom part 5 in the basement 6 that is<br />
supposed to be perfectly stiff. The time-variable magnetic<br />
field generated by the field coil 3 induces in the dilatation<br />
element 2 eddy currents. These eddy currents produce<br />
heat and consequent geometrical changes (mainly in its<br />
longitudinal direction z ) <strong>of</strong> the dilatation element.<br />
r<br />
6 5 4 3 2 1<br />
Figure 1. The basic arrangement <strong>of</strong> the device<br />
1 –shell, 2 – dilatation element, 3 – field coil, 4 – fixing frame, 5 –front,<br />
6 – stiff wall<br />
III. MATHEMATICAL MODEL<br />
Mathematical modelling <strong>of</strong> the device represents a<br />
triply coupled problem. Its mathematical model consists<br />
<strong>of</strong> three partial differential equations describing the<br />
distribution <strong>of</strong> magnetic field, temperature field and field<br />
<strong>of</strong> thermoelastic displacements.<br />
The device does not contain any ferromagnetic part. If<br />
the field coil 3 (Fig. 1) carries harmonic current <strong>of</strong><br />
z<br />
density J , the magnetic field in the system may be<br />
ext<br />
described by the Helmholtz partial differential equation<br />
for the phasor A <strong>of</strong> the magnetic vector potential A in<br />
the form [3]<br />
curlcurlA j A J<br />
. (1)<br />
Here, symbol denotes the magnetic permeability, is<br />
the electric conductivity, stands for the angular<br />
frequency, and J is the phasor <strong>of</strong> external harmonic<br />
ext<br />
current density in the field coil. The conditions along the<br />
axis <strong>of</strong> the device and artificial boundary placed at a<br />
sufficient distance from the system are <strong>of</strong> the Dirichlet<br />
type ( A 0 ).<br />
Heat power in the system is generated by currents in<br />
the field coil and induced currents in the dilatation<br />
element. The distribution <strong>of</strong> the temperature in the system<br />
can be described by equation [4]<br />
T<br />
div grad T cp pJ<br />
,<br />
t<br />
where stands for the thermal conductivity, is the<br />
specific mass, and c p denotes the specific heat at a<br />
constant pressure. Finally, the symbol p stands for the<br />
J<br />
time average internal sources <strong>of</strong> heat represented by the<br />
volumetric Joule losses. These are given by the formula<br />
2<br />
ext<br />
(2)<br />
J<br />
p J , J J + J , J = j <br />
A , (3)<br />
ext ind ind<br />
<br />
where symbol J denotes the current density in the<br />
ind<br />
electrically conductive parts <strong>of</strong> the device.<br />
The boundary conditions should take into account<br />
both convection and radiation. But since between the<br />
dilatation element 2 and field coil 3 there is a ceramic<br />
tube characterized by a very poor thermal conductivity<br />
and all parts are placed in a Teflon insulating shell<br />
(characterized also by a poor thermal conductivity),<br />
radiation can be – with a practically negligible error –<br />
disregarded.<br />
The last considered field is the field <strong>of</strong> thermoelastic<br />
displacements. The distribution <strong>of</strong> displacements in the<br />
dilatation element 2 follows from the solution <strong>of</strong> the<br />
Lamé equation [5]
grad div <br />
3 2 T grad T<br />
u u <br />
f 0 ,<br />
where 0, 0 are coefficients associated with<br />
material parameters by the relations<br />
E E<br />
<br />
1 1 2 2 1 <br />
<br />
, .<br />
Here E is the modulus <strong>of</strong> elasticity and denotes<br />
the Poisson coefficient <strong>of</strong> the contraction. Finally, symbol<br />
u ur , u, u z represents the vector <strong>of</strong> the displacement,<br />
is the coefficient <strong>of</strong> the linear thermal dilatability <strong>of</strong><br />
T<br />
the material, and f stands for the vector <strong>of</strong> the internal<br />
volumetric forces. These consist (at least in the dilatation<br />
element 2) <strong>of</strong> the gravitational and Lorentz volumetric<br />
forces. But in comparison with the thermoelastic strains<br />
and stresses they are very small and may be neglected.<br />
The boundary conditions depend on the particular<br />
arrangement. In the solved case the displacements <strong>of</strong> the<br />
dilatation element 2 at the place <strong>of</strong> clamping are assumed<br />
to be equal to zero.<br />
IV. NUMERICAL SOLUTION<br />
The numerical solution was performed by a<br />
combination <strong>of</strong> pr<strong>of</strong>essional codes COMSOL<br />
Multiphysics and Matlab that were supplemented with a<br />
lot <strong>of</strong> own procedures and scripts. Special attention was<br />
paid to the convergence <strong>of</strong> the results (dependence <strong>of</strong> the<br />
distribution <strong>of</strong> physical fields on the density <strong>of</strong><br />
discretization meshes and in the case <strong>of</strong> electromagnetic<br />
field also on the position <strong>of</strong> the artificial boundary). Some<br />
part <strong>of</strong> solution was verified with results obtained from<br />
code Agros2D. This is a new program from the category<br />
<strong>of</strong> FEM based s<strong>of</strong>tware developed by group at our<br />
department. This program is very strong for the solution<br />
<strong>of</strong> various physical fields but nowadays it is still in a test<br />
version and some specific parts are not finished yet.<br />
Therefore it cannot be used for the nonlinear systems.<br />
V. ILLUSTRATIVE EXAMPLE AND RESULTS<br />
An illustrative example concerns a prototype <strong>of</strong> the<br />
thermoelastic actuator built in our laboratory. Its<br />
arrangement and dimensions are depicted in Fig. 2. The<br />
field coil is wound from a copper wire with diameter<br />
D w 0,63 mm and has approximately 2400 turns. This<br />
coil is supplied by a harmonic current <strong>of</strong> frequencies<br />
519 Hz, 1005 Hz and 2210 Hz, respectively, whose<br />
RMS values were 0.5 A and 1 A. The dilatation element<br />
is made <strong>of</strong> brass UNS C26000, whose nonlinear<br />
temperature-dependent characteristics <strong>of</strong> most important<br />
physical parameters are depicted in next Figs. 3–5. Other<br />
parts exhibit mainly the function <strong>of</strong> electrical and thermal<br />
insulation. All remaining parameters for computation are<br />
listed in Tab. 1. The entire device is fixed in a firm steel<br />
construction for its correct functioning.<br />
- 322 - 15th IGTE Symposium 2012<br />
(4)<br />
(5)<br />
Figure 2: Arrangement <strong>of</strong> the considered actuator (dimensions in mm):<br />
1–nylon shell, 2–brass core, 3–copper coil, 4–Teflon fixing part <strong>of</strong> the<br />
coil, 5–nylon stand, 6–metallic stand, 7–ceramic tube, 8–nylon cap<br />
Figure 3: Temperature dependence <strong>of</strong> thermal expansion coefficient for<br />
brass UNS C26000 (data from [6]).<br />
Figure 4: Temperature dependence <strong>of</strong> the thermal conductivity for brass<br />
UNS C26000 (data from [6]).<br />
Figure 5: Temperature dependence <strong>of</strong> the electrical conductivity for<br />
brass UNS C26000 (data from [6]).
Table 1: Physical properties <strong>of</strong> particular materials for the initial step <strong>of</strong><br />
simulation.<br />
brass Teflon ceramic nylon copper<br />
rel. permeability [-] 1 1 1 1 1<br />
rel. permittivity [-] 1 1 1 1 1<br />
el. conductivity [S/m] 1.5e7 0 0 0 5.7e7<br />
therm. cond. [W/(mK)] 115 0.24 1.6 0.26 395<br />
density [kg/m 3 ] 8440 2220 2500 1150 8930<br />
spec. heat cap [J/(kgK)] 375 1050 1090 1100 313<br />
Young. modulus [Pa] 9.79e10<br />
Poisson ratio [-] 0.301<br />
therm. expans. coeff. [1/K] 18.7e–6<br />
A. Measurements<br />
The physical model <strong>of</strong> the thermoelastic actuator (see<br />
Fig. 6) was designed by our group at the <strong>University</strong> <strong>of</strong><br />
West Bohemia. This is the first prototype that can help us<br />
to validate the results <strong>of</strong> several projects <strong>of</strong> devices based<br />
on the principle <strong>of</strong> thermoelasticity.<br />
Figure 6: The thermoelastic actuator – manufactured prototype.<br />
We also designed and manufactured a measurement<br />
stand intended for fixing <strong>of</strong> the device, which<br />
significantly contributes to the accuracy <strong>of</strong><br />
measurements. The rigidity <strong>of</strong> the measurement device is<br />
<strong>of</strong> extreme importance for measuring such small shifts.<br />
The measuring stand is supposed to be connected with a<br />
dynamometer in the future with the aim <strong>of</strong> making use<br />
the device for another purpose: the actuator can also act<br />
as a source <strong>of</strong> large forces produced by small shifts <strong>of</strong> the<br />
dilatation element.<br />
Figure 7: Arrangement <strong>of</strong> the measurement.<br />
The measuring circuit (see Fig. 7) consists <strong>of</strong> a<br />
measuring rack, thermoelastic actuator, function<br />
generator, amplifier, capacitance decade, auxiliary<br />
resistor and oscilloscope. The measurements <strong>of</strong> the<br />
thermoelastic actuator are carried out in several regimes<br />
- 323 - 15th IGTE Symposium 2012<br />
characterized by the above frequencies and RMS values<br />
<strong>of</strong> the field current. A sinusoidal signal delivered from<br />
the frequency generator is amplified by the amplifier to<br />
the desired value <strong>of</strong> the current at a given frequency.<br />
A small resistor connected to the thermoelastic<br />
actuator in series is used for measuring the current<br />
through the circuit. We oscilloscopically measured the<br />
voltage at the above resistor and the value <strong>of</strong> the current<br />
was determined using the Ohm law from the measured<br />
voltage and the known resistance. Using <strong>of</strong> an ammeter is<br />
inappropriate due to its internal resistance. This would<br />
increase the total resistive load and the input signal could<br />
not be amplified sufficiently by the used amplifier. The<br />
thermoelastic actuator can be considered an RL circuit<br />
and we obtain the maximum current through it by getting<br />
it into resonance. This can be achieved by adding a serial<br />
capacitor. For given values <strong>of</strong> R and L and prescribed<br />
frequency it is very easy to calculate its capacitance<br />
C using the well-known Thomson relation<br />
2 f <br />
1<br />
LC<br />
.<br />
(6)<br />
For compensation we used rolled capacitors whose<br />
capacitance was determined with respect to the given<br />
frequency. But we had to respect the available types <strong>of</strong><br />
the capacitors, which did not allow reaching exactly the<br />
desired values <strong>of</strong> capacitances from (6). And this explains<br />
the above values <strong>of</strong> frequencies that differ from<br />
“reasonable” values <strong>of</strong> 500 , 1000 , and 2000 Hz.<br />
The displacement <strong>of</strong> the top front <strong>of</strong> the dilatation<br />
element was measured using digital indicator MarCator<br />
1088 (accuracy 0.001mm) in the time interval from 0 to<br />
300 seconds with increments equal to 30 seconds. In the<br />
same intervals we measured the temperature inside the<br />
brass core. The measurements <strong>of</strong> the internal temperature<br />
were performed on the coil. The corresponding values<br />
were plotted into graphs for a comparison with the results<br />
from the mathematical model.<br />
The measurements performed on an experimental<br />
prototype were used for calibration. The measured results<br />
are shown in Figs. 8 and 9. Figure 10 shows the<br />
dependence between the temperature and displacement in<br />
the brass core 2.<br />
Figure 8: Time dependence <strong>of</strong> the temperature for the specified<br />
parameters (current RMS value 1 A and frequencies 519, 1005 and 2210<br />
Hz) <strong>of</strong> the field current).
B. Numerical simulation<br />
For numerical solution we used the previously<br />
mentioned programs and algorithms based on the FE<br />
method. At the beginning we created a geometrical model<br />
according to the technical drawing, which was used for<br />
manufacturing <strong>of</strong> the prototype. In the first step <strong>of</strong><br />
simulation we used the material properties according to<br />
the Tab. 1, whose were found in the base material<br />
datasheets.<br />
Figure 9: Time dependence <strong>of</strong> the displacement for the specified<br />
parameters (current RMS value 1 A and frequencies 519, 1005 and 2210<br />
Hz) <strong>of</strong> the field current.<br />
Figure 10: Dependence <strong>of</strong> the displacement on the temperature for the<br />
specified parameters (current RMS value 1 A and frequencies 519, 1005<br />
and 2210 Hz) <strong>of</strong> the field current.<br />
The time dependence <strong>of</strong> the temperature and<br />
displacement (Figs. 11 and 12) show the large<br />
discrepancy between the measurement and simulation.<br />
Figure 11: Time dependence <strong>of</strong> displacement for the first solution –<br />
current RMS value 1 A and selected frequency 1005 Hz.<br />
- 324 - 15th IGTE Symposium 2012<br />
These discrepancies were obviously brought about by<br />
the temperature-dependent material properties used in the<br />
numerical simulation (that differed from the real<br />
properties <strong>of</strong> those used for building the physical model)<br />
and also differences between the real geometry <strong>of</strong> the<br />
prototype and geometry used for the numerical model.<br />
Therefore, we checked all dimensions <strong>of</strong> the model and<br />
prototype. The next step was usage <strong>of</strong> some iteration<br />
processes to found new material properties to get the<br />
better agreement <strong>of</strong> the results. Especially we focused on<br />
the brass, because we did not know its exact designation<br />
and chemical composition <strong>of</strong> this material. All materials<br />
used in the prototype have the relative permeability near<br />
to one and are not ferromagnetic, therefore all used<br />
nonlinear characteristics are just the temperature<br />
dependencies (see Figs. 3, 4 and 5). After several<br />
corrections we found the satisfying values <strong>of</strong> material<br />
properties and comparing them with available database <strong>of</strong><br />
materials [6] we found that the used brass should be UNS<br />
C26000, whose characteristics are in the mentioned<br />
figures and were used for the final solution.<br />
Figure 12: Time dependence <strong>of</strong> temperature for the first solution –<br />
current RMS value 1 A and selected frequency 1005 Hz.<br />
The next two figures show an acceptable agreement<br />
between the measurement and simulation, for the<br />
frequency f 1005 Hz and RMS value <strong>of</strong> current<br />
I 1A.<br />
in<br />
Figure 13: Time dependence <strong>of</strong> displacement for the final solution –<br />
current RMS value 1 A and selected frequency 1005 Hz.
Figure 14: Time dependency <strong>of</strong> temperature for the final solution –<br />
current RMS value 1 A and selected frequency 1005 Hz.<br />
Figures 15 and 16 depict the distribution <strong>of</strong> the<br />
temperature and displacement in the dilatation element<br />
for time t 300 s , frequency f 1005 Hz and current<br />
I 1A.<br />
in<br />
Figures 17 and 18 show the final comparison <strong>of</strong> the<br />
measured data with the results <strong>of</strong> simulation for all three<br />
frequencies ( 519 Hz , 1005 Hz and 2210 Hz ).<br />
From the results is visible a small discrepancy,<br />
especially for the highest frequency. This can be brought<br />
about by the incorrect temperature-dependent<br />
characteristic <strong>of</strong> the thermal conductivity. Therefore, it is<br />
necessary to perform next steps to find the exact model.<br />
Figure 15: Graphical presentation <strong>of</strong> obtained results for total<br />
displacement <strong>of</strong> the dilatation element in time t = 300 s, for frequency<br />
f = 1005 Hz and current Iin = 1 A.<br />
- 325 - 15th IGTE Symposium 2012<br />
Figure 16: Graphical presentation <strong>of</strong> obtained results for temperature in<br />
the model <strong>of</strong> the thermoelastic actuator in time t = 300 s, for frequency<br />
f = 1005 Hz and current Iin = 1 A.<br />
Figure 17: Comparison <strong>of</strong> the final solution results <strong>of</strong> the time<br />
dependency <strong>of</strong> displacement with the measured data – for current RMS<br />
value Iin = 1 A.<br />
Figure 18: Comparison <strong>of</strong> the final solution results <strong>of</strong> the time<br />
dependency <strong>of</strong> temperature with the measured data – for current RMS<br />
value Iin = 1 A.
VI. CONCLUSION<br />
Thermoelasticity may prove to be a mighty tool in<br />
some applications where setting <strong>of</strong> accurate position is<br />
needed. The process <strong>of</strong> reaching the required dilatation is<br />
slow, but reliable.<br />
Nevertheless, accuracy <strong>of</strong> the results strongly depends<br />
on correctness <strong>of</strong> the input data as is shown in this paper.<br />
Further research will be, therefore, aimed at possibilities<br />
<strong>of</strong> their improvement. At this time we are preparing the<br />
measurements <strong>of</strong> material properties <strong>of</strong> the available<br />
materials and we want to make the brass spectroscopy to<br />
find the correct chemical composition to improve the<br />
model. And, <strong>of</strong> course, we need to improve the material<br />
properties <strong>of</strong> other materials in the model that are not so<br />
important like the brass, but they can affect the results as<br />
well.<br />
Another important aim is to accelerate the heating<br />
process. New possibilities are investigated in this<br />
direction, based on using variable amplitude <strong>of</strong> the field<br />
current, which can be realized, for example, by pulsewidth<br />
modulation.<br />
VII. ACKNOWLEDGMENT<br />
This work was supported by the <strong>University</strong> <strong>of</strong> West<br />
Bohemia grant system (project No. SGS-2012-039) and<br />
by the Grant Agency <strong>of</strong> the Czech Republic (project<br />
102/11/0498).<br />
REFERENCES<br />
[1] I. Dolezel, B. Ulrych and V. Kotlan, “Combined Actuator for<br />
Accurate Setting <strong>of</strong> Position Based on Thermoelasticity Produced<br />
by Induction Heating”, in IEEE Transaction on Industry<br />
Applications, Vol. 47, No. 5, 2011, ISSN 0093-9994, p. 2250–<br />
2256.<br />
[2] Doležel, I., Kotlan, V., Ulrych, B. Electromagnetic-thermoelastic<br />
actuator for accurate wide-range setting <strong>of</strong> position. Przeglad<br />
Elektrotechniczny, 2011, Vol. 87, No. 5, p. 22-27. ISSN: 0033-<br />
2097<br />
[3] Kuczmann, M.: Iványi, A.: The Finite Element Method in<br />
Magnetics. Akademiai Kiado, Budapest, 2008.<br />
[4] Holman, J.P.: Heat Transfer. McGrawHill, NY, 2002.<br />
[5] Boley, B., Wiener, J.: Theory <strong>of</strong> Thermal Stresses. NY, 1960.<br />
[6] MPDB Database <strong>of</strong> materials: www.jahm.com.<br />
- 326 - 15th IGTE Symposium 2012
- 327 - 15th IGTE Symposium 2012<br />
Scattering Calculations <strong>of</strong> Passive UHF-RFID<br />
Transponders<br />
*Thomas Bauernfeind, *Gergely Koczka, *Kurt Preis and *Oszkár Bíró<br />
*Institute for Fundamentals and Theory in Electrical Engineering, Inffeldgasse 18, 8010 <strong>Graz</strong>, Austria<br />
E-mail: t.bauernfeind@TU<strong>Graz</strong>.at<br />
Abstract—Beside the energy extraction capability from impinging electromagnetic waves provided by the interrogator, the<br />
signal strength <strong>of</strong> the field scattered by the transponder is also a quality criterion <strong>of</strong> UHF-RFID transponder tags. In general,<br />
the transponder antenna is conjugate complex matched to the nominal transponder IC impedance to achieve maximum<br />
energy extraction. The reverse link from the transponder to the reader is commonly not taken into account. Due to the<br />
modulation technique and a voltage limiter circuit at the analog IC frontend, the input impedance <strong>of</strong> the IC is strongly<br />
nonlinear. In the present paper a method is proposed which is able to analyze the influence <strong>of</strong> this nonlinearity on the<br />
scattered signal by separating the scattered field <strong>of</strong> the transponder to a reference scattering problem and a pure radiation<br />
problem.<br />
Index Terms— Antenna Impedance, Radar Cross Section, UHF-RFID.<br />
I. INTRODUCTION<br />
In passive backscattering applications like UHF-RFID<br />
(ultra high frequency-radio frequency identification) the<br />
communication between the interrogator unit (reader) and<br />
the transponder is established by means <strong>of</strong> modulating the<br />
radar cross section (RCS) <strong>of</strong> the transponder. In general,<br />
for UHF-RFID applications, this is done by switching the<br />
analog input impedance between two states in phase with<br />
the data stream to be transmitted, e.g. the EPC (electronic<br />
product code) value [1], [2]. Unfortunately, the analog IC<br />
input impedance is not constant, indeed it has a strong<br />
nonlinear behavior versus applied power e.g. as shown in<br />
Figure 1. This behavior is mainly caused by a voltage<br />
limiter at the transponder IC’s frontend and on the power<br />
consumption <strong>of</strong> the IC which is determined by the actual<br />
mode <strong>of</strong> operation <strong>of</strong> the transponder [3]. Hence, the<br />
transponder input impedance is a function <strong>of</strong> the induced<br />
antenna voltage [4]. To capture this nonlinear behavior,<br />
an iterative full-wave simulation <strong>of</strong> the whole channel<br />
including the reader antenna, the air volume and the tag<br />
antenna as proposed in [5] should be applied, taking into<br />
account the feedback <strong>of</strong> the tag on the reader. The IC<br />
behavior is commonly taken into account by means <strong>of</strong><br />
circuital co-simulation [6]. Due to the huge problem<br />
domain, this technique is unpractical to carry out<br />
optimization investigations. A possibility to reduce the<br />
computational costs is to describe the scattered field <strong>of</strong><br />
the nonlinearly terminated tag antenna in terms <strong>of</strong> a<br />
reference scattering field and a pure radiated field from<br />
the excited transponder antenna [7]. Since, in general, the<br />
effect <strong>of</strong> the tag on the reader field is small, the feedback<br />
on the reader is neglected in a first approximation hence,<br />
the scattered field can be calculated by superimposing<br />
those fields applying the finite element method.<br />
II. GENERALIZED SCATTERING MATRIX<br />
In Figure 2a) a typical UHF-RFID application<br />
consisting <strong>of</strong> a transponder tag and a reader antenna is<br />
shown. Following the approach described in [7], the<br />
situation at the RFID-transponder tag can be modeled as<br />
shown in Figure 2b) where the field situation is described<br />
in terms <strong>of</strong> spherical wave modes. A mathematical<br />
description <strong>of</strong> the simplified transponder model is given<br />
by the generalized scattering matrix [7], [8]:<br />
Resistance in Ohm<br />
100<br />
90<br />
80<br />
70<br />
60<br />
50<br />
40<br />
30<br />
20<br />
10<br />
0<br />
b S S S a<br />
d S c<br />
d S S c<br />
00 01 0 N<br />
1 S 10 ... c 1 . (1)<br />
N N0NN N<br />
Real{Z_IC} un-modulated<br />
Real{Z_IC} modulated<br />
Imag{Z_IC} un-modulated<br />
Imag{Z_IC} modulated<br />
-10 -8 -6 -4 -2 0 2 4 6 8 10<br />
Pa in dBm<br />
Figure 1: Nonlinear transponder IC impedance versus applied power<br />
(NXP Ucode G2X).<br />
In (1), a relationship between the complex applied (a)<br />
and reflected (b) waves at the transmission line<br />
connecting the load impedance to the antenna and the<br />
incoming (cn) and outgoing (dn) spherical wave mode<br />
series is given.<br />
a) b)<br />
Figure 2: a) Typical RFID application. b) Simplified transponder<br />
model.<br />
0<br />
-50<br />
-100<br />
-150<br />
-200<br />
-250<br />
Reactance in Ohm
In case <strong>of</strong> an antenna driven by external waves<br />
(c1,… cN), the mode on the feed transmission line reflects<br />
at the load impedance defined by the load reflection<br />
coefficient given by<br />
Z L Z 0<br />
Z ZL Z 0 Z 0 . (2)<br />
0<br />
So the total field can be written as:<br />
N<br />
b 1 S 00 S0m c m<br />
m 1 m<br />
N<br />
S0 1 0 , (3)<br />
t<br />
dn S nn0 0 b<br />
N<br />
Snmc m<br />
m 1<br />
c<br />
1 m<br />
N<br />
S<br />
1 nm S . (4)<br />
Combining (3) and (4), the total outgoing field dn t can be<br />
calculated as:<br />
t<br />
dn S N<br />
n<br />
0<br />
S<br />
0<br />
m c m<br />
1 S00 00 m 1 N<br />
S nm nmc c m<br />
m<br />
1<br />
m<br />
0<br />
S S0<br />
. (5)<br />
In (5), S00 is the antenna port reflection coefficient given<br />
to be<br />
Z Ant A t Z 0<br />
S00<br />
Z Ant Z0<br />
0 Z 0 . (6)<br />
0<br />
The coefficients S0m are the transmission coefficients<br />
from the antenna to the transmission line, Sn0 the<br />
transmission coefficients from the transmission line to the<br />
antenna and Snm are the mode reflection coefficients<br />
directly connecting the incoming and outgoing wave<br />
modes. The first term in (5) can be thought <strong>of</strong> as only<br />
load impedance dependent and the second term as<br />
structure dependent, respectively.<br />
If one is interested in the scattered field only, one has<br />
to subtract the field in absence <strong>of</strong> the antenna (dn = cn)<br />
from (5) yielding<br />
N N<br />
s S N<br />
N<br />
n<br />
0<br />
dn S 0<br />
m c cm m c cn n S Snm<br />
nm c cm<br />
m<br />
1<br />
S00 00 m 1 m 1<br />
m<br />
0<br />
. (7)<br />
S S0<br />
Since a short circuit condition can easily be achieved, it is<br />
reasonable to use the short circuited case as reference.<br />
With = -1, one can calculate the scattered field in the<br />
short circuit case too:<br />
s S N N<br />
sc n<br />
0<br />
dn S S00m 0<br />
m c cm m c cn n S Snm<br />
nm c m<br />
1 S 00 m 1 m<br />
1<br />
m<br />
0<br />
. (8)<br />
S S0<br />
Using (7) and (8) it is now possible to rewrite (7) in terms<br />
<strong>of</strong> the reference scattered field from the short circuit<br />
condition as:<br />
1 0<br />
N<br />
s s 1 S<br />
sc<br />
n 0<br />
dn d dn n S0mcm 1<br />
S00 1 S00 m 1<br />
sc s<br />
N<br />
S0cm 1 m 0 S0 00 00 m<br />
S 00 1 S0<br />
1<br />
. (9)<br />
In (9) it is assumed that the incoming spherical wave<br />
mode series cm is known. Since the finite element method<br />
should be applied, the description in terms <strong>of</strong> the wave<br />
modes is not practical. Introducing the antenna short<br />
circuit current Isc and describing the radiated field in<br />
terms <strong>of</strong> an antenna driving current IAnt as presented in<br />
[7], one can eliminate the incoming wave mode series cm<br />
from (9) yielding:<br />
- 328 - 15th IGTE Symposium 2012<br />
0 1 1<br />
s s I<br />
sc rad sc<br />
Z 1 S<br />
rad 0 1<br />
00<br />
dn d sc sc 0 1 S<br />
sc rad<br />
00 000<br />
n d n<br />
n<br />
1...<br />
N<br />
. (10)<br />
2 I IAnt Z ZAnt 1<br />
S S00<br />
000<br />
Equation (10) is the key expression enabling the<br />
description <strong>of</strong> a scattered field in terms <strong>of</strong> a reference<br />
scattered field and a pure radiated one. Finally, using (2)<br />
and (6), (10) can be written as:<br />
s s ssc<br />
rad IscZ d L<br />
n d sc<br />
n d<br />
n<br />
I IAnt ZAnt Z L Z . (11)<br />
L<br />
The electric field E is the summation <strong>of</strong> the spherical<br />
wave mode series, so E is given by [7], [11]:<br />
IZ<br />
E E E 0 L<br />
Scattered E EShort Short E EAntenna<br />
Antenna<br />
. (12)<br />
IAnt ZAnt Z L<br />
Since, especially for UHF-RFID applications in<br />
general, the transponder antenna is conjugate complex<br />
matched to the nominal transponder IC impedance it is<br />
advantageous not to use the short circuit case as reference<br />
but rather the conjugate complex matched case. In [7], [8]<br />
it is described how to eliminate the short circuit case from<br />
(12) to get the final relationship:<br />
* *<br />
EScattered ( ZL) E Scattered ( Z Ant ) E EAntenn<br />
Antenna<br />
. (13)<br />
* *<br />
I I<br />
m I<br />
I<br />
m<br />
Ant<br />
In (13), Im * is the current at the terminal <strong>of</strong> the antenna<br />
for a conjugate complex matched transponder antenna in<br />
case <strong>of</strong> a pure scattering problem and * is the conjugate<br />
matched reflection coefficient:<br />
*<br />
* Z ZAnt AAnt t Z L<br />
Z ZAnt ZL<br />
L Z . (14)<br />
L<br />
Finally, this means that the field scattered by an antenna<br />
terminated with a certain load impedance ZL can be<br />
calculated by a superposition <strong>of</strong> a reference scattering<br />
field and a scaled radiated field <strong>of</strong> the antenna driven<br />
with a current IAnt.<br />
III. NUMERICAL INVESTIGATIONS<br />
The basic electromagnetic field problem shown in<br />
Figure 3 is analyzed with a finite element based in-house<br />
code. Introducing the magnetic vector potential A and the<br />
modified scalar potential V the electric field intensity E<br />
and the magnetic field intensity H in the time harmonic<br />
case can be written as:<br />
E j A j V<br />
, (15)<br />
1<br />
H A = A. (16)<br />
Using n1 edge basis functions Ni for the magnetic vector<br />
potential A and n2 nodal basis functions Ni for the<br />
modified electric scalar potential Vh as proposed in [9],<br />
the Galerkin equations to be solved become:<br />
N i A h hd j c c N i i A<br />
h<br />
h d<br />
j l<br />
j c N<br />
i<br />
gradV h hd N<br />
i A<br />
h hd<br />
d 0<br />
Z IC w<br />
SIBC<br />
( i 1,2,..., 12 1,2,..., , , , n )<br />
N c i<br />
c i (17)<br />
1
j c cgradN<br />
gra adNi<br />
A h d<br />
j c gradN gra adNi<br />
gradV h d 0 (18)<br />
( i 1, 12 , 2,..., 22,...,<br />
, , n2).<br />
In (17) and (18), the approximations <strong>of</strong> the potential<br />
functions are given by:<br />
n 1<br />
Ah a i N<br />
i<br />
i 1<br />
N<br />
i 1 i<br />
, (19)<br />
n<br />
2<br />
Vh VVN i<br />
N i<br />
i<br />
1 i<br />
. (20)<br />
The needed truncation <strong>of</strong> the problem domain has been<br />
realized by applying perfectly matched layers (PMLs) as<br />
proposed in [10]. For the first basic scattering<br />
investigations it was refrained from modeling the reader<br />
antenna structure e.g. as shown in Figure 2a). Instead, the<br />
actual scattering problem was excited by means <strong>of</strong> a<br />
Hertz-dipole as proposed in [4] since the Hertz-dipole can<br />
be modeled by a filament current with a given length. The<br />
main advantage <strong>of</strong> this excitation technique beside the<br />
reduction in the degree <strong>of</strong> freedom is, that the radiated<br />
power <strong>of</strong> a Hertz-dipole can be calculated analytically<br />
[11], which <strong>of</strong>fers the possibility <strong>of</strong> validating the quality<br />
<strong>of</strong> the results gathered within the post processing. On the<br />
other hand, the feedback <strong>of</strong> the transponder tag on the<br />
reader is neglected in this case. Since the influence <strong>of</strong> the<br />
transponder on the reader for UHF-RFID applications is<br />
small in general, it is assumed that this effect is negligible<br />
in a first approximation [4].<br />
The excitation <strong>of</strong> the pure radiation problem is done by<br />
impressing a voltage U0 at the feed gap <strong>of</strong> the dipole<br />
structure by prescribing a constant vector potential for the<br />
length y <strong>of</strong> the feed gap:<br />
Ee y y j j A y y U 0 . (21)<br />
As it can be seen from (17), the IC impedance is<br />
modeled with a surface impedance boundary condition<br />
(SIBC) as proposed in [12]:<br />
E Ett1 t 1 dds<br />
s<br />
U E<br />
l<br />
t 1 l l<br />
Z l<br />
t 1<br />
l<br />
IC Z<br />
SIBC , (22)<br />
I H t 2 d ds s K<br />
w<br />
t1<br />
w w<br />
t<br />
c<br />
where l is the length <strong>of</strong> the impedance geometry and w is<br />
the width. The surface impedance ZSIBC in (22) is given<br />
by the relationship <strong>of</strong> the tangential component <strong>of</strong> the<br />
electric field intensity Et1 and the tangential component<br />
Ht2 <strong>of</strong> the magnetic field intensity, assuming constant<br />
tangential components at the surface impedance.<br />
Due to the choice <strong>of</strong> the basis functions, the resulting<br />
system <strong>of</strong> equations (17) and (18) becomes singular<br />
which is not a drawback applying an iterative solver<br />
method. Unfortunately, the resulting system <strong>of</strong> equations<br />
is ill conditioned as described in [13], [14]. Hence, a<br />
direct solver method [14] has to be applied to avoid<br />
impractically long simulation durations. Due to the<br />
singularity <strong>of</strong> the system <strong>of</strong> equations, a tree-gauging [15]<br />
is needed to be able to apply the direct solver method.<br />
- 329 - 15th IGTE Symposium 2012<br />
IV. BASIC EXAMPLE<br />
The proposed method is tested on a very basic example<br />
shown in Figure 3. Applying a electric boundary<br />
condition<br />
E n 0 (23)<br />
(which is a Dirichlet boundary condition for A) in the x-zplane<br />
and a magnetic boundary condition<br />
H n 0 (24)<br />
(which is a Neumann boundary condition for A) in the yz-plane,<br />
only a quarter <strong>of</strong> the problem has to be modeled.<br />
The half wavelength dipole at a frequency <strong>of</strong> f = 1 GHz<br />
with a length <strong>of</strong> 0.5 l Ant 7.5 cm and a width <strong>of</strong><br />
0.5 w Ant 2.5 mm has a thickness <strong>of</strong> d = 1 mm and is<br />
placed at a distance <strong>of</strong> 25 cm to the Hertz-dipole<br />
excitation which is 0.8 times the wavelength . Hence,<br />
far field conditions can be assumed. In Figure 3b) a detail<br />
<strong>of</strong> the feed gap with a width <strong>of</strong> wgap = 200 μm is shown.<br />
The SIBC is connected via perfect electric conductors<br />
(PEC) to the excitation and the antenna structure.<br />
a) b)<br />
Figure 3: a) Typical RFID application. b) Simplified transponder<br />
model.<br />
The input impedance <strong>of</strong> the dipole antenna structure in<br />
the present model is calculated to be<br />
ZAnt = 100.15 + j 44.05 . Hence, the input impedance <strong>of</strong><br />
the fictitious IC must be ZIC = 100.15 – j 44.05 to<br />
fulfill the conjugate complex matching. With the given IC<br />
impedance, the needed reference field can be calculated<br />
by subtracting the field <strong>of</strong> the Hertz-dipole excitation in<br />
absence <strong>of</strong> the dipole structure from the total scattering<br />
problem as proposed in [4]. The results are shown in<br />
Figure 4a) to c).<br />
a) b) c)<br />
Figure 4: a) Scattering problem. b) Hertz-Dipole excitation.<br />
c) Scattered field from the dipole structure.<br />
Next the scattered field in case <strong>of</strong> the modulated IC<br />
impedance should be calculated with the proposed<br />
method. For the modulated IC impedance, it is assumed
that a resistance <strong>of</strong> Rmod = 150 is parallel to the IC<br />
impedance which is a typical value for UHF-RFID<br />
transponder ICs. So the modulated IC impedance is given<br />
to be ZICmod = 62.76 – j 15.6<br />
In Figure 5, the result obtained by the proposed method<br />
is compared with the result gathered by the method <strong>of</strong> [4].<br />
As it can be seen, a good qualitative agreement between<br />
the two methods is obtained. The current distribution<br />
along the dipole structure has also been investigated. This<br />
is done by comparing the magnetic field intensity in the<br />
vicinity <strong>of</strong> the antenna structure along the antenna for<br />
different phase angles in the excitation. The results are<br />
shown in Figure 6. In Figure 7, the relative error between<br />
the two methods is shown. As it can be seen, the good<br />
agreement between the two methods is also quantitative.<br />
The difference can be explained by uncertainties in the<br />
determination <strong>of</strong> the antenna input impedance ZAnt, since<br />
the scaling factor in (13) is directly related to this<br />
number.<br />
a) b)<br />
Figure 5: a) Scattered field calculated with the method from [4].<br />
b) Scattered field calculated with the proposed method.<br />
|H| in A/m<br />
1,6<br />
1,4<br />
1,2<br />
1,0<br />
0,8<br />
0,6<br />
0,4<br />
0,2<br />
0,0<br />
0 0,02 0,04 0,06 0,08<br />
Distance in m<br />
Figure 6: Current distribution along the dipole antenna in terms <strong>of</strong> the<br />
magnetic field intensity.<br />
8<br />
6<br />
4<br />
2<br />
-4<br />
-6<br />
-8<br />
-10<br />
Magnetic field intensity along the dipole antenna<br />
Scattering 0°<br />
Proposed method 0°<br />
Scattering 45°<br />
Proposed method 45°<br />
Scattering 90°<br />
Proposed method 90°<br />
Scattering 180<br />
Proposed method 180<br />
Relative error <strong>of</strong> the magnetic field intensity in %<br />
0<br />
0<br />
-2<br />
0,02 0,04 0,06 0,08<br />
Relative error 0°<br />
Relative error 45°<br />
Relative error 90°<br />
Relative error 180°<br />
Distance in m<br />
Figure 7: Relative error <strong>of</strong> the current distribution along the antenna.<br />
- 330 - 15th IGTE Symposium 2012<br />
V. CONCLUSION<br />
It has been shown on a very basic example that the<br />
scattering from passive objects like UHF-RFID<br />
transponder tags can be described in terms <strong>of</strong> a reference<br />
scattering problem and a pure radiation problem. Hence,<br />
the proposed method <strong>of</strong>fers the possibility <strong>of</strong> a certain<br />
reduction in the computational effort, since if multiple IC<br />
impedances have to be taken into account the whole<br />
channel including the reader antenna has to be simulated<br />
for the reference case only. All other IC states can be<br />
modeled as pure radiation problems without having to<br />
model the reader structure.<br />
REFERENCES<br />
[1] K. V. S. Rao, P. V. Nikitin and S. F. Sander, “Antenna design for<br />
UHF RFID tags: a review and a practical application,” IEEE<br />
Trans. on Ant. and Prop., vol. 53, no. 12, pp. 3870-3876, 2005.<br />
[2] V. Chawla and D. S. Ha, “An overview <strong>of</strong> passive RFID,” IEEE<br />
Communications Magazine, vol. 45, no. 9, pp. 11-17, Sept. 2007.<br />
[3] A. Moretto, E. Colin, C. Ripoll and S. A. Chakra, “Shunt behavior<br />
in RFID UHF tag according to ISO standard and manufacturer<br />
requirements,” <strong>Proceedings</strong> <strong>of</strong> the IEEE, vol. 98, no. 9, pp. 1550-<br />
1554, 2010.<br />
[4] T. Bauernfeind, K. Preis, G. Koczka, S. Maier and O. Biro,<br />
“Influence <strong>of</strong> the Non-Linear UHF-RFID IC Impedance on the<br />
Backscatter Abilities <strong>of</strong> a T-Match Tag Antenna Design,” IEEE<br />
Trans. on Magn., vol. 48, no. 2, pp. 755-758, 2012.<br />
[5] R. Wang and J. Jin, “A Flexible Time-Stepping Scheme for<br />
Hybrid Field-Circuit Simulation Based on the Extended Time-<br />
Domain Finite Element Method,” IEEE Trans. on Advanced<br />
Packaging, vol. 33, no. 4, pp. 769-776, 2010.<br />
[6] G. Manzi and U. Mühlmann, “Passive UHF RFID sensor /<br />
transponder antenna optimization for backscatter operation by<br />
electromagnetic-circuital co-simulation,” <strong>Proceedings</strong> <strong>of</strong> the 11 th<br />
International Conference on Telecommunications, pp. 17-22,<br />
2011.<br />
[7] R. C. Hansen, “Relationships Between Antennas as Scatterers and<br />
as Radiators,” <strong>Proceedings</strong> <strong>of</strong> the IEEE, vol. 77, no. 5, pp. 659-<br />
662, 1989.<br />
[8] R.G. Green, “Scattering from conjugate-matched antennas,” IEEE<br />
Trans. on Ant. and Prop., vol. 14, no. 1, pp. 17-21, 1966.<br />
[9] O. Biro, “Edge element formulations <strong>of</strong> eddy current problems,”<br />
Comput. Methods Appl. Mech. Eng., vol. 169, pp. 391-405, 1999.<br />
[10] I. Bardi, R. Remski, D. Perry and Z. Cendes “Plane wave<br />
scattering from frequency-selective surfaces by the finite-element<br />
method,” IEEE Trans. on Magn., vol. 38, no. 2, pp. 641-644,<br />
2002.<br />
[11] C. Balanis, Antenna Theory: Analysis and Design, Hoboken: John<br />
Wiley & Sons, 2005.<br />
[12] K. Hollaus, O. Biro, K. Preis and C. Stockreiter, “Edge finite<br />
elements coupled with a circuit for wave problems,” International<br />
Conference on Electromagnetics in Advanced Applications, pp.<br />
956-959, Torino, 2007.<br />
[13] G. Koczka, T. Bauernfeind, K. Preis and O. Biro, “Schur<br />
Complement Method Using Domain Decomposition for Solving<br />
Wave Propagation Problems,” The 10th International Workshop<br />
on Finite Elements for Microwave Engineering, FEM2010, pp. 53,<br />
Meredith, 2010.<br />
[14] G. Koczka, T. Bauernfeind, K. Preis and O. Biro, “An Iterative<br />
Domain Decomposition Method for Solving Wave Propagation<br />
Problems,” The 11th International Workshop on Finite Elements<br />
for Microwave Engineering, FEM2012, pp. 66, Estes Park, 2012.<br />
[15] R. Albanese and G. Rubinacci, “Solution <strong>of</strong> three dimensional<br />
eddy current problems by integral and differential methods,” IEEE<br />
Trans. on Magn., vol. 24, pp. 98-101, 1988.
- 331 - 15th IGTE Symposium 2012<br />
Simulation <strong>of</strong> a High Speed Reluctance Machine<br />
Including Hysteresis and Eddy Current Losses<br />
B. Schweigh<strong>of</strong>er∗ , H. Wegleiter∗ , M. Recheis∗ , and P. Fulmek †<br />
∗<strong>Graz</strong> <strong>University</strong> <strong>of</strong> <strong>Technology</strong>, Institute <strong>of</strong> Electrical Measurement and Measurement Signal Processing<br />
<strong>Graz</strong>, Austria<br />
† Vienna <strong>University</strong> <strong>of</strong> <strong>Technology</strong>, Institute <strong>of</strong> Sensor and Actuator Systems, Vienna, Austria<br />
E-mail: bernhard.schweigh<strong>of</strong>er@TU<strong>Graz</strong>.at<br />
Abstract—Flywheel energy storage systems in automotive applications require a compact design, which typically uses the<br />
rotor <strong>of</strong> the electrical machine as storage mass. In order to minimise friction losses the rotor is running in vacuum. The<br />
heat generated in the rotor can only be transferred by thermal radiation and thermal conduction through the bearings,<br />
eventually. Therefore, a precise estimation <strong>of</strong> the expected rotor losses is needed to design an efficient thermal management<br />
<strong>of</strong> the whole machine. In this paper a switched reluctance machine is analysed by finite–element simulations with focus on<br />
hysteresis losses and eddy current losses.<br />
Index Terms—Flywheel, Reluctance Motor, Hysteresis, EM Model<br />
I. INTRODUCTION<br />
Flywheel energy storage systems are an important<br />
area <strong>of</strong> research in the automotive industry, to satisfy<br />
the demands for low–emission or zero–emission by<br />
hybridisation and electrification <strong>of</strong> vehicles, trucks and<br />
buses. Typical systems have to be designed for high<br />
power and medium energy content, e.g. 120 kW and<br />
1.5 kWh. Advantages <strong>of</strong> flywheel systems are the high<br />
energy density in comparison to double–layer capacitors,<br />
the higher power density in comparison to batteries, the<br />
State–Of–Charge is always known, almost no degradation<br />
<strong>of</strong> performance with age, etc. Additionally several constraints,<br />
such as size, weight, and costs, have to be fulfilled.<br />
Typically a compact design, in which the rotor <strong>of</strong><br />
the electrical machine acts as the flywheel storage mass<br />
is chosen. Permanent magnet (PM) electrical machines<br />
seem to be the optimum choice for flywheel systems<br />
due to their high efficiency, high power density, and<br />
lower rotor losses in comparison to induction machines.<br />
Especially for automotive applications, however, several<br />
drawbacks have to be considered:<br />
– high power rare earth (RE) magnets (samariumcobalt<br />
or neodymium-iron-boron) are very expensive<br />
– even moderate temperatures may dramatically degrade<br />
the RE–magnets performance by demagnetisation<br />
or even destroy them<br />
– high mechanical stress due to centrifugal forces on<br />
the rotor in flywheel machines necessitate additional<br />
supporting structures to protect the RE–magnets<br />
– the combination <strong>of</strong> a RE–magnet–rotor with an<br />
iron stator leads to significant zero–torque losses,<br />
limiting the storage capabilities <strong>of</strong> the flywheel.<br />
Reluctance machines (RM) generally show a higher mechanical<br />
robustness. They can be produced from high–<br />
strength electrical steels to easily withstand the centrifugal<br />
forces, the high torques, and the pulse accelerations<br />
during operation in the vehicle. In comparison to PM–<br />
machines the rotational speed can be increased, leading to<br />
higher storage densities. Additionally, no electromagnetic<br />
losses have to be expected for the free running flywheel,<br />
which means no zero–torque losses. A comparison<br />
between synchronous (synRM) and switched reluctance<br />
machine (SRM) shows an advantage for the SRM due<br />
to its simple coil design and the higher power density.<br />
For the design <strong>of</strong> the SRM the rotor losses need special<br />
attention. As the rotor is operated in vacuum to avoid<br />
frictional losses, heat dissipation can only happen by<br />
thermal radiation and, eventually, by thermal conduction<br />
through conventional mechanical bearings. The stator is<br />
cooled by conventional water–cooling. For an accurate<br />
modeling <strong>of</strong> the heat flow as well as to predict the<br />
efficiency <strong>of</strong> the machine the knowledge <strong>of</strong> the losses<br />
inside the motor, especially in the rotor is needed.<br />
This paper deals with the analysis <strong>of</strong> the rotor losses in<br />
a switched reluctance machine (SRM). We have chosen<br />
an external rotor 12/8 SRM with a power <strong>of</strong> 120 kW and<br />
a rated speed <strong>of</strong> 25000 rpm.<br />
II. METHODOLOGY<br />
Losses in ferromagnetic materials are divided into the<br />
static, rate–independent hysteresis loss and dynamic rate–<br />
dependent electromagnetic losses (eddy–current losses,<br />
excess losses) [1], [2], [3]. The rate–independent iron–<br />
losses are determined by the materials’ hysteresis loop.<br />
If the course <strong>of</strong> the local vectorial flux–density and<br />
magnetic field is known, the integral over the respective<br />
vectorial hysteresis loops gives the hysteresis loss in the<br />
material. Existing models for ferromagnetic hysteresis<br />
(e.g. Jiles–Atherton, Preisach, Energetic Model) describe<br />
scalar BH–loops, only.<br />
The calculation <strong>of</strong> rate–dependent losses is usually<br />
based on the time variation <strong>of</strong> the flux density obtained<br />
from static finite element (FE) simulations. With increasing<br />
frequencies, however, the skin–depth decreases leading<br />
to an increasing deviation <strong>of</strong> the real flux density from<br />
the static simulation results. For the rate dependent losses
several formulations for sinusoidal flux densities have<br />
been proposed [1], [4], suitable only for linear materials,<br />
in the frequency domain. The correspondence between<br />
simulation results and experiments is not satisfying if the<br />
material is used at high flux densities, at saturation, due<br />
to the assumption <strong>of</strong> linear material behaviour. During the<br />
operation <strong>of</strong> a high performance electrical machine the<br />
magnetic fields and the flux densities in the core material<br />
are neither unidirectional nor sinusoidal.<br />
Loss calculations taking into account the non–linear<br />
materials’ properties, the eddy current distribution and<br />
the skin effect require an exorbitantly high computational<br />
effort for 3D dynamic FE simulations.<br />
We have prepared a 2D FE–model <strong>of</strong> the machine to<br />
calculate the distribution <strong>of</strong> the static flux density for<br />
arbitrary rotor angles and excitation patterns. An external<br />
source current is used to establish the magnetic field. The<br />
model is prepared to analyse the influence <strong>of</strong> different<br />
current pulse patterns, and to develop efficient control<br />
strategies. The non–linear ferromagnetic materials’ properties<br />
have been modelled by the scalar Energetic Model<br />
(EM) [5], which has been parameterised by Epstein frame<br />
experiments and from manufacturers data–sheet. The EM<br />
is used to derive the single valued BH–commutation<br />
curve used to characterise the material in FEM, and<br />
to calculate the static hysteresis loss for arbitrary flux<br />
density waveforms [6], [7]. The variation <strong>of</strong> the flux<br />
density with time, obtained by the FE–simulation, is<br />
used to estimate eddy current losses by approximating<br />
equations [8]. Finally, an expression for the losses in<br />
the machine can be found by integrating these local loss<br />
expressions over the whole volume.<br />
III. RELUCTANCE MACHINE FE–MODEL<br />
Fig. 1. Geometry <strong>of</strong> the reluctance machine RM. Three sets <strong>of</strong> coils<br />
produce a rotating magnetic quadrupole field. With the 12/8 ratio the<br />
principle step angle is 15 Degrees, π/12. The shown position <strong>of</strong> the<br />
external rotor is defined as α =0 ◦ , rotation direction chosen is clock–<br />
wise. Two points are chosen to evaluate the magnetic field, flux density<br />
and hysteresis loss: Point 1 at the center <strong>of</strong> a rotor tooth, Point 2 at the<br />
surrounding yoke area.<br />
We have chosen a 12/8 SRM (double 6/4 machine [9])<br />
with an external rotor. The geometrical dimensions <strong>of</strong><br />
- 332 - 15th IGTE Symposium 2012<br />
the SRM are described in Table I. With the chosen ratio<br />
<strong>of</strong> air–gap–diameter to length <strong>of</strong> the machine ≈ 12:16,<br />
the 3rd dimension has been omitted leading to a more<br />
simple 2D FEM model <strong>of</strong> the SRM Figure 1. Three sets<br />
<strong>of</strong> coils (ABC) build a rotating quadrupole field in the<br />
machine. With 12 stator teeth and 8 rotor teeth the step<br />
angle is 15 ◦ . The central steel shaft <strong>of</strong> the stator is used<br />
for cooling.<br />
Both, stator and rotor, are built up from steel sheet.<br />
The stator is made from a standard electrical steel. For<br />
the rotor material we have chosen the Vacodur50S high<br />
strenght cobalt–steel from Vacuumschmelze.<br />
The rated speed <strong>of</strong> the SRM is 25000 rpm, the stator<br />
coils switching frequency results to 10 kHz.<br />
length <strong>of</strong> motor<br />
external rotor<br />
160.0 mm<br />
No. <strong>of</strong> teeth 8<br />
material Vacodur 50S<br />
outer diameter 180.0 mm<br />
inner diameter 121.0 mm<br />
tooth depth 14.5 mm<br />
gap width<br />
stator<br />
28.0 mm<br />
No. <strong>of</strong> teeth 12<br />
material Armco electrical steel<br />
outer diameter 120.0 mm<br />
inner diameter 60.0 mm<br />
gap depth 17.4 mm<br />
tooth width 14.0 mm<br />
shaft<br />
material construction steel<br />
outer diameter 60.0 mm<br />
TABLE I<br />
GEOMETRIC DIMENSIONS OF SRM.<br />
Fig. 2. FEM: Gmsh/GetDP 2D–model for rotor position α =15 ◦ .<br />
17516 vertices and 37224 elements in a free triagonal mesh.<br />
For the FEM simulation we use two different s<strong>of</strong>tware<br />
packages: the commercial Comsol–Multiphysics [10] and<br />
the general open–source packages Gmsh/GetDP [11].<br />
Figure 2 shows the final mesh <strong>of</strong> the model with 37224<br />
triangle elements. In both FEM–packages we defined the<br />
magnetic materials’ property as BH–table, calculated by<br />
the Energetic Model.
IV. MATERIALS MODEL<br />
The static magnetic materials’ properties are described<br />
by the Energetic Model (EM) <strong>of</strong> ferromagnetic hysteresis<br />
[5]. The EM describes the non–linear, hysteretic behaviour<br />
<strong>of</strong> magnetic polarisation and magnetic field in<br />
the material based on the concept <strong>of</strong> minimising the<br />
total energy in a statistical description <strong>of</strong> the magnetic<br />
domain structure. Consequently, many physical factors<br />
influencing the magnetisation process can be included:<br />
e.g. magnetocrystalline anisotropy, internal demagnetisation,<br />
anisotropy <strong>of</strong> magnetostriction, etc. The EM can<br />
simulate major and minor hysteresis loops, it also simulates<br />
the effect <strong>of</strong> slowly evolving closed minor loops, in<br />
contrast to other models (e.g. Preisach model, wipeout–<br />
property). The simplified scalar formulation <strong>of</strong> the EM<br />
[5], [6] is perfectly prepared for integration into FEM.<br />
The parameters <strong>of</strong> the EM are found by evaluating several<br />
important points <strong>of</strong> a measured BH–loop (e.g. saturation,<br />
coercivity, remanence, initial susceptibility).<br />
The rotor material has to be chosen with respect<br />
to s<strong>of</strong>t–magnetic properties (low coercivity, high flux<br />
density, low losses), and mechanical properties, as well.<br />
A material with optimum properties for high speed<br />
rotors is the s<strong>of</strong>t–magnetic cobalt–iron alloy Vacodur50<br />
manufactured by Vacuumschmelze [12]. Due to the Co–<br />
content it exhibits a very high saturation flux density <strong>of</strong><br />
2.35 T. At a magnetic field strenght <strong>of</strong> 800 A/m more<br />
than 2.0 T are reached. The coercivity is in the range<br />
<strong>of</strong> 100–200 A/m. Figure 3 shows the BH–loop from the<br />
Vacodur50S datasheet and the corresponding results <strong>of</strong><br />
EM–simulations.<br />
Fig. 3. Hysteresis loops <strong>of</strong> Vacodur50S. Red lines: BH–loop from<br />
datasheet (Vacuumschmelze), blue line: BH–loop (virgin curve and<br />
major hysteresis loop) from EM–simulation.<br />
Vacodur50S Armco<br />
Js 2.40 2.10<br />
q 40.12 15.26<br />
k 314.95 16.31<br />
Ne 2.09e-5 3.15e-6<br />
g 11.30 23.41<br />
h 3.40 3.1e-3<br />
TABLE II<br />
EM–PARAMETERS FOR VACODUR50S AND ARMCO<br />
- 333 - 15th IGTE Symposium 2012<br />
For our simulations <strong>of</strong> the stator material we used the<br />
EM parameterised for s<strong>of</strong>t–magnetic Armco electrical<br />
steel sheets (GO, FeSi)[13]. This grain oriented FeSi–<br />
steel exhibits a coercivity as low as 7 A/m, the technical<br />
saturation is limited to 2.0 T. Epstein measurements<br />
provided the data necessary to parameterise the EM.<br />
Table II shows the parameters used for the EM simulations<br />
<strong>of</strong> rotor and stator material. The single valued<br />
BH–function, required for our FEM simulations, has been<br />
found by EM calculations <strong>of</strong> the commutation curves<br />
Fig. 4 and 5. Both FEM packages (Comsol, GetDP)<br />
use tables <strong>of</strong> the simulated BH–commutation curve to<br />
interpolate the required BH–point.<br />
Fig. 4. Hysteresis loops for Armco electrical steel. EM–simulations<br />
for several complete symmetrical loops build the commutation curve.<br />
Fig. 5. Vacodur50S: simulated symmetrical hysteresis loops build the<br />
commutation curve.<br />
V. FEM RESULTS<br />
The above described model setup (geometry and materials)<br />
was used for magnetostatic FEM simulations with<br />
current excitation in Comsol and GetDP. Both packages<br />
gave almost exactly identical results. A series <strong>of</strong> static<br />
calculations has been done to simulate the rotating machine<br />
for various coil currents. Typical results <strong>of</strong> both<br />
FEM simulations are depicted in the following figures.<br />
The excitation current was applied to Coil–set A (see<br />
Fig. 1), producing a quadrupole magnetic field. The<br />
angular rotor position has been changed in 1◦ steps, and<br />
the corresponding flux densities in 2 points <strong>of</strong> the rotor
and the flux in the coil A stator tooth have been evaluated<br />
exemplarily to estimate the rotor losses.<br />
Figure 6 shows the corresponding relative permeability<br />
values, i.e. the quotient B/(μ0H), for a moderate coil<br />
current density <strong>of</strong> 1 A/mm 2 . Different μr–scales are<br />
used for stator and rotor. As the respective range <strong>of</strong> flux<br />
densities Fig. 7 is below 300 mT, the permeability is still<br />
rising with flux density (cmp. Fig. 4 and 5), accordingly<br />
the maximum permeability is at the loci <strong>of</strong> maximum<br />
flux density.<br />
Figure 8 shows Comsol results for a coil current<br />
density <strong>of</strong> 5 A/mm 2 . The flux density at the partly overlapping<br />
stator and rotor teeth approaches the materials<br />
saturation.<br />
Fig. 6. GetDP simulation: relative magnetic permeabilities μr at coil<br />
A current density s =1A/mm 2 (I = 112 A), rotor angle α =10 ◦ .<br />
Different scales are used for rotor (Vacodur50) and stator (Armco).<br />
Fig. 7. GetDP simulation: flux density B at coil A current density<br />
s =1A/mm 2 (I = 112 A), rotor angle α =10 ◦ .<br />
The evaluation <strong>of</strong> the flux in the stator teeth is shown<br />
in Fig. 9. The line integral <strong>of</strong> B over a plane stator tooth<br />
cross–section, multiplied by the length <strong>of</strong> the machine,<br />
gives the total flux in Wb for a quarter <strong>of</strong> the machine.<br />
Perfect alignment <strong>of</strong> stator and rotor teeth at a rotor angle<br />
<strong>of</strong> 22.5 ◦ gives the steepest flux versus field curve. As the<br />
rotor teeth are moved towards the gap between the stator<br />
teeth, the demagnetising effect <strong>of</strong> the increasing length <strong>of</strong><br />
the effective air–gap is clearly visible. When the stator<br />
tooth faces a rotor gap exactly at 0 ◦ the flux vs. field<br />
characteristic becomes a rather flat, almost straight line.<br />
- 334 - 15th IGTE Symposium 2012<br />
Fig. 8. Comsol simulation: flux density B at coil A current density<br />
s =5A/mm 2 (I = 560 A), rotor angle α =10 ◦ .<br />
Fig. 9. Flux in stator tooth versus coil current for varying rotor angle.<br />
Maximum flux for α =22.5 ◦ , stator tooth exactly aligned with rotor<br />
tooth, minimum demagnetisation. Minimum flux for α =0 ◦ , stator<br />
tooth exactly between rotor teeth, maximum demagnetisation, almost<br />
linear dependence.<br />
VI. LOSSES<br />
The conventional three–term iron loss model [1], [8]<br />
contains expressions for hysteresis loss, eddy current loss,<br />
and excess loss. All these loss–contributions depend on<br />
the flux density itself and its time derivative. The local<br />
flux densities are usually estimated by finite element analyses,<br />
completely neglecting hysteresis and eddy currents,<br />
sometimes even the single valued non–linear BH curve.<br />
In a first step FEM calculations approximately determine<br />
the local flux density, from the flux density the iron losses<br />
are determined.<br />
A. Hysteresis loss<br />
In our work we use the EM hysteresis model described<br />
above, to calculate the hysteresis loss for any arbitrary<br />
course <strong>of</strong> flux densities in the material. Series <strong>of</strong> simulations<br />
for constant coil current and varying rotor angle<br />
are used to identify the course <strong>of</strong> the flux density at two<br />
chosen distinct points <strong>of</strong> the rotor (see Fig. 1). Figure 10<br />
shows the evolution <strong>of</strong> the vectorial components <strong>of</strong> the<br />
flux density at point 1 (rotor tooth) versus rotor angle<br />
when coil A is switched on. The symmetry with a change<br />
in sign at 90 ◦ and the periodicity <strong>of</strong> 180 ◦ are evident.
Both components <strong>of</strong> the B–vector, radial and tangential,<br />
and the total <strong>of</strong> the flux density are shown.<br />
Under normal operation each set <strong>of</strong> coils (ABC) is<br />
active during a rotation angle <strong>of</strong> 15 ◦ . Figure 11 shows<br />
the radial component <strong>of</strong> the flux density at point 1 (rotor<br />
tooth) when all coils A–B–C are excited sequentially,<br />
leading to a periodicity <strong>of</strong> 60 ◦ for a fixed point on the<br />
rotor. The total hysteresis loss at point 1 for a complete<br />
revolution <strong>of</strong> 360 ◦ is Wh =6·555.5 J/m 3 = 3332.8 J/m 3<br />
(Fig. 12),<br />
Fig. 10. Flux density at rotor point 1 (tooth) versus rotor angle, coil A<br />
activated with 5 A/mm 2 . The thick lines indicate the normal switched–<br />
on range for coils A.<br />
Fig. 11. Radial component <strong>of</strong> the flux density at rotor point 1 (tooth)<br />
versus rotor angle. Coils A–B–C are excitated sequentially with 5<br />
A/mm 2 .<br />
The same procedure, described above for a rotor tooth,<br />
is applied to point 2 on the yoke part <strong>of</strong> the rotor (see<br />
Fig. 1). Figure 13 shows the evolution <strong>of</strong> the vectorial<br />
components <strong>of</strong> the flux density at point 1 (rotor tooth)<br />
versus rotor angle when coil A is switched on. In the rotor<br />
yoke area there exists only a tangential component <strong>of</strong> the<br />
flux density, the radial component completely vanishes.<br />
Figure 14 shows the tangential component <strong>of</strong> the flux<br />
density at point 2 (rotor yoke) when all coils A–B–C<br />
are excited sequentially, leading to a periodicity <strong>of</strong> 60 ◦<br />
for a fixed point on the rotor. The total hysteresis loss<br />
at point 2 for a complete revolution <strong>of</strong> 360 ◦ is Wh =<br />
6 · 672.1 J/m 3 = 4032.6 J/m 3 (Fig. 15). The two extra<br />
minor loops lead to a significant increase <strong>of</strong> the hysteresis<br />
loss in the rotor’s yoke area.<br />
- 335 - 15th IGTE Symposium 2012<br />
Fig. 12. Simulated BH–loop for a complete period <strong>of</strong> the magnetisation<br />
process point 1, covering 60 ◦ rotational angle.<br />
Fig. 13. Flux density at rotor point 2 (yoke) versus rotor angle, coil A<br />
activated with 5 A/mm 2 . The thick lines indicate the normal switched–<br />
on range for coils A.<br />
Figure 16 shows the hysteresis losses at two points <strong>of</strong><br />
the rotor for a complete rotor revolution in dependence <strong>of</strong><br />
coil current. Although the amplitude <strong>of</strong> the flux density<br />
is significantly larger in the rotor tooth than in the rotor<br />
yoke, the losses in the yoke are dominant due to the<br />
existence <strong>of</strong> a pair <strong>of</strong> extra minor loops. As the yoke<br />
reaches local saturation, however, the flux density becomes<br />
distributed more uniformly, and the amplitude <strong>of</strong><br />
the minor loops decreases, accompanied by a decreasing<br />
hysteresis loss.<br />
Fig. 14. Tangential component <strong>of</strong> the flux density at rotor point 2<br />
(yoke) versus rotor angle. Coils A–B–C are excitated sequentially with<br />
5 A/mm 2 .
Fig. 15. Simulated BH–loop for a complete period <strong>of</strong> the magnetisation<br />
process at point 2, covering 60 ◦ rotational angle.<br />
Fig. 16. Hysteresis loss for one complete rotor revolution versus coil<br />
current. The losses are calculated for two points in the rotor (see Fig. 1).<br />
B. Eddy current loss<br />
Eddy current losses depend on the rate <strong>of</strong> change <strong>of</strong><br />
flux density, the electrical conductivity <strong>of</strong> the material,<br />
and the geometry. As long as the induced eddy currents<br />
are small enough to allow the flux density to completely<br />
penetrate the material, eddy current losses can be locally<br />
calculated straight forward. This criterion would<br />
allow a maximum frequency for sinusoidal excitation<br />
below 1 kHz (Vacodur50: h = 0.35 mm, μr ≈ 4000,<br />
σ =2.83 · 106 S/m). A modified expression [8] can be<br />
used to determine eddy current loss and excess loss:<br />
W ′ e ∼ = κσh2 1<br />
·<br />
12 T ·<br />
T 2 dB<br />
dvdt<br />
0 dt<br />
The time derivative dB/dt is defined by our FEM<br />
results, σ is the electrical conductivity <strong>of</strong> the rotor material,<br />
h is the thickness <strong>of</strong> the rotor steel sheets, κ is the<br />
modified coefficient for excess loss, T is the time period.<br />
Figure 17 shows the resulting eddy current losses for<br />
a complete revolution at 12000 rpm. The modified loss<br />
coefficient was chosen as κ =1. For higher frequencies<br />
the skin–effect has to be taken into account additionally.<br />
VII. CONCLUSIONS<br />
A 2D finite element model <strong>of</strong> a switched reluctance<br />
machine is presented, using the EM–hysteresis model to<br />
- 336 - 15th IGTE Symposium 2012<br />
Fig. 17. Eddy current loss per revolution at 12000 rpm versus coil<br />
current. The losses are calculated for two points in the rotor (see Fig. 1).<br />
derive the non–linear, hysteretic BH–function. The EM<br />
is parameterised from BH–loops from experiment (e.g.<br />
Epstein measurement) or data–sheet information. The<br />
FEM calculations use only the single–valued commutation<br />
curve, derived from EM simulations. Hysteresis loss<br />
is calculated based on the local flux densities resulting<br />
from 2D–FEM, by numerical integration <strong>of</strong> the respective<br />
EM BH–loops. Under the necessary assumption <strong>of</strong> a<br />
negligible influence <strong>of</strong> the skin depth, we can estimate<br />
eddy current and excess losses in the rotor, as well.<br />
VIII. ACKNOWLEDGMENT<br />
Part <strong>of</strong> this research has been supported by the Austrian<br />
FFG, project No. 824164.<br />
REFERENCES<br />
[1] G. Bertotti. Hysteresis in magnetism. Academic Press, 2008.<br />
[2] I. D. Mayergoyz. Mathematical Models <strong>of</strong> Hysteresis. New York,<br />
Springer, 1991.<br />
[3] D. C. Jiles, D. L. Atherton. J. Magn. Magn. Mater. vol. 61, pp.<br />
48–60, 1986.<br />
[4] D. Lin et.al. IEEE Trans. Magn. 40 (2), pp. 1318–1321, 2004.<br />
[5] H. Hauser. J. Appl. Phys. 96 (5), pp. 2753–2767, 2004.<br />
[6] P. Fulmek, P. Haumer, H. Wegleiter, B. Schweigh<strong>of</strong>er. COMPEL<br />
29 (6), pp. 1504–1513, 2010.<br />
[7] P. Fulmek, N. Mehboob, P. Haumer, M. Kriegisch, R. Grössinger.<br />
SMM19, Book <strong>of</strong> Abstracts, B1–16, 2009.<br />
[8] K. Yamazaki, N. Fukushima. IEEE Trans. Magn. 46 (8), 3121 –<br />
3124, 2010.<br />
[9] T. J. E. Miller. Switched Reluctance Motors and their Control.<br />
Magna Physics Publishing and Oxford Science Publications<br />
(1993)<br />
[10] Comsol Multiphysics, http://www.comsol.com.<br />
[11] C. Geuzaine, J.–F. Remacle, Gmsh, http://www.geuz.org/gmsh.<br />
P. Dular, C. Geuzaine, GetDP, http://www.geuz.org/getdp.<br />
[12] Vacodur50, S<strong>of</strong>t magnetic Cobalt Iron http://www.<br />
vacuumschmelze.com/<br />
[13] http://www.aksteel.eu/en/1-products/3-electrical-sheet/
- 337 - 15th IGTE Symposium 2012<br />
An Iterative Domain Decomposition Method for<br />
Solving Wave Propagation Problems<br />
*Gergely Koczka, *Thomas Bauernfeind, *Kurt Preis and *Oszkár Bíró<br />
*Institute for Fundamentals and Theory in Electrical Engineering, Inffeldgasse 18, A-8010 <strong>Graz</strong>, Austria<br />
E-mail: gergely.koczka@TU<strong>Graz</strong>.at<br />
Abstract—Solving wave propagation problems with FEM results in a huge number <strong>of</strong> unknowns due to the large air volume to<br />
be modeled. These equation systems are very ill-conditioned, because <strong>of</strong> the big material differences, element-size changes and<br />
due to the fact that the system matrices are indefinite. Common iterative methods (CG, GMRES) exhibit bad convergence due<br />
to these conditions. The memory requirement is the weakness <strong>of</strong> the direct methods. The aim <strong>of</strong> this paper is to present a<br />
method, which has smaller memory requirement than the direct methods, and converging faster than iterative methods.<br />
Index Terms—no more than 4 in alphabetical order.<br />
I. INTRODUCTION<br />
It is always an open question how to solve huge<br />
indefinite, ill-conditioned equation systems efficiently.<br />
Common iterative methods (CG, GMRES) exhibit bad<br />
convergence due to these conditions [1]. Assembling the<br />
system matrix <strong>of</strong> wave propagation problems in frequency<br />
domain with finite element method (FEM) results in this<br />
kind <strong>of</strong> equation systems because <strong>of</strong> the huge material<br />
differences and element-size changes.<br />
Applying direct solver methods to overcome the<br />
problem <strong>of</strong> the ill-conditioned system <strong>of</strong> equations results<br />
in high memory requirements [2]. The aim <strong>of</strong> this paper is<br />
to present a method with smaller memory requirement<br />
than the direct methods and better convergence quality<br />
than common iterative methods.<br />
The memory requirement <strong>of</strong> the classical LU<br />
decomposition applied to sparse matrices can be reduced<br />
with fill-in reduction algorithms: the minimum degree<br />
algorithm [3] or the nested dissection algorithm [4].<br />
These algorithms are implemented in the Intel® Math<br />
Kernel Library (PARDISO: sparse linear equation system<br />
solver routine). However, the memory requirement <strong>of</strong> the<br />
direct method is higher than first order: between<br />
2<br />
Onlog n and <br />
1.3<br />
1.4<br />
O n , typically On ,<br />
<br />
O n .For<br />
huge systems a method is required which is capable <strong>of</strong><br />
decreasing it.<br />
Fig. 1. The geometrical decomposition <strong>of</strong> the domain<br />
II. DOMAIN DECOMPOSITION<br />
With geometrical domain decomposition it is possible<br />
to decrease the memory requirement <strong>of</strong> the direct method.<br />
The problem domain should be subdivided into n<br />
open disjunctive sub-domains, namely<br />
<br />
n<br />
<br />
,<br />
i1<br />
i<br />
(1)<br />
i, j 1,2,.., n : i j . (2)<br />
i j<br />
The interfaces between the domains are<br />
ij : i j ,<br />
n<br />
(3)<br />
: .<br />
(4)<br />
<br />
i, j1<br />
With these notations, the equation system can be<br />
written in block-form:<br />
AII AIxI bI ,<br />
AIA <br />
x<br />
<br />
b<br />
<br />
<br />
where the sub-matrix AII corresponds to the domains,<br />
ij<br />
(5)<br />
A to the interfaces between the domains and I A and<br />
AI to the connections <strong>of</strong> the sub-domains and the<br />
interface.<br />
The Schur-complement equation system <strong>of</strong> (5) is<br />
obtained as<br />
<br />
A A A A x b A A b .<br />
<br />
1 1<br />
I II I I<br />
II I<br />
<br />
S<br />
c<br />
(6)<br />
To build the Schur-complement matrix S ,<br />
it is<br />
necessary to have regular subsystems corresponding to<br />
the domains (the matrix AII has to be regular).<br />
The memory requirement <strong>of</strong> the method can be<br />
estimated as follows:<br />
Let us assume that the memory requirement <strong>of</strong> the
direct method is One <br />
, where ne is the number <strong>of</strong><br />
equations and 1 2.<br />
If all the sub-domains have the<br />
same number <strong>of</strong> unknowns, then<br />
1<br />
<br />
ne <br />
n nen ,<br />
n<br />
<br />
1<br />
<br />
i.e. the overall memory requirement will be decreased by<br />
a multiplier which depends on the number <strong>of</strong> the subdomains.<br />
Assembling the Schur complement matrix takes<br />
a long time. However, for solving the reduced equation<br />
system, not the full Schur complement matrix is<br />
necessary. Applying the biconjugate gradient method<br />
(BiCG) to solve the Schur complement equation system<br />
will result in an efficient iterative solver (DD-BiCG) for<br />
solving wave propagation problems.<br />
III. ANALYSIS OF THE METHOD<br />
A. Stability<br />
Since the original equation system is symmetric, the<br />
matrices corresponding to the sub-domains and the<br />
interfaces are also symmetric. To improve the<br />
conditioning <strong>of</strong> (6), the following form is used:<br />
T T<br />
I L A A A L L x L b L<br />
A A b<br />
<br />
where L <br />
T A LL 1 1 1 1 1<br />
I II I I<br />
II I<br />
<br />
1T L SL y 1<br />
L c<br />
is the Cholesky decomposition <strong>of</strong> A <br />
. Using (8) with BiCG without<br />
preconditioning is theoretically equivalent to the form (6)<br />
using A as a preconditioner. The practical examples<br />
show that the symmetric form (8) is more stable and<br />
therefore converging faster.<br />
B. Condition number<br />
The convergence speed <strong>of</strong> the BiCG depends on the<br />
condition number <strong>of</strong> the equation system.<br />
Since the A,v formulation is used to solve the<br />
Maxwell’s equations, the resulting linear equation system<br />
is singular. To formulate a regular system a tree has to be<br />
eliminated in the discretized domain. Due to the fact that<br />
the singular system is better conditioned than the regular<br />
one, it is better to work with the singular system.<br />
If and only if the original matrix is singular, the Schurcomplement<br />
matrix is also singular, if the sub-domain<br />
matrix AII is regular.<br />
1<br />
AII :<br />
1<br />
<br />
A AIAII AIv0AII AIvI 0 1<br />
v A I A<br />
<br />
v<br />
<br />
0<br />
.(9)<br />
I AII AIv <br />
<br />
So to let the Schur complement matrix be singular, it is<br />
enough to not eliminate edges on the interface.<br />
A huge advantage <strong>of</strong> this method is that the matrix<br />
,<br />
- 338 - 15th IGTE Symposium 2012<br />
(7)<br />
(8)<br />
corresponding to the sub-domains is block-diagonal, and<br />
its blocks can be inverted in parallel. The speed <strong>of</strong> one<br />
iteration-step <strong>of</strong> the DD-BiCG can be increased by<br />
choosing the sub-domains with about the same number <strong>of</strong><br />
unknowns and using a multi-core computer.<br />
IV. NUMERICAL RESULT<br />
The efficiency <strong>of</strong> this method is shown on the example<br />
<strong>of</strong> a dipole. The dipole has a length <strong>of</strong> 140 mm, a width <strong>of</strong><br />
1 mm and thickness <strong>of</strong> 20 μm. There is a 160 μm air gap<br />
in the middle (see Fig. 3.). An air volume <strong>of</strong> 250 mm<br />
radius is modeled around the antenna (see Fig. 2.). The<br />
air volume is truncated by a first order absorbing<br />
boundary condition (ABC).<br />
Fig. 2. The structure <strong>of</strong> the dipole antenna and the truncation <strong>of</strong> the air<br />
volume. 1/8 model.<br />
The voltage is prescribed in the air gap (1 V, 1.5 GHz).<br />
Modeling an eighth <strong>of</strong> the problem, using A,v formulation<br />
(A is the magnetic vector potential, v is the modified<br />
electric scalar potential), and second order hexahedral<br />
finite elements (20 nodes, 36 edges) the resulting problem<br />
has 1.986.152 edges and 669.398 nodes.<br />
Fig. 3. The structure <strong>of</strong> the dipole antenna (yellow) near the air gap and<br />
the prescribed electric field in the gap (blue). 1/8 model.<br />
The efficiency <strong>of</strong> the DD-BiCG method compared with<br />
the incomplete Cholesky preconditioned Biconjugate<br />
gradient method (IC-BiCG) is shown in Fig. 6. The<br />
convergence criterion was the global relative residual to<br />
become smaller than 10 -7 .<br />
To demonstrate the efficiency <strong>of</strong> the method, two<br />
different decompositions were tested. In the first case the<br />
domain has been subdivided into 5 sub-domains (see Fig.<br />
4.), in the second case 8 sub-domains. (Fig. 5.).
Fig. 4. The problem with five sub-domains.<br />
In the first case the first domain is the antenna, the second<br />
is a small air volume around the antenna, and the huge air<br />
volume is subdivided into three parts.<br />
Fig. 5. The problem with eight sub-domains.<br />
In the second case, the antenna and a small air volume<br />
again build the first two sub-domains, but the air volume<br />
is subdivided into six parts.<br />
Method<br />
name<br />
TABLE I<br />
Comparison <strong>of</strong> the methods<br />
Memory<br />
Requirement<br />
Iterations Run<br />
time (h)<br />
ICCG 9,0 GB 68.750 77,86<br />
DDCG<br />
5 Domains<br />
DDCG<br />
8 Domains<br />
41,8 GB 975 4,41<br />
31,8 GB 1.036 4,66<br />
LU 81,0 GB - -<br />
To solve the problem an “Intel(R) Xeon(R) CPU 2x<br />
X5570@2.93GHz 8 cores 64 GB RAM” computer was<br />
used.<br />
- 339 - 15th IGTE Symposium 2012<br />
Fig. 6. The best residuum <strong>of</strong> the methods during the iterations.<br />
Blue line: DD-BiCG (5 domains) global residuum;<br />
Red line: DD-BiCG residuum on the interface;<br />
Blue dotted line: DD-BiCG (8 domains);<br />
Red dotted line: DD-BiCG (8 domain) residuum on the interface;<br />
Black line: IC-BiCG residuum in the whole domain.<br />
V. CONCLUSION<br />
Applying the domain decomposition method for<br />
solving huge indefinite equation system iteratively results<br />
in an efficient method with reduced memory requirement<br />
compared to direct methods, and accelerates the iteration<br />
by decreasing the number <strong>of</strong> iterations. It enables an<br />
efficient parallelization technique in implementating <strong>of</strong><br />
the algorithm.<br />
The choice <strong>of</strong> the sub-domains is very important to<br />
increase the efficiency <strong>of</strong> the method. The sub-domains<br />
should have about the same number <strong>of</strong> unknowns.<br />
Increasing the number <strong>of</strong> domains decreases the memory<br />
requirement but results in a higher condition number.<br />
REFERENCES<br />
[1] O. Nevanlinna, Convergence <strong>of</strong> iterations for linear equations.<br />
Birkhauser Verlag AG, Basel, 1993, pp. viii+177<br />
[2] G. Koczka , T. Bauernfeind, K. Preis and O. Bíró, "Schur<br />
complement method using domain decomposition for solving<br />
wave propagation problems," presented at The 10th International<br />
Workshop on Finite Elements for Microwave Engineering,<br />
Meredith, New Hampshire United States, Oct. 12-13, 2010.<br />
[3] J.W.H. Liu. Modification <strong>of</strong> the Minimum-Degree algorithm by<br />
multiple elimination. ACM Transactions on Mathematical<br />
S<strong>of</strong>tware, 11(2):141-153, 1985.<br />
[4] G. Karypis and V. Kumar. A Fast and High Quality Multilevel<br />
Scheme for Partitioning Irregular Graphs. SIAM Journal on<br />
Scientific Computing, 20(1):359-392, 1998.
- 340 - 15th IGTE Symposium 2012<br />
On Effectiveness <strong>of</strong> Model Order Reduction<br />
for Computational Electromagnetism<br />
*Yuki Sato, *Hajime Igarashi<br />
*Graduate School <strong>of</strong> Information Science and <strong>Technology</strong>, Hokkaido <strong>University</strong><br />
Kita 14, Nishi 9, Kita-ku, Sapporo, 060-0814<br />
E-mail: yukisato@em-si.eng.hokudai.ac.jp<br />
Abstract— This paper presents the model reduction method based on the method <strong>of</strong> snapshots for time-domain finite element<br />
analysis <strong>of</strong> quasi-static electromagnetic fields. In this method, the snapshots <strong>of</strong> transient electromagnetic fields for relatively<br />
short periods are stored to build the variance-covariance matrix, from whose eigenvalues the basis functions for reduced<br />
analysis are constructed. In this paper, the effect <strong>of</strong> various parameters in the present method such as the number <strong>of</strong><br />
snapshots, snapshot intervals on the results <strong>of</strong> the reduced field computations is discussed.<br />
Index Terms—Model order reduction, finite element method, method <strong>of</strong> snapshots, eddy current problem.<br />
applying it to three dimensional eddy current problems.<br />
Moreover, we discuss a possible method to determine the<br />
adequate values <strong>of</strong> the parameters for this method.<br />
I. INTRODUCTION<br />
In recent years, finite element method (FEM) has<br />
widely been applied to transient analysis <strong>of</strong> quasi-static<br />
and high-frequency electromagnetic fields. However,<br />
since FE equations must be solved at each time step, it<br />
has significant computational burden. Therefore, a lot <strong>of</strong><br />
efforts have been made to reduce the computational times<br />
for analysis <strong>of</strong> transient electromagnetic fields.<br />
One <strong>of</strong> the most promising methods to shorten the<br />
computational time would be the time-period explicit<br />
error correction (TP-EEC) method [1], [2]. It has been<br />
shown that TP-EEC method applied to non-linear eddy<br />
current problems reduces the computational time for<br />
transient analysis and gives correct steady state solutions<br />
[1], [2]. However, TP-EEC method cannot be applied to<br />
analysis <strong>of</strong> non-time-periodic problems or high-frequency<br />
problems.<br />
On the other hand, there is yet another method, called<br />
the model order reduction, which can reduce the<br />
computational time for transient analysis [3], [4]. In this<br />
method, the snapshots <strong>of</strong> transient solution are stored for<br />
initial short period. Then, using these snapshotted<br />
solutions, a variance-covariance matrix is constructed and<br />
the eigenvectors <strong>of</strong> this matrix is computed. The reduced<br />
FE matrix is then constructed using the transform matrix<br />
whose column space is spanned by the dominant<br />
eigenvectors. There are a few merits in this method; it can<br />
accurately analyze the transient solutions and can be<br />
applied to non-time-periodic problems and highfrequency<br />
problems. However, in order to realize accurate<br />
analysis, it is important to determine adequate values <strong>of</strong><br />
the parameters in this method such as snapshot interval<br />
and period, and number <strong>of</strong> the basis vectors that are<br />
chosen from the eigenvectors <strong>of</strong> the variance-covariance<br />
matrix. However, the dependence <strong>of</strong> the accuracy on<br />
these parameters has not been clarified. Moreover, though<br />
the effectiveness <strong>of</strong> the model reduction for twodimensional<br />
eddy current problems has been discussed<br />
[5], that for three dimensional problems has not been<br />
shown yet.<br />
In our study, the dependence <strong>of</strong> the accuracy in the<br />
model reduction method on its parameters is evaluated by<br />
II. REDUCTION TECHNIQUE<br />
A. Time-Domain Finite Element Method<br />
The A-φ (A-V) method is used for FE analysis <strong>of</strong><br />
quasi-static electromagnetic analysis. The governing<br />
equations derived from Maxwell’s equations is expressed<br />
as<br />
A<br />
<br />
<br />
rot rotA<br />
grad J , (1)<br />
t<br />
t<br />
<br />
A <br />
<br />
div grad 0 ,<br />
(2)<br />
t<br />
t<br />
<br />
where ν is magnetic resistivity, is conductivity and J is<br />
forced current density. The vector potential A and scholar<br />
potential φ are discretized as follows<br />
e<br />
<br />
A a N ,<br />
(3)<br />
j<br />
n<br />
<br />
j<br />
j<br />
, (4)<br />
j<br />
j j N<br />
where e and n is the number <strong>of</strong> edges and nodes, and Nj<br />
and Nj are vector and scholar interpolation functions<br />
respectively. The weighted residual method with Galerkin<br />
method applied to (1) and (2) results in the FE equation<br />
given by<br />
K 0a<br />
d N<br />
S a<br />
b<br />
<br />
,<br />
t<br />
0 0<br />
<br />
<br />
d<br />
<br />
S M<br />
<br />
<br />
0<br />
(5)<br />
<br />
t <br />
<br />
where<br />
K rotN<br />
rotN<br />
dV<br />
,<br />
(6)<br />
ij<br />
<br />
V<br />
i<br />
j<br />
Nij <br />
N i N jdV<br />
,<br />
(7)<br />
V<br />
Sij <br />
N i grad N jdV<br />
,<br />
(8)<br />
V<br />
<br />
M grad N grad N dV<br />
, (9)<br />
ij<br />
V<br />
i<br />
j
i <br />
N i JdV.<br />
V<br />
(10)<br />
Moreover, time derivative is approximated by the finite<br />
difference and the unknown variables and right hand<br />
vector are interpolated as<br />
k<br />
x x<br />
k1<br />
( 1<br />
) x ,<br />
(11)<br />
k<br />
b b<br />
k1<br />
( 1<br />
) b ,<br />
(12)<br />
where x = [a φ] t , 0 ≤ θ ≤ 1 and k represents time steps.<br />
Equation (5) now becomes<br />
1 N<br />
<br />
t<br />
t<br />
<br />
S<br />
S K<br />
M<br />
<br />
<br />
0<br />
0<br />
k<br />
0<br />
<br />
<br />
<br />
x <br />
<br />
1 N<br />
<br />
t<br />
t<br />
<br />
S<br />
S K<br />
( 1<br />
)<br />
M<br />
<br />
0<br />
k<br />
k 1<br />
0<br />
k 1<br />
b ( 1<br />
) b <br />
<br />
,<br />
0<br />
<br />
x<br />
<br />
0 <br />
(13)<br />
where Δt is the time step interval. The transient solutions<br />
can be obtained by solving (13) at each time step.<br />
B. Model Reduction<br />
As mentioned above, solution <strong>of</strong> (13) at each time step<br />
is computationally expensive if the number <strong>of</strong> unknowns<br />
is large. To reduce the computational time, the reduced<br />
equation is obtained from (13) using the model reduction<br />
method. To do so, after obtaining snapshots for the initial<br />
periods by solving (13), the variance-covariance matrix<br />
Cm<br />
t<br />
m XX C (14)<br />
is constructed where<br />
1<br />
2<br />
s<br />
X [<br />
x μ x μ x μ]<br />
, (15)<br />
x i , i=1,2,..,s are snapshotted solution vectors, s is the<br />
number <strong>of</strong> snapshots (m>>s) and μ is the mean vector <strong>of</strong><br />
these solutions. Note that the matrix Cm is a dense matrix<br />
whose size is the same as that <strong>of</strong> the FE matrix.<br />
Therefore, numerical solution to the eigenvalue problem<br />
for (14) is computationally prohibitive. In order to<br />
alleviate this problem, we consider the smaller matrix <strong>of</strong><br />
s×s defined by<br />
t<br />
Cs<br />
X X<br />
(16)<br />
instead <strong>of</strong> Cm. The eigenvalues <strong>of</strong> Cm and Cs are identical<br />
except m-s zero eigenvalues <strong>of</strong> Cm as shown below. The<br />
singular value decomposition <strong>of</strong> matrix XR m×s is given<br />
by<br />
X<br />
s<br />
<br />
i1<br />
t<br />
t<br />
u v UV<br />
,<br />
(17)<br />
i<br />
i<br />
i<br />
where UR m×s , VR s×s and<br />
diag[ 1 2 s ] , (18)<br />
σ1 ≥σ2 ≥ ... ≥σs ≥ 0 and σi is singular value <strong>of</strong> X. The<br />
matrices U and V satisfy<br />
t<br />
U I ,<br />
(19)<br />
U s<br />
t<br />
V V Is<br />
,<br />
(20)<br />
where Is denotes the s×s unit matrix. Then, the matrices<br />
Cm and Cs can be decomposed as follows:<br />
C<br />
2 t<br />
U<br />
U ,<br />
(21)<br />
m<br />
- 341 - 15th IGTE Symposium 2012<br />
2 t<br />
Cs V<br />
V . (22)<br />
From eqs. (21) and (22), we find that the eigenvalues <strong>of</strong><br />
Cm and Cs are essentially identical. Moreover, since<br />
X vi iu<br />
(23)<br />
i<br />
holds, the eigenvectors <strong>of</strong> Cm can be easily obtained by<br />
solving the eigenvalue problem for Cs.<br />
Then, the dominant r eigenvectors are chosen to<br />
construct the matrix defined by<br />
W [ 1 2<br />
r ] . w w w <br />
(24)<br />
It is assumed that the original unknown variable x k R m<br />
can be expressed by the linear combination <strong>of</strong> the reduced<br />
variables y k R r in the form<br />
W .<br />
k<br />
k<br />
x y<br />
(25)<br />
Using the transform (25), the original FE equation (13),<br />
which is simply expressed by Ax=b, can be reduced to<br />
t n t n<br />
W AWy<br />
W b .<br />
(26)<br />
Since the size <strong>of</strong> the coefficient matrix W t AW is r×r<br />
(m>>s>r), eq. (26) can be solved much faster than (13).<br />
III. ANALYSIS OF BULK CONDUCTOR MODEL<br />
The bulk conductor model shown in Fig. 1 is analyzed<br />
by using the present method. The FE model has 125000<br />
nodes, 117649 elements, 367500 edges and 369036<br />
unknown variables. The conductivity and relative<br />
permeability in the magnetic material are 0.510 7 S/m<br />
and 1000. The driving frequency is set to 50 Hz and time<br />
step is ∆t=10 -4 sec. Under these conditions, as the time<br />
constant τ is estimated to be about 0.01 sec and the period<br />
T is 0.02 sec, the relation T>τ holds.<br />
A. Dependence on snapshot interval and period<br />
We change the snapshot intervals and the period during<br />
which the snapshots are taken to clarify the dependence <strong>of</strong><br />
the solution on them. In this study, the solution to eq. (13)<br />
is snapshotted from 0 to T with snapshot intervals 4∆t,<br />
2∆t and ∆t. The snapshot period is T, T/2 and T/4.<br />
Moreover, the number <strong>of</strong> the basis functions is set as r=40<br />
and 50.<br />
The time variation in the magnetic flux density |B| at<br />
the center <strong>of</strong> magnetic material is shown in Fig. 2. In this<br />
problem, we set the number <strong>of</strong> basis function as r=40. In<br />
Fig. 2(a), we find that there are no significant differences<br />
between the original solution and the solutions obtained<br />
by the conventional method and model reduction method<br />
with different snapshot intervals. Also, Fig. 2(b) plots the<br />
time changes in |B| for initial period, 0
35<br />
20<br />
15<br />
y(mm)<br />
magnetic<br />
material<br />
J(A/m 2 )<br />
TABLE I<br />
DEPENDENCE OF COMPUTATIONAL TIME AND ERROR ON SNAPSHOT<br />
INTERVAL AND PERIOD<br />
(A) SNAPSHOT INTERVAL<br />
Snapshot intervals Δt 2Δt 4Δt<br />
computational time (%) 43.2 37.3 29.6<br />
error e(%) 0.07 0.19 0.65<br />
Snapshot period<br />
(B) SNAPSHOT PERIOD<br />
T T/2 T/4<br />
computational time (%) 42.5 30.0 24.7<br />
error e(%) 0.02 0.32 0.60<br />
TABLE II<br />
DEPENDENCE OF COMPUTATIONAL TIME AND ERROR ON NUMBER OF<br />
BASIS FUNCTION<br />
Number <strong>of</strong> basis function 20 30 40<br />
computational time (%) 40.2 41.8 43.2<br />
error (%) 2.38 0.29 0.07<br />
TABLE III<br />
DEPENDENCE OF COMPUTATIONAL TIME AND ERROR ON NUMBER OF<br />
BASIS FUNCTION LONGER TIME CONSTANT<br />
Number <strong>of</strong> basis function 40 60 80<br />
computational time (%) 46.6 56.0 65.5<br />
error (%) 0.39 0.15 0.07<br />
conductor, shown in Fig. 4, obtained by the conventional<br />
method and present method in which the snapshot period<br />
is set to T/4. The discrepancy can also be found in the<br />
initial responses shown in Fig. 3(b). This suggests that the<br />
long range errors can be predicted from the initial<br />
responses. That is, by performing the analysis using the<br />
present method for initial short period changing the<br />
snapshot period, we could know the appropriate value for<br />
it.<br />
The error between the original solution and that<br />
obtained by the present method is defined by<br />
e <br />
H i<br />
i<br />
<br />
i<br />
red<br />
H<br />
H<br />
i<br />
i<br />
z(mm)<br />
x(mm)<br />
15 20 35<br />
15<br />
coil<br />
Figure1 : Bulk conductor model<br />
100(%)<br />
- 342 - 15th IGTE Symposium 2012<br />
(27)<br />
where Hi red is the magnetic field obtained by the present<br />
method and Hi is computed from the original solution.<br />
Table I and II summarize the error e evaluated at t=0.08<br />
sec where the solutions sufficiently converge to steady<br />
state and corresponding computational time. We can see<br />
that the errors e become small as the snapshot interval<br />
decreases or the snapshot period increases. However, the<br />
35<br />
5<br />
x<br />
magnetic<br />
material<br />
20<br />
coil<br />
35<br />
x(mm)<br />
Magnetic Density (T)<br />
Magnetic Density (T)<br />
2.00E-04<br />
1.50E-04<br />
1.00E-04<br />
5.00E-05<br />
0.00E+00<br />
Original solution without reduction<br />
T/4<br />
T/2<br />
T<br />
-5.00E-05<br />
0 0.02<br />
Time (s)<br />
0.04 0.06<br />
(a) |B| during 0
(a) Original solution.<br />
(b) Solution obtained by present method.<br />
Figure 4 : Eddy current distribution.<br />
computational time simultaneously increases. This means<br />
that we must determine these parameters considering both<br />
effects.<br />
B. Dependence on number <strong>of</strong> basis functions<br />
In this section, we discuss the dependence <strong>of</strong> the<br />
solutions obtained by the present method on r, the number<br />
<strong>of</strong> the basis functions. The snapshot period and interval<br />
are set to T and Δt, respectively. The analysis results are<br />
shown in Fig. 5. It can be found both in Fig. 5 (a) and (b)<br />
that the solutions obtained by the present results approach<br />
the original solution as r increases.<br />
The computational time and error e depending on r are<br />
summarized in Table II. We can see that e decreases as r<br />
increases while there are little dependence <strong>of</strong> the<br />
computational time on r.<br />
C. Bulk conductor with longer time constant<br />
To test the validity <strong>of</strong> the present method for systems<br />
with longer time constants, we increase the conductivity <br />
- 343 - 15th IGTE Symposium 2012<br />
Magnetic Density (T)<br />
Magnetic Density (T)<br />
2.00E-04<br />
1.50E-04<br />
1.00E-04<br />
5.00E-05<br />
0.00E+00<br />
-5.00E-05<br />
0 0.01 0.02 0.03 0.04 0.05<br />
(a) |B| during 0
(a) Original solution<br />
(b) Primal basis vector w1<br />
(c) Second basis vector w2<br />
(d) Fifth basis vector w5<br />
Figure 7 : Distribution <strong>of</strong> original solution and basis vector.<br />
- 344 - 15th IGTE Symposium 2012<br />
35<br />
20<br />
15<br />
Magnetic Density (T)<br />
2.50E-04<br />
2.00E-04<br />
1.50E-04<br />
1.00E-04<br />
5.00E-05<br />
Original solution without reduction<br />
Δt<br />
0.00E+00<br />
-5.00E-05<br />
2Δt<br />
4Δt<br />
0 0.01 0.02<br />
Time (s)<br />
0.03 0.04 0.05<br />
(a) |B| during 0
sec. The snapshot period is set to T/2, T/4 and T/8. The<br />
snapshot interval and the number <strong>of</strong> basis vectors are set<br />
to Δt and 40, respectively.<br />
The time change in |B| at the center <strong>of</strong> the model is<br />
shown in Fig. 9, where (a) and (b) show the relatively<br />
long-range and initial responses, respectively. Due to the<br />
structure <strong>of</strong> the stacked iron core, the time constant <strong>of</strong> this<br />
system is much smaller than that <strong>of</strong> the bulk iron shown in<br />
Fig. 1. We can see that in Fig. 9 that the solutions are in<br />
good agreement with the original solution except during<br />
the initial short period.<br />
V. CONCLUSION<br />
In this paper, the three dimensional time-domain FE<br />
analysis using the model order reduction based on the<br />
method <strong>of</strong> snapshots has been presented. Effectiveness <strong>of</strong><br />
this present method is shown for bulk conductor and<br />
stacked iron model. It has been found that the snapshot<br />
period and number <strong>of</strong> basis functions have great influence<br />
on the transient solutions obtained by the present method.<br />
It has been suggested that these parameters could be<br />
appropriately determined by performing time marching<br />
for initial some steps for the different parameter values.<br />
In future, we plan to apply the present method to nonlinear<br />
eddy current problems and high-frequency<br />
problems.<br />
REFERENCES<br />
[1] Y. Takahashi, T. Tokumasu, M. Fujita, S. Wakao, T. Iwashita,<br />
and M.Kanazawa, “Improvement <strong>of</strong> convergence characteristic in<br />
nonlinear transient eddy-current analyses using the error<br />
correction <strong>of</strong> time integration based on the time-periodic FEM and<br />
the EEC method,” (in Japanese) IEEJ Trans. PE, vol. 129, no. 6,<br />
2009.<br />
[2] H. Igarashi, Y. Watanabe and Y. Ito, ”Why Error Correction<br />
Methods Realize Fast Computations,” IEEE Trans. Magn., vol.<br />
48, no. 2, pp.415-418, 2012.<br />
[3] Krysl, P., S. Lall, and J. Marsden, “Dimensional Model Reduction<br />
in Non-linear Finite Element Dynamics <strong>of</strong> Solids and Structures,”<br />
International Journal for Numerical Methods in Engineering,<br />
vol. 51, pp479-504, 2001.<br />
[4] G. Kerschen J. Golinval, AF. Vakakis, LA. Bergman, The<br />
method <strong>of</strong> proper orthogonal decomposition for dynamical<br />
characterization and order reduction <strong>of</strong> mechanical systems: an<br />
overview, Nonlinear Dynamics, vol. 41, pp. 147 169, 2005.<br />
[5] S. Rutenkroger, B. Deken, S. Pekarek, Reduction <strong>of</strong> Model<br />
Dimension in Nonlinear Finite Element Approximations <strong>of</strong><br />
Electromagnetic, Computers in Power Electronics, 2004,<br />
<strong>Proceedings</strong>. IEEE Workshop on, pp. 20-27, Aug., 2004.<br />
[6] P. Holmes, JL. Lumley, G. Berkooz, Turbulence; Coherent<br />
Structures; Dynamical Systems and Symmetry. Cambridge<br />
<strong>University</strong> Press: Cambridge, 1996.<br />
- 345 - 15th IGTE Symposium 2012
- 346 - 15th IGTE Symposium 2012<br />
Calculation <strong>of</strong> eddy-current probe signal for a<br />
3D defect using global series expansion<br />
Sándor Bilicz, József Pávó and Szabolcs Gyimóthy<br />
Budapest <strong>University</strong> <strong>of</strong> <strong>Technology</strong> and Economics<br />
Department <strong>of</strong> Broadband Infocommunications and Electromagnetic Theory<br />
Goldmann Gy. tér 3.,1111 Budapest, Hungary<br />
E-mail: bilicz@evt.bme.hu<br />
Abstract—A novel eddy-current modeling technique <strong>of</strong> volumetric defects embedded in conducting plates is presented in<br />
the paper. This problem is <strong>of</strong> great interest in electromagnetic nondestructive evaluation (ENDE) and has already been<br />
exhaustively studied. The defect is modeled by a volumetric current dipole density which satisfies an integral equation.<br />
The latter is solved by the classical method <strong>of</strong> moments. This have been usually based on the volume discretisation <strong>of</strong><br />
the defect. Contrarily, –as a new contribution– we propose the use <strong>of</strong> globally defined, continuous basis functions for the<br />
expansion <strong>of</strong> the current dipole density. This global expansion lets us expect for an improvement <strong>of</strong> the numerical stability<br />
and the performance <strong>of</strong> the simulation. The proposed method is tested against both measured and synthetic data obtained<br />
by a different defect model.<br />
Index Terms—eddy-current modeling, integral equation, global expansion, moment method<br />
I. INTRODUCTION<br />
Eddy-current nondestructive testing (ECT) is a widely<br />
used technique to reveal and characterize in-material<br />
flaws (inclusions, voids, cracks, etc.) within conducting<br />
specimens. The principle <strong>of</strong> ECT is based on the local<br />
changes in the specimen’s electromagnetic (EM) parameters<br />
due to the flaw. These changes result in an EM field<br />
different from the field in the flawless case. Either the<br />
field directly, or a deduced quantity (e.g., impedance <strong>of</strong><br />
a probe coil) is measured during a nondestructive test and<br />
the acquired data are used for the flaw reconstruction.<br />
The inverse problem <strong>of</strong> nondestructive testing can be<br />
ill-posed. This means that any <strong>of</strong> the existence, unicity<br />
and stability <strong>of</strong> the solution is not necessarrily provided.<br />
Beyond these theoretical challenges, the flaw characterization<br />
can be numerically demanding as well: the<br />
inversion algorithms are <strong>of</strong>ten iterative, i.e., several flaws<br />
are to be sequentially simulated in an optimisation loop.<br />
Consequently, a key element <strong>of</strong> the inversion is a fast and<br />
reliable numerical simulation <strong>of</strong> flaws.<br />
Classical attempts <strong>of</strong> flaw modeling are the integral<br />
approaches. They can cope with the difficulties arisen by<br />
the relatively small size <strong>of</strong> flaws compared to the excited<br />
region (yielding discretisation issues). The classical work<br />
[1] presents a flaw simulation where the flawed volume is<br />
discretised by a regular grid. The yielded volume integral<br />
eqution is resolved by the Method <strong>of</strong> Moments (MoM)<br />
[2], assuming a piecewise constant approximation <strong>of</strong> the<br />
EM field over each cell <strong>of</strong> the grid. By now, this method<br />
has been implemented in commercial s<strong>of</strong>twares, e.g., [3],<br />
and has been successfully applied in inversion algorithms<br />
as well [4]. The volume integral method has recently<br />
been revisited in [5], where the EM field is expanded<br />
by means <strong>of</strong> locally defined splines. This provides the<br />
smoothness <strong>of</strong> the field, which is violated in the previous<br />
approach. Variational formalisms have also been tried<br />
with success: in [6], a Finite Element Method (FEM)<br />
scheme is presented for the separated computation <strong>of</strong> the<br />
field in the flawless specimen and the “reaction field”<br />
risen by the presence <strong>of</strong> the flaw. Coupled methods<br />
have been introduced, e.g., in [7]: a FEM code for the<br />
computation <strong>of</strong> the flawless field is coupled with a surface<br />
integral scheme <strong>of</strong> the ideally thin crack model.<br />
In [8], an ideally thin crack is considered which is<br />
modeled by a surface integral equation, again resolved<br />
by MoM, using a piecewise constant approximation.<br />
Though some <strong>of</strong> the works above were carried out<br />
decades ago, several pitfalls are still present in eddycurrent<br />
flaw modeling. Today’s challenges are mainly<br />
related to the increasing needs <strong>of</strong> flaw inversion in the<br />
sense <strong>of</strong> accuracy and speed. Beyond being small, flaws<br />
can have bad aspect ratio as well, making the volumetric<br />
models fail. It is also not straightforward how to choose<br />
between the volumetric and the ideally thin crack models<br />
for an arbitrary defect. The optimisation-based inversion<br />
schemes can badly perform if the sensitivity data are<br />
inaccurate. Another important issue is the convergence <strong>of</strong><br />
the simulation with respect to the discretisation applied.<br />
In case <strong>of</strong> a grid-discretisation, this can only be controled<br />
at the price <strong>of</strong> computational load.<br />
These challenges inspired the improvement <strong>of</strong> the<br />
MoM-based discretisation techniques <strong>of</strong> the integral<br />
equation models. The above cited formalisms are resolved<br />
by using locally defined basis functions for the<br />
expansion <strong>of</strong> the EM field. A new approach has been<br />
presented in [9], where the basis functions were globally<br />
defined (i.e., all over the surface <strong>of</strong> the ideal crack)<br />
harmonic functions. The use <strong>of</strong> such global expansion
provided considerable advantages over local expansions.<br />
In this paper, we present the use <strong>of</strong> global expansion<br />
functions for volumetric flaw modeling. In a certain<br />
sense, this is an extension <strong>of</strong> the method formalized for<br />
the ideally thin flaws in [9].<br />
II. THE VOLUME INTEGRAL METHOD<br />
Let us consider a non-magnetic, conducting specimen<br />
(to be tested against material flaws) with a homogeneous<br />
conductivity σ0. A time-harmonic source (typically, a<br />
coil) near the specimen induces eddy-currents within the<br />
conductive medium. In the presence <strong>of</strong> a flaw embedded<br />
in the volume region V , the otherwise constant conductiviy<br />
<strong>of</strong> the specimen will locally change: σ = σ(r),<br />
r ∈ V , so the EM field will change, too. The EM field<br />
can be decomposed into a so-called incident term and a<br />
defect term :<br />
E(r) =E i (r)+E d (r), (1)<br />
where only Ei (r) would exist in the flawless (σ(r) ≡ σ0)<br />
specimen, whereas Ed (r) rises due to the flaw. The latter<br />
is imagined as the field corresponding to a fictious source<br />
distribution which takes place in the flawless specimen<br />
and has exactly the same effect as imposed by the flaw<br />
[1]. Formally, let the secondary source be a current dipole<br />
density P =(σ(r) − σ0)E, r ∈ V .ThenEd (r) can be<br />
expressed as<br />
E d <br />
(r) =−jωμ0 G(r|r ′ )P(r ′ )dV ′ , (2)<br />
V<br />
where G(r|r ′ ) is the electric-electric dyadic Green’s<br />
function transforming the current density excitation at<br />
the point r ′ to the generated electric field at the point<br />
r. ω is the angular frequency <strong>of</strong> the source and μ0 is<br />
the vacuum permeability. By substituting (2) into (1),<br />
using the definition <strong>of</strong> P, we get a Fredholm-type integral<br />
equation <strong>of</strong> the second kind for the unknown current<br />
dipole density:<br />
<br />
1<br />
P(r)+jωμ0<br />
σ(r) − σ0<br />
G(r|r<br />
V<br />
′ )P(r ′ )dV ′ =<br />
= E i (r).<br />
Once the integral equation is solved, P(r) can be used<br />
to derive quantites that can be measured during the<br />
nondestructive test. In the illustrative cases that we will<br />
present in this paper, a probe coil is used for both the<br />
excitation <strong>of</strong> the field and the acquisition <strong>of</strong> the measured<br />
data via its complex impedance variation ΔZ. As a<br />
consequence <strong>of</strong> the reciprocity principle [10], ΔZ can<br />
be computed as<br />
ΔZ = − 1<br />
I2 <br />
E i (r) · P(r)dV, (4)<br />
V<br />
with I being the amplitude <strong>of</strong> the probe coil’s current.<br />
This decomposition (1) let E i (r) and G(r|r ′ ) be<br />
separately computed, which provides the well-known<br />
advantages from the viewpoint <strong>of</strong> numerical evaluation.<br />
Moreover, the formula (4) can also be easily evaluated.<br />
- 347 - 15th IGTE Symposium 2012<br />
(3)<br />
Let us also highlight that the volume integral method<br />
bears the potential pitfall <strong>of</strong> properly computing the<br />
Green’s function. Due to its singularity, a numerically<br />
stable expression <strong>of</strong> G(r|r ′ ) <strong>of</strong>ten requires special efforts,<br />
as it will be shown in Subsection III-C.<br />
III. SOLUTION OF THE INTEGRAL EQUATION<br />
A. The studied configuration<br />
We restrict our studies to a special, but practically<br />
important configuration, outlined in Fig. 1. The specimen<br />
is assumed to be a homogeneous conducting plate with<br />
a finite thickness. The dimensions <strong>of</strong> the plate in the x<br />
and y directions are assumed to be infinite. The flaw<br />
is <strong>of</strong> cuboid shape and it has four edges perpendicular<br />
to the plate surface. The flaw edges are A, B and D,<br />
respectively, and the volume V is defined as<br />
|x| ≤ A B<br />
, |y| ≤ and |z − C| ≤<br />
2 2<br />
D<br />
, (5)<br />
2<br />
where C is the center <strong>of</strong> the crack along z. The conductivity<br />
within the flaw volume V is known, σ(r), typically,<br />
σ(r) =0.<br />
r 1<br />
r2<br />
Coil<br />
x<br />
c<br />
Coil<br />
Plate<br />
z=C<br />
y<br />
c<br />
A<br />
y<br />
z<br />
B<br />
z=0 l<br />
A<br />
z=−d<br />
D<br />
Plate<br />
Flaw<br />
TOP VIEW<br />
d<br />
h<br />
SIDE VIEW<br />
Figure 1. The studied configuration. An air-cored pancake-type coil<br />
scans above the infinite plate near the flaw. For generality, a burried<br />
flaw is sketched, however, we deal with ID and OD flaws.<br />
The probe coil is actually an air-cored pancake-type<br />
probe, driven by a time-harmonic current. During the<br />
nondestructive test, the coil scans above the damaged<br />
zone and its impedance variation is measured at given<br />
coil positions.<br />
B. Global approximation <strong>of</strong> the current dipole density<br />
Let us approximate the solution P(r) <strong>of</strong> the integral<br />
equation (3) by means <strong>of</strong> a finite series. Let the basis<br />
x<br />
x
functions <strong>of</strong> this expansion be products <strong>of</strong> three factors,<br />
each depending only on one Cartesian coordinate:<br />
P(r) =<br />
M<br />
N<br />
Q<br />
m=−M n=−N q=−Q<br />
Pmnqf m x (x)f n y (y)f q z (z),<br />
(6)<br />
The key idea in this paper is the choice <strong>of</strong> the basis<br />
functions: in contrary with the classical schemes, herein<br />
each basis function is globally defined, i.e., all over the<br />
flaw volume V . Let us note that the special restrictions<br />
for the shape <strong>of</strong> the flaw are needed here. We propose<br />
the use <strong>of</strong> the following harmonic factors in the basis<br />
functions:<br />
f m <br />
1<br />
x (x) =<br />
A exp<br />
<br />
2πj mx<br />
<br />
,<br />
A<br />
f n <br />
1<br />
y (y) =<br />
B exp<br />
<br />
2πj ny<br />
<br />
,<br />
B<br />
f q <br />
1<br />
z (z) =<br />
D exp<br />
(7)<br />
<br />
<br />
q(z − C)<br />
2πj .<br />
D<br />
In fact, this leads to a three-dimensional complex Fourierseries;<br />
the integers m, n and q are the harmonic orders.<br />
The basis functions form an orthonormal set with respect<br />
to the scalar product:<br />
<br />
<br />
g(r) ,h(r) := g(r)h ⋆ (r)dV. (8)<br />
Let us notice that our choice for the basis functions<br />
provides a smooth approximation <strong>of</strong> P, instead <strong>of</strong> the<br />
piecewise constant approximation discussed in [1]. In<br />
Section IV, the advantages provided by the global expansion<br />
are discussed along with numerical examples.<br />
C. Discretisation by the Method <strong>of</strong> Moments; computation<br />
<strong>of</strong> the matrix elements<br />
The R =(2M +1)(2N+1)(2Q+1) unknown vectorial<br />
coefficients Pmnq in the series (6) are determined by<br />
means <strong>of</strong> the Method <strong>of</strong> Moments. The testing functions<br />
are the same as the basis functions (Galerkin-method) and<br />
a linear system <strong>of</strong> R vectorial equations is obtained. For a<br />
handy formalization, let the basis functions be ordered so<br />
as each triplet <strong>of</strong> harmonic orders (m, n, q) has a unique<br />
index k (1 ≥ k ≥ R) and denote the kth basis function<br />
as<br />
wk(x, y, z) =f m x (x)f n y (y)f q z (z). (9)<br />
The elements in the system matrix <strong>of</strong> the linear equations<br />
are in the form<br />
a λκ<br />
lk =(eλ · eκ) wl(r) ,wk(r)/(σ − σ0) +<br />
<br />
<br />
jωμ0 wl(r) , eλ · G(r|r<br />
V<br />
′ )(eκwk(r ′ ))dV ′ ,<br />
(10)<br />
where l and k are the indices <strong>of</strong> the test and basis<br />
functions (l, k =1, 2,...,R), eλ and eκ are the unitvectors<br />
(λ, κ = x, y, z) and , stands for the scalar<br />
product.<br />
V<br />
- 348 - 15th IGTE Symposium 2012<br />
The evaluation <strong>of</strong> the integral with respect to r ′ dV ′<br />
needs special numerical treatment due to the singularity<br />
<strong>of</strong> the Green’s function. However, in the case <strong>of</strong> the<br />
considered planar geometry and <strong>of</strong> the proposed basis<br />
functions, one can cope with the singular kernel by<br />
means <strong>of</strong> the spectral method, presented in detail in<br />
[11]. In brief, by using the 2-dimensional spatial Fouriertransform<br />
in the xy plane, the spectrum <strong>of</strong> the Green’s<br />
function can be represented as a sum <strong>of</strong> planar waves<br />
traveling along z. Due to the product-separation form (7)<br />
<strong>of</strong> the bais functions, the integral with respect to r ′ dV ′<br />
in (10) splits up to a factor depending only on z and z ′<br />
and to an other factor which is represented by its 2D<br />
Fourier-transform. The integral with respect to z ′ can<br />
be analytically evaluated in the spectral domain, and the<br />
remaining factor (to be inverse transformed) is no longer<br />
singular.<br />
As a useful consequence <strong>of</strong> the Galerkin-method,<br />
certain elements <strong>of</strong> the yielded system matrix must be<br />
the same. In (10), the volume integral with respect to<br />
rdV (due to the scalar product) and to r ′ dV ′ can be<br />
commuted. According to the reciprocity theorem, we<br />
have<br />
a λκ<br />
lk ≡ a κλ<br />
k ′ l ′, (11)<br />
for all k and l if wk(r) ≡ wk ′(r)⋆ and wl(r) ≡ wl ′(r)⋆<br />
hold, respectively. This equivalence can be applied to<br />
check the numerical computations and/or to reduce the<br />
computational load.<br />
Finally, let us notice that the presence <strong>of</strong> the probe coil<br />
is neglected in the expression <strong>of</strong> the Green’s function.<br />
This is usual and does not cause considerable error.<br />
D. Computation <strong>of</strong> the incident field<br />
The Ei (r) incident field can be analytically computed<br />
in the studied case. The pancake-type coil generates an<br />
axisymmetric field which depends only on z and r =<br />
(x − xc) 2 +(y − yc) 2 . This field can be expressed in<br />
the form <strong>of</strong> an integral <strong>of</strong> first-order Bessel-functions, as<br />
detailed in the classical work [12].<br />
More complicated probes (e.g., including ferrit core or<br />
having rectangular-shaped turns) can also be considered.<br />
However, in such cases, Ei (r) is obviously more difficult<br />
to compute (e.g., by a Finite Element Method).<br />
Once the incident field is obtained within the flaw<br />
volume V , the excitation vector <strong>of</strong> the linear system <strong>of</strong><br />
equations yielded by the MoM can be assembled from<br />
the entries<br />
b λ l = wl(r) , eλ · E i (r) , (12)<br />
where l =1, 2,...,Ris the index <strong>of</strong> testing function and<br />
eλ is the unit vector (λ = x, y, z). As a consequence <strong>of</strong><br />
the axial symmetry, bz l ≡ 0 holds for all l.<br />
E. Implementation issues<br />
The algorithms are coded in Matlab R○ . The spectral<br />
domain expression <strong>of</strong> the Green’s function is inverse<br />
transformed by a 2-dimensional Fast Fourer Transform
(FFT2) routine. The width <strong>of</strong> the FFT2’s spatial window<br />
in the xy plane is estimated from the skin depth within<br />
the conductive medium, whereas the spectral window is<br />
assigned with respect to the harmonic orders m and n,<br />
respectively.<br />
The integrals involved by the scalar products in (10)<br />
and (12) are evaluated numerically, based on a regular<br />
discretisation <strong>of</strong> the flaw volume. The number <strong>of</strong> samples<br />
along each axis is set with respect to the harmonic orders<br />
m, n and q <strong>of</strong> the basis and testing functions, respectively.<br />
IV. TEST CASES AND COMPARISONS<br />
In this section, the proposed method is illustrated<br />
and its main advantages are highlighted via numerical<br />
examples.<br />
A. Definition <strong>of</strong> the configurations<br />
The illustrative test cases are presented in Fig. 1. The<br />
air-filled rectangular flaw has constant zero conductivity.<br />
Though a buried flaw is outlined in the sketch, we present<br />
cases for ID-type (“inner defect”, opening to the top<br />
surface <strong>of</strong> the plate: C = −D/2) and OD-type (“outer<br />
defect”, C = −d + D/2) flaws only.<br />
Experimental data <strong>of</strong> the variation <strong>of</strong> the coil’s<br />
impedance (ΔZ) are available on the xc = 0 line in<br />
function <strong>of</strong> yc, at discrete coil positions. The cases #1<br />
and #2 are JSAEM Benchmarks [13], whereas case #3 is<br />
an also frequently cited TEAM Benchmark no. 15 [14].<br />
The parameters <strong>of</strong> each case are given in Tab. I.<br />
Table I<br />
NUMERICAL DESCRIPTION OF THE TEST CASES.(NOTATION IS<br />
ACCORDING TO FIG.1.)<br />
#1 #2 #3<br />
Specimen<br />
d (mm) 1.25 1.25 12.22<br />
σ0 (MS/m) 1 1 30.6<br />
Flaw<br />
A (mm) 0.21 0.21 0.28<br />
B (mm) 10 10 12.6<br />
D (mm) 0.75 0.5 5<br />
C (mm) −d + D/2 −D/2 −D/2<br />
Probe coil<br />
r1 (mm) 0.6 0.6 6.15<br />
r2 (mm) 1.6 1.6 12.4<br />
h (mm) 0.8 0.8 6.15<br />
l (mm) 0.5 1.0 0.88<br />
f (kHz) 150 300 0.9<br />
Turns 140 140 3790<br />
B. Convergence <strong>of</strong> the series<br />
One <strong>of</strong> the main advantages <strong>of</strong> the global expansion<br />
method is the easy access to the convergence with respect<br />
to the maximal harmonic orders <strong>of</strong> M, N and Q in the<br />
series <strong>of</strong> P. By adding further terms <strong>of</strong> higher orders,<br />
the previously computed elements <strong>of</strong> the system matrix<br />
remain unchanged. Consequently, convergence studies<br />
can be performed at a much lower computational cost<br />
than in the case <strong>of</strong> local basis functions (the latter needs<br />
a new grid whenever a finer discretisation is set).<br />
- 349 - 15th IGTE Symposium 2012<br />
We have computed ΔZ in the test case #1 using different<br />
MNQ maximal orders. The discrepancy between<br />
two impedance signals is expressed by the norm<br />
<br />
ΔZ := (1/K)ΣK k=1 |ΔZ(yc,k)| 2 , (13)<br />
where the number <strong>of</strong> coil positions is actually K =11<br />
and yc,k =(k−1) mm. In Fig. 2, the normalised discrepancy<br />
between the first impedance signal (MNQ = 121)<br />
and some others obtained by higher order approximations<br />
is shown. A fast convergence is experienced: e.g., there<br />
is ca. 5% discrepancy between the impedance signals<br />
computed with MNQ = 121 and with the higher orders<br />
MNQ = 163. Let us note that the variation <strong>of</strong> the current<br />
dipole density in the x-direction is smooth enough to be<br />
modeled by a first order Fourier series, i.e., the choice<br />
M =1seems to be appropriate.<br />
||ΔZ − ΔZ 121 || / ||ΔZ 121 ||<br />
0.08<br />
0.07<br />
0.06<br />
0.05<br />
0.04<br />
0.03<br />
0.02<br />
0.01<br />
0<br />
121<br />
131<br />
151161<br />
141<br />
122<br />
132<br />
152162<br />
142<br />
123<br />
133<br />
153163<br />
143<br />
Figure 2. Normalised discrepancy between impedance signals obtained<br />
by different maximal harmonic orders MNQ (marked above each bar)<br />
in test problem #1.<br />
The fast convergence is also reasoned by the behavior<br />
<strong>of</strong> the coefficients Pmnq. Again in the test case #1, we<br />
examined the coefficients <strong>of</strong> the x-directed current dipole<br />
density P x (r) (note that P x is much more dominant than<br />
P y and P z in this case) for a centered coil location (xc =<br />
yc =0). Some coefficients <strong>of</strong> the largest magnitude are<br />
plotted in Fig. 3. A fast decrease <strong>of</strong> the magnitudes is<br />
experienced as the harmonic orders increase.<br />
C. Comparison to exparimental data and to the ideally<br />
thin crack model<br />
The volumetric flaw model using global expansion<br />
is a sort <strong>of</strong> extension <strong>of</strong> the model proposed in [9].<br />
Therein, ideally thin cracks are considered and modeled<br />
by a surface layer <strong>of</strong> current dipole density. This can be<br />
imagined as if the A edge length <strong>of</strong> the crack (Fig. 1)<br />
would collapse to zero whereas the x component <strong>of</strong><br />
the total electric field E vanishes on the crack surface.<br />
For the surface current dipole density, certain boundary<br />
conditions must hold, thus, the basis functions in the
x<br />
Normalized |P |<br />
mnq<br />
1.2<br />
1<br />
0.8<br />
0.6<br />
0.4<br />
0.2<br />
0<br />
0 1 0<br />
0 −1 0<br />
1 1 0<br />
−1 1 0<br />
1 −1 0<br />
−1 −1 0<br />
0 −2 0<br />
0 2 0<br />
1 −2 0<br />
−1 −2 0<br />
1 2 0<br />
−1 2 0<br />
0 3 0<br />
0 −3 0<br />
−1 1 1<br />
−1 −1 1<br />
1 1 1<br />
1 −1 1<br />
−1 1 −1<br />
−1 −1 −1<br />
1 1 −1<br />
1 −1 −1<br />
1 1 2<br />
−1 1 2<br />
1 −1 2<br />
−1 −1 2<br />
1 1 3<br />
1 −1 3<br />
−1 1 3<br />
−1 −1 3<br />
1 1 −2<br />
−1 1 −2<br />
1 −1 −2<br />
−1 −1 −2<br />
Figure 3. Normalized magnitudes <strong>of</strong> the coefficients <strong>of</strong> the x-directed<br />
current dipole density P x (r) in test problem #1. The 34 highest magnitudes<br />
are plotted (totally there are (2M + 1)(2N + 1)(2Q + 1) = 273<br />
coefficients, as M =1, N =6and Q =3is chosen); the triplets<br />
mnq (any <strong>of</strong> the indices can be negative as it can be seen) are marked<br />
above each bar.<br />
series expansion –e.g., for ID cracks– are products <strong>of</strong><br />
the following sine and cosine functions:<br />
g n <br />
2<br />
y (y) =<br />
B sin<br />
<br />
y + B/2<br />
nπ ,<br />
B<br />
g q <br />
2<br />
z (z) =<br />
D cos<br />
<br />
(2q − 1)π z<br />
(14)<br />
<br />
,<br />
2D<br />
with the integers n and q, as “harmonic orders”. Herein<br />
we do not deal with the surface model in detail, but<br />
only present some results for comparison, provided by<br />
the authors <strong>of</strong> [9].<br />
In Figs. 4, 5 and 6, comparisons <strong>of</strong> impedances (i)<br />
computed by our volumetric model, (ii) by the surface<br />
model and (iii) measured data (provided with the benchmarks)<br />
are presented. The comparisons let us conclude<br />
the followings:<br />
• The volumetric model can appropriately reconstruct<br />
the measured data at very low maximal harmonic<br />
orders. Let us note again that for instance, when<br />
MNQ = 122, we have only 75 (vectorial) unknowns<br />
in the series (6).<br />
• Though there is no straightforward connection between<br />
the harmonic orders <strong>of</strong> the volumetric and<br />
surface models, in the presented cases, the volumetric<br />
model provides better results in the sense <strong>of</strong> |ΔZ|<br />
than the surface model with more-or-less the same<br />
harmonic orders. In Figs. 4 and 5, the surface model<br />
appears to be unstable at NQ =44. (However, the<br />
surface model also has good convergence properties<br />
but at considerably higher N and Q which is not<br />
presented herein.)<br />
• The volumetric model tends to slightly overestimate<br />
|ΔZ| and underestimate the phase arg{ΔZ}.<br />
Whereas the phase error is acceptable in Figs. 4 and<br />
5, it becomes considerable in Fig. 6.<br />
- 350 - 15th IGTE Symposium 2012<br />
|ΔZ| (mΩ)<br />
arg{ΔZ} (rad)<br />
80<br />
60<br />
40<br />
20<br />
0<br />
1.5<br />
1<br />
0.5<br />
0<br />
JSAEM OD−60 Benchmark 150 kHz<br />
measured<br />
vol 011<br />
vol 122<br />
surf 44<br />
surf 66<br />
surf 88<br />
0 2 4 6 8 10<br />
Coil position y (mm)<br />
c<br />
Figure 4. Impedance variation in the test case #1. Legend: “vol”<br />
and “surf” refer to the volumetric and the surface model. The maximal<br />
harmonic orders MNQ and NQ used in the simulations are also given.<br />
|ΔZ| (mΩ)<br />
arg{ΔZ} (rad)<br />
150<br />
100<br />
50<br />
0<br />
3<br />
2<br />
1<br />
JSAEM ID−40 Benchmark 300 kHz<br />
measured<br />
vol 011<br />
vol 122<br />
surf 44<br />
surf 66<br />
surf 88<br />
0<br />
0 2 4 6 8 10<br />
Coil position y (mm)<br />
c<br />
Figure 5. Impedance variation in the test case #2. Legend notations<br />
are explained in Fig. 4.<br />
The computation times are quite short. The assemblation<br />
<strong>of</strong> the system matrix took, e.g., 285 s for the OD60<br />
flaw and 202 s for the ID40 flaw when a basis function<br />
set with maximal orders M =1, N =4, Q =8was<br />
used for both. Though the number <strong>of</strong> samples within the<br />
flaws for the numerical integration is the same in both<br />
cases, the OD60 flaw computation needs a wider spatial<br />
window for the FFT2 as the lower frequency yields a<br />
higher skin-depth.<br />
V. CONCLUSION AND PERSPECTIVES<br />
The classical integral equation models <strong>of</strong> volumetric<br />
flaws have been used for decades in the simulation<br />
<strong>of</strong> ECT. Though these schemes have many advantages,<br />
several bottlenecks are still present. In this paper, we<br />
proposed a new discretisation technique for the numerical<br />
solution <strong>of</strong> the volume integral equation. Instead <strong>of</strong> the<br />
locally defined, pulse basis functions, we use globally<br />
defined, harmonic basis functions for the expansion <strong>of</strong><br />
the unknown current dipole density distribution. Thanks
|ΔZ| (Ω)<br />
arg{ΔZ} (rad)<br />
20<br />
15<br />
10<br />
5<br />
0<br />
2<br />
1.5<br />
1<br />
0.5<br />
TEAM Benchmark<br />
measured<br />
vol 011<br />
vol 122<br />
surf 22<br />
surf 44<br />
surf 66<br />
0 5 10 15 20<br />
Coil position y (mm)<br />
c<br />
Figure 6. Impedance variation in the test case #3. Legend notations<br />
are explained in Fig. 4.<br />
to this choice, an improvement in the accuracy and<br />
performance <strong>of</strong> the simulation has been experienced in<br />
the test cases. The results obtained so far are promising,<br />
the research certainly needs to be continued, with special<br />
emphasis on the followings:<br />
• More parametric studies are needed for a various<br />
range <strong>of</strong> flaw sizes and frequencies to confirm that<br />
the new scheme outperforms the existing ones, and<br />
at the same time, to point out its limitations. We<br />
have not considered yet, for instance, through-plate<br />
flaws.<br />
• The method could easily be extended to the case<br />
<strong>of</strong> flaws embedded in thick plates (modeled as halfspace).<br />
The extension must be possible to the case<br />
<strong>of</strong> layered medium as well, which might be <strong>of</strong> more<br />
practical interest.<br />
<br />
• As the aspect ratio <strong>of</strong> the flaw (e.g., width length)<br />
•<br />
gets worse, the volumetric model is expected to become<br />
less accurate and the ideally thin crack model<br />
should be applied instead. However, the relation between<br />
the two models has not been exactly revealed<br />
yet. One expects the results <strong>of</strong> the volumetric model<br />
to converge to the results <strong>of</strong> the surface model as the<br />
width <strong>of</strong> the flaw collapses. This should be studied<br />
both in theoretical and in numerical senses as well.<br />
The inverse problem is <strong>of</strong>ten formalised as an<br />
optimisation task <strong>of</strong> minimizing the discrepancy<br />
between the measured and simulated data. The<br />
gradient-based schemes require the sensitivity data<br />
with respect to the parameters <strong>of</strong> the flaw. This<br />
sensitivity is accessible, e.g., via the the adjoint<br />
problem [15]. However, the numerical stability <strong>of</strong><br />
the gradient computation strongly depends on the<br />
precision <strong>of</strong> the EM field calculation near the boundaries<br />
<strong>of</strong> the flaw. The proposed global expansion<br />
<strong>of</strong> P could improve the precision in these cruical<br />
regions.<br />
- 351 - 15th IGTE Symposium 2012<br />
The authors think that the contribution <strong>of</strong> this paper<br />
can be <strong>of</strong> industrial interest as well, if the further numerical<br />
studies remain convincing about its performance.<br />
VI. ACKNOWLEDGEMENTS<br />
This research is supported by the Hungarian Science<br />
Research Fund (OTKA grant no. K105996).<br />
REFERENCES<br />
[1] J. R. Bowler, S. A. Jenkins, L. D. Sabbagh, and H. A. Sabbagh,<br />
“Eddy-current probe impedance due to a volumetric flaw,” Journal<br />
<strong>of</strong> Applied Physics, vol. 70, no. 3, pp. 1107 –1114, 1991.<br />
[2] R. F. Harrington, Field computation by moment methods.<br />
Macmillan, 1968.<br />
[3] CIVA. “CIVA: State <strong>of</strong> the art simulation platform for NDE”.<br />
[Online]. Available: http://www-civa.cea.fr<br />
[4] S. Bilicz, E. Vazquez, M. Lambert, S. Gyimóthy, and J. Pávó,<br />
“Characterization <strong>of</strong> a 3D defect using the expected improvement<br />
algorithm,” COMPEL: The International Journal for Computation<br />
and Mathematics in Electrical and Electronic Engineering,<br />
vol. 28, no. 4, pp. 851–864, 2009.<br />
[5] C. Reboud, D. Prémel, D. Lesselier, and B. Bisiaux, “New discretisation<br />
scheme based on splines for volume integral method:<br />
Application to eddy current testing <strong>of</strong> tubes,” COMPEL: The<br />
International Journal for Computation and Mathematics in Electrical<br />
and Electronic Engineering, vol. 27, no. 1, pp. 288–297,<br />
2008.<br />
[6] Z. Badics, Y. Matsumoto, K. Aoki, F. Nakayasu, M. Uesaka, and<br />
K. Miya, “Accurate probe-response calculation in eddy current<br />
NDE by finite element method,” Journal <strong>of</strong> Nondestructive<br />
Evaluation, vol. 14, pp. 181–192, 1995. [Online]. Available:<br />
http://dx.doi.org/10.1007/BF00730888<br />
[7] Y. Le Bihan, J. Pavo, M. Bensetti, and C. Marchand, “Computational<br />
environment for the fast calculation <strong>of</strong> ect probe signal by<br />
field decomposition,” Magnetics, IEEE Transactions on, vol. 42,<br />
no. 4, pp. 1411 –1414, 2006.<br />
[8] J. R. Bowler, “Eddy-current interaction with an ideal crack. I. The<br />
forward problem,” Journal <strong>of</strong> Applied Physics, vol. 75, no. 12, pp.<br />
8128–8137, 1994.<br />
[9] J. Pávó and D. Lesselier, “Calculation <strong>of</strong> eddy current testing<br />
probe signal with global approximation,” IEEE Transactions on<br />
Magnetics, vol. 42, no. 4, pp. 1419–1422, 2006.<br />
[10] R. F. Harrington, Time-harmonic electromagnetic fields.<br />
McGraw-Hill, 1961.<br />
[11] J. Pávó and K. Miya, “Reconstruction <strong>of</strong> crack shape by optimization<br />
using eddy current field measurement,” IEEE Transactions on<br />
Magnetics, vol. 30, no. 5, pp. 3407–3410, 1994.<br />
[12] C. V. Dodd and W. E. Deeds, “Analytical solutions to eddy-current<br />
probe-coil problems,” Journal <strong>of</strong> Applied Physics, vol. 39, no. 6,<br />
pp. 2829–2838, 1968.<br />
[13] T. Takagi, M. Uesaka, and K. Miya, “Electromagnetic NDE<br />
research activities in JSAEM,” in Electromagnetic Nondestructive<br />
Evaluation, ser. Studies in Applied Electromagnetics and Mechanics,<br />
T. Takagi, J. R. Bowler, and Y. Yoshida, Eds. IOS Press,<br />
1997, vol. 1, pp. 9–16.<br />
[14] T.E.A.M. Benchmark Problems. Accessed on 7.08.2012. [Online].<br />
Available: http://www.compumag.org/jsite/team.html<br />
[15] S. J. Norton and J. R. Bowler, “Theory <strong>of</strong> eddy current inversion,”<br />
Journal <strong>of</strong> Applied Physics, vol. 73, no. 2, pp. 501–512, 1993.
- 352 - 15th IGTE Symposium 2012<br />
Computation <strong>of</strong> the Motion <strong>of</strong> Conducting Bodies<br />
Using the Eddy-Current Integral Equation<br />
*Mihai Maricaru, † Ioan R. Ciric, *Horia Gavrila, *George-Marian Vasilescu and *Florea I. Hantila<br />
*Department <strong>of</strong> Electrical Engineering, Politehnica <strong>University</strong> <strong>of</strong> Bucharest, Spl. Independentei 313,<br />
Bucharest, 060042, Romania, E-mail: mihai.maricaru@upb.ro<br />
† Department <strong>of</strong> Electrical and Computer Engineering, The <strong>University</strong> <strong>of</strong> Manitoba, Winnipeg, MB R3T 5V6, Canada<br />
Abstract—The analysis <strong>of</strong> the motion <strong>of</strong> a system <strong>of</strong> solid conductors in the presence <strong>of</strong> magnetic fields is performed by<br />
solving the classical mechanics equation <strong>of</strong> motion under the action <strong>of</strong> magnetic forces. Application <strong>of</strong> the eddy-current<br />
integral equation and the usage <strong>of</strong> the local coordinates attached to the bodies in motion allow the determination <strong>of</strong><br />
electromagnetic field without being necessary to reconstruct the discretization grid at each new position <strong>of</strong> the conducting<br />
bodies. Only the submatrices associated with the coupling between the bodies in relative motion are modified in the global<br />
system matrix. A time-domain method <strong>of</strong> solution is first presented for the electromagnetic field problem, coupled with the<br />
equation <strong>of</strong> motion, which can be efficiently applied at high frequencies when the time steps are small. The eddy-current<br />
integral equation for the derivative <strong>of</strong> current density contains a term that takes into account the relative motion <strong>of</strong> the<br />
bodies. Since the electromagnetic quantities vary much more rapidly than the mechanical quantities, a second method is also<br />
proposed in this paper, where the eddy-current integral equation is solved in the frequency domain by assuming that the<br />
bodies are motionless, but by adding supplementary terms due to the actual motion <strong>of</strong> the bodies. Thus, only the average<br />
force over a period <strong>of</strong> time is now computed. This method is extremely efficient especially at higher frequencies when the<br />
time steps are very small.<br />
Index Terms—eddy-current integral equation, electrodynamics <strong>of</strong> moving conductors, levitation.<br />
I. INTRODUCTION<br />
The equation <strong>of</strong> translational motion <strong>of</strong> a solid<br />
conducting body <strong>of</strong> mass m under the action <strong>of</strong> the<br />
magnetic force F is<br />
2<br />
d r dr<br />
m F(<br />
r,<br />
, )<br />
G<br />
(1)<br />
2<br />
dt dt<br />
where r is the position vector <strong>of</strong> a point <strong>of</strong> the body, for<br />
instance <strong>of</strong> its center <strong>of</strong> gravity, is a vector<br />
representing the imposed current distribution and G is the<br />
gravitational force acting on the body. Equation (1) is<br />
discretized in time and F is determined at each time step<br />
by solving an electromagnetic field problem in the region<br />
with moving bodies. The application <strong>of</strong> the Finite<br />
Element Method requires a tremendous amount <strong>of</strong><br />
computation since it is necessary to reconstruct the<br />
discretization mesh at each time step. Moreover, the<br />
modifications <strong>of</strong> the discretization mesh are, usually,<br />
accompanied by undesired cumulative errors in the<br />
successive solutions <strong>of</strong> the electromagnetic field. A<br />
substantial improvement can be achieved when adopting<br />
hybrid Finite Element – Boundary Element Methods [1].<br />
Using the “laboratory” frame <strong>of</strong> references complicates<br />
the field problem solution due to the presence <strong>of</strong> the<br />
motional electric field intensity v0 B , where v 0 is the<br />
body velocity in this frame <strong>of</strong> references and B the<br />
magnetic induction. This disadvantage is eliminated<br />
when employing local frames <strong>of</strong> reference, attached to<br />
the bodies in motion [1], [2]. This also allows the usage<br />
<strong>of</strong> the simpler eddy-current integral equation for the<br />
bodies at rest, as it has been done in the case the<br />
velocities <strong>of</strong> the bodies are known [2]. However, in many<br />
situations the velocities <strong>of</strong> the bodies are not known, as,<br />
for instance, in the case <strong>of</strong> the electromagnetic levitation,<br />
their determination constituting one <strong>of</strong> the objectives <strong>of</strong><br />
the present work.<br />
In the case <strong>of</strong> the electromagnetic levitation, to ensure<br />
the stability <strong>of</strong> the solution it is necessary to choose a<br />
sufficiently small time step. Since for same accuracy <strong>of</strong><br />
the results the time period has to be divided practically in<br />
the same number <strong>of</strong> intervals (for example, at 50 Hz in<br />
200 intervals [1]), at higher frequencies the time step<br />
decreases. Unfortunately, as the time step decreases, the<br />
successively computed solutions tend to be very close to<br />
each other and the errors in the solution differences<br />
increase considerably, the computation procedure<br />
becoming inefficient.<br />
In the present paper, a new procedure is described for<br />
the time-domain solution <strong>of</strong> the eddy-current integral<br />
equation applicable to small time steps. As well, a<br />
technique is proposed for accelerating the determination<br />
<strong>of</strong> the trajectory <strong>of</strong> the moving bodies, based on the<br />
frequency-domain solution <strong>of</strong> the eddy-current integral<br />
equation.<br />
II. TIME-DOMAIN SOLUTION OF THE EDDY-CURRENT<br />
INTEGRAL EQUATION<br />
For two-dimensional field problems, the time-domain<br />
eddy-current integral equation for motionless conductors<br />
is<br />
<br />
d<br />
1<br />
J ( r, t)<br />
J ( r',<br />
t)<br />
ln dS'<br />
dt R<br />
<br />
d<br />
1<br />
<br />
Ji<br />
(r',<br />
t)<br />
ln dS'<br />
(2)<br />
dt<br />
R<br />
i<br />
where r and r ' are the position vectors <strong>of</strong> the observation<br />
point and <strong>of</strong> the source point, respectively, is the
egion containing the solid conductors, i is the region<br />
where the imposed current density J i is confined,<br />
0<br />
R |<br />
r r'|<br />
, , 0 being the permeability <strong>of</strong> free<br />
2<br />
space.<br />
In the three-dimensional case, the eddy-current integral<br />
equation has the form<br />
d J(<br />
r',<br />
t)<br />
J(<br />
r,<br />
t)<br />
dV '<br />
2 dt R<br />
<br />
d Ji<br />
( r',<br />
t)<br />
dV ' grad <br />
2 dt R<br />
<br />
i<br />
where is the electric scalar potential.<br />
To simplify the formulation, we consider here a<br />
two-dimensional structure with a single solid conductor.<br />
Using a frame <strong>of</strong> reference attached to the conducting<br />
body in motion, the time discretization <strong>of</strong> (3) leads to<br />
t<br />
1<br />
( J J0<br />
) ( J1<br />
J0<br />
) ln dS '<br />
2<br />
R<br />
<br />
1 <br />
- 353 - 15th IGTE Symposium 2012<br />
(3)<br />
1<br />
1<br />
J0t Ji<br />
ln dS'<br />
ln '<br />
1 Ji<br />
dS (4)<br />
0<br />
R<br />
R<br />
<br />
i1<br />
where the subscript “0” indicates the time t and the<br />
subscript “1” the time t t<br />
. Dividing (4) by t yields<br />
<br />
i0<br />
J<br />
t<br />
J<br />
1<br />
ln dS'<br />
J<br />
t<br />
1 2 t<br />
1 R<br />
2<br />
<br />
2<br />
Ji<br />
1<br />
1<br />
ln ' ( ) 1 dS <br />
ln '<br />
1 n v<br />
Ji<br />
dl (5)<br />
t<br />
R<br />
2 R<br />
i<br />
1<br />
2<br />
2<br />
i<br />
1 t<br />
where the subscript “ ” refers to the time t<br />
2<br />
2<br />
<br />
, i<br />
is the boundary <strong>of</strong> i , v is the velocity <strong>of</strong> the i in the<br />
frame <strong>of</strong> references attached to , and n is the outward<br />
unit vector normal to i<br />
. The last term in (5) is due to<br />
the relative motion <strong>of</strong> and i . Solution <strong>of</strong> (5) gives<br />
the current distribution J 1 at the time step t t<br />
in<br />
terms <strong>of</strong> that at the time step t in the form<br />
1<br />
2<br />
J<br />
<br />
J1 t<br />
J<br />
1<br />
0<br />
t<br />
<br />
2<br />
. (6)<br />
The magnetic force is evaluated by applying Ampère’s<br />
force formula, i.e.,<br />
r ri<br />
F <br />
J<br />
( r,<br />
t)<br />
Ji<br />
( ri<br />
, t)<br />
dSi<br />
dS (7)<br />
2<br />
| r r |<br />
<br />
i<br />
i<br />
0<br />
with r and r i being the position vectors <strong>of</strong> the points <strong>of</strong><br />
and i , respectively.<br />
The spatial discretization grid in the two-dimensional<br />
case is constructed by dividing the region into<br />
polygonal surface elements m , with the induced current<br />
density considered to be constant through each m . The<br />
region i is divided into surface elements i , with the<br />
k<br />
imposed current density being constant through each<br />
i . Integrating (5) over each <br />
k<br />
m yields the following<br />
J<br />
<br />
matrix equation for the vector :<br />
t<br />
t<br />
<br />
J<br />
<br />
J<br />
A B AJ<br />
i <br />
<br />
0 Bi<br />
C i<br />
2 <br />
t<br />
<br />
t<br />
<br />
1<br />
2<br />
1<br />
2<br />
1<br />
2<br />
J1 2<br />
where A is a diagonal matrix with entries Am mSm<br />
,<br />
m being the resistivity <strong>of</strong> the material for m and S m<br />
its area, and B is a symmetric matrix with its entries<br />
corresponding to the elements m <strong>of</strong> having the form<br />
B<br />
<br />
<br />
1<br />
ln dS<br />
dS<br />
S mS<br />
m,<br />
k<br />
k m = k<br />
R<br />
mk <br />
4<br />
1 2<br />
<br />
<br />
m k<br />
(8)<br />
1<br />
( n m nk<br />
) R ln dlkdlm<br />
. (9)<br />
R<br />
The entries <strong>of</strong> the matrix B i are defined as in (9), but<br />
with the elements k <strong>of</strong> being replaced with the<br />
elements i belonging to <br />
k<br />
i , while the entries <strong>of</strong> the<br />
matrix C are<br />
C<br />
m,<br />
i<br />
k<br />
<br />
1<br />
2<br />
<br />
<br />
m ik<br />
1<br />
( n m R)(<br />
ni<br />
v)<br />
ln dl<br />
k<br />
i dl<br />
k m . (10)<br />
R<br />
All integrals in (9) and (10) are evaluated by analytic<br />
expressions, the entries <strong>of</strong> the matrix B being calculated<br />
only once, but those <strong>of</strong> the matrices B i and C are to be<br />
calculated for each new position <strong>of</strong> i .<br />
Taking into account the small dimensions <strong>of</strong> the<br />
elements m , a rapid numerical computation <strong>of</strong> the force<br />
in (7) is performed using the approximation<br />
where the vector<br />
Jm<br />
F Sm Pi<br />
(11)<br />
k<br />
m k<br />
P i is expressed in the form<br />
k<br />
1<br />
P i n<br />
k<br />
i ln dl<br />
k<br />
i (12)<br />
k<br />
| rm<br />
ri<br />
|<br />
Jik <br />
ik<br />
k
with r m being the position vector <strong>of</strong> the center <strong>of</strong> the<br />
element m <strong>of</strong> and r i the position vector <strong>of</strong> the<br />
k<br />
point <strong>of</strong> integration on i<br />
. When the ratio <strong>of</strong> the linear<br />
k<br />
dimensions <strong>of</strong> i to the distance between its center and<br />
k<br />
the center <strong>of</strong> m is sufficiently small, P i can be<br />
k<br />
calculated by subdividing i in a number <strong>of</strong> elements<br />
k<br />
p in terms <strong>of</strong> this ratio and by using the summation<br />
k<br />
rm<br />
rpk<br />
P i J<br />
k ik<br />
S p (13)<br />
2 k<br />
p | rm<br />
rp<br />
|<br />
k<br />
where r p is the position vector <strong>of</strong> the center <strong>of</strong> p<br />
k<br />
k and<br />
S p the area <strong>of</strong> the element p<br />
k<br />
k . The same technique is<br />
applied for a rapid numerical calculation <strong>of</strong> the entries in<br />
the matrices B i and C in (8), making also use <strong>of</strong> the<br />
relation<br />
1 R<br />
( n i v)<br />
ln dl'<br />
v <br />
k dS'<br />
. (14)<br />
R<br />
2<br />
<br />
R<br />
i<br />
i<br />
k<br />
In the case the imposed currents are periodic, the initial<br />
distribution <strong>of</strong> the induced current can be obtained by<br />
performing a Fourier expansion and by employing the<br />
phasor form <strong>of</strong> the eddy-current integral equation (see<br />
Section IV).<br />
III. SOLUTION OF EQUATION OF MOTION<br />
Equation (1) is solved iteratively. We choose an<br />
appropiate time step t and assume that the magnetic<br />
force F has a linear variation during t . At the time t the<br />
body has a position defined by the vector r 0 and a<br />
magnetic force F 0 is exerted upon it. The iterative<br />
process is started by imposing the value F1 F0<br />
at the<br />
time t t<br />
and the position vector r 1 results from<br />
solving (1). The electromagnetic field problem is then<br />
solved for the new r 1 and a new value <strong>of</strong> the force F 1 is<br />
determined for the time t t<br />
. This operation is repeated<br />
until the difference between two successive values <strong>of</strong> the<br />
magnetic force for the time t t<br />
is sufficiently small<br />
and, then, we proceed to the next time step.<br />
IV. FREQUENCY-DOMAIN SOLUTION OF THE<br />
EDDY-CURRENT INTEGRAL EQUATION<br />
Since the region i is moving with the velocity v in<br />
the frame <strong>of</strong> reference attached to , (2) is written in the<br />
form<br />
J<br />
( r',<br />
t)<br />
1 Ji<br />
( r',<br />
t)<br />
1<br />
J ( r, t)<br />
ln dS'<br />
<br />
ln dS'<br />
t R<br />
t<br />
R<br />
<br />
i<br />
k<br />
1<br />
( n v)<br />
Ji ( r',<br />
t)<br />
ln dl'<br />
. (15)<br />
R<br />
i<br />
k<br />
- 354 - 15th IGTE Symposium 2012<br />
If the imposed currents are sinusoidal, the phasor<br />
representation <strong>of</strong> (15) is<br />
1<br />
1<br />
J ( ) j J ( ')<br />
ln dS'<br />
j J i ( ')<br />
ln dS'<br />
R<br />
R<br />
<br />
<br />
r<br />
r r <br />
<br />
1<br />
( n v)<br />
J i ( r')<br />
ln dl'<br />
(16)<br />
R<br />
<br />
i<br />
where 2<br />
f<br />
, f being the frequency, and<br />
re im<br />
J J jJ<br />
is the phasor form <strong>of</strong> the current density,<br />
with j 1<br />
. The two terms on the right side <strong>of</strong> (16)<br />
show the contribution to the induced current density due<br />
to the time variation <strong>of</strong> the imposed currents and that due<br />
to the relative motion. The same technique as in the case<br />
<strong>of</strong> the time-domain analysis is used for the space<br />
discretization <strong>of</strong> (16). One obtains the following<br />
algebraic system with complex coefficients:<br />
re im im re<br />
AJ BJ Bi<br />
Ji<br />
CJi<br />
im re re im<br />
AJ BJ Bi<br />
Ji<br />
CJi<br />
. (17)<br />
The average magnetic force over a period is evaluated<br />
using the relation<br />
<br />
<br />
* r ri<br />
<br />
Fav <br />
ReJ<br />
( r)<br />
J i ( ri<br />
) dSi<br />
dS<br />
2 (18)<br />
<br />
<br />
| r ri<br />
|<br />
i<br />
<br />
where the asterisk indicates the complex conjugate.<br />
For a multiple-conductor systems, one uses local<br />
frames <strong>of</strong> reference attached to each <strong>of</strong> the conductors.<br />
For the conductor q, occupying the region ,<br />
q 1,<br />
2,<br />
,<br />
(16) is written in the form<br />
i<br />
q<br />
1<br />
1<br />
J ( r) j<br />
J ( r')<br />
ln dS'<br />
j<br />
J ( r')<br />
ln dS'<br />
R<br />
R<br />
<br />
pq<br />
<br />
q<br />
( q)<br />
1<br />
1<br />
( n v p ) J ( r')<br />
ln dl'<br />
j<br />
J i ( r'<br />
) ln dS'<br />
R<br />
R<br />
pq<br />
<br />
p<br />
i<br />
( q)<br />
1<br />
( n<br />
vi<br />
) J i ( r')<br />
ln dl'<br />
(19)<br />
R<br />
<br />
i<br />
(q)<br />
(q)<br />
where v p and v i are, respectively, the velocities <strong>of</strong><br />
the conductor p and <strong>of</strong> i with respect to the conductor<br />
q. Equation (1) is always solved separately for each body.<br />
V. SOLUTION ACCELERATION FOR THE EQUATION OF<br />
MOTION<br />
The computation <strong>of</strong> the motion <strong>of</strong> conducting bodies<br />
can be spectacularly accelerated by using the average<br />
value <strong>of</strong> the force over a period, evaluated using the<br />
p
Figure 1: Discretization <strong>of</strong> the levitated plate.<br />
y (m)<br />
0.06<br />
0.05<br />
0.04<br />
0.03<br />
0.02<br />
0.01<br />
0.00<br />
0 1 2 3 4 5 6 7 8 9 10 11 12 13<br />
t (s)<br />
Figure 2: Evolution in time <strong>of</strong> the coordinate y <strong>of</strong> the plate for<br />
f = 2,000 Hz.<br />
y (m)<br />
0.06<br />
0.05<br />
0.04<br />
0.03<br />
0.02<br />
0.01<br />
0.00<br />
0.0 0.1 0.2 0.3 0.4 0.5<br />
t (s)<br />
Figure 3: Detail regarding the motion at the beginning for f = 2,000 Hz.<br />
phasor representation <strong>of</strong> current density. Using the<br />
algorithm described in Section III, the time step is chosen<br />
to be a multiple <strong>of</strong> the period and is adjusted according to<br />
the force value, such that when the force decreases the<br />
time step is increased and when the force increases it is<br />
reduced.<br />
VI. ILLUSTRATIVE EXAMPLE<br />
A copper plate <strong>of</strong> width 80 mm, thickness 4 mm (see<br />
8<br />
Fig. 1), resistivity 210<br />
m<br />
and <strong>of</strong> mass density<br />
3 3<br />
8. 9<br />
10 kg / m is levitated using two coils <strong>of</strong> 200 turns<br />
each, <strong>of</strong> 10 mm 10 mm in cross section and a distance<br />
between the axes <strong>of</strong> their sides <strong>of</strong> 70 mm and 30 mm,<br />
respectively. The current direction is the same in the<br />
outer and inner coils, the current intensity in each turn<br />
being i I 2 sin 2ft<br />
, with I = 10 A and f = 2,000 Hz.<br />
- 355 - 15th IGTE Symposium 2012<br />
y<br />
y (m)<br />
0.050<br />
0.045<br />
0.040<br />
0.035<br />
0.030<br />
0.025<br />
0.020<br />
0.015<br />
0.010<br />
0.005<br />
0.000<br />
0 1 2 3 4 5 6 7<br />
t (s)<br />
Figure 4: Evolution in time <strong>of</strong> the coordinate y <strong>of</strong> the plate for<br />
f = 2,000 Hz, with a direct current <strong>of</strong> 10 A added in the outer coil.<br />
y (m)<br />
0.045<br />
0.040<br />
0.035<br />
0.030<br />
0.025<br />
0.020<br />
0.015<br />
0.010<br />
0.005<br />
0.000<br />
0 1 2 3 4 5 6 7 8 9 10 11 12 13<br />
t (s)<br />
Figure 5: Evolution in time <strong>of</strong> the coordinate y <strong>of</strong> the plate for<br />
f = 200 Hz.<br />
Initially, the conducting plate is located at 10 mm<br />
above the coils. It is assumed that the plate only moves in<br />
the vertical direction, but the procedures described in the<br />
paper are also applicable when more degrees <strong>of</strong> freedom<br />
are considered. The plate cross section is discretized in<br />
180 rectangular elements, as indicated in Fig. 1.<br />
For the time-domain method presented in this paper,<br />
the period was divided in 48 intervals and the motion <strong>of</strong><br />
the plate was observed during 26,000 periods, i.e. for<br />
1,248,000 time steps. The result is presented in Fig. 2,<br />
with the detailed motion at the beginning shown in Fig. 3.<br />
The computation took about 6 hours employing a 2.128<br />
GHz Intel processor notebook. A great reduction in the<br />
amount <strong>of</strong> computation is obtained by approximating the<br />
conducting region with a thin strip <strong>of</strong> thickness equal to<br />
the field depth <strong>of</strong> penetration [3]. The oscillations <strong>of</strong> the<br />
plate can be attenuated by adding a dc component in the<br />
current coils or by using a permanent magnet. If a direct<br />
current <strong>of</strong> 10 A is added in the outer coil, the plate<br />
motion becomes as it is shown in Fig. 4.<br />
For a frequency <strong>of</strong> 200 Hz, the motion <strong>of</strong> the plate is<br />
shown in Fig. 5, the attenuation <strong>of</strong> the mechanical<br />
oscillations being much stronger than for a frequency <strong>of</strong><br />
2,000 Hz.<br />
It should be remarked that the same results in Figs. 2<br />
and 3 were obtained by using the proposed<br />
frequency-domain procedure (see Sections IV and V).<br />
Only 4,393 variable time steps, <strong>of</strong> magnitude between a<br />
period and 50 periods, were necessary for determining<br />
the motion <strong>of</strong> the plate between t = 0 and t = 13 s. The
equired computation time was only 124 s, i.e., about 170<br />
times less than for the time-domain solution.<br />
VII. CONCLUSION AND REMARKS<br />
Two efficient methods are presented for computing the<br />
motion <strong>of</strong> the solid conductors under the action <strong>of</strong><br />
electromagnetic forces. Practically, for the same accuracy<br />
<strong>of</strong> the results, a tremendous reduction in the amount <strong>of</strong><br />
computation is achieved when using a frequency-domain<br />
procedure.<br />
The proposed methods can be extended to nonlinear<br />
media. For the time-domain procedure one can utilize the<br />
polarization method [4], which allows the formulation <strong>of</strong><br />
the eddy-current integral equation [2]. For the<br />
frequency-domain procedure, one can adopt the method<br />
proposed in [5], [6]. Since in some problems, for instance<br />
in electromagnetic levitation problems, the air regions are<br />
relatively large with respect to the conducting or/and<br />
ferromagnetic regions, the weight <strong>of</strong> the fundamental<br />
harmonic in the harmonic spectrum is significant and,<br />
thus, for the convergence acceleration in the polarization<br />
method one can efficiently employ the technique<br />
proposed in [7].<br />
The methods presented can be applied to<br />
three-dimensional structures as well, by adopting the<br />
eddy-current integral equation proposed in [8] and<br />
extended in [2] to nonlinear media and to moving bodies.<br />
In this case, the spatial discretization <strong>of</strong> (16) is done by<br />
decomposing the induced current density using functions<br />
<strong>of</strong> the form W<br />
, where W are edge elements. When<br />
edge elements <strong>of</strong> the first order are used, then the current<br />
density is constant inside the tetrahedral volume elements<br />
and the relations presented in this paper remain valid.<br />
Now, the integrals similar to those in (9) and (10) can<br />
only partially be evaluated analytically.<br />
Finally, it should be remarked that the procedures in<br />
- 356 - 15th IGTE Symposium 2012<br />
[2], [5], [6] and [7] allow the extension to nonlinear<br />
media <strong>of</strong> the proposed frequency-domain technique<br />
which, as illustrated in this paper, could yield a<br />
spectacular reduction in the amount <strong>of</strong> computation<br />
needed.<br />
ACKNOWLEDGMENT<br />
This work was supported in part by the Romanian<br />
Ministry <strong>of</strong> Labour, Family and Social Protection through<br />
the Financial Agreement POSDRU/89/1.5/S/62557 and<br />
by a grant <strong>of</strong> the Romanian National Authority <strong>of</strong><br />
Scientific Research, CNDI-UEFISCDI, project number<br />
PN-II-PT-PCCA-2011-3.2-0373.<br />
REFERENCES<br />
[1] S. Kurz, J. Fetzer, G. Lehner, and W. M. Rucker, “A novel<br />
formulation for 3D eddy current problems with moving bodies<br />
using a Lagrangian description and BEM-FEM coupling,” IEEE<br />
Trans. Magn., vol. 34, no. 5, pp. 3068-3073, Sep. 1998.<br />
[2] R. Albanese, F. Hantila, G. Preda, and G. Rubinacci, “Integral<br />
formulation for 3-D eddy current computation in ferromagnetic<br />
moving bodies,” Rev. Roum. Sci. Techn., Electrotechn. et Energ.,<br />
vol. 41, no. 4, pp. 421-429, 1996.<br />
[3] I. R. Ciric, F. I. Hantila, and M. Maricaru, “Field analysis for thin<br />
shields in the presence <strong>of</strong> ferromagnetic bodies,” IEEE Trans.<br />
Magn., vol. 46, no. 8, pp. 3373-3376, Aug. 2010.<br />
[4] F. Hantila, “A method <strong>of</strong> solving magnetic field in nonlinear<br />
media,” Rev. Roum. Sci. Techn., Electrotechn. et Energ., vol. 20,<br />
no. 3, pp. 397-407, 1975.<br />
[5] I. R. Ciric and F. I. Hantila, “An efficient harmonic method for<br />
solving nonlinear time-periodic eddy-current problems,” IEEE<br />
Trans. Magn., vol. 43, no. 4, pp. 1185-1188, Apr. 2007.<br />
[6] I. R. Ciric, F. I. Hantila, M. Maricaru, and S. Marinescu, “Efficient<br />
analysis <strong>of</strong> the solidification <strong>of</strong> moving ferromagnetic bodies with<br />
eddy-current control,” IEEE Trans. Magn., vol. 45, no. 3, pp.<br />
1238-1241, Mar. 2009.<br />
[7] I. R. Ciric, F. I. Hantila, and M. Maricaru, “Convergence<br />
acceleration in the polarization method for nonlinear periodic<br />
fields,” COMPEL, vol. 30, no. 6, pp. 1688-1700, 2011.<br />
[8] R. Albanese and G. Rubinacci, “Integral formulation for 3D<br />
eddy-current computation using edge elements,” IEE <strong>Proceedings</strong><br />
A , vol.135, no.7, pp.457-462, Sep. 1988.
- 357 - 15th IGTE Symposium 2012<br />
Adaptive Inductance Computation on GPUs<br />
A.G. Chiariello, A. Formisano and R. Martone<br />
Dipartimento di Ingegneria Industriale e dell’Informazione<br />
Seconda Università di Napoli, Via Roma 29, Aversa (CE), Italy<br />
E-mail: Alessandro.Formisano@Unina2.it<br />
Abstract—Inductances computation involving highly complex geometries and linear materials can be tackled by discretizing<br />
coils into simpler elements, whose magnetic behavior is analytically expressible but, to achieve reliable results, very high<br />
numbers <strong>of</strong> elements may be required. In such cases, advantages can be taken from GPU capabilities <strong>of</strong> dealing efficiently<br />
with simple computational tasks. In the paper, a code able to compute self and mutual inductances <strong>of</strong> any 3D coils, taking<br />
advantage <strong>of</strong> GPU capabilities, is presented.<br />
Index Terms— GPU, High Performance Computing, Inductance Computation<br />
I. INTRODUCTION<br />
The computation <strong>of</strong> self and mutual inductances for<br />
complex 3D shaped coils is a demanding task, since no<br />
general formulas exist. Numerical computations, in order<br />
to achieve reliable results, require large discretization<br />
efforts and, in general, multiple runs. As a consequence,<br />
in a number <strong>of</strong> practical cases the computer burden could<br />
be very high.<br />
Accuracy for generally shaped coils and computational<br />
promptness represent conflicting objectives, especially in<br />
optimal design [1, 2], and various computational<br />
paradigms have been proposed to achieve reasonable<br />
trade-<strong>of</strong>fs. One <strong>of</strong> the most promising approaches is the<br />
adoption <strong>of</strong> High-Performance Computing (HPC)<br />
architectures; among HPC approaches, an effective<br />
solution is the use <strong>of</strong> Graphic Processing Units (GPU),<br />
easily available even on desktop class hardware.<br />
On the other hand, in order to exploit at their best these<br />
peculiar architectures, a revising <strong>of</strong> simulation codes is<br />
<strong>of</strong>ten necessary, and new solutions, well suited for CPUbased<br />
computational environments, must be adopted.<br />
If assuming that no magnetic materials are present (i.e.<br />
the relative permeability rel is equal to 1 everywhere),<br />
following well established approximation formulas [3],<br />
self and mutual inductances can be computed using<br />
“segmented” approximations. The basic idea is to<br />
decompose coils into simpler elements, for which self or<br />
mutual inductances can be easily computed, eventually<br />
using closed form expressions. Superposition is then used<br />
to get the final value <strong>of</strong> coils self or mutual inductance.<br />
In this paper a method able to compute self and mutual<br />
inductances in air for generally 3D shaped massive coils,<br />
based on coil decomposition into filamentary elements,<br />
called current sticks, is presented, and its implementation<br />
on HPC environments, based on GPUs, is briefly<br />
discussed. The method is able to adapt the discretization<br />
level to the required accuracy, and was implemented in<br />
the INDIANA code (INDuctance Iterative and Adaptive<br />
Numerical Assessment).<br />
The basic objective <strong>of</strong> INDIANA is the computation <strong>of</strong><br />
self and mutual inductances <strong>of</strong> coils wound with multiple<br />
series-connected turns <strong>of</strong> conductors. In mutual<br />
inductance computation, INDIANA decomposes both<br />
coils into a suited number <strong>of</strong> current sticks, and computes<br />
the line integral along each stick <strong>of</strong> the “target” coil <strong>of</strong> the<br />
vector potential A generated by each stick <strong>of</strong> the “source”<br />
coil, with unit current. Then, the procedure, taking<br />
advantage <strong>of</strong> the concept <strong>of</strong> “partial inductance” [4] and<br />
<strong>of</strong> the linearity assumption, sums all contributions to get<br />
the final, overall required value. Self inductances are<br />
computed by using the same coil for both source and<br />
target, but extracting the singular case <strong>of</strong> “self”<br />
computations for each element, which are treated using<br />
the expression for self inductance <strong>of</strong> a current stick [3].<br />
This scheme suits quite well for GPU-based<br />
computations, as will be further discussed in Sect. III.<br />
In the following, a short overview <strong>of</strong> the relevant<br />
points in the method will be presented (Sect. II), some<br />
comments on the GPU implementations are reported<br />
(Sect. III), and a few examples are discussed to help<br />
assessing the method capabilities (Sect. IV). Finally, in<br />
two final annexes, a brief description <strong>of</strong> the GPUs<br />
architecture and programming paradigms is given.<br />
II. MATHEMATICAL FORMULATION<br />
The decomposition <strong>of</strong> massive coils into elementary<br />
components is performed in two steps, taking into<br />
account the structure <strong>of</strong> the winding, the distance between<br />
the coils, and the local curvature <strong>of</strong> each coil.<br />
As a first step, each coil is decomposed in as many<br />
filamentary conductors as required by accuracy needs.<br />
The typical figure adopted is as many as the actual<br />
conductors in the Winding Pack (WP) <strong>of</strong> each coil.<br />
However the figure can be increased according to the<br />
adopted technology (e.g. in superconducting cable in<br />
conduit conductor technology, the decomposition can be<br />
extended down to petals level) to meet the accuracy<br />
needs.<br />
As a second step, each conductor is described using an<br />
interpolating line, typically a spline, defined by a limited<br />
number <strong>of</strong> parameters, such as the coordinates <strong>of</strong> a<br />
suitable number <strong>of</strong> control points, constraining the shape<br />
<strong>of</strong> the conductor to the required geometrical accuracy. In<br />
addition the continuous curve is reduced to a collection <strong>of</strong><br />
sticks, defined by a number <strong>of</strong> suitable break points (see<br />
Fig. 1 for a schematic view).<br />
The number <strong>of</strong> sticks is selected on the basis <strong>of</strong> the<br />
accuracy required by magnetic field computation, and can<br />
vary depending on the local curvature <strong>of</strong> the interpolating<br />
curve and on the distance from field points. INDIANA<br />
performs a first guess for the distribution <strong>of</strong> break points<br />
along conductors <strong>of</strong> the source coil on the basis <strong>of</strong><br />
average curvature radius and on minimum distance
Actual<br />
Coil<br />
Current<br />
Sticks<br />
Approximation<br />
First coil: source Second coil: target<br />
Figure 1: Segmentation <strong>of</strong> coils centerline into “sticks”: source coil<br />
(left) and target coils (right).<br />
between the midpoint <strong>of</strong> the section <strong>of</strong> the source coil<br />
being considered and the target coil.<br />
If the required accuracy is not fulfilled, the inductance<br />
computation is assessed by increasing the number <strong>of</strong><br />
break points, and performing a new inductance<br />
computation. The process iterates until the convergence,<br />
in Cauchy sense within a prescribed accuracy, is<br />
achieved.<br />
The mutual inductance Mjk between the k-th stick on<br />
the source coil and j-th stick on the target coil can be<br />
computed either using the classical formulas from [3], or<br />
by line integrating (numerically) the vector potential<br />
Ak(x) generated by stick k on stick j:<br />
(1)<br />
ˆ<br />
M jk Akx j tˆ dl j<br />
<br />
j<br />
where j is straight line along the j-th stick, xj is a generic<br />
point along j, and ˆt is the stick unit vector. The<br />
expression for Ak is given in [5]:<br />
1<br />
ˆ<br />
cba A 0 a ln <br />
(2)<br />
4<br />
cba where c=j+1-xi, b=j-xi, and a=j+1-j and the coordinates<br />
<strong>of</strong> the stick tips.<br />
c<br />
xj<br />
Figure 2: Basic elements form computation <strong>of</strong> vector potential using<br />
(2b).<br />
INDIANA implements a slightly modified version <strong>of</strong><br />
(2), to treat the singularity when computing A on points<br />
on the source stick axis [6]. The number <strong>of</strong> Gauss points<br />
for numerical integration <strong>of</strong> (2) is chosen, according to<br />
the requested accuracy, on the basis <strong>of</strong> the distance<br />
between centers <strong>of</strong> source and target points, using a linear<br />
relationship based on the length <strong>of</strong> the target stick and the<br />
distance between mid points <strong>of</strong> source and target sticks.<br />
If source and target sticks are coincident, the self<br />
inductance Mkk <strong>of</strong> the k-th stick can be computed using<br />
the expression for a thin beam [3], providing the value in<br />
Henry if the stick length Lk is given in meters:<br />
4 2Lk <br />
Mkk 210 Lkln<br />
1<br />
(3)<br />
r<br />
<br />
<br />
Lk<br />
where r is the geometric mean distance and is the<br />
a<br />
Ak b<br />
A<br />
- 358 - 15th IGTE Symposium 2012<br />
arithmetic mean distance on the corresponding k-th beam<br />
cross section. Values <strong>of</strong> r and for different cross<br />
sections are given in [3], while INDIANA adopts the<br />
expression for circular cross section and long beams,<br />
where r is the radius <strong>of</strong> the cross section and /Lk is<br />
negligible.<br />
III. GPU IMPLEMENTATION<br />
In this section attention will be focused on the porting<br />
<strong>of</strong> INDIANA code on the peculiar GPUs hardware. In<br />
Appendix A, to the benefit <strong>of</strong> non experts, a short<br />
introduction to GPU’s architecture is given [7-11], while<br />
in Appendix B some hints strictly related to the<br />
peculiarity <strong>of</strong> the GPU’s hardware are provided for the<br />
interested programmers [7, 8].<br />
The typical computational GPU architecture includes a<br />
classical CPU section where the GPUs are grafted. In<br />
order to pr<strong>of</strong>itably use the parallel nature <strong>of</strong> the GPU, any<br />
code implementing numerical computations needs to be<br />
split into sequential parts, which are performed on the<br />
main CPU (or CPUs), and into the numerically intensive<br />
parts, which can be more effectively performed on the<br />
GPUs. In this way the best exploitation <strong>of</strong> GPUs and<br />
CPUs execution capabilities can be achieved.<br />
In the INDIANA code the basic computational task,<br />
i.e. the evaluation <strong>of</strong> (2), the magnetic vector potential A<br />
generated by a single stick in a single field point, can<br />
benefit <strong>of</strong> the GPUs architecture. As a matter <strong>of</strong> fact this<br />
task, which must be repeated a very high number <strong>of</strong><br />
times, can be simply assigned to a thread; then, suitable<br />
grouping <strong>of</strong> treads onto computational blocks, can be<br />
organized to exploit at best the data structure and the<br />
available resources.<br />
In order to achieve the peak performance [9], the<br />
computational kernel needs to use GPU registers; in<br />
addition, it needs to treat a large number <strong>of</strong> independent<br />
instructions to exploit the scheduler capability <strong>of</strong> the<br />
graphic card (further details can be found in Appendix<br />
B). For these reasons the code was structured following<br />
the flowchart:<br />
1) Load the field point associated<br />
to the considered thread. (Global<br />
memory access)<br />
2) Load the start and end points <strong>of</strong><br />
the sticks (Global memory access)<br />
3) Compute the contribution to the<br />
field <strong>of</strong> each stick in the bundle<br />
using (2)<br />
4) Accumulate all the contributions;<br />
in a register<br />
Last<br />
stick?<br />
Yes<br />
5) Store the computed vector field<br />
in the global memory<br />
Figure 3: Flowchart showing the INDIANA kernel computation on<br />
GPU architecture.<br />
No
The final integration step (1) is executed in the CPU<br />
side, since it is well suited to CPU characteristics, and its<br />
impact on the computational burden is quite marginal.<br />
INDIANA was implemented using the MATLAB©<br />
parallel computational toolbox, and the core <strong>of</strong> the code<br />
has been parallelized on the GPU using CUDA©, an<br />
extension <strong>of</strong> the C language for Nvidia© GPU<br />
programming. A few details are given in Appendix B.<br />
IV. EXAMPLES OF APPLICATION<br />
In this section, two groups <strong>of</strong> examples are presented.<br />
In the first group, in order to assess accuracy, results from<br />
INDIANA are compared to standard results for simple<br />
geometries where analytical formulas are available. In the<br />
second group, results from INDIANA are compared to<br />
computations from 3D finite elements to assess speed-up<br />
<strong>of</strong> computations for generally shaped coils.<br />
Computational times are all referred to an intel i7–based<br />
PC, with 8Gb ram, running Matlab© Ver.7 for the CPU<br />
computations, and CUDA© Ver. 4.2 for GPU<br />
computations.<br />
a. Accuracy Assessment<br />
Three filamentary single-turn coils have been<br />
considered in this group. The first one is a circular<br />
coil, while the other two are elliptical coils. Analytical<br />
expressions and reference figures were taken from<br />
[12]. Geometrical details are reported in Table I,<br />
while a comparison <strong>of</strong> results is reported in Table II,<br />
for increasing accuracy (expressed in Parts Per<br />
Million -p.p.m.- <strong>of</strong> the reference value), and<br />
consequently for increasing number <strong>of</strong> sticks in<br />
INDIANA calculations.<br />
z<br />
C3<br />
Figure 4: Coils used for Accuracy Assessment<br />
TABLE I<br />
COILS USED FOR ACCURACY ASSESSMENT<br />
Coil # Centre position [m] Radii (a,b) [m]<br />
C1 (0.0, 0.0, 0.0) 1 (circular)<br />
C2 (0.0, 0.0, 0.5) (1/3, 2/3)<br />
C3 (0.6, 2.0, 0.1) (1/3, 2/3)<br />
Required<br />
Figure<br />
x<br />
b<br />
TABLE II<br />
INDUCTANCES FOR INCREASING ACCURACY<br />
Required<br />
Accuracy<br />
[p.p.m.]<br />
C2<br />
a<br />
C1<br />
Number <strong>of</strong><br />
sticks<br />
Computational<br />
times [s]<br />
Reference<br />
Value [H]<br />
MC1-C2 1.0 10 3 1.13 0.30871178<br />
MC1-C2 0.1 10 4 36.9 0.30871178<br />
MC1-C3 1.0 10 3 1.17 0.03963496<br />
MC1-C3 0.1 10 4 35.8 0.03963496<br />
y<br />
- 359 - 15th IGTE Symposium 2012<br />
b. Speed Assessment<br />
For this second analysis, the mutual inductance <strong>of</strong> two<br />
coaxial circular coils has been considered. This<br />
benchmark case is treated either using INDIANA with<br />
300 sticks and the 3D FEM package COMSOL<br />
multiphysics 4.2a (27524 2 nd order tetrahedral elems.,<br />
neglecting any symmetry for the sake <strong>of</strong> generality). The<br />
two coils are described in Table III, while results are<br />
reported in Table IV.<br />
TABLE III<br />
COILS USED FOR SPEED ASSESSMENT<br />
Coil # Centre position [m] Radius [m]<br />
C1 (0.0, 0.0, 0.0) 0.20<br />
C3 (0.0, 0.0, 0.1) 0.25<br />
Method<br />
TABLE IV<br />
INDUCTANCES FOR SPEED ASSESSMENT<br />
Mutual<br />
Inductance [H]<br />
Computational time<br />
[s]<br />
INDIANA 0.2487 4<br />
3D FEM package 0.2486 490<br />
Reference Value [2] 0.2488 ---<br />
As a second speed test, the computation <strong>of</strong> mutual<br />
inductance between a massive solenoidal source coil<br />
(radius 1.7 m, length 2.0 m, thickness 0.7 m, 14 layers, 38<br />
turns per layer) and a filamentary coil (Rin=4.77 m,<br />
Rout=5.83 m, Zlow=5.02 m, Zup=5.11 m, =17.3°) used to<br />
measure flux across a test surface was considered (See<br />
Fig. 5). This test case is relevant for flux measurements in<br />
magnetic confinement fusion devices [13, 14]. The<br />
accuracy requirements on the flux measurement are rather<br />
severe, in order to achieve that accuracy a high number <strong>of</strong><br />
sticks can be needed, in these cases the GPU speed<br />
enhancement can be very useful to complete the<br />
simulation in a reasonable amount <strong>of</strong> time. All methods<br />
proved able to give the correct result <strong>of</strong> 2.5665048e-3 H.<br />
The massive source coil has been represented by as<br />
many conductors as actually present in its WP (that is,<br />
14×38), while the discretization level along each<br />
conductor has been varied to improve accuracy.<br />
Comparison <strong>of</strong> computational times for various<br />
discretization levels either for GPU computations, and for<br />
purely CPU computations for the sake <strong>of</strong> comparison, are<br />
reported in Table V.<br />
Source Coil<br />
Partial Flux<br />
Measurement<br />
Loop<br />
Figure 5: Sketch <strong>of</strong> source coil for flux generation in Tokamak devices<br />
and Partial Flux measurement loop
TABLE V<br />
SPEED UP FOR MUTUAL INDUCTANCE<br />
Computational Times [s] 1000 sticks 5000 sticks<br />
GPU tGPU 5.31 19.4<br />
CPU tCPU 35.8 125<br />
Speed up (tCPU/ tGPU) 6.74 6.44<br />
V. CONCLUSIONS<br />
A numerical code able to compute mutual inductance<br />
between couples <strong>of</strong> any massive coils has been presented.<br />
The code is called INDIANA, and is able to adaptively<br />
modify its computational parameters to achieve a trade<strong>of</strong>f<br />
between accuracy and computational speed.<br />
INDIANA code benefit <strong>of</strong> a significant acceleration<br />
(up to 7x) thanks to the GPU parallelization. This<br />
performance allows to easily meet high accuracy request<br />
in mutual inductance calculations for complex 3D shaped<br />
coils.<br />
INDIANA performance has been assessed either in<br />
terms <strong>of</strong> accuracy and speed with respect to simple<br />
geometries presented in literature or complex shapes,<br />
compared to FEM computations.<br />
Future activity will be addressed the MPI<br />
parallelization over a computer cluster where each node<br />
is equipped with GPU, in order to analyze more complex<br />
structures.<br />
ACKNOWLEDGEMENTS<br />
Authors wish to thank Mr. M. Nicolazzo from<br />
CREATE and Mr. M. Fatica from Nvidia for fruitful<br />
discussions, and valuable hints and suggestions.<br />
This work was partly supported by Seconda Università<br />
di Napoli under PRIST grant “Generazione distribuita di<br />
energia da fonti tradizionali e rinnovabili: aspetti<br />
ingegneristici e giuridici-economici-ambientali”, partly<br />
by NVIDIA Corporation and partly by<br />
ENEA/EURATOM CREATE association.<br />
REFERENCES<br />
[1] M. Ci<strong>of</strong>fi, A. Formisano, R. Martone, “Increasing design<br />
robustness in evolutionary optimisation“, COMPEL, vol. 23,<br />
pp.187-196, 2004.<br />
[2] M. Ci<strong>of</strong>fi, A. Formisano, R. Martone, G. Steiner, D. Watzenig, ”A<br />
fast method for statistical robust optimization”, IEEE Transactions<br />
on Magnetics , Vol. 42, pp. 1099-1102, 2006.<br />
[3] F. Grover, Inductance Calculation, New York: D. Van Nostrand,<br />
1946.<br />
[4] C. R. Paul, Introduction to Electromagnetic Compatibility,<br />
Hoboken (NJ): J. Wiley & Sons, 2006.<br />
[5] H. A. Haus, J. R. Melcher, Electromagnetic Fields and Energy,<br />
Englewood Cliffs, NJ: Prentice Hall, 1989.<br />
[6] J. Hanson, S. Hirshman, “Compact expressions for the Biot–<br />
Savart fields <strong>of</strong> a filamentary segment”, Phys. <strong>of</strong> Plasmas, vol. 9,<br />
pp.4410-4412, Oct. 2002.<br />
[7] D. Kirk, W. Hwu, Programming Massively Parallel Processors: A<br />
Hands-on Approach, Elsevier, 2010.<br />
[8] M. Garland et al., “Parallel computing experiences with CUDA”,<br />
IEEE Micro, vol. 28, 2008 pp. 13–27.<br />
[9] V. Volkov. “Better performance at lower occupancy”, Proceedins<br />
<strong>of</strong> NVIDIA GPU <strong>Technology</strong> Conference 2010, San Jose, USA,<br />
pp. 20-23, Sept. 2010.<br />
[10] R. Farber, CUDA Application Design and Development, Morgan<br />
Kaufmann, 2011.<br />
[11] F. Calvano, G. Rubinacci, A. Tamburrino, G. Vasilescu, S.<br />
Ventre, “Parallel MGS-QR sparsification for fast eddy current<br />
- 360 - 15th IGTE Symposium 2012<br />
NDT simulation” Studies in Applied Electromagnetics and<br />
Mechanics, vol. 36, pp. 29-36, 2012.<br />
[12] J. T. Conway, “Exact Solutions for the Mutual Inductance <strong>of</strong><br />
Circular Coils and Elliptic Coils”, IEEE Trans. on Magn., vol 48,<br />
pp. 81-94, 2012.<br />
[13] A. J. Donné et al., “Progress in ITER Physics basis, Chapetr 7:<br />
Diagnostics”, Nucl. Fusion, vol. 47, pp. S337-S384, (2007).<br />
[14] A. Formisano, J. Knaster J., R. Martone et al., “ITER nonaxisymmetric<br />
error fields induced by its magnet system”, Fusion<br />
Engineering and Design, vol. 86, pp. 1053-1056, 2011.<br />
APPENDIX A: GPU ARCHITECTURES<br />
Hardware used in computer Central Processing Unit<br />
(CPU) seems to be reaching the physical limits beyond<br />
which increase <strong>of</strong> clocking frequency or <strong>of</strong> integration<br />
scale is very hard with present technology. As a possible<br />
alternative, CPU manufactures are moving to multiplecores<br />
CPU’s, but the number <strong>of</strong> the cores is usually<br />
limited to a few tens. On the other hand, realistic<br />
treatment <strong>of</strong> real world applications gives rise to<br />
computationally demanding numerical models. In order<br />
to speed up the computations, different parallelization<br />
paradigms can be considered, a few examples being<br />
reported in [11].<br />
Recently, the Graphic Processing Units (GPUs),<br />
present on virtually all graphic cards <strong>of</strong> computers, have<br />
been proposed as data-parallel coprocessors, used to<br />
solve compute-intensive science and engineering<br />
problems, since it was observed that the mathematical<br />
processing in high resolution images are very similar to<br />
the computations usually required in numerical models <strong>of</strong><br />
physical phenomena.<br />
A modern GPU can have up to 1024 processor cores or<br />
Streaming Processors (SPs) grouped in Streaming<br />
Multiprocessors (SMs), each containing eight processor<br />
cores.<br />
In order to reduce the dimension <strong>of</strong> chip area dedicated<br />
to the control unit, each core in an SM use a parallel<br />
computation paradigm called Single Instruction, Multiple<br />
Data (SIMD), where concurrent processor execute the<br />
same code (called Kernel) on different data.<br />
The tasks submitted at each core are called threads; the<br />
threads are grouped in thread blocks (fig. 6b); the<br />
maximum dimension <strong>of</strong> a block is presently 512 threads;<br />
hence, a code need to launch a lot <strong>of</strong> thread blocks. For<br />
these reason the thread blocks are grouped into a grid <strong>of</strong><br />
thread blocks (fig. 6c). The threads in a block can be<br />
indexed using a 3D identifier, a block in a grid can be<br />
indexed using a 2D identifier.<br />
A block <strong>of</strong> threads is assigned at a SM; each SM can<br />
use 8,192 registers; the registers are the faster memory<br />
inside a GPU but they are dynamically partitioned among<br />
the threads inside the blocks; each thread can only access<br />
its own registers (fig. 6a). All threads inside a block can<br />
cooperate with the others sharing memory using an onchip<br />
low latency memory called shared memory (48 kB);<br />
the shared memory bandwidth is about 6x lower respect<br />
the register bandwidth.<br />
The GPU has a memory, called Global memory, where<br />
the CPU can upload the input data and download the<br />
result <strong>of</strong> the computation. The global memory is available<br />
at all the SMs; it is the largest memory inside a GPU, up<br />
to 6 GB in modern solutions.
APPENDIX B: GPU PROGRAMMING STRATEGY<br />
In order to obtain the best results, a programmer needs<br />
to take into account the peculiar hardware characteristic<br />
<strong>of</strong> the GPUs, in each step <strong>of</strong> the program [7, 10].<br />
The access to Global Memory are time consuming, in<br />
order to increase the access rate each time a location is<br />
accessed, many consecutive locations are accessed by the<br />
hardware.<br />
a) thread<br />
b) thread blocks<br />
…<br />
c) grid <strong>of</strong> thread blocks<br />
…<br />
Figure 6: GPU memory model<br />
A typical GPU program will follow the steps showed<br />
in Fig.7. In order to obtain the peak performance the<br />
programmer need to reorganize, in the host side (Fig.7<br />
step a), the input data in such a way that adjacent threads<br />
operate on adjacent data in global memory (Coalescing<br />
Access). In this case, the hardware combines, or<br />
coalesces, all <strong>of</strong> these accesses into a unique access to<br />
consecutive locations. An example related to geometric<br />
data is showed in figure 8. In order to allow coalescing<br />
accesses, duplication <strong>of</strong> data can be needed.<br />
Figure 7: Flowchart <strong>of</strong> a typical GPU code<br />
Register<br />
… …<br />
Shared<br />
memory<br />
a) Upload part <strong>of</strong> the input data from the<br />
CPU memory to the GPU global memory<br />
b) Use the thread and block index to select<br />
the data from the GPU global memory<br />
c) Compute the task and store the results<br />
in GPU global memory<br />
d) Download the result from the GPU global<br />
memory to the CPU memory<br />
Global memory<br />
- 361 - 15th IGTE Symposium 2012<br />
a)<br />
b)<br />
X1 Y1 Z1 X2 Y2 Z2 …. XN YN ZN<br />
Thread 1 (th1) th2 thN<br />
X1 X2 …XN Y1 Y2 … YN Z1 Z2 ZN<br />
th1 th2 thN<br />
Figure 8: a) Uncoalescing access pattern, b) Coalescing access pattern<br />
As already discussed, a thread can access three kinds<br />
<strong>of</strong> memory: global memory, shared memory and the<br />
registers. The registers are the fastest memory; hence, in<br />
order to achieve the best performance, the programmer<br />
has to use as many registers as possible.<br />
Of course, the actual speed up can be limited by the<br />
amount <strong>of</strong> on chip memory resources (as registers and<br />
shared memory) and by the shared memory bandwidth.<br />
The GPU has a sophisticated scheduler very effective<br />
in minimizing the performance loss due to access to<br />
global memory. If a sufficient number <strong>of</strong> threads is<br />
available, the scheduler can concurrently run the threads<br />
on the multiprocessor, masking the memory accesses.<br />
This approach is called in literature Thread Level<br />
Parallelism (TLP) approach [7-9].<br />
The larger is the number <strong>of</strong> threads, the fewer are the<br />
registers available per each thread (the registers are 8192<br />
and are partitioned among the threads inside a block).<br />
Unfortunately the actual availability <strong>of</strong> a limited number<br />
<strong>of</strong> registers per thread can be a bottleneck for the entire<br />
code.<br />
In order to increase the speed up, the scheduler is<br />
provided by the capability to analyze the instruction flow<br />
and evaluate how reliable could be to execute two<br />
instructions at the same time. If the answer is positive, the<br />
hardware localizes possible free units and increases the<br />
parallelism and executes more than one instruction during<br />
the same clock cycle. Then, a small number <strong>of</strong> threads<br />
per block usually are recommended (64 threads per block<br />
can be a good compromise) and, in addition, the thread<br />
have to be designed to present as many independent<br />
instructions as possible: in such a way the Instruction<br />
Level Parallelism (ILP) [7-9] increases the performance.
- 362 - 15th IGTE Symposium 2012<br />
The Reduced Basis Method Applied to<br />
Transport Equations <strong>of</strong> a Lithium-Ion Battery<br />
Stefan Volkwein∗ ∗ †<br />
, Andrea Wesche<br />
∗Universität Konstanz, Fachbereich Mathematik und Statistik , Universitätsstraße 10, D-78457 Konstanz,<br />
E-mail: stefan.volkwein@uni.konstanz.de<br />
† Adam Opel AG, Bahnh<strong>of</strong>splatz, D-65423 Rüsselsheim, E-mail: Andrea.Wesche@de.opel.com<br />
Abstract—In this paper we consider a coupled system <strong>of</strong> nonlinear parametrized partial differential equations (P 2 DEs),<br />
which models the concentrations and the potentials in lithium-ion batteries. The goal is to develop an efficient reduced<br />
basis approach for the fast and robust numerical solution <strong>of</strong> the P 2 DE system. Numerical examples illustrate the efficiency<br />
<strong>of</strong> the proposed approach.<br />
Index Terms—finite volume method, greedy algorithm, lithium-ion battery, reduced basis method<br />
I. INTRODUCTION<br />
The modelling <strong>of</strong> lithium-ion batteries has received an<br />
increasing amount <strong>of</strong> attention in the recent past. Several<br />
companies worldwide are developing such batteries<br />
for consumer electronic applications, in particular, for<br />
electric-vehicle applications. To achieve the performance<br />
and lifetime demands in this area, exact mathematical<br />
models <strong>of</strong> the battery are required. Moreover, the multiple<br />
evaluation <strong>of</strong> the battery model for different parameter<br />
settings involves a large amount <strong>of</strong> time and experimental<br />
effort. Here, the derivation <strong>of</strong> reliable mathematical<br />
models and their efficient numerical realization are very<br />
important issues in order to reduce both time and cost in<br />
the improvement <strong>of</strong> the performance <strong>of</strong> batteries.<br />
In the present work we consider a mathematical model<br />
for lithium-ion batteries which describes the transport<br />
processes by a partial differential equation system. This<br />
model is developed in the paper by Popov et al. [17]. The<br />
physical and chemical details can be found in [13] and<br />
[14]. The equation system models a physico-chemical<br />
micro-heterogeneous battery model.<br />
We discretize this by the finite volume method and the<br />
backward Euler method. The reduced basis methodology<br />
for by finite volumes discretized systems can be found<br />
in [10]. The discretized model is reduced by the reduced<br />
basis method [16]. Our numerical tests will illustrate the<br />
efficiency <strong>of</strong> this approach.<br />
A popular battery model is the one developed by Newman<br />
[4], [15], which was implemented and tested [5]. Let<br />
us also refer to the work [6], where a different battery<br />
model is derived. For an equation system which describes<br />
a physico-chemical macro-homogeneous battery model<br />
the well posedness is shown by Wu et al. [19].<br />
II. BATTERY MODEL<br />
PDE Model<br />
Let Ω ⊂ R be an open interval, which is divided in three<br />
disjunct open sub-intervals Ωc, Ωe, Ωa ⊂ R, see Figure<br />
2.1. For tend > 0 we define Q ∶= Ω ×(0,tend) and let<br />
c, φ ∶ Q → R and α, β, λ, κ ∶ R2 → R, the notations<br />
positive electrode<br />
electrolyte<br />
negative electrode<br />
Ωc Ωe Ωa<br />
BUTLER-VOLMER-equation<br />
Fig. 2.1. Structure <strong>of</strong> the considered battery domain<br />
can be found in Table 1.2 in the appendix. The transport<br />
processes in a battery, i.e. transport <strong>of</strong> mass and charge,<br />
are described by the equations [17]<br />
∂c<br />
−∇⋅(α (c, φ)∇c + β (c, φ)∇φ) =0<br />
∂t<br />
(2.1a)<br />
−∇ ⋅ (λ (c, φ)∇c + κ (c, φ)∇φ) =0 (2.1b)<br />
in Ωc ×(0,tend), Ωe ×(0,tend) and Ωa ×(0,tend), where<br />
“∇” denotes the gradient and “∇⋅” the divergence.<br />
The positive electrode is Ωc (cathode for discharge),<br />
the electrolyte Ωe and the anode Ωa (anode for discharge).<br />
Boundary/Interface Conditions<br />
The boundary conditions are<br />
∂c<br />
∂ν = 0, φ = 0 on (∂Ω ∩ ∂Ωc)×(0,tend) (2.2a)<br />
∂c<br />
= 0,<br />
∂ν<br />
∂φ I<br />
=−<br />
∂ν σa<br />
on (∂Ω ∩ ∂Ωa)×(0,tend) (2.2b)<br />
where ν is the outer unit normal vector, I ∶ R + → R<br />
(time-dependent current) and σa ∈ R + /{0} (electric conductivity<br />
multiplied with the cross section). The initial<br />
condition is given by<br />
c(x, t0 = 0) =c0 (x) ,x∈ Ω (2.3)
The interface conditions are given by<br />
−(α (c, φ)∇c + β (c, φ)∇φ)<br />
⎧⎪ I(cec,cc,φec,φc) ∣ in (∂Ωc ∩ ∂Ωe)<br />
(0,tend)<br />
= ⎨<br />
⎪⎩<br />
−I (cea,ca,φea,φa) ∣ in (∂Ωe ∩ ∂Ωa)<br />
(0,tend)<br />
−(λ (c, φ)∇c + κ (c, φ)∇φ)<br />
⎧⎪ J(cec,cc,φec,φc) ∣ in (∂Ωc ∩ ∂Ωe)<br />
(0,tend)<br />
= ⎨<br />
⎪⎩<br />
−J (cea,ca,φea,φa) ∣ in (∂Ωe ∩ ∂Ωa)<br />
(0,tend)<br />
(2.4a)<br />
(2.4b)<br />
where cea is the concentration in the electrolyte at the<br />
negative electrode interface:<br />
cea (t) =lim<br />
h→0 c (x∣ Ωe∩Ωa − h, t)<br />
and h > 0 is small enough, i.e. x∣ − h ∈ Ωe.<br />
Ωe∩Ωa<br />
Analogously<br />
cec (t) =lim<br />
h→0 c (x∣ Ωe∩Ωc + h, t)<br />
cc (t) =lim<br />
h→0 c (x∣ Ωe∩Ωc − h, t)<br />
ca (t) =lim<br />
h→0 c (x∣ Ωe∩Ωa + h, t)<br />
for sufficient small h > 0. Thevariablesφc, φa, φec, φea<br />
are defined in the same way. We write cs for the concentration<br />
in the solid part, i.e. in the negative and positive<br />
electrode, and ce for the concentration in the electrolyte,<br />
φs and φe are analogously denoted. The scalar functions<br />
I∶R 4 → R and J∶R 4 → R are defined by<br />
I(ce,cs,φe,φs) = J(ce,cs,φe,φs)<br />
√ √<br />
F<br />
√<br />
ce cs<br />
J(ce,cs,φe,φs) =k<br />
1 − cs<br />
c 0 e<br />
c 0 s<br />
cs,max<br />
⋅ 2sinh( F<br />
(φs − φe − U0 (cs)))<br />
2RT<br />
where F = 96486 A⋅s<br />
is the Faraday constant, R =<br />
mol<br />
A⋅V ⋅s<br />
8.314 is the gas constant and T > 0 [K] is the<br />
K⋅mol<br />
temperature. The function U0 ∶ R → R is the over<br />
potential and depends on the concentration c in the<br />
electrodes. The coefficient functions are defined as<br />
α (c, φ) ∶=De (c, φ)+ RT<br />
F 2<br />
(t+ (c, φ)) 2 κ (c, φ)<br />
c<br />
t+ (c, φ)<br />
β (c, φ) ∶=κ (c, φ)<br />
F<br />
λ (c, φ) ∶= RT<br />
F<br />
t+ (c, φ) κ (c, φ)<br />
c<br />
[ cm2<br />
s ]<br />
[ mol<br />
V ⋅ cm ⋅ s ]<br />
[ A ⋅ cm2<br />
mol ]<br />
where the transference number t+ is zero in the electrodes<br />
and larger than zero in the electrolyte and κ<br />
is the ionic/electric conductivity; κ, t+, De ∶ R 2 → R.<br />
To measure the battery parameters experimentally it is<br />
assumed, that they are polynomials in c and φ.<br />
The homogeneous Neumann boundary conditions for<br />
the concentration (2.2) mean that no flux <strong>of</strong> lithium(-ions)<br />
- 363 - 15th IGTE Symposium 2012<br />
can pass through. The inhomogeneous Neumann boundary<br />
condition for the potential is Ohm’s law, the homogeneous<br />
Dirichlet boundary condition have no physical<br />
meaning. It ensures the uniqueness <strong>of</strong> the solution if one<br />
exists.<br />
The interface conditions describe the exchange <strong>of</strong> the<br />
lithium-ions at the interfaces which are modeled by the<br />
Butler-Volmer-equation [1].<br />
For physical reasons we assume that<br />
c (x, t) ≥0 ∀x ∈ Ω, t∈(0,tend)<br />
We remark that the coefficient functions α and κ are<br />
larger than zero for physical reasons: the diffusivity De<br />
and the conductivity κ are larger than zero. Because <strong>of</strong><br />
the definition <strong>of</strong> the transference number t+ the coefficient<br />
functions β and λ are equal or larger than zero.<br />
Discretization <strong>of</strong> the Problem<br />
We discretize the partial differential equation system<br />
(2.1a)-(2.1b) with the appropriate boundary (2.2) and<br />
interface conditions (2.4) by the cell centered finite<br />
volume method. We divide therefore Ωc in Nc ∈ N, Ωe in<br />
Ne ∈ N and Ωa in Na ∈ N, ND = Nc+Ne+Na, equidistant<br />
control volumes <strong>of</strong> the width Δx. We use the method <strong>of</strong><br />
lines and solve the equation system for every time step.<br />
The time step size is Δt. The integrals over the spatial<br />
we approximate by the middle point rule, the integrals<br />
over the time by the backward Euler method, for details<br />
cf. [17]. These discretized equations are implemented in<br />
MATLAB 7.10.0 (R2010a).<br />
III. REDUCED BASIS METHOD<br />
Initial Point<br />
We consider a parametrized PDE which we want to<br />
solve for many parameter sets, e.g. for parametric studies.<br />
The better the numerical model approximates the physical<br />
phenomenon, the more expensive the computation<br />
gets. So in some cases e. g. a parameter analysis needs<br />
too much effort, because a single computation is too<br />
expensive. Therefore one has to develop a reduced model<br />
to get cheap solutions.<br />
The reduced basis method is based on the discretized<br />
model: the idea is to compute a “few” times an expensive<br />
solution to different parameter sets which are in the range<br />
<strong>of</strong> interest. With the knowledge <strong>of</strong> these so called “true”<br />
solutions basis vectors are computed. The approach is<br />
that the reduced solutions in the parameter set <strong>of</strong> interest<br />
are linear combinations <strong>of</strong> these basis vectors.<br />
An assumption is that the error between the “exact”<br />
analytical solution and the “true” numerical solution is<br />
small in contrast to the error between the “true” and the<br />
“reduced” solution.<br />
A big advantage <strong>of</strong> the present method is that you<br />
determine the error between the true and reduced solution<br />
during “developing” your reduced model (→ Greedy<br />
algorithm). A further property is that the method has<br />
two phases: the <strong>of</strong>fline computation in which the reduced
model is set which fulfills the given error tolerance and<br />
in which the needed true solutions are computed and<br />
the online phase in which the reduced solution(s) are<br />
computed. The <strong>of</strong>fline part is expensive and the online<br />
phase is cheap.<br />
Approach<br />
In the following we have to resolve how to choose the<br />
true solution and how to estimate the error between the<br />
reduced and true solution. The (POD-)Greedy algorithm<br />
ensures both issues.<br />
We now describe how to apply the reduced basis<br />
method on our battery model. The transport equations <strong>of</strong><br />
the battery (2.1a)-(2.1b) depend on many parameters: on<br />
geometrical parameters (e.g. the width <strong>of</strong> the electrode)<br />
on state parameters (e.g. temperature) and on battery<br />
parameters (e.g. the diffusion coefficient). We note these<br />
parameters with μ ∈Dand assume that all these different<br />
parameters are polynomials in c and φ, some are <strong>of</strong><br />
course constant and so polynomials <strong>of</strong> the degree zero.<br />
We write u N ∈X N for a piecewise linear functions,<br />
its coefficient are denoted with u N ∈ R N . The discretized<br />
problem is now the following: Find a u N ∈X N so that<br />
F N (u N ; μ) =0<br />
⇔⟨F N (u N ; μ) ,v N ⟩W =0 ∀v N ∈X N<br />
(3.1)<br />
where the mass matrix W is in the present case given by<br />
W = Δx ⋅ 1 ∈ R N×N .<br />
We approximate the finite volume space by a Ndimensional<br />
space X N which is spanned by “snapshots”.<br />
A snapshot is a true solution to a specific parameter set<br />
μ ∈D and time node t ∈(0,tend). We assume now, that<br />
we have a (orthonormalized) basis Ξ =(ξ1,...,ξN )∈<br />
R N×N <strong>of</strong> this space and that the reduced solution can be<br />
written as<br />
N<br />
u N (μ) = ∑ θ uN<br />
j (μ) ξj (3.2)<br />
j=1<br />
If we replace u N in equation (3.1) by u N <strong>of</strong> equation<br />
(3.2) and choose v N = ξi ∈ R N we get:<br />
Ξ T ⋅ W ⋅ F N (θ uN<br />
(μ)⋅Ξ; μ) =∶F N (θ uN<br />
(μ))<br />
=0 (3.3)<br />
The start vector for the coefficient vector for Newton’s<br />
method one can get by<br />
u start<br />
coeff (μ) =ΞT ⋅ W ⋅ u N (⋅,t0 = 0; μ)<br />
To find a basis we use for our time dependent problem<br />
the POD-Greedy algorithm, cf. algorithm 1 and [10]. It<br />
consists <strong>of</strong> two loops: the outer loop is the “standard”<br />
Greedy, cf. for instance [16], which finds the new parameter<br />
set to which the error between the reduced and<br />
- 364 - 15th IGTE Symposium 2012<br />
true solution is the largest. For this we need a problem<br />
specific error estimator; an error estimator for a linear<br />
by finite volumes discretized problem can be found in<br />
[10]. The inner loop reduces the trajectory u N (⋅,tn; μ ∗ )<br />
in time with the POD (proper orthogonal decomposition)<br />
algorithm. This algorithm returns for each snapshot matrix<br />
u N (μ) ℓ ∈ N eigen-/basis vectors. One can state the<br />
number <strong>of</strong> basis vectors or the projection error <strong>of</strong> the<br />
POD method. For details to the POD algorithm cf. for<br />
instance [11], [18].<br />
Usually we take an error estimator to estimate the error<br />
between the true and reduced solution to each parameter<br />
<strong>of</strong> the discretized parameter set. Until now we have no<br />
error estimator for the present problem, so we compare<br />
the true solution to a parameter set <strong>of</strong> the training set,<br />
to the reduced solution computed by the so far reduced<br />
basis vectors. The function u (i)<br />
RB (μj) is the reduced<br />
solution constructed by i basis functions evaluated at the<br />
parameter set μj ∈Dtrain ⊂D.<br />
Algorithm 1 POD-Greedy algorithm, c.f [10]<br />
Require: ● Limit the parameter range, discretize the<br />
parameter set Dtrain ={μ1,...,μNP }<br />
● Ξtrain = {u N (μ1) ,...,u N (μNP )}, uN (μi) ∈<br />
R Nx×Nt , ∀i ∈{1,...,NP }<br />
● Choose a tolerance for the Greedy: TOLGreedy<br />
● Choose the exactness for the POD basis per ∈<br />
[0, 1], it is just a measurement for the projection<br />
error (or directly choose the number <strong>of</strong> POD<br />
basis elements in each “Greedy step” ℓ).<br />
Ensure:<br />
1: Initializing:<br />
● Choose μ (1) ∈Dtrain → ˜ ξ (1) ∈ Ξtrain<br />
ξ (1) = POD( ˜ ξ (1) ) ∈ R Nx×ℓ (1), with the POD<br />
tolerance per<br />
● Set: i = 2, ɛ = 1<br />
2: while i ≤ NP and ɛGreedy > TOL do<br />
3: [μ (i) ,ɛ]=max j∈{1,...,NP } ∣u (i−1)<br />
RB (μj)−u N (μj)∣<br />
4: ˜ ξ (i) = POD(u N (μ (i) )) NT<br />
n=0 ∈ RNx×ℓ (i), where<br />
u N (μ (i) ) ∈ Ξtrain<br />
5: (ξ (1) ,...,ξ (i) )<br />
= Gram-Schmidt (ξ (1) ,...,ξ (i−1) , ˜ ξ (i) )<br />
6: end while<br />
An essential property <strong>of</strong> the reduced basis method is<br />
that you can decompose the computation into an <strong>of</strong>fline<br />
and online phase.<br />
Offline: After determination <strong>of</strong> the set <strong>of</strong> parameter<br />
sets and the accuracy <strong>of</strong> the reduced solutions,<br />
we start the Greedy algorithm to compute the<br />
basis vectors, the true solutions to the chosen<br />
parameters respectively. The <strong>of</strong>fline phase is<br />
computationally expensive and so the reduced<br />
basis method is only worth if you want to solve<br />
the equation system many times.<br />
Online: In the present case we have to compute the
coefficient vector for the basis to the different<br />
parameter sets. We get it by the damped Newton’s<br />
method. This phase is cheap.<br />
The big advantage <strong>of</strong> the reduced solution is that<br />
you know how good your reduced solution approximates<br />
the true solution. A big disadvantage is, that if you<br />
change your parameter ranges you usually have to do<br />
the expensive <strong>of</strong>fline computation again.<br />
RBM applied on the battery model<br />
In the following we explain how to apply the reduced<br />
basis method on our discretized problem (3.1).<br />
We denote by F N 1 (C, Φ) ∶R N × R N → R N equation<br />
(2.1a) with boundary and interface conditions discretized<br />
by the finite volume method, analogously F N 2 (C, Φ) ∶<br />
R N × R N → R N stands for the finite volume discretized<br />
equation (2.1b) with the boundary and interface conditions.<br />
We choose a parameter training set for C and Φ 1 :<br />
Ξ C<br />
train = (c N (μ1) ,...,c N (μNP ))<br />
Ξ Φ<br />
train = (φ N (μ1) ,...,φ N (μNP ))<br />
Let us assume that we have a basis matrix for C<br />
) ∈ RND×NBc and for Φ ΨΦ =<br />
ΨC = (ψc 1,...,ψc NBC (ψ φ<br />
1 ,...,ψφ )∈R NBφ ND×NBφ then the reduced models,<br />
reduced functions respectively, are given by<br />
(Ψ c ) T ⋅ W ⋅(F N<br />
1 (Ψ c Ccoeff , Ψ φ Φcoeff )) ! = 0 (3.4a)<br />
(Ψ φ ) T<br />
⋅ W ⋅(F N<br />
2 (Ψ c Ccoeff , Ψ φ Φcoeff )) ! = 0 (3.4b)<br />
We should add that the snapshots for Ψ c and Ψ φ are<br />
taken at the same parameter sets (outer POD-Greedy),<br />
but the number <strong>of</strong> the POD basis elements could differ<br />
to achieve the same accuracy (inner POD-Greedy).<br />
A further issue is how to choose the next parameter<br />
in the (POD-)Greedy algorithm. The error estimation we<br />
have to do for two functions c and φ. So we usually<br />
get two different parameter values μc and μφ, where<br />
the L 2 -errors <strong>of</strong> the concentration and <strong>of</strong> the potential,<br />
respectively, attain their maximum values. Then, we<br />
choose the parameter μ ∈{μc,μφ} corresponding to the<br />
greater L 2 -error <strong>of</strong> both.<br />
IV. NUMERICAL EXPERIMENTS<br />
In this section we apply the reduced basis method to<br />
the discretized equations describing the transport processes<br />
in a lithium-ion battery (3.1). The step size in<br />
spatial is Δx = 1μm, the time step size Δt = 5s and we<br />
compute Nt = 10 time steps. The Newton tolerance to<br />
compute the true solution is set 10−6 relatively and 10−9 absolutely. The discretization error <strong>of</strong> the finite volume<br />
solution is ɛFVM =O( 1<br />
100 ).<br />
1 One can also choose different training set, e.g. cf. [7]<br />
- 365 - 15th IGTE Symposium 2012<br />
pos. electrode electrolyte neg. electrode<br />
De, [ cm2<br />
s ] 1.0 ⋅ 10−9 7.5 ⋅ 10−7 3.9 ⋅ 10−10 κ, [ A<br />
]<br />
V ⋅cm<br />
c<br />
0.038 0.002 1.0<br />
0 , [ mol<br />
cm3 ]<br />
cmax, [<br />
0.020574 0.001 0.002639<br />
mol<br />
cm3 ]<br />
U0, [V ]<br />
t+, [−]<br />
k, [<br />
0.02286<br />
0.001<br />
0 0.2<br />
0.02639<br />
0<br />
0<br />
A<br />
cm2 ] 1.3716 ⋅ 10−4 5.2780 ⋅ 10−7 N⋅, [−]<br />
A⋅, [cm<br />
10 30 10<br />
2 ] (50 ⋅ 10−4 ) 2<br />
(50 ⋅ 10−4 ) 2<br />
TABLE 4.1<br />
BATTERY PARAMETERS, [17]<br />
The Newton tolerance for the reduced solution is 10 −5 .<br />
So the L 2 -error between the finite volume and reduced<br />
solution is at best less than<br />
ɛ L 2 = 10 −5 ⋅ Nx ⋅ Nt = 0.005 =∶ ɛGreedy<br />
The tolerance <strong>of</strong> the POD method is set 99%, cf. for the<br />
POD method for instance [18].<br />
Our “standard” battery parameter set is listed in table<br />
4.1, notations can be found in Table 1.2 in the appendix.<br />
We charge the battery with 1.5913 ⋅ 10 −8 A which corresponds<br />
to 1C-rate and set the temperature T = 300K.<br />
Test 1<br />
In this subsection we variate the open circuit voltage<br />
in the positive electrode: μ = Uc ∈[0.001, 4.501]. We<br />
discretize this parameter set with the equidistant step<br />
width ΔUc = 0.1V .Sowehavea46-dimensional training<br />
set for the parameter. All the other parameters are fixed,<br />
cf. Table 4.1.<br />
In Figure 4.1 and 4.2 the finite volumes solutions<br />
chosen by the first two iterations <strong>of</strong> algorithm 1 are<br />
presented. The associated parameters are Uc = 3.001V<br />
and Uc = 4.501V . The concentration seems to be less sensitive<br />
to the circuit voltage than the electrical potential.<br />
The L2-error for the concentration becomes worse after<br />
the second Greedy step, but already after the first Greedy<br />
step the error is smaller than the L2-error tolerance<br />
ɛL2 and stays smaller; the L2-error for the electrical<br />
potential gets denotative smaller with the information <strong>of</strong><br />
a second true solution, cf. Figure 4.3 and 4.4. The same<br />
observation can be done for the L∞-error which is not<br />
presented here. The basis functions are shown in Figure<br />
4.5: they have the same “structure” as the finite volume<br />
solutions for one fixed time step and there is just one<br />
basis function for each Greedy step.<br />
The speed up <strong>of</strong> the reduced solution in comparison<br />
to the true solution is 17.54.<br />
Test 2<br />
In this section we variate a few parameters:<br />
μ ={Dec,Dee,Dea,t+,kc,ka}<br />
The subscript c denotes the parameter in the positive<br />
electrode, e in the electrolyte and a in the
c [mol/cm 3 ]<br />
0.025<br />
0.02<br />
0.015<br />
0.01<br />
0.005<br />
0<br />
20 40 60<br />
x [μ m]<br />
0<br />
t [s]<br />
50<br />
U [V]<br />
0<br />
−1<br />
−2<br />
−3<br />
−4<br />
0 20 40 60<br />
x [μ m]<br />
Fig. 4.1. Test 1: Finite volume solution fort the concentration (left)<br />
and the potential (right) for Uc = 3.001V<br />
c [mol/cm 3 ]<br />
0.025<br />
0.02<br />
0.015<br />
0.01<br />
0.005<br />
0<br />
20 40 60<br />
x [μ m]<br />
0<br />
t [s]<br />
50<br />
U [V]<br />
0<br />
−2<br />
−4<br />
−6<br />
0 20 40 60<br />
x [μ m]<br />
Fig. 4.2. Test 1: Finite volume solution fort the concentration (left)<br />
and the potential (right) for Uc = 4.501V<br />
x 10−5<br />
6.5<br />
6<br />
5.5<br />
5<br />
4.5<br />
1 2 3 4<br />
U [V]<br />
0c<br />
0.08<br />
0.06<br />
0.04<br />
0.02<br />
0<br />
0<br />
0<br />
t [s]<br />
t [s]<br />
1 2 3 4<br />
U [V]<br />
0c<br />
Fig. 4.3. Test 1: L 2 -error after the first Greedy step for the<br />
concentration (left) and the potential (right).<br />
x 10−5<br />
7.019<br />
7.019<br />
7.019<br />
7.019<br />
7.019<br />
7.019<br />
1 2 3 4<br />
U [V]<br />
0c<br />
x 10−3<br />
3.6949<br />
3.6949<br />
3.6949<br />
3.6949<br />
3.6949<br />
1 2 3 4<br />
U [V]<br />
0c<br />
Fig. 4.4. Test 1: L 2 -error after the second Greedy step for the<br />
concentration (left) and the potential (right).<br />
40<br />
20<br />
0<br />
−20<br />
−40<br />
−60<br />
Basis functions for the concentration<br />
−80<br />
0 10 20 30 40 50<br />
Spatial<br />
Greedy step 1<br />
Greedy step 2<br />
30<br />
20<br />
10<br />
0<br />
−10<br />
Basis functions for the potential<br />
−20<br />
0 10 20 30 40 50<br />
Spatial<br />
50<br />
50<br />
Greedy step 1<br />
Greedy step 2<br />
Fig. 4.5. Test 1: Basis functions for the first Greedy step for the<br />
concentration (left) and the potential (right).<br />
negative electrode. We choose the following parameter<br />
set range: for the diffusion coefficients Dec ∈<br />
[1.0 ⋅ 10 −9 , 1.1 ⋅ 10 −9 ], Dee ∈ [7.5 ⋅ 10 −7 , 7.6 ⋅ 10 −7 ],<br />
- 366 - 15th IGTE Symposium 2012<br />
Dea ∈ [3.9 ⋅ 10 −10 , 4.0 ⋅ 10 −10 ], the transference number<br />
t+ ∈ [0.2, 0.3] and the reaction rates kc,ka ∈<br />
[0.02, 0.022]. We discretize the parameter set in the<br />
following way: for the diffusion coefficients we choose<br />
the boundary values, for the other parameters we also<br />
take the boundary values and a value in between. With<br />
this discretization we get a 216-dimensional trainings<br />
set. All the other parameters are fixed like in table 4.1<br />
noted, but in contrast to the previous subsection the POD<br />
tolerance is set 1 − 1 ⋅ 10 −8 %.<br />
The graphical results are listed in the Figures 4.6 and<br />
4.7: In Figure 4.6 the finite volume solutions for the first<br />
parameter set can be seen. The first parameter set is the<br />
one denoted in table 4.1. The L 2 -error <strong>of</strong> the reduced<br />
solutions in comparison to the finite volume solution<br />
is for the concentration smaller than 10 −5 and for the<br />
electrical potential 10 −4 to all 216 parameter sets. The<br />
L ∞ -error is smaller than 10 −3 for the concentration as<br />
well as for the potential after one Greedy step. The basis<br />
functions are plotted in Figure 4.7: there are four basis<br />
functions for the concentration and three for the potential.<br />
c [mol/cm 3 ]<br />
0.025<br />
0.02<br />
0.015<br />
0.01<br />
0.005<br />
0<br />
20 40 60<br />
x [μ m]<br />
0<br />
t [s]<br />
50<br />
−1<br />
−2<br />
−3<br />
−4<br />
x 10<br />
0<br />
−3<br />
−5<br />
0 20 40 60<br />
x [μ m]<br />
Fig. 4.6. Test 2: Finite volume solution fort the concentration (left)<br />
and the potential (right) for first parameter set<br />
80<br />
60<br />
40<br />
20<br />
0<br />
−20<br />
−40<br />
Basis functions for the concentration<br />
−60<br />
0 10 20 30 40 50<br />
Spatial<br />
U [V]<br />
30<br />
20<br />
10<br />
0<br />
−10<br />
−20<br />
−30<br />
0<br />
t [s]<br />
Basis functions for the potential<br />
−40<br />
0 10 20 30 40 50<br />
Spatial<br />
Fig. 4.7. Test 2: Basis functions after the first Greedy step for the<br />
concentration (left) and the potential (right).<br />
The L 2 -error between the reduced solution and the true<br />
solution is after one Greedy step smaller than the L 2 -error<br />
tolerance ɛ L 2 to all 216 parameter set. That means that<br />
just the information <strong>of</strong> one computational expensive true<br />
solution is needed to compute the reduced solutions to<br />
all 216 parameter sets with an acceptable error.<br />
The speed up <strong>of</strong> the reduced solution in comparison<br />
to the true solution is 12.37.<br />
V. DISCUSSION<br />
In the present document we do the same approach like<br />
in [10].<br />
The reduced basis approach works for the transport<br />
equation in a lithium-ion battery: in the above numerical<br />
50
Notation<br />
c, [ mol<br />
cm3 ] concentration <strong>of</strong> the lithium/lithium-ions<br />
φ, [V ] electrical potential<br />
α, [ cm2<br />
] coefficient function<br />
s<br />
β, [ mol<br />
] coefficient function<br />
V ⋅cm⋅s<br />
λ, [ A⋅cm2<br />
] coefficient function<br />
mol<br />
I, [A] current<br />
De, [ cm2<br />
]<br />
s<br />
κ, [<br />
diffusion coefficient<br />
A<br />
]<br />
V ⋅cm<br />
c<br />
electric/ionic conductivity<br />
0 , [ mol<br />
cm3 ] start concentration <strong>of</strong> lithium in the<br />
electrodes/electrolyte<br />
cmax, [ mol<br />
cm3 ]<br />
U0, [V ]<br />
t+, [−]<br />
k, [<br />
maximum <strong>of</strong> lithium the electrode can store<br />
open circuit potential<br />
transference number<br />
A<br />
cm2 ]<br />
N⋅, [−]<br />
A⋅, [cm<br />
reaction rates<br />
number <strong>of</strong> control volumes<br />
2 ] cross section<br />
TABLE 1.2<br />
NOTATIONS OF THE BATTERY PARAMETERS<br />
tests there are at most two Greedy steps needed to reach<br />
the L 2 -error tolerance ɛ L 2. If the knowledge <strong>of</strong> all finite<br />
volume solutions to the training set is needed, the method<br />
would not be sufficient. In the above numerical tests<br />
we see that the limiting factor is the potential: for the<br />
concentration one Greedy step is sufficient but for the<br />
potential we need in some cases an additional Greedy<br />
step.<br />
The application <strong>of</strong> the reduced basis method to this<br />
problem is not completed yet: We have to develop an<br />
a posteriori error estimator so that we do not have to<br />
compute all true solutions to the discretized parameter<br />
set. Further the computational time <strong>of</strong> the reduced<br />
solutions in comparison to the finite volume solutions<br />
for the presented numerical tests are fast but not so<br />
fast as it could be. In every Newton step we have to<br />
evaluate the nonlinearities completely. Also we have no<br />
affine parameter dependence. If you have a linear(ized)<br />
affine parameter dependent problem you can separate the<br />
parameter dependence from the bilinear form and from<br />
the linear form. To get an affine parameter dependent<br />
problem as well as a linearized problem we have to<br />
apply the (discrete) empirical interpolation method, cf.<br />
for instance [2], [3].<br />
VI. ACKNOWLEDGMENTS<br />
The authors gratefully acknowledge support by the<br />
Adam Opel AG. Besides Competence Center The Virtual<br />
Vehicle (<strong>Graz</strong>) supported the lecture by Mr. Volkwein<br />
within the scope <strong>of</strong> the IGTE Symposium.<br />
APPENDIX<br />
NOTATION<br />
In Table 1.2 one can find some notations for the battery<br />
parameters.<br />
- 367 - 15th IGTE Symposium 2012<br />
REFERENCES<br />
[1] P.W. Atkins, “Physikalische Chemie”, Wiley-VCH, 2., vollst.<br />
neubearb. A., 1996.<br />
[2] M. Barrault and N.C. Nguyen and Y. Maday and A.T. Patera,<br />
“An “Empirical Interpolation” Method: Application to Efficient<br />
Reduced-Basis Discretization <strong>of</strong> Partial Differential Equations”, C.<br />
R. Acad. Sci. Paris, Série I., pp. 667–672, 339, 2004.<br />
[3] S. Chaturantabut and D.C. Sorensen, “Nonlinear Model Reduction<br />
via Discrete Empirical Interpolation”, SIAM J. Sci. Comput., 32(5),<br />
pp. 2737–2764, 2010.<br />
[4] M. Doyle and T.F. Fuller and J. Newman, “Modeling <strong>of</strong> Galvanostatic<br />
Charge and Discharge <strong>of</strong> the Lithium/Polymer/Insertion Cell”,<br />
Journal <strong>of</strong> The Electrochemical Society, 140(6), pp. 1526–1533,<br />
1993.<br />
[5] C. M Doyle, “Design and Simulation <strong>of</strong> Lithium Rechargeable<br />
Batteries”, Ph.D. thesis, 1995.<br />
[6] W. Dreyer and M. Gaberscek and C. Guhlke and R. Huth and<br />
J. Jamnik, “Phase Transition and Hysteresis in a Recharchable<br />
Lithium Battery Revisited”, European J. Appl. Math., 22, pp. 267–<br />
290, 2011.<br />
[7] A.-L. Gerner and K. Veroy, “Certified reduced basis method for<br />
parameterized saddle point problems”, SIAM J. Sci. Comput.,<br />
(accepted Jul 2012).<br />
[8] M.A. Grepl and A.T. Patera, ”A Posteriori Error Bounds for<br />
Reduced-Basis Approximations <strong>of</strong> Parametrized Parabolic Partial<br />
Differential Equations”, Mathematical Modelling and Numerical<br />
Analysis, 2005, 39(1), pp. 157-181.<br />
[9] M.A. Grepl, Y. Maday, N.C. Nguyen, and A.T. Patera, ”Efficient<br />
Reduced-Basis Treatment <strong>of</strong> Nonaffine and Nonlinear Partial<br />
Differential Equations”, Mathematical Modelling and Numerical<br />
Analysis, 2007, 41(3), pp. 575-605.<br />
[10] B. Haasdonk and M. Ohlberger, “Reduced Basis Method for<br />
Finite Volume Approximations <strong>of</strong> Parametrized Linear Evolution<br />
Equations”, Math. Model. Numer. Anal., 42(2), 2008, pp. 277-302.<br />
[11] P. Holmes and J.L. Lumley and G. Berkooz and C. Rowley,<br />
“Turbulence, Coherent Structures, Dynamical Systems and Symmetry”,<br />
Cambridge Monographs on Mechanics, 2012, Cambridge<br />
<strong>University</strong> Press.<br />
[12] O. Lass and S. Volkwein, “POD Galerkin schemes for nonlinear<br />
elliptic-parabolic systems”, submitted, 2011.<br />
[13] A. Latz and J. Zausch and O. Iliev, “Modeling <strong>of</strong> Species and<br />
Charge Transport in Li–Ion Batteries Based on Non-Equilibrium<br />
Thermodynamics”, Lecture Notes in Computer Science 6046, 329–<br />
337, 2011.<br />
[14] A. Latz and J. Zausch, “Thermodynamic Consistent Transport<br />
Theory <strong>of</strong> Li-Ion Batteries”, Journal <strong>of</strong> Power Sources 196, 3296-<br />
3302, 2011.<br />
[15] J. S. Newman and K. E. Thomas-Alyea, “Electrochemical Systems”,<br />
Wiley John + Sons, 3rd ed., 2004.<br />
[16] A. T Patera and G. Rozza, Reduced Basis approximation and<br />
A Posteriori Error Estimation for Parametrized Partial Differential<br />
Equations, MIT, 2007.<br />
[17] P. Popov, Y. Vutov, S. Margenov and O. Iliev, ”Finite Volume<br />
Discretization <strong>of</strong> Equations Describing Nonlinear Diffusion in Li-<br />
Ion Batteries,” Fraunh<strong>of</strong>er ITWM report 191, 2010.<br />
[18] S. Volkwein, “Model Reduction Using Proper Orthogonal Decomposition”,<br />
lecture notes, Konstanz, 2011.<br />
[19] J. Wu and J. Xu and H. Zou, “On the Well-posedness <strong>of</strong> a<br />
Mathematical Model for Lithium-Ion Battery Systems, Methods<br />
and Applications <strong>of</strong> Analysis, 13(3), pp. 275–298, 2006.
- 368 - 15th IGTE Symposium 2012<br />
Surrogate Parameter Optimization based on<br />
Space Mapping for Lithium-Ion Cell Models<br />
Matthias K. Scharrer∗ , Bettina Suhr∗ , and Daniel Watzenig∗† ∗Kompetenzzentrum – Das Virtuelle Fahrzeug Forschungsgesellschaft mbH (ViF),<br />
Inffeldgasse 21/A/I, A-8010 <strong>Graz</strong>, Austria<br />
† Institute <strong>of</strong> Electrical Measurement and Measurement Signal Processing,<br />
Kopernikusgasse 24/4, A-8010 <strong>Graz</strong>, Austria<br />
E-mail: matthias.scharrer@v2c2.at<br />
Abstract—Optimizing batteries <strong>of</strong> electric cars is a complex and time consuming task. In order to reduce the number <strong>of</strong><br />
prototypes, development costs and time, reliable numerical models are highly required. But optimizing models reflecting the<br />
fundamental electrochemical processes is typically computationally expensive. In this paper we present a surrogate model<br />
optimization approach based on space mapping to reduce computation time. This technique is applied to the parameter<br />
estimation problem <strong>of</strong> an electrochemical cell model by linking a coarse linearized model to the accurate model. We<br />
present results <strong>of</strong> two synthetical fitting problems solved directly and by our surrogate optimization method to validate the<br />
approach. As a remarkable result 15% reduction <strong>of</strong> computation time for the one dimensional case and 25% for the two<br />
dimensional case are obtained. We discuss a simple measure that doubles the achieved reduction to 48% for the latter.<br />
The method can easily be adopted to speed up other gradient–based optimization problems. Since the used electrochemical<br />
model shows strong non–linear behaviour, the achieved speed up indicates even better performance in the case <strong>of</strong> relaxed<br />
conditions.<br />
Index Terms—multi physics, space mapping, surrogate optimization<br />
I. INTRODUCTION<br />
In terms <strong>of</strong> pollutant emissions during vehicle operation,<br />
battery–powered and hybrid vehicles are clearly<br />
more environmentally friendly than those purely based<br />
on combustion engines. In order to reduce the number <strong>of</strong><br />
expensive prototypes the fast and reliable simulations <strong>of</strong><br />
the electrical and chemical behavior <strong>of</strong> cells are becoming<br />
increasingly important. Also a more efficient operation <strong>of</strong><br />
a battery can be achieved, when simulation models are<br />
used to estimate the battery’s internal states – e.g. state–<br />
<strong>of</strong>–charge (SOC), state–<strong>of</strong>–function and state–<strong>of</strong>–health –<br />
from measurement data.<br />
Many internal variables and material properties are<br />
difficult to access or not measurable. Several approaches<br />
exist to get insight into the internal dynamic processes in<br />
a lithium–ion cell. The field was pioneered by Newman<br />
and co–workers [1], [2]. Overviews can be found in [3]<br />
and [4]. The authors focus on modeling the cell in terms<br />
<strong>of</strong> transport equations for lithium ions, chemical interaction<br />
and electronic field computations in active particles<br />
<strong>of</strong> anode and cathode. These are coupled by modeling<br />
electrode kinetics occuring on the particle surfaces <strong>of</strong><br />
such electrodes.<br />
Recently, much effort is put into estimating parameters<br />
for cell models to gain better knowledge about effects<br />
occuring during life time. In this work we focus on non–<br />
invasive methods only, i.e. methods that estimate parameters<br />
by matching predicted cell model voltages for a given<br />
current pr<strong>of</strong>ile to experimental measurements without<br />
the need to destructively open the cell. For example,<br />
Santhanagopalan and White [5] devised an online SOC–<br />
estimation method by applying an extended Kalman<br />
filter to a simplified ordinary differential equation model.<br />
In [6], [7] a gradient based method to parameter estimation<br />
is introduced – based on Levenberg–Marquardt<br />
optimization. Here Santhanagopalan et al. investigate<br />
both single and multi–particle systems and successfully<br />
identify five parameters for constant current charge and<br />
discharge cycles. Schmidt et al. estimate parameters<br />
<strong>of</strong> a single–particle model in a combined approach by<br />
performing Fisher–information based parameter analysis<br />
and applying a pattern search algorithm consecutively [8].<br />
In contrast to the usage <strong>of</strong> single–particle models (SPM)<br />
and deterministic estimation algorithms above, Forman et<br />
al. [9] focus on fitting the Doyle–Fuller–Newman (DFN)<br />
model [1] to battery cycling data by application <strong>of</strong> genetic<br />
algorithm.<br />
The above literature provides an overview <strong>of</strong> a range<br />
<strong>of</strong> different methods and models to estimate the intrinsic<br />
material properties and unknown states. In contrast to<br />
using a SPM, we try to find a mechanism that allows<br />
us to use the higher detailed DFN model, as suggested<br />
in [9]. But, as opposed to the latter, we try to further<br />
improve the speed <strong>of</strong> parameter estimation up to online<br />
application if possible.<br />
In this paper the application <strong>of</strong> the so called space<br />
mapping method – first mentioned in [10] – to the<br />
parameter estimation <strong>of</strong> the cell model is investigated.<br />
Thus, it is possible to apply a fast gradient based Gauss–<br />
Newton optimization method to the complex DFN model<br />
and use a simplified model as a surrogate to both, speed<br />
up direct model evaluation and gradient computation at<br />
once.
The remainder <strong>of</strong> this paper is structured as follows:<br />
Section II defines the simulation framework <strong>of</strong> the cell<br />
model and briefly summarizes the solution procedure. A<br />
mechanistic model describing the electrochemistry <strong>of</strong> a<br />
lithium–ion cell motivated by the DFN model [1] has<br />
been implemented as a system <strong>of</strong> coupled non–linear<br />
partial differential equations (PDEs) in one dimension<br />
[11]. In Section III the optimization problem and solution<br />
algorithm to estimate the parameters is defined.<br />
Section IV presents a framework how to replace many <strong>of</strong><br />
the time consuming evaluations <strong>of</strong> the complex forward<br />
model by a surrogate model – a linearization <strong>of</strong> the<br />
complex model in this case – and how a link between<br />
the two models is established. In Section V we present<br />
and discuss the results. Finally, Section VI summarizes<br />
and concludes the paper.<br />
II. FORMULATION OF THE PROBLEM<br />
In order to mathematically describe the internal dynamic<br />
processes in a lithium–ion cell, a mechanistic<br />
electrochemical model has been realized as a system<br />
<strong>of</strong> coupled non–linear partial differential equations in<br />
one dimension [11]. A lithium–ion cell with two porous<br />
intercalation electrodes (cathode in Ωc and anode in<br />
Ωa) and an electronically isolating separator in Ωs in<br />
between is considered. For homogenization purpose each<br />
electrode is assumed to consist <strong>of</strong> two phases. We assume<br />
spherical particles in both cathode (in Λc) and anode<br />
(in Λa), which line up continously in x direction. The<br />
liquid phase modeled in each electrode is electrolyte.<br />
In the separator Ωs we only consider electrolyte, as the<br />
solid phase in the separator does not participate in the<br />
reactions. In Figure 1 a schematic view <strong>of</strong> the modeled<br />
domain is given.<br />
Ri<br />
, a<br />
a<br />
a<br />
Ro,<br />
a<br />
a<br />
r r<br />
<br />
c <br />
s<br />
a,<br />
s c,<br />
s<br />
c<br />
<br />
<br />
<br />
Ro, c<br />
Fig. 1. Problem Domain: The spatial domains are defined as Ω=<br />
Ωa ∪ Ωs ∪ Ωc ⊂ R, Ω ′ =Ωa ∪ Ωc, Λa =Ωa × [0,Ra] ⊂ R 2 ,<br />
Λc =Ωc × [0,Rc] ⊂ R 2 , Λ=Λa ∪ Λc and Ra,Rc ∈ R.<br />
The implemented model is similar to the widely<br />
used DFN approach [1], extended to include additional<br />
aspects, e.g. from [2] and [12]. The full model will<br />
be described in [13]. It is a coupled system <strong>of</strong> non–<br />
linear partial differential equations in one dimension.<br />
The variables are potentials and concentrations for the<br />
electrolyte (ϕℓ,cℓ), for the cathode (ϕsc,csc) and for<br />
the anode (ϕsa,csa). The one–dimensional cell model<br />
considered is defined by the system (1).<br />
c<br />
Ri , c<br />
x<br />
- 369 - 15th IGTE Symposium 2012<br />
TABLE I<br />
LIST OF SYMBOLS<br />
Ai<br />
inner surface (m 2 m −3 )<br />
Dℓ<br />
diffusivity in electrolyte (m 2 s −1 )<br />
Ds diffusivity in solid (m 2 s −1 )<br />
Cdl double layer capacity (Fm −1 )<br />
F Faraday’s constant (= 96485C mol −1 )<br />
R universal gas constant (= 8.31447 Jmol −1 K −1 )<br />
T temperature (K)<br />
UOCV(cs) equilibrium potential function (V )<br />
cℓ<br />
Li + –concentration in electrolyte (mol m −3 )<br />
cℓ,0 initial Li + –concentration in electrolyte (mol m −3 )<br />
cs<br />
Li + –concentration in active particles (mol m −3 )<br />
cs,0 initial Li + –concentration in particles (mol m −3 )<br />
i(t) cell current density (Am −2 )<br />
j ∗<br />
BV Butler–Volmer current density (Am −2 k<br />
)<br />
exchange current density and reaction rate (mol m −2 s −1 )<br />
t +<br />
ℓ<br />
z<br />
transference number <strong>of</strong> cations (1)<br />
number <strong>of</strong> transferred electrons per unit (Li + : z =1)(1)<br />
αA,αK anodic/cathodic charge transfer coefficients (1)<br />
εℓ<br />
electrolyte volume fraction (1)<br />
κ (cℓ) ionic conductivity function (Sm −1 )<br />
μℓ<br />
migration coefficient<br />
ϕs<br />
electrochemical potential <strong>of</strong> the active material (V )<br />
ϕℓ<br />
electrochemical potential <strong>of</strong> the electrolyte (V )<br />
σs<br />
electronic conductivity (Sm −1 )<br />
−∇ · (σs∇ϕs) =−Aij ∗<br />
BV in Ω ′<br />
<br />
−∇ · κℓ(cℓ)∇ϕℓ + RT<br />
<br />
+ 1<br />
κℓ(cℓ)t ℓ ∇cℓ = Aij<br />
zF cℓ<br />
∗<br />
BV in Ω<br />
∂ (ɛℓcℓ)<br />
∂t<br />
−∇·<br />
<br />
Dℓ ∇cℓ + zF<br />
RT μℓcℓ∇ϕℓ<br />
<br />
= Ai<br />
zF j∗ BV in Ω<br />
∂cs 1<br />
−<br />
∂t r2 <br />
∂<br />
Dsr<br />
∂r<br />
2 <br />
∂cs<br />
=0 in Λ<br />
∂r<br />
j ∗<br />
BV =<br />
⎧ <br />
αAzF(ϕs−ϕℓ −UOCV(cs))<br />
<br />
zFk exp<br />
RT<br />
+<br />
⎪⎨<br />
<br />
−(1−αK)zF(ϕs−ϕℓ −UOCV(cs))<br />
<br />
−zFk exp<br />
RT<br />
+<br />
∂(ϕs−ϕℓ) ⎪⎩<br />
+Cdl ∂t<br />
in Ω ′<br />
(1)<br />
0 else<br />
where the system variables are defined as ·(t, x) at time<br />
t ∈ [0,T], T ∈ R and at space point x and (x, r),<br />
respectively. A comprehensive overview <strong>of</strong> symbols is<br />
given in Table I.<br />
Homogenous Neumann conditions are applied at the<br />
boundaries except for the outer boundaries <strong>of</strong> potentials<br />
and concentrations in solid phase:<br />
ϕs =0 on Γa × [0,T]<br />
−σs∇ϕs = −i (t) on Γc<br />
∂cs 1<br />
−Ds =<br />
∂r zF j∗<br />
BV on ΓRo,a ∪ ΓRo,c<br />
The concentrations are restricted by the following<br />
initial conditions:<br />
cℓ = cℓ,0<br />
cs = cs,0<br />
in Ω<br />
in Λ<br />
The potentials are consistently initialized at rest by<br />
the condition j (x, 0) = 0. The solution <strong>of</strong> this system <strong>of</strong><br />
four non–linearly coupled partial differential equations<br />
is done by application <strong>of</strong> the Finite Element Method<br />
with linear test functions for spatial discretization and<br />
Backwards Euler Method for time integration. The non–<br />
linearity is solved by a damped Newton method – see<br />
[11] for details.<br />
(2)<br />
(3)
III. PARAMETER ESTIMATION<br />
The system described in the previous section contains<br />
many parameters which cannot be measured directly. To<br />
formulate the parameter estimation problem in a general<br />
way, we merge the parameter set <strong>of</strong> interest into the<br />
parameter vector μ ∈ Pad ⊂ R m , where Pad is defined<br />
as the admissible parameter set. The basis optimization<br />
problem is introduced as<br />
μ ∗ =argminH (f (μ)) , (4)<br />
μ∈Pad where an optimal set <strong>of</strong> parameters μ ∗ ∈ Pad is sought,<br />
which minimizes a merit function H <strong>of</strong> a model response<br />
f (μ) depending on the parameters μ.<br />
Since we focus on parameter estimation based on cell<br />
voltages, we set H to compute the difference with respect<br />
to a predescribed function ˆy. We rewrite (4) to<br />
μ ∗ =argminwi (y(ti; μ) − ˆy(ti))<br />
μ∈Pad 2<br />
2<br />
where we want to minimize the difference between measured<br />
cell voltages, ˆy, and computed voltages, f (μ) =<br />
y (·; μ) =ϕs| Γc − ϕs| Γa , at predefined times, ti. Variations<br />
in time step sizes are taken into account by the<br />
weights wi.<br />
Classical optimization using this objective function<br />
yields unacceptable response times, since not only the<br />
solution <strong>of</strong> the system defined in Section II has to be<br />
computed, but additionally the derivative <strong>of</strong> the objective<br />
function with respect to every parameter in our set <strong>of</strong><br />
interest μ has to be estimated. Since this might be<br />
intractable for non–linear PDE constraint problems, we<br />
revert to numerical gradient estimation by finite differences.<br />
As execution time <strong>of</strong> a single simulation on current<br />
hardware varies between seconds and hours – depending<br />
on the prescribed input pr<strong>of</strong>ile – direct evaluation <strong>of</strong> (5)<br />
is not acceptable due to its enormous computation times.<br />
IV. PROPOSED FRAMEWORK<br />
To speed up parameter estimation, we introduce space<br />
mapping – first mentioned in [10] – to the cell model optimization.<br />
The idea behind is best described as follows:<br />
We have a very complex and accurate – fine – model<br />
that describes a process on basis <strong>of</strong> a couple <strong>of</strong> parameters.<br />
We search for an optimal set <strong>of</strong> parameters with<br />
respect to some cost function by repeatingly evaluating<br />
our model and computing model responses for intermediate<br />
sets <strong>of</strong> parameters. Since a single evaluation <strong>of</strong> the<br />
model is expensive, we replace the responses by results <strong>of</strong><br />
a much cheaper and less accurate – coarse – model (also<br />
known as surrogate model) describing the same process<br />
by using a similar parameter set. Thus we only get a<br />
vague idea <strong>of</strong> where the optimal parameters are with<br />
respect to the fine model. Finally, we link the results<br />
by evaluating the fine model and establish a mapping<br />
between the individual parameter spaces <strong>of</strong> both, the fine<br />
and the coarse model. Since this results in fewer calls <strong>of</strong><br />
the fine model, the optimization time can be reduced.<br />
- 370 - 15th IGTE Symposium 2012<br />
(5)<br />
In our case this means to substitute evaluations <strong>of</strong> the<br />
fine model u = F(μ), where u is the tuple representing<br />
the solutions to (1) and F(·) is the solution operator,<br />
by evaluations <strong>of</strong> a coarse model v = C(λ), where λ ∈<br />
Lad ⊂ R m is the parameter set <strong>of</strong> interest <strong>of</strong> the coarse<br />
model, v is the tuple representing the solutions <strong>of</strong> the<br />
coarse model and C(·) is the solution operator.<br />
A mapping function p : Pad → Lad enables us<br />
to establish a link between the two models such that<br />
the response <strong>of</strong> the coarse model c(p(μ)) is a good<br />
approximation for f(μ). Of course, the coarse model<br />
response c (·) has to be defined the same way as the<br />
fine model response f (·). Since directly evaluating the<br />
mapping function p is not possible, we introduce a new<br />
optimization problem:<br />
<br />
<br />
λ =argminc( ˜λ∈L ad<br />
˜ <br />
<br />
λ) − f(μ) 2<br />
2<br />
A problem with this approach is the computation <strong>of</strong> the<br />
Jacobian <strong>of</strong> the space mapping. Bandler suggested a time<br />
consuming way in [10]. This unnecessary big effort can<br />
be circumvented by applying Broyden’s formula [14], as<br />
discussed in [15]. The latter will be used in this paper.<br />
Using the coarse model response c (·), we reformulate<br />
the optimization problem in (4) as follows:<br />
˜μ ∗ =argminH (c (p (μ))) , (7)<br />
μ∈Pad where ˜μ ∗ is a coarse approximation <strong>of</strong> μ ∗ . By iteratively<br />
updating p(·), the solution <strong>of</strong> the surrogate problem ˜μ ∗<br />
is supposed to converge towards the real solution μ ∗ .<br />
The algorithm applied to indirectly solve the optimization<br />
problem defined in (5) by means <strong>of</strong> the space mapping<br />
is stated in Algorithm 1.<br />
Following the idea <strong>of</strong> [16], we obtain the simplified<br />
coarse model by linearizing the right hand side <strong>of</strong> the<br />
original system by approximation on the basis <strong>of</strong> Taylor<br />
series expansion:<br />
<br />
j (v; λ) ≈ ˆj (û; ˆμ)+ ∇uˆj T <br />
(û; ˆμ) (v − û)+ ∇μˆj T (û; ˆμ) (λ − ˆμ),<br />
(8)<br />
where û denotes the state <strong>of</strong> the original system for a<br />
reference parameter set ˆμ, ˆj (û;ˆμ) denotes the function<br />
j∗ BV in a working point – throughout the rest <strong>of</strong> the paper<br />
we write ˆj for a function ˆj (û;ˆμ). The parameters <strong>of</strong> the<br />
linearized model are denoted by v and λ, respectively.<br />
The non–linear factors on the left hand side, i.e. the<br />
ionic conductivity κℓ and direct occurrences <strong>of</strong> the Li + –<br />
concentrations in solution cℓ, are fixed to their initial<br />
values. In (11) the coarse model is stated as used. In<br />
addition, the boundary condition <strong>of</strong> the concentrations in<br />
solid phase cs changes to:<br />
<br />
∂cs 1 <br />
−Ds = ∇u<br />
∂r zF<br />
ˆj<br />
T <br />
v + ∇μˆj T λ + ˆ <br />
Jc on ΓRo,a ∪ ΓRo,c<br />
(9)<br />
where the constant parts <strong>of</strong> the linearization (8) are given<br />
as:<br />
<br />
ˆJc = ˆj − ∇uˆj T <br />
û − ∇μˆj T ˆμ (10)<br />
(6)
- 371 - 15th IGTE Symposium 2012<br />
<br />
∂ˆj<br />
−∇ · (σs∇ϕs) +Ai ϕs +<br />
∂ϕs<br />
∂ˆj<br />
ϕℓ +<br />
∂ϕℓ<br />
∂ˆj<br />
cℓ +<br />
∂cℓ<br />
∂ˆj<br />
cs = −Ai ∇μ<br />
∂cs<br />
ˆj<br />
T λ + ˆ <br />
Jc in Ω ′<br />
<br />
−∇ · κℓ(ĉℓ)∇ϕℓ + RT<br />
<br />
+ 1<br />
∂ˆj<br />
κℓ(ĉℓ)t ℓ ∇cℓ − Ai ϕs +<br />
zF ĉℓ<br />
∂ϕs<br />
∂ˆj<br />
ϕℓ +<br />
∂ϕℓ<br />
∂ˆj<br />
cℓ +<br />
∂cℓ<br />
∂ˆj<br />
cs = Ai ∇μ<br />
∂cs<br />
ˆj<br />
T λ + ˆ <br />
Jc in Ω<br />
∂ (ɛℓcℓ)<br />
∂t<br />
−∇·<br />
<br />
Dℓ ∇cℓ + zF<br />
RT μℓĉℓ∇ϕℓ<br />
<br />
+ Ai<br />
<br />
∂ˆj<br />
ϕs +<br />
zF ∂ϕs<br />
∂ˆj<br />
ϕℓ +<br />
∂ϕℓ<br />
∂ˆj<br />
cℓ +<br />
∂cℓ<br />
∂ˆj<br />
<br />
cs = −<br />
∂cs<br />
Ai<br />
∇μ<br />
zF<br />
ˆj<br />
T λ + ˆ <br />
Jc in Ω<br />
∂cs 1<br />
−<br />
∂t r2 <br />
∂<br />
Dsr<br />
∂r<br />
2 <br />
∂cs<br />
=0 in Λ<br />
∂r<br />
Algorithm 1 Space mapping surrogate optimization<br />
Input: Initial μ0 ∈ Pad; set i =0,λ0 = μ0, and B0 = I.<br />
1: Evaluate f (μ0) and H (f (μ0))<br />
2: repeat<br />
3: Define mapping function<br />
pi (μ) ← Bi (μ − μi) +λi<br />
4: Compute new candidate parameter<br />
˜μ ∗<br />
i+1 ← arg min H (c (pi (μ)))<br />
μ∈Pad 5: Evaluate f ˜μ ∗ ∗<br />
i+1 and H f ˜μ i+1<br />
6: if H f ˜μ ∗ <br />
i+1
implementation <strong>of</strong> Algorithm 1 reached its final residual<br />
<strong>of</strong> 4.110 −7 after 4 hours and 4 iterations which results in<br />
a reduction <strong>of</strong> runtime <strong>of</strong> 15%. This speed up is induced<br />
by the number <strong>of</strong> evaluations <strong>of</strong> the fine model being<br />
reduced from 20 to 5. But the additional 54 evaluations <strong>of</strong><br />
the coarse model in typically 189.6±2.4s during the two<br />
optimization processes undermines this large reduction.<br />
The differences in the curves resulting from inital and<br />
final parameters can be mainly related to the strong<br />
impact <strong>of</strong> the non–linearity <strong>of</strong> the equilibrium potential<br />
function U OCV(cs). Different inital Li + –concentrations<br />
cs,0 lead to a different working window <strong>of</strong> the curve,<br />
so that the shape changes. Additionally, by changing the<br />
initial amount <strong>of</strong> Li + , the available amount to move<br />
inside the system is changed. This intrinsic value can<br />
be estimated by the volume integral <strong>of</strong> the initial Li + –<br />
concentrations in the electrodes <br />
Ω εscs,0 dΩ. The limits<br />
<strong>of</strong> Li + –concentrations <strong>of</strong> each electrode – commonly<br />
referred to as full and empty, respectively – will result in<br />
the cell capacity. By modifying the cell capacity and the<br />
effective load, which can differ from the preset 0.2h−1 ,<br />
the lower cut–<strong>of</strong>f voltage is reached at different times.<br />
The model response and the reference data where padded<br />
with their final value, to match the length <strong>of</strong> one another.<br />
Because <strong>of</strong> the sharp voltage drop near the empty cell,<br />
the sensitivity <strong>of</strong> the objective function is very high.<br />
This simplifies finding the optimum and interpreting the<br />
results.<br />
The second task was to simulate a 100s short charge<br />
pulse <strong>of</strong> 0.5h−1 load to find the exchange current density<br />
and reaction rate μ = k = {ka,kc} in both anode<br />
Ωa and cathode Ωc starting from 50% SOC. As shown<br />
in [6] and confirmed by our own investigations, the<br />
exchange current density and reaction rate are showing<br />
high impact on the results. The reference value μ∗ in this<br />
case was set to 10−7 , 10−7 mol m−2s−1 , optimization<br />
was initialized at μ0 = 10−8 , 10−6 mol m−2s−1 –<br />
see Figure 3 for resulting voltage curves. In this case,<br />
stopping criteria were applied tighter as before, because<br />
<strong>of</strong> the smaller number <strong>of</strong> points in time <strong>of</strong> the problem:<br />
• absolute value <strong>of</strong> the function value<br />
H (f (μi))
TABLE II<br />
COARSE MODEL SPEED UP COMPARISON<br />
T model runtime model Total Speed<br />
evaluations runtime up<br />
1 2.95 ± 0.07s 549 (7) 1469s 1.3<br />
2 2.39 ± 0.05s 474 (7) 1241s 1.6<br />
5 2.04 ± 0.05s 474 (7) 1071s 1.8<br />
10 1.92 ± 0.04s 486 (7) 1038s 1.9<br />
20 1.87 ± 0.04s 549 (8) 1145s 1.7<br />
non–linear 11.8 ± 0.19s 165 1947s 1.0<br />
Speed up achieved by space mapping surrogate optimization <strong>of</strong><br />
kinetic rate parameters k for different reassembly periods T<br />
compared to the non–linear case (last line). Model evaluations in<br />
parentheses (·) are non–linear model evaluations.<br />
TABLE III<br />
PROGRESS OF RESIDUALS DURING OPTIMIZATION<br />
H (f(μk))<br />
i T =1 T =2 T =5 T =10 T =20<br />
0 9.566E-02<br />
1 5.159E-01 5.161E-01 5.159E-01 5.126E-01 4.261E-01<br />
2 5.276E-02 5.280E-02 5.287E-02 4.127E-02 4.179E-02<br />
3 2.414E-03 2.426E-03 2.439E-03 3.793E-03 3.802E-03<br />
4 5.399E-06 5.484E-06 5.648E-06 9.178E-06 1.049E-05<br />
5 3.229E-11 3.726E-11 4.589E-11 6.074E-11 3.258E-11<br />
6 7.749E-13 7.840E-13 7.796E-13 9.783E-13 1.035E-12<br />
7 — — — — 9.031E-13<br />
Residuals achieved by space mapping surrogate optimization <strong>of</strong><br />
kinetic rate parameters k at iterations i for different reassembly<br />
periods T .<br />
Additionally, an optimum exists for some reassembly<br />
period T , which appears in the decrease <strong>of</strong> the speed<br />
up factor at some point. This is related to the additional<br />
iteration necessary to reach the target residual threshold.<br />
Yet, the performance <strong>of</strong> each iteration <strong>of</strong> the optimization<br />
procedure shows similar progression, as can be seen in<br />
Table III for different reassembly periods which strengthens<br />
the before mentioned assumption <strong>of</strong> adaptivity <strong>of</strong> the<br />
algorithm.<br />
VI. CONCLUSION<br />
This paper shows the application <strong>of</strong> the space mapping<br />
approach to speed up estimation <strong>of</strong> the parameters <strong>of</strong> a<br />
DFN motivated battery model. This is done by substituting<br />
model evaluations by the response <strong>of</strong> a fast surrogate<br />
model. Afterwards, the obtained parameters are mapped<br />
into the original model’s parameter space by an iteratively<br />
refined mapping function.<br />
We have validated the algorithm by applying it to<br />
two synthetic fitting problems, where parameters <strong>of</strong> a<br />
simulation are recovered starting from a different point<br />
in parameter space. The one dimensional quasi–stationary<br />
case results in a reduction <strong>of</strong> 15%, whereas in the two<br />
dimensional case shows 25% as compared to the direct<br />
optimization computation times. Further simplification <strong>of</strong><br />
the coarse model increases the latter to 48%.<br />
There is another advantage that evolves out <strong>of</strong> the<br />
usage <strong>of</strong> a linear coarse model, which is the possibility<br />
to state the adjoint system, which can be used to estimate<br />
the cost functions’s exact gradient after only one<br />
- 373 - 15th IGTE Symposium 2012<br />
additional evaluation instead <strong>of</strong> one per parameter using<br />
finite differences to approximate the gradient. This way<br />
<strong>of</strong> optimization seems to be prospective, for example, for<br />
estimating parameters <strong>of</strong> a real system or optimization<br />
<strong>of</strong> the battery itself, with much less effort and increased<br />
efficiency than by using direct methods.<br />
VII. ACKNOWLEDGMENT<br />
The authors gratefully acknowledge<br />
financial support from Climate- and<br />
Energy Fund “Klima- und Energiefonds”<br />
as part <strong>of</strong> the program New Energy 2020<br />
“NEUE ENERGIEN 2020” <strong>of</strong> the Federal<br />
Province <strong>of</strong> Styria/Austria for the project in which the<br />
above presented research results were achieved.<br />
REFERENCES<br />
[1] M. Doyle, T. F. Fuller, and J. Newman, “Modeling <strong>of</strong> galvanostatic<br />
charge and discharge <strong>of</strong> the lithium/polymer/insertion cell,” J<strong>of</strong><br />
The Electrochemical Society, vol. 140, no. 6, pp. 1526–1533,<br />
1993.<br />
[2] J. Newman and K. E. Thomas-Alyea, Electrochemical Systems,<br />
3rd ed. John Wiley & Sons, Inc., Hoboken, New Jersey, 2004.<br />
[3] P. M. Gomadam et al., “Mathematical modeling <strong>of</strong> lithium-ion<br />
and nickel battery systems,” J <strong>of</strong> Power Sources, vol. 110, no. 2,<br />
pp. 267–284, 2002.<br />
[4] S. Santhanagopalan et al., “Review <strong>of</strong> models for predicting the<br />
cycling performance <strong>of</strong> lithium ion batteries,” J <strong>of</strong> Power Sources,<br />
vol. 156, no. 2, pp. 620–628, 2006.<br />
[5] S. Santhanagopalan and R. E. White, “Online estimation <strong>of</strong> the<br />
state <strong>of</strong> charge <strong>of</strong> a lithium ion cell,” J <strong>of</strong> Power Sources, vol.<br />
161, no. 2, pp. 1346–1355, 2006.<br />
[6] S. Santhanagopalan, Q. Guo, and R. E. White, “Parameter estimation<br />
and model discrimination for a lithium-ion cell,” J <strong>of</strong> The<br />
Electrochemical Society, vol. 154, no. 3, pp. A198–A206, 2007.<br />
[7] S. Santhanagopalan et al., “Parameter estimation and life modeling<br />
<strong>of</strong> lithium-ion cells,” J <strong>of</strong> The Electrochemical Society, vol.<br />
155, no. 4, pp. A345–A353, 2008.<br />
[8] A. P. Schmidt et al., “Experiment-driven electrochemical modeling<br />
and systematic parameterization for a lithium-ion battery cell,”<br />
J <strong>of</strong> Power Sources, vol. 195, no. 15, pp. 5071–5080, 2010.<br />
[9] J. C. Forman et al., “Genetic identification and fisher identifiability<br />
analysis <strong>of</strong> the doyle-fuller-newman model from experimental<br />
cycling <strong>of</strong> a lifepo4 cell,” J <strong>of</strong> Power Sources, vol. 210, no. 0, pp.<br />
263–275, 2012.<br />
[10] J. Bandler et al., “Space mapping technique for electromagnetic<br />
optimization,” IEEE Trans. on Microwave Theory and Techniques,<br />
vol. 42, no. 12, pp. 2536–2544, dec 1994.<br />
[11] F. Pichler, “Anwendung der Finite-Elemente Methode auf ein<br />
Litium-Ionen Batterie Modell,” Master Thesis, <strong>University</strong> <strong>of</strong> <strong>Graz</strong>,<br />
2011.<br />
[12] I. J. Ong and J. Newman, “Double-layer capacitance in a dual<br />
lithium ion insertion cell,” J <strong>of</strong> The Electrochemical Society, vol.<br />
146, no. 12, pp. 4360–4365, 1999.<br />
[13] M. Cifrain et al., “Elektrochemisches Zellmodell,” publication<br />
planned.<br />
[14] C. G. Broyden, “A class <strong>of</strong> methods for solving nonlinear simultaneous<br />
equations,” Mathematics <strong>of</strong> Computation, vol. 19, no. 92,<br />
pp. 577–593, Oct. 1965.<br />
[15] M. H. Bakr et al., “An introduction to the space mapping<br />
technique,” Optimization and Engineering, vol. 2, no. 4, pp. 369–<br />
384, 2001.<br />
[16] O. Lass et al., “Space mapping techniques for a structural optimization<br />
problem governed by the p-Laplace equation,” Optimization<br />
Methods and S<strong>of</strong>tware, vol. 26, no. 4-5, pp. 617–642,<br />
2011.<br />
[17] E. Jones et al., “SciPy: Open source scientific tools for Python,”<br />
2001–. [Online]. Available: http://www.scipy.org/<br />
[18] MATLAB, version 7.12 (R2011a). Natick, Massachusetts: The<br />
MathWorks Inc., 2011.
- 374 - 15th IGTE Symposium 2012<br />
Large Scale Energy Storage with Redox Flow Batteries<br />
Piergiorgio Alotto, Massimo Guarnieri, Federico Moro and Andrea Stella<br />
Dipartimento di Ingegneria Industriale, Università di Padova, Via Gradenigo 6/a, 35131 Padova, Italy<br />
E-mail: name.surname@unipd.it<br />
Abstract— The expected expansion <strong>of</strong> renewable energy sources is calling for large and efficient energy storage systems.<br />
Electrochemical ones are considered the solution <strong>of</strong> choice in most cases, since they present unique features <strong>of</strong> localization<br />
flexibility, efficiency and scalability. Among them, Redox Flow Batteries (RFBs) exhibit remarkably high potential for several<br />
reasons, including power/energy independent sizing, high efficiency, room temperature operation and extremely long life.<br />
The most developed RFBs are the all-vanadium based ones (VRB), but other research programs are underway in many<br />
countries. They aim at substantial improvements which can lead to more compact systems, capable <strong>of</strong> taking the technology<br />
to a real breakthrough in stationary grid-connected applications, but which can also prove suitable for powering electric<br />
vehicles. This paper gives an overview <strong>of</strong> the RFB technology state-<strong>of</strong>-the-art, highlights its pros and cons, and indicates<br />
current research challenges.<br />
Index Terms— Electrochemical storage, redox flow batteries, vanadium flow batteries.<br />
I. INTRODUCTION<br />
Presently, renewable sources except hydroelectric,<br />
particularly solar and wind, provide roughly 4% <strong>of</strong> the<br />
worldwide electricity production, but they are expected<br />
to grow substantially in the near future (up to 26% by<br />
2030 [1]).<br />
In contrast with conventional electrical power plants,<br />
wind, solar, and other primary renewable sources are<br />
intermittent, because the generated electrical power<br />
depends on the time <strong>of</strong> the day and on the climatic<br />
availability <strong>of</strong> resources. The integration into the grid <strong>of</strong><br />
such primary energy sources, each with different<br />
peculiarities, requires their careful design and control.<br />
Furthermore, traditional grids have not been designed for<br />
such operational conditions so that they are not always<br />
able to work satisfactorily when many renewable-source<br />
generators are connected. In fact, recent studies suggest<br />
that grids can become unstable if such sources provide<br />
more than 20% <strong>of</strong> the whole generated power without<br />
adequate energy storage.<br />
Thus, future power grids provided with relevant<br />
amounts <strong>of</strong> renewable sources call for adequate energy<br />
storage systems capable <strong>of</strong> storing production surplus<br />
during some periods and <strong>of</strong> contributing to face higher<br />
demand during others, while at the same time<br />
contributing to stabilizing the grid operation. Applied in<br />
this way, energy storage systems will allow to<br />
substantially undersize the primary power plants with<br />
respect to peak demand, since relevant quantity <strong>of</strong> power<br />
will be provided by the storage systems.<br />
Two main different application scenarios can be<br />
identified: i) “peak shaving” and “sag compensation”<br />
refer to charge/discharge cycles on short timescales (secmin)<br />
and are needed for grid stabilization; ii) “load<br />
leveling” concerns charge/discharge cycles on longer<br />
timescales (hour) and allow to improve load factor <strong>of</strong> the<br />
grid.<br />
Several recent surveys show that electrochemical<br />
storage systems will be the solution <strong>of</strong> choice for<br />
complementing intermittent photovoltaic and wind<br />
generation facilities with long-time-scale energy storage.<br />
In fact, such storage technologies feature site versatility,<br />
modularity, scalability, ease <strong>of</strong> operation, and no moving<br />
parts [2]. Worldwide important funding programs have<br />
been established for the scientific and technological<br />
Fig. 1: Discharge times vs. power for mainstream energy<br />
storage technologies<br />
development <strong>of</strong> innovative electrochemical storage<br />
systems.<br />
Among them, Redox Flow Batteries (RFBs) are<br />
particularly promising They have emerged in recent<br />
years as a very promising solution for stationary<br />
applications, in combinations with renewable sources,<br />
for applications such as peak shaving, sag compensation,<br />
and load leveling [3,4,5]. The reason for their potential<br />
success depends on the fact that, with respect to<br />
competing technologies, they cover a very wide range <strong>of</strong><br />
discharge times (energy) and powers, as shown in Fig. 1.<br />
RFBs exploit reduction and oxidation (redox) reactions<br />
<strong>of</strong> ion metals (i.e. electrochemical species) solved in<br />
aqueous or nonaqueous liquids. The storage <strong>of</strong> these<br />
solutions is performed in external tanks, potentially <strong>of</strong><br />
very high capacity, and they are circulated into the RFB<br />
battery when power exchange is required. The main<br />
appealing features <strong>of</strong> RFBs are: scalability and<br />
flexibility, independent sizing <strong>of</strong> power and energy, high<br />
round-trip efficiency, high depth <strong>of</strong> discharge (DOD),<br />
long durability, fast response times, reduced<br />
environmental impact, and absence <strong>of</strong> expansive noblemetal<br />
based catalyzers.<br />
In the rest <strong>of</strong> this paper, the most important features <strong>of</strong><br />
RFBs will be presented together with an overview <strong>of</strong> the<br />
current state-<strong>of</strong>-the art <strong>of</strong> commercial systems.<br />
Furthermore, current research and development issues <strong>of</strong><br />
RFB systems will be highlighted.
II. RFB BASICS AND FEATURES<br />
A. RFB basics<br />
RFB cells operate on the basis <strong>of</strong> electrochemical<br />
reduction and oxidation reactions <strong>of</strong> two liquid<br />
electrolytes containing ionized metals [6]. One electrode<br />
performs the reduction half-reaction <strong>of</strong> one electrolyte<br />
that releases one electron and one ion while the other<br />
electrode performs an oxidation half-reaction that<br />
recombines them into the other electrolyte (Fig. 2). Ions<br />
can then migrate from one electrode to the other (from<br />
anode to cathode) through a membrane which can not be<br />
crossed by electrons, which are instead forced to pass<br />
through the external circuit thus exchanging electric<br />
energy. Typical RFB cells must operate at room<br />
temperature in order to keep the solutions in the liquid<br />
phase. This condition implies that the ion-conducting<br />
membrane between the two electrodes is a polymeric<br />
one. Both half-cells are connected to external storage<br />
tanks where the solutions are stored and circulated by<br />
means <strong>of</strong> two suitable pumps. In order to design such a<br />
storage system based on a RFB, expertise in<br />
electrochemistry, chemistry, chemical engineering,<br />
electrical engineering, power electronics, and control<br />
engineering are required, thus calling for highly<br />
multidisciplinary research and development teams.<br />
B. RFB features<br />
RFBs can be considered as a particular type <strong>of</strong> Fuel<br />
Cell (FC), since they can generate electrical power as<br />
long as they are fed with fuel (in this case the electrolyte<br />
solutions), and indeed the cell structure is very similar to<br />
the one <strong>of</strong> Polymer Electrolyte Membrane Fuel Cells<br />
(PEMFCs). Another similarity between RFBs and FCs is<br />
that energy is stored in components, the tanks, which are<br />
physically separated from the cells themselves, were<br />
power conversion takes place.<br />
Independent sizing <strong>of</strong> power and energy in typical<br />
RFB systems is therefore possible and this feature allows<br />
for virtually unlimited capacity simply by using larger<br />
storage tanks, without altering the battery itself or<br />
Fig. 2: Schematic <strong>of</strong> a RFB energy storage system: RFB<br />
stack and electrolyte tanks are separated<br />
- 375 - 15th IGTE Symposium 2012<br />
Fig. 3: Schematic <strong>of</strong> a typical RFB cell structure with<br />
MEA (membrane-electrode assembly) and flow-by<br />
solution distribution in bipolar plates with parallel-channel<br />
layout (gaskets are not shown)<br />
power conversion devices. Compared to other<br />
electrochemical systems that incorporate energy and<br />
power in a single device, RFBs are usually more<br />
advantageous when generation for 4-6 hours or more at<br />
maximum power is required. Furthermore, RFBs can<br />
also be fully charged or discharged and left in such<br />
conditions for long periods with no negative effects.<br />
RFB cells consist <strong>of</strong> a sandwiched structure composed<br />
<strong>of</strong> electrodes and interposed proton conducting<br />
membranes that are very similar to the Membrane<br />
Electrode Assembly (MEA) typical <strong>of</strong> PEMFCs (Fig. 3).<br />
The electrolyte solutions reach the electroactive sites<br />
within the electrodes by flowing through a porous<br />
structure consisting <strong>of</strong> materials such as carbon felt. In<br />
contrast with FC storage systems, which require a<br />
specific device, i.e. the electrolyzer, for converting<br />
electrical energy into hydrogen and oxygen, RFBs can<br />
perform both conversions, from electrical to chemical<br />
and from chemical to electrical, in a single device.<br />
A second advantage <strong>of</strong> RFBs with respect to FCs is that<br />
their fuels are not hazardous gases such as hydrogen and<br />
oxygen, but much less dangerous electrolyte solutions,<br />
which makes handling and storage much simpler and<br />
cheaper. As shown in Fig. 2, only two tanks and two<br />
pumps are required for these functions.<br />
Moreover, RFBs operate by changing the metal ion<br />
valence, but the ions themselves are not consumed. This<br />
feature allows extremely long cyclic service with very<br />
low maintenance.<br />
The optimal electrolyte temperatures are in a narrow<br />
range, roughly between 15°C and 35°C, and outside this<br />
range unwanted side effects such as solution<br />
precipitation may occur. On the other hand, the<br />
temperature <strong>of</strong> the battery can be controlled rather easily<br />
by appropriately regulating the electrolyte flow. The<br />
control RFBs is also relatively easy: in fact, the cell<br />
voltage allows to monitor easily the state <strong>of</strong> charge<br />
(SOC) <strong>of</strong> the battery and, at the same time, very deep<br />
discharges are possible because no damage occurs to the<br />
morphology <strong>of</strong> the cell with such operations.<br />
Furthermore, self-discharge is prevented by the<br />
separation <strong>of</strong> the two electrolytes in two different
circuits. The very fast reaction kinetics provides very<br />
fast response times and high overloading is tolerable on<br />
short time scales.<br />
On the other hand, at present even the most advanced<br />
RFBs have relatively low power and energy densities<br />
compared to other competing electrochemical storage<br />
technologies. Consequently, RFBs tend to have large<br />
active areas and ion conducting membranes and<br />
therefore their overall size is usually bulky, making them<br />
unsuitable for mobile applications. Also, the large active<br />
areas cause high transverse gradients <strong>of</strong> the solutions<br />
that feed the electrochemically active sites, particularly<br />
when operating at high power and with high flows. This<br />
causes the current density to be far from uniform on the<br />
active areas, with average values quite lower than the<br />
maximum ones.<br />
III. RFB TECHNOLOGIES<br />
A. Fe-Cr systems<br />
The first commercial RFBs were <strong>of</strong> the Fe-Cr type,<br />
featuring open circuit voltages <strong>of</strong> about 1 V for the<br />
single cell. Test systems in the range <strong>of</strong> 10-60 kW were<br />
produced in the late 1980s by several Japanese<br />
companies including: Mitsui Engineering and<br />
Shipbuilding Co. Ltd, Kansai Electric Power Co. Inc,<br />
and Sumitomo Electric Industries Ltd (SEI). Beside<br />
relatively low energy density, the main drawbacks <strong>of</strong><br />
such systems included: slow reaction <strong>of</strong> the Cr ion,<br />
membrane aging and cell degradation due to the mixing<br />
<strong>of</strong> the two ions. Due to these problems, Fe-Cr cells are<br />
inferior to VRBs, so that they were abandoned after the<br />
emerging <strong>of</strong> the latter.<br />
B. VRB systems<br />
VRBs (Vanadium Redox Batteries), also called allvanadium<br />
RFBs, are currently the most successful RFB<br />
technology, the only one that has reached substantial<br />
quite commercial maturity. Such systems use only<br />
vanadium, dissolved in aqueous sulfuric acid (~5 M). A<br />
positive feature with respect to other RFBs is that, since<br />
they use the same metal on each electrode, the electrodes<br />
and membrane are not cross-contaminated, preventing<br />
capacity decrease and providing longer lifetimes.<br />
Exploiting the ability <strong>of</strong> vanadium to exist in solution<br />
in four different oxidation states, vanadium II-III<br />
(bivalent-trivalent) is used at one electrode while<br />
vanadium IV-V (tetravalent-pentavalent) is used at the<br />
other one. The electrochemical half-reactions are:<br />
positiveelectrode<br />
VO 2+ charge<br />
+ H2O↽⇀ + + −<br />
VO2 + 2H + e<br />
discharge<br />
negativeelectrode<br />
V 3+ + e − charge<br />
↽ ⇀V 2+<br />
discharge<br />
- 376 - 15th IGTE Symposium 2012<br />
(1)<br />
Fig. 4: Polarization curve <strong>of</strong> a RFB<br />
During charge, tetravalent vanadium ions VO2+ are<br />
oxidized to pentavalent vanadium ions VO2 + at the<br />
positive electrode, while trivalent ions V3+ are reduced<br />
to bivalent ions V2+ at the negative electrode. The<br />
hydrogen ions 2H + created at the positive electrode flow<br />
to the negative one through the membrane thus<br />
maintaining the electrical neutrality <strong>of</strong> the electrolytes.<br />
The theoretical open circuit voltage (OCV) <strong>of</strong> VRB cell<br />
is Eo =1.26 V at 25°C, but, in fact, real cells exhibit<br />
Eo =1.4 V in operating conditions. The cell voltage v in<br />
load operation differs from Eo due to the activation<br />
overpotentials η <strong>of</strong> the electrodes which are modeled by<br />
the Butler-Volmer equation:<br />
c<br />
j = j r (0,t)<br />
o exp<br />
cr *<br />
αF<br />
RT η<br />
⎛ ⎞<br />
⎝<br />
⎜<br />
⎠<br />
⎟ − c ⎡<br />
p(0,t) ⎛ (1− α )F ⎞ ⎤<br />
⎢<br />
exp η<br />
cp * ⎝<br />
⎜<br />
RT ⎠<br />
⎟ ⎥<br />
⎣⎢<br />
⎦⎥<br />
where j is the current density at the electrode, jo is the<br />
exchange current density, ci are the species<br />
concentrations at the electrochemical activity sites <strong>of</strong> the<br />
reagents and products indicated in (1) (i = r regents,<br />
i = p products), α is the transfer coefficient (with a value<br />
around 0.5), F is the Faraday constant, R is the gas<br />
constant, and T is the absolute temperature. The ci/ci<br />
fractions express the dynamic reduction <strong>of</strong> the<br />
concentrations normalized to steady state equilibrium<br />
values.<br />
According to (2) v =Eo – η is higher than Eo in the<br />
charge phase, i.e. where the current density is j 0 when electric power<br />
is released. jo is a parameter which depends on the<br />
reactions and on the physical-chemical structure <strong>of</strong> the<br />
electrodes, and is crucial for the cell operation, since the<br />
higher jo the lower η for a given j. In fact, the activation<br />
overpotentials are the major culprits for the cell’s<br />
internal losses at lower current densities (with ci/ci≅1,<br />
Fig. 4). Thus, increasing jo by means <strong>of</strong> appropriate<br />
electrode designs allows to get performance<br />
improvements and higher round trip efficiency. jo can be<br />
increased with higher concentrations, lower activation<br />
(2)
Fig. 5: Schematic <strong>of</strong> a RFB stack with side fluid feedings –<br />
series <strong>of</strong> about 100 cells with active areas as large as<br />
0.4x0.4 m 2 are usual<br />
barriers (i.e. higher activity provided by efficient<br />
catalysts), and larger effective active areas, achievable<br />
by means <strong>of</strong> highly porous electrodes (e.g.<br />
nanostructured materials).<br />
At medium current densities internal losses mainly<br />
depend on the ion conducting membrane that separates<br />
the electrodes (Fig. 3). The material <strong>of</strong> choice is a<br />
perfluorosulfonic acid polymer that allows, if properly<br />
hydrated, the transport <strong>of</strong> ions by binding cations to its<br />
sulfonic acid sites. It is a rather expensive material<br />
patented by DuPont and commercially available under<br />
the name Nafion. Electrically the membrane behaves as<br />
a linear resistor, if temperature and hydration are kept<br />
constant.<br />
At higher current densities the losses are dominated by<br />
transport phenomena in the electrode diffusion layers,<br />
which dramatically reduce the concentrations (ci/ci
system so far, intended for smoothing power output<br />
fluctuations at the Subaru Wind Villa Power Plant which<br />
is rated at 30.6 MW, is a 4 MW / 6 MWh installation<br />
built by Sumitomo Electric Industries (SEI), Japan, for J-<br />
Power in 2005. The system consists <strong>of</strong> 4 banks, each <strong>of</strong><br />
24 stacks and rated at 1 MW (which can be overloaded<br />
up to a maximum <strong>of</strong> 1.5 MW). Individual stacks consist<br />
<strong>of</strong> 108 cells, with a rated power <strong>of</strong> 45 kW each. During<br />
over 3 years <strong>of</strong> operation, the system completed more<br />
than 270,000 complete cycles, thus demonstrating its<br />
reliability.<br />
The abovementioned SEI is one <strong>of</strong> the largest<br />
manufacturers <strong>of</strong> VRB systems for the smoothing and<br />
leveling <strong>of</strong> the fluctuating power generated by wind<br />
farms. Most <strong>of</strong> such systems have been built by SEI and<br />
later by VRB Power Inc., based in Vancouver, CA,<br />
which acquired SEI patents around 2005. In 2009, all<br />
vanadium redox battery assets <strong>of</strong> VRB Power Inc. where<br />
acquired by Prudent Energy, controlled by investors<br />
from China and the U.S.A., in a plan <strong>of</strong> business<br />
expansion in China and abroad. Further important efforts<br />
in the development <strong>of</strong> commercial RFB technologies in<br />
China are those <strong>of</strong> the Chengde Wanlitong Industrial<br />
Group. The reason <strong>of</strong> this interest lies in Chinese plans<br />
to expand the exploitation <strong>of</strong> intermittent renewable<br />
energy sources, especially wind. In fact, wind power<br />
production in the country is expected to rise from about<br />
20 GW in 2010 to 100 GW in 2015, and almost $50<br />
billion per year are expected to be invested in power grid<br />
improvements in the next decade to handle this<br />
increasing amount <strong>of</strong> energy production from<br />
intermittent sources.<br />
Significant developments are also taking place in other<br />
Asian countries, e.g. Cellennium Company Ltd. <strong>of</strong><br />
Thailand produces VRB systems under license, while<br />
Samsung Electronics Co. Ltd. in South Korea is engaged<br />
in the development <strong>of</strong> RFBs with nonaqueous<br />
electrolytes.<br />
Further interesting developments are taking place in<br />
Australia, where V-Fuel Pty Ltd is pursuing innovative<br />
V-Br technology in cooperation with the <strong>University</strong> <strong>of</strong><br />
New South Wales (UNSW). Other Australian companies<br />
working on RFBs, are ZBB Energy Corp. and Redflow<br />
Ltd., both involved in the development and installation<br />
<strong>of</strong> Zn/Br2 batteries.<br />
In the U.S., the Department <strong>of</strong> Energy (DoE) launched<br />
an RFB development program which identified Ashlawn<br />
Energy, LLC for the design <strong>of</strong> a 1 MW / 8 MWh VRB<br />
test plant while Primus Power Corp. was funded to<br />
develop a 25 MW / 75 MWh system based on Zn/Cl2<br />
RFBs. Premium Power Corp. is also developing Zn/Br2<br />
batteries.<br />
In Europe, Renewable Energy Dynamics (RED-T),<br />
Ireland, Cellstrom GmbH, Austria, and RE-Fuel<br />
<strong>Technology</strong> Ltd., UK, are some <strong>of</strong> the most active<br />
companies developing and producing VRB systems.<br />
High-energy density innovative RFBs are also being<br />
investigated in Germany, where the Fraunh<strong>of</strong>er-<br />
Gesellschaft is researching nonaqueous electrolytes, and<br />
in the UK where Plurion Ltd is working on Zn-Ce<br />
systems.<br />
Overall, since the market for smart grid technologies is<br />
expected to grow significantly worldwide in the near<br />
- 378 - 15th IGTE Symposium 2012<br />
future, the market for VRB systems, which is already<br />
starting to flourish, is also expected to expand<br />
vigorously.<br />
V. RESEARCH ISSUES<br />
In spite <strong>of</strong> the previously described initial commercial<br />
successes, RFB technology has yet to witness a complete<br />
technical and commercial breakthrough and substantial<br />
R&D programs are still required in order to fully unleash<br />
its industrial potential. The next generation <strong>of</strong> systems,<br />
expected within the next 5 years, will be even more<br />
economically competitive and will be able to provide the<br />
capital and lifecycle cost reductions that are essential for<br />
widespread commercial success.<br />
The basis for more compact and efficient systems,<br />
exhibiting higher power and energy densities will be<br />
provided by non-aqueous electrolytic solutions and by<br />
improved electrode activity. Improved electrolytes will<br />
also allow to expand the operation temperature range.<br />
For example, the nonaqueous 2MW/20MWh RFB<br />
system under development at the Fraunh<strong>of</strong>er Institute<br />
will consists <strong>of</strong> 8 blocks <strong>of</strong> 7 stacks, with 100-cell<br />
stacks, and will have an output <strong>of</strong> 2 kV, 1 kA, while<br />
being fed from 2x300 m3 tanks. Further improvements<br />
will come also from nanostructured electrodes, currently<br />
under development, with increased effective surface area<br />
and hence improved exchange current density.<br />
In next generation systems, the currently common and<br />
expensive Nafion ion conducting membrane will be<br />
substituted with alternative ones having significantly<br />
reduced cost and, at the same time, lower ohmic losses<br />
in the cells. Incidentally, further material cost reduction<br />
will also be provided by higher power densities, through<br />
more compact designs.<br />
Apart from the above mentioned developments which<br />
involve mainly materials science and basic chemistry,<br />
important engineering efforts are being aimed at system<br />
scale-up and at the structural and operational<br />
optimization <strong>of</strong> flow geometries, state-<strong>of</strong>-charge<br />
monitoring and supervisor systems. Numerical modeling<br />
and simulation are instrumental in improving the current<br />
systems which are far from optimal in many respects.<br />
Multi-scale, multidimensional, multi-physic, both<br />
steady-state and dynamic, models can accurately<br />
simulate the behavior <strong>of</strong> the whole system and its<br />
components and thus speedup the development <strong>of</strong> more<br />
efficient components and systems. Many modeling<br />
problems encountered in RFB systems are similar to<br />
those posed by direct alcohol fuel cells which also<br />
consist <strong>of</strong> the same basic building blocks (MEA-based<br />
cells, bipolar plates and stacks) and are also fed with<br />
liquid solutions instead <strong>of</strong> gases, so that some <strong>of</strong> the<br />
numerical tools developed in that context [10] may be<br />
adapted to the simulation <strong>of</strong> RFB systems. Sophisticated<br />
modeling tools are also aimed at designing advanced<br />
bipolar plates with either flow-by or flow-through<br />
diffusion <strong>of</strong> the electrolytic solutions, were the aim is to<br />
minimize transverse gradients and, at the same time, to<br />
reduce longitudinal conductance for lowering the shunt<br />
currents. Advanced computational techniques are needed<br />
to deal with the very challenging numerical problems<br />
arising from cell elements which exhibit multi-physic
material behavior and high aspect ratio geometries<br />
[11,12].<br />
In the area <strong>of</strong> controls engineering, advanced control<br />
systems will provide automatic electrolyte rebalancing<br />
and capacity correction and will possibly allow the<br />
remote operation <strong>of</strong> large RFB systems. Optimized<br />
electrolyte flow-rates will also minimize pumping<br />
energy requirements, which are one <strong>of</strong> the main factors<br />
affecting the overall efficiency (together with shunt<br />
currents and internal cell losses). Such control systems<br />
will eventually cope with the conflicting requirements<br />
arising from the strong dependence <strong>of</strong> the cell voltage<br />
vs. current polarization curve on the solution flow-rates.<br />
As far as the electrical interface <strong>of</strong> RFB systems is<br />
concerned, modeling, simulation and optimization are<br />
aimed at designing supervisor and control sub-systems<br />
with proper feed-back loops and reduced response times<br />
which are required to assure improved performance for<br />
peak shaving, sag compensation and load leveling in the<br />
smart-grid context. Flexible solutions for interfacing<br />
both the DC renewable energy sources and AC grid and<br />
load can be obtained with DC/DC converters coupled to<br />
inverters. Non-linear control techniques <strong>of</strong> the inverter<br />
can allow RFB systems to provide active as well as<br />
reactive power to the smart-grid connected loads. The<br />
success in designing such power management subsystems,<br />
including both the DC/DC converter and the<br />
inverter, strongly depends on the accuracy in modeling<br />
the various components and the whole system.<br />
Further research is also needed for optimizing the<br />
technological solutions from the economical (operating<br />
earning and savings arising from the RFBs operation)<br />
and environmental (primary energy savings, carbon<br />
dioxide emission reductions) point <strong>of</strong> view. The results<br />
<strong>of</strong> these analyses will allow assessing the viability <strong>of</strong><br />
RFB technologies within the context <strong>of</strong> modern energy<br />
hubs.<br />
All the above described scientific challenges raised by<br />
RFBs require strongly interdisciplinary development<br />
programs and collaborative efforts among researchers<br />
with different and complementary expertise. If such<br />
efforts are successful the next generation <strong>of</strong> RFB<br />
systems will be low cost, highly efficiency and durable,<br />
and thus be suitable for large-scale industrial<br />
exploitation, overcoming the limitations <strong>of</strong> more<br />
conventional systems.<br />
Finally, more compact and more flexible RFB systems,<br />
such as the ones mentioned above, may one day become<br />
suitable for powering some classes <strong>of</strong> electric vehicles.<br />
VI. CONCLUSIONS<br />
Redox flow batteries (RFBs) are already a promising<br />
energy storage technology and first generation systems,<br />
based on all-vanadium solutions, have already been<br />
successfully demonstrated in test installations<br />
worldwide, and their commercial exploitation is<br />
undergoing. The next generation <strong>of</strong> systems, with<br />
increased power and energy densities, are currently<br />
under development, but further progresses in<br />
electrochemical materials and system engineering are<br />
expected to produce the final technical and commercial<br />
breakthrough. RFB systems are expected to become a<br />
- 379 - 15th IGTE Symposium 2012<br />
key technology for stationary smart-grid-oriented<br />
applications supporting the load leveling and peak<br />
shaving <strong>of</strong> intermittent renewable energy sources. Future<br />
high-density systems may also become suitable for some<br />
automotive applications.<br />
REFERENCES<br />
[1] European Commission, “Proposal for a COUNCIL<br />
DECISION<br />
establishing the Specific Programme Implementing Horizon<br />
2020 - The Framework Programme for Research and<br />
Innovation (2014-2020), COM(2011) 811 final, 2011/0402<br />
(CNS).<br />
[2] B. Dunn, H. Kamath, J.Tarascon, “Electrical Energy<br />
Storage for the Grid: A Battery <strong>of</strong> Choices”, Science, 334, pp.<br />
928-935, 2011.<br />
[3] Z. Weber, M. M. Mench, J. P. Meyers, P. N. Ross, J. T.<br />
Gostick, Q. Liu, “Redox flow batteries: a review”, J. Appl.<br />
Electrochem. 41, pp. 1137-1164, 2011.<br />
[4] T. Shigematsu, “Redox Flow Batteries for Energy Storage”,<br />
SEI Technical Review, 73, pp. 4-13, 2011.<br />
[5] C. Ponce de León, A. Frías-Ferrer, J. González-García,<br />
D.A. Szánto, F. C. Walsh, “Redox flow cells for energy<br />
conversions”, J. Power Sources, 160, pp. 716-732, 2006.<br />
[6] C. Menictas, M. Skyllas-Kazacos, “Performance <strong>of</strong><br />
vanadium-oxigen redox fuel cell”, J. Appl. Electrochem., 41,<br />
pp. 1223-1232, 2011.<br />
[7] M. Skyllas-Kazacos, G. Kazacos, G. Poon, H. Verseema,<br />
“Recent advances with UNSW vanadium-based redox flow<br />
batteries”, Int. J. Energ. Res., 34, pp. 182-189, 2010.<br />
[8] Kaneko H, Negishi A, Nozaki K, Sato K, Nakajima M<br />
(1992) Redox battery. US Patent 5318865.<br />
[9] C. Menictas, M. Skyllas-Kazacos, “Vanadium-oxygen<br />
redox fuel cell”, Final report. SERDF Grant, NSW Department<br />
<strong>of</strong> Energy, 1997.<br />
[10] V. Di Noto, M. Guarnieri, F. Moro: “A Dynamic Circuit<br />
Model <strong>of</strong> a Small Direct Methanol Fuel Cell for Portable<br />
Electronic Devices”, IEEE Tran.s Ind. Electronics, Vol. 57, N.<br />
6, pp. 1865-1873, 2010.<br />
[11] P. Alotto, M. Guarnieri, F. Moro, A. Stella: “A Proper<br />
Generalized Decomposition Approach for Fuel Cell Polymeric<br />
Membrane Modelling”, IEEE Trans. Mag., Vol. 47 No. 5, pp.<br />
1462-1465, 2011.<br />
[12] P. Alotto, M. Guarnieri, F. Moro, A. Stella: “Multi-physic<br />
3D dynamic modelling <strong>of</strong> polymer membranes with a proper<br />
generalized decomposition model reduction approach”,<br />
Electrochimica Acta, pp. 250-256, 2011.<br />
[1]
- 380 - 15th IGTE Symposium 2012<br />
Model Order Reduction via Proper Orthogonal<br />
Decomposition for a Lithium-Ion Cell<br />
B. Suhr∗ , J. Rubeˇsa∗ ∗Kompetenzzentrum - Das Virtuelle Fahrzeug Forschunggesellschaft mbH (VIF), <strong>Graz</strong>, Austria<br />
E-mail: bettina.suhr@v2c2.at<br />
Abstract—The simulation <strong>of</strong> lithium-ion batteries is a challenging research topic. Since there are many electrochemical<br />
processes involved in dis-/charging, models which aim to include these processes are in general complex and therefore slow.<br />
For many tasks, e.g. in optimization, a repeated solution <strong>of</strong> a model is necessary. In this paper a speed up in simulations,<br />
with acceptable error in results, is obtained by combining proper orthogonal decomposition with empirical interpolation<br />
method. We report a speed up factor between 10 and 15.<br />
Index Terms—electrochemical model, empirical interpolation method, model reduction, proper orthogonal decomposition<br />
I. INTRODUCTION<br />
The accurate and fast simulation <strong>of</strong> lithium-ion batteries<br />
is <strong>of</strong> a growing interest in the automotive industry.<br />
As fossil fuels are limited, more and more research is<br />
conducted on electric, especially on hybrid cars. Here, the<br />
quality and the speed <strong>of</strong> the battery simulation is a crucial<br />
point. Often battery models are simplified strongly, in<br />
a physical meaning, to obey the need for speed <strong>of</strong> on<br />
board usage or optimization purposes. In contrary, here a<br />
speed up in simulation will be gained by using model<br />
reduction via proper orthogonal decomposition (POD)<br />
combined with a fast evaluation <strong>of</strong> nonlinearities, the<br />
empirical interpolation method (EIM).<br />
Cai and White in [4] applied POD method to a battery<br />
model, but the mayor nonlinearity <strong>of</strong> the system was<br />
assumed to be constant. Starting from the full model and<br />
very fine discretization in space, a speed up factor <strong>of</strong> 4<br />
was obtained. In their work a comparison between full<br />
and reduced for only constant discharge simulations were<br />
done.<br />
We follow the work introduced by Lass and Volkwein<br />
in [7] where POD and EIM were applied to the battery<br />
model <strong>of</strong> Wu-Xu-Zou [10]. We use this procedure and<br />
apply it on more general but more complex battery model<br />
<strong>of</strong> Cifrain [5]. In our work, as in the work <strong>of</strong> Lass,<br />
no simplifications <strong>of</strong> the battery model, as mentioned<br />
previously, are necessary and a speed up factor <strong>of</strong> 15<br />
was obtained for constant discharge simulations.<br />
The paper is organized in the following manner: In<br />
Section II the nonlinear parabolic dynamical system that<br />
describes considered battery model is formulated. Section<br />
III is devoted to the reduced order model (ROM) utilizing<br />
proper orthogonal decomposition (POD) method.<br />
We describe the method <strong>of</strong> POD in general and its<br />
application to the battery system. Moreover, the empirical<br />
interpolation is introduced. In Section IV numerical<br />
results are presented. Finally, in Section V conclusions<br />
are drawn and an outlook on future work is given.<br />
II. BATTERY MODEL<br />
The battery cell consists <strong>of</strong> two electrodes, an anode<br />
and a cathode, and a separator between them. Each<br />
electrode consists <strong>of</strong> particles and an electrolyte, while<br />
in the separator we consider only the electrolyte.<br />
The mathematical model described in [5] and used<br />
here is an electrochemical model similar to the well<br />
known model <strong>of</strong> Newman [6]. It is a coupled dynamical<br />
system <strong>of</strong> four nonlinear partial differential equations.<br />
The system variables are potentials and concentrations<br />
for the electrolyte, φl,cl, for the cathode, φsc,csc, and<br />
for the anode φsa,csa. All state variables describing the<br />
potential and the liquid concentration variable are one<br />
dimensional system variables. Those four variables are<br />
time t ∈ [0,T], T ∈ R, and space x ∈ Ω dependent,<br />
where Ω ⊂ R. Two remaining variables, the variables<br />
for the solid concentration are two dimensional variable,<br />
i.e., cs := cs(x, r, t) ∈ Λ × (0,T) where Λ ⊂ R 2 . Such<br />
model is also referred to as the pseudo-two-dimensional<br />
model; see Figure 1. Considered battery model is given<br />
in the following way:<br />
∂ (φs − φl)<br />
CDSAi<br />
−∇·(σs∇φs) =−AiθjBV in Ω<br />
∂t<br />
′<br />
∂ (φs − φl)<br />
−CDSAi<br />
−∇ ·<br />
RT<br />
zF<br />
∂ (ɛlcl)<br />
∂t<br />
∂t<br />
κl(cl)t +<br />
l<br />
<br />
−∇ · Dl ∇cl + zF<br />
∂cs<br />
∂t<br />
= 1<br />
r 2<br />
−∇·(κL(cl)∇φl)+<br />
(1a)<br />
<br />
1<br />
∇cl = AiθjBV in Ω (1b)<br />
cl<br />
∂ (φs − φl)<br />
− CDSAi<br />
+<br />
∂t<br />
RT μlcl∇φl<br />
<br />
= Aiθ<br />
F jBV in Ω (1c)<br />
<br />
∂<br />
Dsr<br />
∂r<br />
2 fRK(cs) ∂cs<br />
<br />
in Λ (1d)<br />
∂r<br />
strongly coupled with<br />
⎧<br />
⎪⎨<br />
⎪⎩<br />
<br />
αAzF(φs−φ<br />
zFkθ exp<br />
l−UOCV (cs))<br />
+<br />
RT<br />
<br />
−(1−αK )zF(φs−φ<br />
−zFkθ exp<br />
l−UOCV (cs))<br />
RT<br />
in Ω ′<br />
0 in Ωs<br />
jBV =
where the spatial sub-domains are introduced<br />
as Ω = Ωc∪ Ωs ∪ Ωa, Ω ′ Λa<br />
= Ωc∪ Ωa,<br />
=Ωa × [0,Ra] ⊂ R2 , Λc =Ωc × [0,Rc] ⊂ R2 ,<br />
Λ=Λa∪ Λc and Ra,Rc ∈ R. See Section VI for a<br />
list <strong>of</strong> symbols. The system is initialized with constant<br />
values which correspond to an equilibrium state, i.e.,<br />
jBV =0. The boundary conditions are all homogeneous<br />
Neumann conditions at the external boundaries and<br />
continuous flux conditions at internal boundaries<br />
(between electrodes and separator). Exceptions are<br />
the solid potential where either a current is specified<br />
−σS ∂φS<br />
<br />
<br />
∂x = − Γa<br />
I(t)<br />
<br />
or a voltage φS<br />
= U(t). Also<br />
Acell Γa<br />
the solid concentration has a non zero boundary condition<br />
DSfRK(cS) ∂cS<br />
<br />
∂r = − j∗ BV 1 ∂(φS−φL)<br />
F − F<br />
CDS ∂t .<br />
r=Rp<br />
The nonlinear initial value problem (1) is discretized<br />
in space with the Finite Element (FE) method, in time<br />
using the implicit Euler method and linearized with the<br />
damped Newton method. The details on numerics, initial<br />
values and boundary conditions one can find in [8]. After<br />
obtaining the numerical solution <strong>of</strong> the model, the need<br />
for model reduction and speed up has occurred.<br />
<br />
<br />
<br />
Fig. 1. Pseudo-two-dimensional model.<br />
III. MODEL REDUCTION<br />
The reduction <strong>of</strong> linear dynamical systems is a classical<br />
research topic and there exist several well known<br />
methods for this task, e.g. balanced truncation, Krylov<br />
methods, reduced basis and POD. Detailed information<br />
one can found in the standard literature or in [1], [3]<br />
and [9]. A relatively new area in research is the model<br />
reduction <strong>of</strong> nonlinear dynamical systems <strong>of</strong> equations.<br />
A. The POD method.<br />
POD is a commonly used model order reduction technique,<br />
when repeated simulations are to be conducted.<br />
Relevant information is extracted from snapshots generated<br />
with the full model and saved into the POD basis.<br />
Using the POD basis a ROM will be build and for all<br />
further simulations the ROM is used.<br />
Computation <strong>of</strong> the POD basis: The solutions yj ∈<br />
Rm , j =1,...,n, <strong>of</strong> the full system we refer to as<br />
snapshots and we define the following:<br />
Y := [y 1 , ..., y n ] ∈ R m×n .<br />
<br />
- 381 - 15th IGTE Symposium 2012<br />
The goal <strong>of</strong> the POD is to find l ≤ d = dim span (Y ) ≤ n<br />
orthonormal vectors {Ψi} l i=1 in Rm that minimize the<br />
cost function and does the best approximation <strong>of</strong> Y , i.e.,<br />
J(Ψ1,...,Ψl) =<br />
n<br />
αjyj −<br />
j=1<br />
l<br />
〈yj, Ψi〉XΨi 2 , (2)<br />
i=1<br />
where x = √ x T x is the Euclidean norm, αj ≥ 0 are<br />
the weights and 〈x, y〉 L 2 is the inner product. Further,<br />
from the Lagrange functional:<br />
L(Ψ1,...,Ψl,λ11,...,λll) =J(Ψ1,...,Ψl)+<br />
l<br />
+ λij(〈Ψi, Ψj〉X − δij) ,<br />
i,j=1<br />
with the Kronecker symbol δij, we obtain two necessary<br />
optimality conditions:<br />
n<br />
1) αjyj〈yj, Ψi〉X = λiiΨi ,λij =0for i = j<br />
j=1<br />
2) 〈Ψi, Ψj〉X = δij ,i,j=1,...,l.<br />
The second condition is satisfied under the assumption<br />
that the vectors {Ψi} l i=1 are orthonormal and the first<br />
condition is equivalent with<br />
YBY T MΨi = λiΨi , i =1,...,l, (3)<br />
where M ∈ Rm×m is the mass matrix from the FE<br />
simulation, λi = λii, B := diag(α1,...,αn), α1 = δt1<br />
2 ,<br />
αj = δtj+δtj−1<br />
2 ,j=2,...,n− 1 and αn = δtn<br />
2 . There<br />
exist more possibilities to solve (3) and we will consider<br />
the following approach.<br />
a) K-Ansatz (Covariance matrix): By setting ¯ Y :=<br />
M 1<br />
2 YB 1<br />
2 we can define the symmetric matrix K, i.e.,<br />
K := ¯ Y T ¯ Y = B 1<br />
2 Y T MYB 1<br />
2 ∈ R n×n .<br />
After the application <strong>of</strong> the eigenvalue decomposition on<br />
K we obtain the following relation:<br />
Kvj = λjvj , j =1,...,n.<br />
The n eigenvalues <strong>of</strong> the matrix K we denote by λj<br />
and the eigenvectors by vj. We sort the eigenvalues in<br />
decreasing order, i.e., λ1 ≥ λ2 ≥ ... ≥ λl ≥ ... ≥ λn,<br />
and we cut the first l eigenvalues such that following<br />
holds:<br />
n<br />
λj ≤ tolerance ,<br />
j=l+1<br />
since the error <strong>of</strong> the cost function (2) can be calculated<br />
by J(Ψ1,...,Ψl) = n j=l+1 λj. Hence, the choice <strong>of</strong><br />
l strictly depends on the eigenvalues λj, j =1,...,n<br />
and how fast same decay. In Figure 2, the computed<br />
eigenvalues are shown which are scaled by trace <strong>of</strong> matrix<br />
K.<br />
The POD basis consists <strong>of</strong> l functions:<br />
Ψi(x) = 1<br />
n<br />
√ α<br />
λi<br />
1<br />
2<br />
j (¯vi)jy j (x) ∈ R m , i =1,...,l.<br />
j=1
Fig. 2. Example plot <strong>of</strong> decaying eigenvalues.<br />
Denoting the FE basis functions as ϕ, the snapshots<br />
yj have the standard Galerkin representation: yj <br />
=<br />
m<br />
k=1 yj<br />
kϕ(x). This directly yields:<br />
m<br />
Ψi(x) = ψ i kϕ(x), ψ i k = 1<br />
n m<br />
√ α<br />
λi<br />
1<br />
2 j<br />
j (¯vi)jy k .<br />
k=1<br />
j=1 k=1<br />
For later use we define (Ψi)k := ψ i k , Ψi ∈ R m ,i =<br />
1,...,l, and the matrix ˆ Ψ:=[Ψ1, ..., Ψl] ∈ R m×l .<br />
B. Application <strong>of</strong> POD to the battery model.<br />
The POD basis for the battery model is calculated<br />
for each variable separately. The snapshots <strong>of</strong> the full<br />
FE solutions are split up in the data for each <strong>of</strong> the<br />
six variables φl,cl,φsc,φsa,csc,csa and the six matrices<br />
<strong>of</strong> snapshots, Yi ∈ RNi×n , i =1,...,6, are obtained.<br />
Here n is the number <strong>of</strong> time steps and Ni is the<br />
number <strong>of</strong> degrees <strong>of</strong> freedom in the FE Ansatz for the<br />
corresponding variable, i.e., m = 6 i=1 Ni. With these<br />
prerequisites the computation <strong>of</strong> the POD basis for each<br />
<strong>of</strong> the six variables can be carried out, as described above.<br />
The differences in the dimension <strong>of</strong> the domains <strong>of</strong> the<br />
six variables, cause no further changes.<br />
Before we describe how the reduced order model can<br />
be build we need to give a quick overview on how the<br />
full FE system is solved and to introduce some notations.<br />
As the FE method is well known we will not give details<br />
about how one can transform the battery system (1) in<br />
its weak form and apply the Galerkin method on it.<br />
The system, which is discretized in space, is still<br />
continuous in time and has the following form:<br />
M ˙u + K(u)u = f(u) , (4)<br />
where u := [φl,cl,φsc,φsa,csc,csa] T , M is the mass<br />
matrix, K(u) is the stiffness matrix and f(u) is the right<br />
hand side <strong>of</strong> the system. Please note that we solve all<br />
six equations simultaneously, therefore all matrices are<br />
block matrices. For time discretization the implicit Euler<br />
method is applied on (4) and we use<br />
<br />
1<br />
M + K(uk+1) uk+1 − f(uk+1) =<br />
Δt 1<br />
Δt Muk , (5)<br />
- 382 - 15th IGTE Symposium 2012<br />
in every time step k = 1, 2, ..., n. As this system is<br />
fully discretized but still nonlinear, we apply now the<br />
Newton method in order to linearize the system (5). After<br />
introducing the following notation:<br />
<br />
1<br />
A(uk+1) := M + K(uk+1)<br />
Δt<br />
we can rewrite equation (5) and define the operator G as<br />
and F := 1<br />
Δt Muk ,<br />
G(uk+1) :=A(uk+1)uk+1 − f(uk+1) − F =0. (6)<br />
Applying the Newton method we end up calculating the<br />
Newton step<br />
Δu = −[JG(u i k+1)] −1 G(u i k+1) , (7)<br />
in every time step k until the predefined stopping criterion<br />
is met for the sequence {ui k }∞i=0 , ui+1<br />
k+1 = ui k+1 +Δu.<br />
The derivative JG is calculated using the derivatives <strong>of</strong><br />
K and f such that<br />
JG(uk+1) = 1<br />
Δt M + JK(uk+1) − Jf(uk+1). (8)<br />
1) Reduced order model for the battery: Next step<br />
is to build the reduced order model using the POD<br />
basis. We point out that it is necessary to formulate<br />
non homogeneous Dirichlet boundary conditions with a<br />
penalty method.<br />
To obtain the reduced order model the Galerkin method<br />
is used with the POD basis functions, instead <strong>of</strong> the FE<br />
basis functions as before. We denote our calculated POD<br />
basis for each <strong>of</strong> the six variables with:<br />
ˆΨi ∈ R Ni×li 6<br />
, i =1,...,6 , lPOD = li.<br />
i=1<br />
In the case <strong>of</strong> one single equation the mass matrix M <strong>of</strong><br />
the full system and the mass matrix ˆ M <strong>of</strong> the reduced<br />
system are connected via the POD basis ˆ Ψ as follows:<br />
ˆM = ˆ Ψ T M ˆ Ψ , M ∈ R m×m , Mˆ l×l<br />
∈ R . (9)<br />
This can be adapted to a system <strong>of</strong> equations using the<br />
block structure <strong>of</strong> the mass matrix. The mass matrix<br />
M ∈ R m×m in (4) is a block matrix with the following<br />
structure: Mij ∈ R Ni×Nj , i,j =1,...,6. The corresponding<br />
matrix ˆ M ∈ R lPOD×lPOD , in the reduced order<br />
model, can be obtained as follows:<br />
ˆMij = ˆ Ψ T i Mij ˆ Ψj , i,j =1,...,6 ,<br />
where ˆ Mij ∈ R li×lj for i, j =1,...,6. For the right<br />
hand side in (5) it holds analogously:<br />
ˆFi = ˆ Ψ T i Fi ∈ R li , i,j =1,...,6 ,<br />
where Fi ∈ R Ni , F =[F1, ..., F6] T ∈ R m and ˆ F =<br />
[ ˆ F1, ..., ˆ F6] T ∈ R lPOD . To improve readability we use<br />
from now on L(M) := ˆ M and L(F ):= ˆ F .<br />
There are many hidden nonlinearities in equations (7)<br />
which have to be evaluated for each time and Newton<br />
step in the full FE dimension m. This can be accessed<br />
via the relation (uk+1)i = ˆ Ψi (ûk+1)i.
The corresponding equations to (6) and (8), for the<br />
reduced order model, are<br />
L(G(ûk+1)) :=L(A( ˆ Ψûk+1))ûk+1 −L(f( ˆ Ψûk+1))<br />
−L(F ) (10a)<br />
L(JG(ûk+1)) := 1<br />
Δt L(M)+L(JK( ˆ Ψûk+1))<br />
−L(Jf( ˆ Ψûk+1)) (10b)<br />
then the Newton step and the iterations are defined by<br />
Δû = −[L(JG(ûk+1))] −1 L(G(ûk+1)) , (11a)<br />
ûk+1 = Luk +Δû. (11b)<br />
C. Application <strong>of</strong> EIM<br />
The evaluation <strong>of</strong> nonlinearities in (10) is very slow.<br />
EIM can be used to reduce number <strong>of</strong> necessary evaluations<br />
<strong>of</strong> these nonlinearities, see [2] for details on<br />
the method. Let g be a given nonlinear function, which<br />
can be evaluated point wise. Needed are snapshots <strong>of</strong><br />
this function with fast decaying eigenvalues. For the<br />
basis construction a greedy algorithm for the biggest<br />
residual is used, which stops at a given tolerance. The<br />
returned basis, Υ=[υ1,...,υlEIM ], has lEIM entries<br />
and also the indices for the evaluation <strong>of</strong> the nonlinearity<br />
p =[p1,...,plEIM ] are given. The nonlinear function g<br />
is approximated as follows:<br />
g(y) =Υc(y) =<br />
lEIM <br />
i=1<br />
υici(y). (12)<br />
To compute the coefficients c the above equality can be<br />
transformed to:<br />
P T Υc(y) =P T g(y) ⇔ c(y) =(P T Υ) −1 P T g(y) (13)<br />
with the matrix P =[ep1,...,epl ] where epi ∈ Rm<br />
EIM<br />
is the pith unit vector.<br />
Since the singular values <strong>of</strong> the nonlinearity on the<br />
right hand side <strong>of</strong> the system decay slowly, the use <strong>of</strong><br />
EIM is not useful here. Luckily, the eigenvalues on the<br />
left hand side <strong>of</strong> the system decay fast and EIM can be<br />
used. This substitutes the matrix assembly.<br />
In equation (10) the nonlinearities are part <strong>of</strong> the<br />
stiffness matrix K and its derivative JK. As both cases<br />
are handled analogously we explain the case for the<br />
stiffness matrix only. The nonlinearities in the different<br />
blocks <strong>of</strong> the stiffness matrix are dealt separately and<br />
therefore we drop one pair <strong>of</strong> indices and denote the<br />
entries <strong>of</strong> an arbitrary block <strong>of</strong> the stiffness matrix as<br />
<br />
Kij = g(y)∇ϕi∇ϕjdx, i, j =1,...,m, (14)<br />
Ω<br />
where g(y) is the nonlinear function to be approximated.<br />
When we now insert equation (12) in (14) and combine<br />
this with the reduced order stiffness matrix ˆ K = ˆ Ψ T K ˆ Ψ<br />
- 383 - 15th IGTE Symposium 2012<br />
we obtain:<br />
ˆK = ˆ Ψ T<br />
=<br />
lEIM<br />
<br />
lEIM <br />
k=1<br />
k=1<br />
<br />
<br />
ck(y) υk∇ϕi∇ϕjdx<br />
Ω<br />
ck(y) ˆ Ψ T<br />
<br />
<br />
υk∇ϕi∇ϕjdx ˆΨ . (15)<br />
<br />
Ω<br />
<br />
ij<br />
<br />
The underlined expressions can be precomputed, these<br />
are lEIM matrices <strong>of</strong> dimension lPOD×lPOD. Instead <strong>of</strong><br />
the assembly <strong>of</strong> the stiffness matrix the coefficients c have<br />
to be calculated using (13), this includes lEIM function<br />
evaluations. These coefficients are then used in the above<br />
formula where a linear combination <strong>of</strong> the precomputed<br />
matrices is calculated. When lEIM is small a massive<br />
speed up can be gained by this method.<br />
IV. RESULTS<br />
In general, our goal is to replace a slow FE simulation<br />
with a fast ROM simulation by allowing a small relative<br />
error between those two simulations. In this section we<br />
will compare the FE solutions and the ROM solutions<br />
for accuracy and speed archived by three different approaches.<br />
We will start with one very simple example.<br />
A. Simple simulation <strong>of</strong> battery.<br />
We consider a simulation where the battery is charged,<br />
discharged and charged again between 3.8V and 2.5V (all<br />
three times with 0.2C); see the solid blue line in Figure 3.<br />
To solve the full model the following spatial discretization,<br />
given in degrees <strong>of</strong> freedom (DOFs), for the single<br />
variables is used:<br />
φl φsc φsa cl csc csa total<br />
241 101 101 241 5151 5151 10986<br />
The simulated time is 61586s and 1451 time steps are<br />
used due to the adaptive time stepping algorithm. The<br />
solutions <strong>of</strong> the full FE system is plotted in Figure 4.<br />
These solutions were taken as snapshots for the POD<br />
basis computation for each <strong>of</strong> the six variables. When we<br />
cut the scaled eigenvalues, at the tolerance <strong>of</strong> 10−8 , the<br />
number <strong>of</strong> POD basis functions for the single variables<br />
is as follows:<br />
Fig. 3. Simple simulation <strong>of</strong> battery.<br />
ij<br />
ˆΨ
Fig. 4. 3d plots <strong>of</strong> full FE solution for first example.<br />
φl φsc φsa cl csc csa total<br />
5 3 3 3 14 12 42<br />
The total dimension <strong>of</strong> the ROM is therefore 42. Also<br />
EIM bases for the different nonlinearities are computed.<br />
The ROM is then solved using these POD and EIM bases.<br />
For a comparison between the FE and the ROM solution<br />
we consider the L2-error. For voltage and current the<br />
absolute error is 6.6 · 10−06 and 1.0 · 10−04 , respectively.<br />
A difference between the voltage <strong>of</strong> the FE solution and<br />
the ROM solution is shown with asterisk line in Figure 3.<br />
Also we consider the relative error <strong>of</strong> each variable which<br />
is defined for φl as:<br />
eφl :=<br />
<br />
<br />
<br />
<br />
<br />
||φFE l − φROM<br />
l || L2 (Ω)<br />
||φFE l || L2 (Ω)<br />
<br />
<br />
<br />
<br />
<br />
L 2 (0,T )<br />
, (16)<br />
and for all other variables analogously. The resulting<br />
relative errors for each variable are:<br />
φl φsc φsa<br />
2.2 · 10 −6<br />
1.3 · 10 −6<br />
cl csc csa<br />
1.7 · 10 −7<br />
4.2 · 10 −6<br />
2.6 · 10 −5<br />
5.5 · 10 −6<br />
All <strong>of</strong> these relative errors are acceptably small. Now<br />
that we checked the accuracy <strong>of</strong> the ROM solution we<br />
are interested in the speed <strong>of</strong> the computation. The full<br />
FE solution took 2828s in CPU time while the ROM<br />
solution lasted 189s, which means that a speed up <strong>of</strong><br />
factor 15 was achieved.<br />
B. More general POD and EIM bases.<br />
We need POD and EIM bases which are more general<br />
and applicable for a broader range <strong>of</strong> use cases, concerning<br />
the battery behavior. This is achieved this by building<br />
one basis out <strong>of</strong> a set <strong>of</strong> similar simulations.<br />
In battery simulation a natural choice for this set <strong>of</strong><br />
simulations is the discharge <strong>of</strong> the battery at different<br />
C-rates. To be more precise: we discharge the battery<br />
from 3.8V until 2.5V at the C-rates 0.1, 0.2, 0.5, 1,<br />
2, 3, 4 and 5C. From these set <strong>of</strong> snapshots we build<br />
the POD and EIM bases by using again the tolerance <strong>of</strong><br />
10−8 . The ROM is <strong>of</strong> a dimension <strong>of</strong> 85, compared to<br />
the full dimension <strong>of</strong> 10986 DOFs. To make sure that<br />
- 384 - 15th IGTE Symposium 2012<br />
Fig. 5. Voltage <strong>of</strong> simulations at different C-rates.<br />
Fig. 6. Voltage <strong>of</strong> simulations not used for POD basis calculation.<br />
the dynamics <strong>of</strong> the different C-rates are captured well,<br />
we repeat all discharge simulations used for building<br />
the basis with the ROM. In Figure 5 we plotted the<br />
calculated voltage <strong>of</strong> the full FE solution and the ROM<br />
solution using the new POD and EIM bases. We detected<br />
a extremely small L 2 relative error and we conclude that<br />
the results are in good agreement for all used C-rates.<br />
Next we want to find out whether C-rates not used<br />
for the basis computation, are also simulated well by the<br />
ROM. For this purpose we simulate the C-rates: 0.15,<br />
0.3 and 3.5 with the full FE system and the ROM. The<br />
results are in good accordance as can be seen in Figure 6.<br />
We conclude that all C-rates between 0.1 and 5 can be<br />
simulated well with the ROM using the improved, more<br />
general, basis.<br />
C. POD basis switching.<br />
With the single POD basis we can simulate well<br />
different discharges in the 0.1 - 5 C-rate range and now<br />
we want to use the ROM for a simulation that involves<br />
charge and discharge processes. We have the set <strong>of</strong> POD<br />
and EIM bases for a discharge. Equivalently using the<br />
same C-rates, we calculate the POD and the EIM basis for<br />
the charge processes. Using the same tolerance, 10−8 ,we
Fig. 7. Voltage <strong>of</strong> simulations where bases for ROM computation<br />
were switched.<br />
obtain 110 POD basis functions. In the ROM simulation<br />
we can then switch between these two sets <strong>of</strong> bases.<br />
This is demonstrated through the simulation <strong>of</strong><br />
charge/discharge processes were charge and discharge are<br />
simulated at C-rates not included in the bases computation.<br />
The voltage plot <strong>of</strong> the full FE solution and the<br />
ROM solution can be seen in Figure 7.<br />
The voltage curves are in good accordance, only at the<br />
switching points there occurs an error. Both the discharge<br />
and the charge POD basis span a subspace in the space <strong>of</strong><br />
all possible solutions. When switching between discharge<br />
to the charge basis (or vice versa) the ROM solution is<br />
projected from one subspace to the other and an error<br />
occurs.<br />
Below the relative errors for each variable are given,<br />
see (16) for the definition <strong>of</strong> the error norm.<br />
φl φsc φsa<br />
1.4 · 10 −5<br />
3.4 · 10 −6<br />
cl csc csa<br />
4.1 · 10 −6<br />
1.3 · 10 −5<br />
3.8 · 10 −5<br />
3.0 · 10 −5<br />
Naturally the error in this simulation is bigger than in<br />
the first example, nevertheless the results are acceptable.<br />
This method seems to be very suitable to allow a fast<br />
simulation <strong>of</strong> different processes, as pulses, rest steps,<br />
cycling etc. To find appropriate POD and EIM bases for<br />
these processes, which also give a small projection error,<br />
remains our next task.<br />
V. CONCLUSION<br />
For speeding up the numerical simulation <strong>of</strong> the<br />
lithium-ion battery model, a reduced order model was<br />
build utilizing proper orthogonal decomposition method<br />
and empirical interpolation method.<br />
Speed up factors <strong>of</strong> 10-15 (depending on the considered<br />
case) with acceptable error was obtained. The<br />
developed method <strong>of</strong> switching between POD and EIM<br />
bases for different purposes, shows good results for<br />
charging/discharging processes at different C-rates. This<br />
- 385 - 15th IGTE Symposium 2012<br />
method is very promising to allow a fast simulation <strong>of</strong><br />
different processes, e.g. pulses, rest steps, cycling etc.<br />
Our future research will aim for the construction <strong>of</strong><br />
appropriate POD bases for these processes. Also the<br />
question whether the projection error can be minimized<br />
will be considered.<br />
Acknowledgment:<br />
The authors gratefully acknowledge financial support<br />
from “Zukunftsfonds des Landes Steiermark” <strong>of</strong> the Federal<br />
Province <strong>of</strong> Styria/Austria for the project in which<br />
the above presented research results were achieved.<br />
VI. LIST OF SYMBOLS<br />
cs<br />
concentration <strong>of</strong> Li + in active material<br />
cl<br />
concentration <strong>of</strong> Li + in electrolyte<br />
Φs<br />
electrochemical potential <strong>of</strong> active material<br />
Φl<br />
electrochemical potential <strong>of</strong> electrolyte<br />
Ai<br />
inner active surface<br />
αA,αK anodic/cathodic charge transfer coefficients<br />
CDS double layer capacity<br />
Dl<br />
solution diffusivity<br />
Ds<br />
solid diffusivity<br />
εl<br />
electrolyte volume fraction<br />
jBV<br />
Butler-Volmer current density<br />
F Faraday’s constant (= 96485Cmol −1 )<br />
k exchange current density and reaction rate<br />
κl(cl) ionic conductivity function<br />
μl<br />
migration coefficient<br />
σs<br />
electronic conductivity<br />
R universal gas constant (= 8,31447 Jmol −1 K −1 )<br />
T temperature<br />
t time<br />
t +<br />
transference number<br />
UOCV (cs) equilibrium potential function<br />
USEI ohmic loss <strong>of</strong> potential due to solid electrolyte interface at<br />
ΩA<br />
z number <strong>of</strong> transfered electrons (for Li + : z =1)<br />
REFERENCES<br />
[1] A.C. Antoulas, “Approximation <strong>of</strong> Large-Scale Dynamical Systems.”<br />
in Siam, 2005.<br />
[2] M. Barrault, Y. Maday, N. Nguyen, A. Patera, “An empirical<br />
interpolation method: Application to efficient reduced basis<br />
discretization <strong>of</strong> partial differential equations.” Comptes Rendus<br />
Mathematique, 339: 667-672, 2004.<br />
[3] P. Benner, V. Mehrmann, D.C. Sorensen et. al., “Dimension Reductin<br />
<strong>of</strong> Large-Scale Systems.” Springer, 2003.<br />
[4] L. Cai and R.E. White, “An Efficient Electrochemical–Thermal<br />
Model for a Lithium-Ion Cell by Using the Proper Orthogonal<br />
Decomposition Method.” in J. Electrochem. Soc., vol. 157, pp.<br />
A1188-A1195, 2010<br />
[5] M. Cifrain et. al., “Elektrochemisches Zellmodell.” publication in<br />
preparation, 2012.<br />
[6] M. Doyle, T.F. Fuller, J. Newman, “Modeling <strong>of</strong> Galvanostatic<br />
Charge and Discharge <strong>of</strong> the Lithium/Polymer/Insertion Cell.” in<br />
J. Electrochem. Soc., vol. 140 (7), pp. 1526–1533, 1993.<br />
[7] O. Lass and S. Volkwein, “POD Galerkin schemes for nonlinear<br />
elliptic-parabolic systems.” submitted for publication in 2011<br />
[8] F. Pichler, “Anwendung der Finite-Elemente Methode auf ein<br />
Litium-Ionen Batterie Modell.” Master Thesis, <strong>University</strong> <strong>of</strong> <strong>Graz</strong>,<br />
2011.<br />
[9] S. Volkwein, “Proper orthogonal decomposition (POD) for nonlinear<br />
systems.” PhD program in Mathematics for <strong>Technology</strong><br />
Catania, 2007.<br />
[10] J. Wu, J. Xu, H. Zou, “On the well-posedness <strong>of</strong> a mathematical<br />
model for lithium-ion battery systems.” Methods and Applications<br />
<strong>of</strong> Analysis, 13:275-298, 2006.
- 386 - 15th IGTE Symposium 2012<br />
Automatic domain detection for a meshfree postprocessing<br />
in boundary element methods<br />
André Buchau, Matthias Jüttner, and Wolfgang M. Rucker<br />
Universität Stuttgart, Institut für Theorie der Elektrotechnik, Pfaffenwaldring 47, 70569 Stuttgart, Germany<br />
E-mail: andre.buchau@ite.uni-stuttgart.de<br />
Abstract—Modern advanced visualization techniques for three-dimensional electromagnetic fields evaluate field values in<br />
some points in space, which are determined only during the computation <strong>of</strong> visual objects like streamlines. Furthermore, a<br />
meshfree post-processing in boundary element methods along with a bidirectional coupling <strong>of</strong> numerical field computations<br />
with a visualization tool are advisable to reduce significantly computational costs and the total amount <strong>of</strong> stored data. However,<br />
a completely automatic domain detection method is then required. Domain data like material values are not explicitly<br />
defined due to the lack <strong>of</strong> a volume mesh but are necessary for a correct computation <strong>of</strong> field values in arbitrary points. Here,<br />
a robust and fast octree-based method is presented to determine the domain data <strong>of</strong> an evaluation point efficiently even for<br />
large and complex field problems. The computational costs <strong>of</strong> position detection <strong>of</strong> a single evaluation point are kept small by<br />
filtering <strong>of</strong> relevant boundary elements. Furthermore, position data <strong>of</strong> other evaluation points is used if possible.<br />
Index Terms—boundary element methods, domain detection methods, meshfree post-processing, octree-based schemes<br />
I. INTRODUCTION<br />
A very important step in numerical field computations<br />
is an extensive post-processing including a vivid visualization<br />
<strong>of</strong> the obtained results. Today, visualization tools<br />
like virtual and augmented reality along with modern<br />
visualization techniques enable even experienced engineers<br />
a deep insight into the physical properties <strong>of</strong> the<br />
studied problem [1, 2]. However, an expressive visualization<br />
<strong>of</strong> three-dimensional fields is still a challenge. One<br />
possibility is to use volume rendering in the case <strong>of</strong> scalar<br />
data [3]. An interesting approach, which has been presented<br />
for two-dimensional magnetic fields, is to compute<br />
visual objects that represent the topology <strong>of</strong> the vector<br />
field [4]. Further techniques, which are more commonly<br />
used, are to filter three-dimensional data first and to visualize<br />
three-dimensional fields in slices or to compute<br />
streamlines or isosurfaces. Data exchange between the<br />
numerical field computation tool and the visualization<br />
tool is normally done with the help <strong>of</strong> volume meshes <strong>of</strong><br />
all considered domains. The field values are precomputed<br />
in all nodes <strong>of</strong> this mesh and transferred to the<br />
visualization tool that performs the post-processing independently<br />
<strong>of</strong> the numerical field computation tool.<br />
The boundary element method (BEM) is a very attractive<br />
method for the solution <strong>of</strong> three-dimensional electromagnetic<br />
field problems, which consist <strong>of</strong> multiple,<br />
piece-wise homogeneous, linear media. Then, a modeling<br />
and discretization <strong>of</strong> domain surfaces suffices. Hence, the<br />
discretized model is much smaller than in volume-based<br />
methods like the finite element method (FEM). Furthermore,<br />
well-established compression techniques for the<br />
linear system <strong>of</strong> equations exist to enable a fast and efficient<br />
solution <strong>of</strong> large and complex field problems [5].<br />
However, an additional volume mesh is normally created<br />
for the post-processing [6]. Domain data like material<br />
values are assigned to the auxiliary mesh and are available<br />
for the computation <strong>of</strong> field values in the mesh nodes.<br />
The application <strong>of</strong> the fast multipole method (FMM)<br />
enables the computation <strong>of</strong> field values in a huge number<br />
<strong>of</strong> points at acceptable computational costs [7], but the<br />
amount <strong>of</strong> data, which must be stored and transferred to<br />
the visualization tool, is relatively large and a bottleneck.<br />
A better approach, which fits much more the basic<br />
concept <strong>of</strong> a BEM, is a meshfree post-processing [8].<br />
There, field values are only computed at points, which are<br />
necessary for visualization. Hence, the number <strong>of</strong> evaluation<br />
points is dramatically reduced in comparison to the<br />
classical approach, which uses an auxiliary volume mesh.<br />
Furthermore, the meshfree approach is much more flexible.<br />
Post-processing domains and visualization techniques<br />
are defined completely after the solution <strong>of</strong> the<br />
problem. The creation <strong>of</strong> an expensive volume mesh is<br />
unnecessary and the BEM is integrated into the visualization<br />
tool. However, automatic domain detection is required<br />
to make a meshfree post-processing applicable for<br />
problems, which consist <strong>of</strong> multiple domains.<br />
Here, a novel method is presented that completely automatically<br />
detects the position <strong>of</strong> an arbitrary evaluation<br />
point directly from the given boundary element mesh.<br />
The octree-based method is fast and efficient to enable<br />
extensive post-processing <strong>of</strong> large and complex field<br />
problems. A very small number <strong>of</strong> boundary elements are<br />
extracted from the total model to perform the position<br />
detection efficiently with a method that is similar to the<br />
well-known ray tracing method [9]. Furthermore, domain<br />
data <strong>of</strong> other evaluation points is used if possible.<br />
The paper is structured as follows. First, the problem<br />
<strong>of</strong> position detection directly from boundary elements is<br />
formulated and the concept <strong>of</strong> the presented method is<br />
shown. Then, a flexible octree-based scheme is introduced<br />
to enable a grouping <strong>of</strong> all boundary elements<br />
regarding their position in three-dimensional space. It is<br />
applied to determine the position <strong>of</strong> an evaluation point<br />
with the help <strong>of</strong> other evaluation points <strong>of</strong> the same domain<br />
or to filter candidate boundary elements for the<br />
actual position detection, which is described afterwards.<br />
There, a method similar to ray tracing is presented to find<br />
a boundary element that is the closest boundary element<br />
to the given evaluation point with the help <strong>of</strong> a small list<br />
<strong>of</strong> candidates. Two numerical examples have been studied<br />
to demonstrate robustness and efficiency <strong>of</strong> the presented<br />
automatic domain detection method. Finally, significance<br />
<strong>of</strong> the method to BEM and an outlook to future<br />
work are given in the conclusions.
II. NUMERICAL METHOD<br />
In general, automatic domain detection is required for<br />
each evaluation point in a meshfree BEM postprocessing.<br />
Since both the number <strong>of</strong> evaluation points<br />
and the number <strong>of</strong> boundary elements are <strong>of</strong>ten very large<br />
in practical three-dimensional problems, a fast algorithm,<br />
which exploits properties <strong>of</strong> BEM, is advisable.<br />
The concept <strong>of</strong> the presented approach is presented in<br />
the first sub-section. There, a technical formulation <strong>of</strong> the<br />
problem is given along with a comparison to ray tracing<br />
methods. The second sub-section is about the novel fast<br />
octree-based scheme. It is the key to a successful application<br />
<strong>of</strong> meshfree post-processing in large BEM problems.<br />
Finally, an efficient and general method <strong>of</strong> ray tests based<br />
on gradient search is given in the third sub-section. Note,<br />
the presented approach has been developed to achieve<br />
two goals, an efficient and robust automatic domain detection<br />
method and a flexible octree for fast and efficient<br />
post-processing computations using the fast multipole<br />
method (FMM).<br />
A. Concept <strong>of</strong> automatic position detection<br />
The aim <strong>of</strong> the presented automatic domain detection<br />
method is to determine the domain <strong>of</strong> an arbitrary<br />
evaluation point, which is defined by its global Cartesian<br />
coordinates, e. g. during the computation <strong>of</strong> a streamline,<br />
=<br />
. (1)<br />
The complete BEM problem is discretized by in total<br />
boundary elements. The shape <strong>of</strong> boundary elements<br />
and the order <strong>of</strong> their shape functions can be arbitrarily<br />
chosen for the presented domain detection method. The<br />
direction <strong>of</strong> the normal vector <strong>of</strong> a boundary element<br />
is known. Furthermore, the domain , which lies in<br />
direction <strong>of</strong> , and the domain , which lies in<br />
direction <strong>of</strong> , are stored for each boundary element.<br />
The concept <strong>of</strong> the presented approach is to construct a<br />
single ray, which starts at the given evaluation point<br />
= + , (2)<br />
where is the direction <strong>of</strong> the ray and 0 is a parameter.<br />
Then, the domain is determined with the<br />
help <strong>of</strong> the first boundary element, which is intersected<br />
by the ray (2). An example configuration is given in<br />
Fig. 1. The red point is the given evaluation point, the<br />
blue line is the ray, and the green and yellow lines are<br />
two boundary elements. Here, the domain <strong>of</strong> the evaluation<br />
point is = <strong>of</strong> the green boundary element.<br />
Fig. 1: Example <strong>of</strong> domain detection with the help <strong>of</strong> a ray<br />
The concept <strong>of</strong> domain detection is similar to the wellknown<br />
ray tracing method [9]. The main difference is that<br />
the direction <strong>of</strong> the ray is unknown. In ray tracing, the<br />
direction corresponds to the view direction. Furthermore,<br />
a single ray suffices for successful domain detection. In<br />
ray tracing <strong>of</strong> computer graphics, a ray for each pixel <strong>of</strong><br />
the screen is constructed. Hence, an adapted octree-based<br />
algorithm for BEM is presented in the next sub-section.<br />
- 387 - 15th IGTE Symposium 2012<br />
B. Fast octree-based algorithm<br />
A fast algorithm is necessary to enable an application<br />
<strong>of</strong> the automatic domain detection method to large BEM<br />
problems. The costs <strong>of</strong> a standard implementation <strong>of</strong> the<br />
concept, which has been presented in the previous subsection,<br />
are proportional to . Hence, an efficient filtering<br />
<strong>of</strong> relevant boundary elements is required to reduce<br />
computation time. Furthermore, the computational costs<br />
are proportional to the number <strong>of</strong> evaluation points , which are given by the visualization tool. However, an<br />
evaluation point is <strong>of</strong>ten close to a previous evaluation<br />
point, for instance points on a streamline. An approach to<br />
reduce the computational costs is to use domain data <strong>of</strong><br />
previous evaluation points if possible.<br />
Trees are a common method to group and select<br />
boundary elements in three-dimensional space. Here, an<br />
octree has been chosen, since the FMM is used to accelerate<br />
BEM computations and the FMM is based on an<br />
octree, too [10]. An implementation <strong>of</strong> the octree, which<br />
uses modern s<strong>of</strong>tware techniques, enables the application<br />
<strong>of</strong> the same code foundations for both domain detection<br />
and post-processing computations. Then, code is more<br />
reliable and the complete method works more robust.<br />
The first step is to initialize the octree using all boundary elements. The so-called root cube at octree<br />
level 0 is the smallest cube, which encloses all boundary<br />
elements. Its edges are parallel to the axes <strong>of</strong> the global<br />
Cartesian coordinate system. Then, the root cube is subdivided<br />
into eight equal sized cubes and the boundary<br />
elements are assigned to these so-called child cubes. Each<br />
child cube is again subdivided into child cubes. The subdivision<br />
is continued while the boundary elements <strong>of</strong> a<br />
cube can be assigned to its child cubes or the total number<br />
<strong>of</strong> octree levels is smaller than a given limit.<br />
Boundary elements are assigned to a cube, if their centroid<br />
lies inside the cube. Of course, the position <strong>of</strong><br />
boundary elements and their real dimensions must be<br />
considered. The bounding box <strong>of</strong> a cube is determined<br />
including all its boundary elements. This bounding box<br />
must completely lie inside a test cube with the same center<br />
as the considered cube and an edge length <strong>of</strong><br />
The domain detection starts with the addition <strong>of</strong> the<br />
evaluation point to the octree. If is outside the<br />
root cube <strong>of</strong> the octree, the spatial domain <strong>of</strong> the octree is<br />
enlarged by creating a new root cube. Of course, is<br />
increased in that case. The evaluation point is assigned to<br />
a cube in the same way as the boundary elements during<br />
initialization. Here, the subdivision <strong>of</strong> a cube with an<br />
evaluation point is aborted, if no boundary elements are<br />
assigned to the cube <strong>of</strong> the evaluation point. The number<br />
<strong>of</strong> evaluation points in a cube is not taken into account.<br />
Hence, the cubes <strong>of</strong> evaluation points are chosen as large<br />
as possible.<br />
An example is given in Fig. 2. The black lines with<br />
black points represent some boundary elements. The red<br />
point is an evaluation point. The thick black lines are the<br />
octree cubes, which have been created according to the<br />
above-described rules. One boundary element is assigned<br />
to one cube and the boundary elements stick slightly out<br />
<strong>of</strong> the cubes. In contrast, the cube <strong>of</strong> the evaluation point<br />
is relatively large.<br />
Fig. 2: Example <strong>of</strong> an octree for position detection<br />
A strategy is to reduce the number <strong>of</strong> actual domain<br />
detections. Hence, a goal is to determine the domain not<br />
only for the given evaluation point but also for the cube<br />
<strong>of</strong> the evaluation point if possible. Consequently, an initial<br />
cube is searched that fulfills the following conditions.<br />
The octree level <strong>of</strong> a cube <strong>of</strong> an evaluation point is<br />
maximal the finest octree level <strong>of</strong> cubes <strong>of</strong> boundary<br />
elements. While the octree level criterion is satisfied, the<br />
cube <strong>of</strong> the evaluation point is refined until no boundary<br />
elements are assigned to that cube or no boundary elements<br />
stick into this cube. To avoid expensive tests based<br />
on the real position <strong>of</strong> boundary elements, first a cube is<br />
searched that has no neighbors with boundary elements.<br />
If the cube <strong>of</strong> the evaluation point already has no neighbors<br />
with boundary elements, its parent cube is tested for<br />
the above-described rules.<br />
The black cube <strong>of</strong> the evaluation point in Fig. 2 has<br />
neighbor cubes with boundary elements. Hence, the black<br />
cube is refined and the blue cube is obtained. Since the<br />
blue cube has also neighbor cubes with boundary elements,<br />
it is refined as well and the green cube is the initial<br />
cube for domain detection.<br />
If no elements are lying inside the cube , the domain<br />
is determined for including all its child cubes. The<br />
domain data <strong>of</strong> the cube is then used for all evaluation<br />
points, which are assigned to the cube at a later moment.<br />
Neighbor cubes <strong>of</strong> the cube are at several octree levels<br />
due to the adaptive octree rules <strong>of</strong> octree initialization.<br />
Furthermore, octree structure is changing during the post-<br />
- 388 - 15th IGTE Symposium 2012<br />
processing. Hence, it is not possible to determine cube<br />
neighbors in advance. Neighbor cubes, or cubes in general,<br />
are searched by the position <strong>of</strong> the cube center. Here,<br />
the centers <strong>of</strong> possible neighbor cubes are computed and<br />
the cubes are searched starting from the root cube by<br />
simple and fast comparison <strong>of</strong> Cartesian coordinates.<br />
If the domain data <strong>of</strong> a neighbor cube <strong>of</strong> cube has<br />
been already determined, it can be used for the cube ,<br />
too. Otherwise, a cube with boundary elements <strong>of</strong> the<br />
second neighbors <strong>of</strong> cube has to be chosen for domain<br />
detection. Since no elements are assigned to the cube <br />
and its neighbor cubes, one second neighbor cube, which<br />
adjoins a neighbor cube <strong>of</strong> , suffices for domain detection.<br />
At least one second direct neighbor cube with<br />
boundary elements exists, because the cube is chosen<br />
as large as possible as described above.<br />
After a cube has been chosen, the neighbor cube<br />
<strong>of</strong> , which lies between and , is determined.<br />
First, all boundary elements, which stick into , are<br />
searched. Note, most neighbors <strong>of</strong> are without<br />
boundary elements and only neighbors, which adjoin<br />
, have to be considered. In practice, the number <strong>of</strong><br />
relevant cubes is small. Furthermore, the boundary elements,<br />
which are assigned to and which stick into<br />
, are determined. In total, a list with relevant<br />
boundary elements is obtained. To improve robustness <strong>of</strong><br />
the domain detection method, a new point for domain<br />
detection is defined inside and close to .<br />
Some examples <strong>of</strong> typical situations <strong>of</strong> domain detection<br />
are depicted in Fig. 3.<br />
Fig. 3: Some typical situations <strong>of</strong> domain detection<br />
First, the domain <strong>of</strong> the red evaluation point is determined.<br />
Since no boundary elements are assigned to the<br />
red cube, the domain <strong>of</strong> the red cube is evaluated. The<br />
black cube with the red lines is chosen as cube . The<br />
orange evaluation point inside is used for the domain<br />
detection <strong>of</strong> the read evaluation point. Next, the<br />
domain <strong>of</strong> the blue evaluation point is detected. Since the<br />
blue evaluation point lies within a cube with boundary<br />
elements, these boundary elements are used for domain<br />
detection. Finally, the domain <strong>of</strong> the green evaluation<br />
point is determined with the help <strong>of</strong> domain data <strong>of</strong> its red<br />
neighbor cube.<br />
As already mentioned, the list <strong>of</strong> relevant<br />
boundary elements for domain detection includes all<br />
boundary elements, which are assigned to a relevant cube<br />
and all boundary elements <strong>of</strong> neighbor cubes, which stick<br />
into the relevant cube. A fast method to test whether a<br />
boundary element sticks into a cube is to test an intersection<br />
<strong>of</strong> the bounding box <strong>of</strong> the cube and <strong>of</strong> the bounding
ox <strong>of</strong> the boundary element. Two examples <strong>of</strong> boundary<br />
elements inside a cube are given in Fig. 4. Although no<br />
boundary elements are assigned to the blue cube, the blue<br />
boundary element <strong>of</strong> its neighbor cube must be taken into<br />
account, since it sticks into the blue cube. In the case <strong>of</strong><br />
the red cube, one red boundary element, which is assigned<br />
to the red cube, and one red boundary element <strong>of</strong><br />
its neighbor cube have to be considered.<br />
Fig. 4: Examples <strong>of</strong> elements inside a cube<br />
In total, the presented octree-based method is fast,<br />
since is approximately independent <strong>of</strong> , at least<br />
in the case <strong>of</strong> large problems. If possible, domain data for<br />
cubes is determined. The domain detection with the help<br />
<strong>of</strong> the filtered boundary elements is described in<br />
the following sub-section.<br />
C. Domain detection from boundary elements<br />
Starting position <strong>of</strong> a domain detection from boundary<br />
elements is a very small list <strong>of</strong> boundary elements,<br />
which is determined using the octree-based method <strong>of</strong> the<br />
previous sub-section, and the given evaluation point (1).<br />
Furthermore, domain detection from boundary elements<br />
is only necessary, if domain data <strong>of</strong> octree cubes cannot<br />
be used. Hence, general applicability and extension possibilities<br />
<strong>of</strong> the following method are more important than<br />
pure efficiency considerations.<br />
The initial step is to sort the boundary elements<br />
by their distance to the evaluation point<br />
= , 0
III. NUMERICAL EXAMPLES<br />
The presented automatic domain detection method for<br />
BEM has been tested on two numerical examples. The<br />
first example is a capacitor, which can be simply discretized<br />
with different sizes <strong>of</strong> boundary elements. Hence, it<br />
is well suited for fundamental tests <strong>of</strong> the automatic domain<br />
detection method. The second example is an inductor<br />
<strong>of</strong> a micro-electro-mechanical system. It represents a<br />
typical configuration in an application <strong>of</strong> BEM with <strong>of</strong>ten<br />
changing domains in a slice. There, domain data <strong>of</strong> evaluation<br />
points cannot be easily defined and the power <strong>of</strong><br />
the presented automatic domain detection method is<br />
clearly demonstrated.<br />
The domain detection method has been implemented in<br />
C# using the .NET framework 4.0 [11]. C# is a managed<br />
language and it supports very well necessary data handling<br />
<strong>of</strong> the domain detection method. Furthermore, the<br />
interface <strong>of</strong> C# to native C++ enables the use <strong>of</strong> existing<br />
high-performance code <strong>of</strong> numerical methods, for instance<br />
the used BEM and FMM implementation. Bounding<br />
boxes including intersection tests or vector operations<br />
are standard methods <strong>of</strong> the Windows Presentation<br />
Framework (WPF), which is part <strong>of</strong> the .NET framework.<br />
Furthermore, the Windows Communication Foundation<br />
(WCF) <strong>of</strong> the .NET framework enables an interactive<br />
data exchange between different processes based on extensible<br />
markup language (XML) over hypertext<br />
transport protocol (http). Here, WCF is applied to couple<br />
the process <strong>of</strong> the visualization tool HLRS COVISE,<br />
which is developed at the High Performance Computing<br />
Center at the <strong>University</strong> <strong>of</strong> Stuttgart, with the implementation<br />
<strong>of</strong> the domain detection method. HLRS COVISE<br />
visualizes three-dimensional data with the help <strong>of</strong> virtual<br />
and augmented reality techniques [1].<br />
The surfaces <strong>of</strong> both numerical examples have been<br />
discretized with second order, quadrilateral boundary<br />
elements. The Galerkin method has been applied to indirect<br />
and direct BEM formulations. The matrix <strong>of</strong> the<br />
linear system <strong>of</strong> equations has been compressed with the<br />
help <strong>of</strong> the fast multipole method [12]. The matrix has<br />
been assembled in parallel using the OpenMP standard.<br />
The system <strong>of</strong> linear equations has been solved in parallel,<br />
too. An implementation <strong>of</strong> the BEM in combination<br />
with the FMM in C++ has been executed on a workstation<br />
with two six-core Intel Xeon E5649 2.53 GHz<br />
processors.<br />
Although the presented domain detection method supports<br />
all kinds <strong>of</strong> boundary elements, the second order,<br />
quadrilateral boundary elements have been converted into<br />
linear, triangular boundary elements for the postprocessing.<br />
The reason is that in computer graphics only<br />
linear elements, <strong>of</strong>ten only linear triangles, are well supported.<br />
Rendering and graphics processing on linear triangles<br />
is much faster than on other types <strong>of</strong> elements and<br />
linear triangles are supported by modern graphics processors.<br />
The implementation <strong>of</strong> the domain detection method<br />
has been executed on an Intel Core 2 Duo T9900<br />
3.06 GHz laptop processor using a single core. Graphical<br />
objects have been rendered on a NVIDIA Quadro FX<br />
770M graphic card.<br />
- 390 - 15th IGTE Symposium 2012<br />
A. Capacitor<br />
The electric field <strong>of</strong> a capacitor has been studied as<br />
first example. The capacitor consists <strong>of</strong> two quadratic<br />
electrodes and a homogeneous, linear, isotropic dielectric<br />
between the electrodes. The potential <strong>of</strong> the electrodes<br />
has been set to 0.5 V and -0.5 V respectively. The relative<br />
permittivity <strong>of</strong> the dielectric is 10.<br />
The capacitor has been discretized with 9600 second<br />
order, quadrilateral elements (Fig. 5). An indirect BEM<br />
formulation is applied. The Dirichlet boundary conditions<br />
are the potential at the two electrodes. The Neumann<br />
boundary condition is the continuity <strong>of</strong> the electric flux<br />
density at the surface between the dielectric and the surrounding<br />
free space domain. The corresponding linear<br />
system <strong>of</strong> equations with in total 29442 unknowns has<br />
been solved iteratively using generalized minimal residual<br />
method (GMRES) along with a Jacobi preconditioner<br />
within 91 iteration steps in approximately 3 minutes.<br />
Fig. 5: Discretized BEM model <strong>of</strong> a capacitor<br />
The original second order boundary elements have<br />
been converted into 19200 first order, triangular elements<br />
for the post-processing including the domain detection.<br />
The boundary elements are grouped by an octree, which<br />
consists <strong>of</strong> 9 octree levels and 2 elements assigned to a<br />
cube in average. The maximum number <strong>of</strong> elements <strong>of</strong> a<br />
cube is 6. The domain data in a slice in 40000 evaluation<br />
points has been determined in 23 s (Fig. 6). The red color<br />
represents the domain inside the dielectric and the green<br />
color represents the air domain. Furthermore, the two<br />
electrodes are depicted. The color at the electrodes displays<br />
the surface charge density, which equals the solution<br />
<strong>of</strong> the linear system <strong>of</strong> equations. The surface <strong>of</strong> the<br />
dielectric has been omitted for graphical reasons.<br />
The presented octree-based scheme reduces the number<br />
<strong>of</strong> boundary elements, which must be considered for<br />
correct domain detection, from 19200 to maximal 39. The<br />
domain data <strong>of</strong> 93 % <strong>of</strong> the given evaluation points could<br />
be obtained from position data <strong>of</strong> the octree cubes without<br />
expensive ray hit tests.
Fig. 6: Detected domains in a slice through the capacitor<br />
B. Inductor in micro-electro-mechanical systems<br />
The electric current inside an inductor <strong>of</strong> a microelectro-mechanical<br />
system (MEMS) has been studied as<br />
second example. The inductor has been discretized using<br />
9168 second order, quadrilateral elements (Fig. 7). The<br />
potential at the ports <strong>of</strong> the inductor has been set as Dirichlet<br />
boundary condition <strong>of</strong> a direct BEM formulation.<br />
The linear system <strong>of</strong> equations with in total 27594 unknowns<br />
has been solved within 166 iteration steps <strong>of</strong><br />
GMRES in approximately 4 minutes.<br />
Fig. 7: Discretized BEM model <strong>of</strong> a inductor in MEMS<br />
The boundary elements have been converted into<br />
18336 first order, triangular elements for the postprocessing.<br />
The domain in 40000 evaluation points,<br />
which are lying in a slice through the inductor (Fig. 8),<br />
has been detected in 106 s. The yellow color represents<br />
the domain inside the conductor <strong>of</strong> the inductor and the<br />
blue color represents the surrounding free space domain.<br />
Although the slice is <strong>of</strong>ten intersected by boundaries <strong>of</strong><br />
the inductor, the number <strong>of</strong> actual domain detections is<br />
reduced by 45 % by using domain data <strong>of</strong> the octree cubes.<br />
is maximal 26.<br />
Fig. 8: Detected domains in a slice through the inductor<br />
- 391 - 15th IGTE Symposium 2012<br />
IV. CONCLUSION<br />
A fast and efficient automatic domain detection method<br />
for a meshfree post-processing in three-dimensional<br />
boundary element methods has been presented. Relevant<br />
boundary elements are filtered with the help <strong>of</strong> an adaptive<br />
octree-based method. Hence, the number <strong>of</strong> boundary<br />
elements, which have to be considered for domain detection,<br />
is extremely reduced and approximately independent<br />
<strong>of</strong> the total number <strong>of</strong> boundary elements for large problems.<br />
The application <strong>of</strong> the octree, bounding boxes, and<br />
optimized standard libraries results in very low computational<br />
costs. As a result, the shown domain detection<br />
method enables an efficient post-processing in a large<br />
number <strong>of</strong> evaluation points even for large and complex<br />
BEM problems. Furthermore, the complete method including<br />
its implementation is very flexible and supports<br />
all types <strong>of</strong> elements. Hence, it is not only restricted to<br />
pure BEM applications, but volume elements <strong>of</strong> a volume<br />
integral equation can be used, too. Finally, the numerical<br />
examples show that domain data is detected in arbitrary<br />
chosen evaluation points reliably and efficiently.<br />
The shown method is a very important step towards<br />
flexible and powerful post-processing in boundary element<br />
methods. A direct coupling <strong>of</strong> visualization tools<br />
with a boundary element method is enabled. Postprocessing<br />
objects as streamlines or isosurfaces can be<br />
computed totally meshfree even in the case <strong>of</strong> multiple<br />
domains. The octree, which is used here, is the basis <strong>of</strong><br />
fast and efficient field computations using the fast multipole<br />
method, too.<br />
[1]<br />
REFERENCES<br />
U. Lang and U. Wössner, “Virtual and augmented reality developments<br />
for engineering applications”, <strong>Proceedings</strong> <strong>of</strong><br />
[2]<br />
ECCOMAS 2004, Jyväskylä, July 24-28, pp. 24-8., 2004<br />
A. Buchau, W. M. Rucker, U. Wössner, and M. Becker, “Augemented<br />
reality in teaching <strong>of</strong> electrodynamics”, COMPEL, vol.<br />
28, no. 4, pp. 948-963, 2009<br />
[3] D. Weiskopf, „GPU-Based Interactive Visualization Techniques“,<br />
Springer, 2006<br />
[4] S. Bachthaler, F. Sadlo, R. Weeber, S. Kantorovich, Ch. Holm,<br />
and D. Weiskopf, “Magnetic Flux Topology <strong>of</strong> 2D Point Dipoles”,<br />
Eurographics Conference on Visualization (EuroVis) 2012, vol.<br />
31, no. 3, 2012<br />
[5] A. Buchau, W. M. Rucker, O. Rain, V. Rischmüller, S. Kurz, S.<br />
Rjasanow, “Comparison Between Different Approaches for Fast<br />
and Efficient 3D BEM Computations”, IEEE Transactions on<br />
Magnetics, vol. 39, no. 3, pp. 1107-1110, 2003<br />
[6] W. Hafla, A. Weinläder, A. Bardakcioglu, A. Buchau, and W. M.<br />
Rucker, “Efficient Post-Processing with the Integral Equation<br />
Method”, COMPEL, vol. 26, no. 3, pp. 873-887, 2007<br />
[7] A. Buchau, W. Rieger, and W. M. Rucker, “Fast Field Computations<br />
with the Fast Multipole Method”, COMPEL, vol. 20, no. 2,<br />
pp. 547-561, 2001<br />
[8] A. Buchau and W. M. Rucker, “Meshfree Visualization <strong>of</strong> Field<br />
Lines in 3D”, 14 th IGTE Symposium, pp. 172-177, <strong>Graz</strong>, 2010<br />
[9] J. Goldsmith, J. Salmon, “Automatic Creation <strong>of</strong> Object Hierarchies<br />
for Ray Tracing”, IEEE Computer Graphics and Applications,<br />
vol. 7, no. 5, pp. 14-20, 1987<br />
[10] A. Buchau, Ch. J. Huber, W. Rieger, W. M. Rucker, ”Fast BEM<br />
Computations with the Adaptive Multilevel Fast Multipole Method”,<br />
IEEE Transactions on Magnetics, vol. 36, no. 4, pp. 680-684,<br />
2000<br />
[11] “.NET Framework Developer Center”, Micros<strong>of</strong>t Corporation<br />
[12] A. Buchau, W. Hafla, F. Groh, and W. M. Rucker, ”Parallelized<br />
Computation <strong>of</strong> Compressed BEM Matrices on Multiprocessor<br />
Computer Clusters”, COMPEL, vol. 24, no. 2, pp. 468-479, 2005
- 392 - 15th IGTE Symposium 2012<br />
Efficient modeling <strong>of</strong> coil filament losses in 2D<br />
L. Lehti∗ ,J.Keränen †∗ ,S.Suuriniemi∗ , T. Tarhasaari∗ , and L. Kettunen∗ ∗Tampere <strong>University</strong> <strong>of</strong> <strong>Technology</strong> - Electromagnetics, P.O. Box 692, FI-33101 Tampere, Finland<br />
† VTT Technical Research Centre <strong>of</strong> Finland, P.O. Box 1300, FI-33101 Tampere, Finland<br />
E-mail: leena.lehti@tut.fi<br />
Abstract—Practical estimates for losses in coil filaments <strong>of</strong> a FEM model are sought for. A low-dimensional function space<br />
is introduced on the filament-air interface and then suitably extended into the filament to significantly reduce the number<br />
<strong>of</strong> unknowns per filament. Careful choice <strong>of</strong> extensions enables good loss estimate accuracy. The result is a system matrix<br />
assembly block that can be used verbatim for all filaments, further reducing the cost. Both net current and voltage per<br />
length <strong>of</strong> the filament are readily available in the problem formulation.<br />
Index Terms—coil modeling, FEM, winding loss estimate<br />
I. INTRODUCTION<br />
Improving the efficiency <strong>of</strong> electrical machines is an<br />
important aspect <strong>of</strong> machine design. Modeling methods<br />
are required to be as fast and accurate as possible to<br />
help the designer optimize the machines. One important<br />
aspect <strong>of</strong> machine design are Ohmic coil loss estimates.<br />
They enable the designer to choose the placement <strong>of</strong> the<br />
conductors such that the losses are minimized.<br />
Solving for the conductor losses is not a straightforward<br />
task. The losses depend on the conductivity and<br />
the current density. The current density is affected by<br />
numerous factors, which cannot be separately solved for.<br />
The different elements include the feeding current, fields<br />
generated by neighboring conductors, placement <strong>of</strong> the<br />
permeable materials, and—depending on the frequency—<br />
skin effect.<br />
Different methods have been developed to overcome<br />
these difficulties. One method is to replace the conductivity<br />
<strong>of</strong> the material by other parameters which<br />
transform the eddy-current losses to hysteresis losses<br />
[1]. In this approach the parameters for resistance and<br />
inductance are sought for. A separate mesh for magnetic<br />
and electric problems are introduced in [2]. The results<br />
are acceptable, but the method has computationally slow<br />
segments. In [3] a cell-model is used to reduce the<br />
computational effort and to extract the resistance <strong>of</strong><br />
windings. A homogenization technique in [4] derives<br />
parameters to characterize skin and proximity effects in<br />
windings. Surface impedance methods have also been<br />
used [5], where it is assumed that the magnetic flux does<br />
not penetrate into the conducting material. Therefore the<br />
conducting material can be approximated on the surface<br />
only and the interior <strong>of</strong> the material can be ignored. This<br />
can be used for conductors with low curvature and small<br />
skin depth.<br />
We aim at a good trade-<strong>of</strong>f between moderate calculation<br />
time and accuracy <strong>of</strong> loss estimates while maintaining<br />
an explicit connection to the exterior circuit.<br />
In addition, the conductors can be placed freely, i.e. a<br />
periodical spacing is not required, as in [3] and [4].<br />
Previously [6], magnetic flux was not allowed to enter the<br />
filaments and a separable problem was achieved, i.e. the<br />
field problem was divided into the filament interiors and<br />
the exterior. The exterior and interior had only limited<br />
interaction through net current and constant stream function<br />
values on the boundary. To improve the accuracy,<br />
it is necessary to admit some magnetic flux into the<br />
filaments while solving for the exterior problem. Ignoring<br />
the magnetic energy and losses inside the filaments on<br />
the exterior problem enables subsequent solving for the<br />
interior problem, but it leads to a very small reluctance<br />
inside the filaments. When the filaments are close to each<br />
other, this is detrimental.<br />
Consequently, the filament interiors have to be coupled<br />
with the exterior problem. To save computational effort,<br />
the function space on the filament interface is significantly<br />
limited. The basis representing the result inside<br />
the filaments is spanned by solutions <strong>of</strong> eddy current<br />
problems that use the functions from the interface as<br />
boundary conditions. The use <strong>of</strong> these solutions improves<br />
the loss estimates significantly compared to [6] and the<br />
same solutions can be used for all filaments with similar<br />
cross-section. The resulting method is called an interface<br />
technique.<br />
II. EXTERIOR FORMULATION<br />
Here, we concentrate on two-dimensional cases, since<br />
they are widely used in industry. Solving for 2D problems<br />
is simple, and they provide enough accuracy for many industrial<br />
applications. Since we look for a time-harmonic<br />
solution, the materials used are assumed to be linear.<br />
However, the method could be extended to nonlinear<br />
time-domain problems with convolution techniques if the<br />
material <strong>of</strong> the filaments stays linear.<br />
A few terms are represented in Figure 1 for convenience.<br />
Exterior problem refers to Ωe and interior<br />
problem to Ωin = Ωj. The geometry contains K<br />
conducting filaments, Ωj, and the relative permeability <strong>of</strong><br />
the core is 1000. The whole domain is Ω= Ωj ∪ Ωe<br />
and on the boundary <strong>of</strong> Ω the stream function is set to
zero. This geometry, with K =25and f =50Hz, is<br />
also used as an example in the computations. The radius,<br />
r, for each filament is 0.01m to produce a challenging<br />
modeling problem (skin depth/radius ≈ 1).<br />
symmetry axis<br />
∂Ωj<br />
Ω=Ωin ∪ Ωe<br />
Ωj<br />
Ωe<br />
b · n =0<br />
Fig. 1. An example geometry for a transformer with an E-shaped core.<br />
There are 25 conductors in the coil and their net current is set to one.<br />
The core material’s relative permeability is 1000 and f =50Hz. Ω is<br />
the whole domain that consists <strong>of</strong> Ωin = Ωj and Ωe.<br />
In our previous work [6], the floating potential approach<br />
with a constant but unknown stream function on<br />
each filament boundary was used. This prevented the flux<br />
from entering the filament and the eddy currents were<br />
similar in all filaments. Now, we replace the constant<br />
potential with a low-dimensional subspace <strong>of</strong> functions,<br />
L, that enables the magnetic flux to enter the filament.<br />
The function space comprises <strong>of</strong> a constant function and<br />
trigonometric functions. Let ˆ L be the space <strong>of</strong> Whitney<br />
nodal basis function interpolations <strong>of</strong> L. We construct the<br />
following formulation for the exterior problem<br />
div 1<br />
μ grad a =0 in Ωe, (1)<br />
a =0 on ∂Ω, (2)<br />
<br />
∂Ωj<br />
a =<br />
M−1 <br />
i=0<br />
cijfi ,fi ∈ ˆ L on ∂Ωj ∀ j, (3)<br />
h · dl = Ij on ∂Ωj ∀ j. (4)<br />
Here μ is the permeability, a the stream function,<br />
M the number <strong>of</strong> functions on the interface, cij are<br />
complex scalar coefficients, h the magnetic field, and Ij<br />
the net current in j th filament. There can be multiple<br />
trigonometric functions, i.e., (3) can be written<br />
a = c0 +<br />
N<br />
(c2n−1 sin nα + c2n cos nα), (5)<br />
n=1<br />
- 393 - 15th IGTE Symposium 2012<br />
where N is the number <strong>of</strong> trigonometric functions and<br />
α an angle parameter. 1 For N =0this reduces to floating<br />
potential. On the interface, other than trigonometric<br />
functions could be used.<br />
The solution for (1)–(4) is sought for in the form<br />
a = <br />
diλi + <br />
cij ˆ fi, (6)<br />
i<br />
where each λi is a Whitney nodal basis function associated<br />
to the interior nodes <strong>of</strong> Ωe, and ˆ fi are approximated<br />
by Whitney interpolants on the boundary and the extension<br />
<strong>of</strong> these interpolants into the domain Ωe is done<br />
canonically.<br />
In practice, the function space L on the interface is<br />
reduced to ˆ L by using a projection matrix Q. InQ we<br />
have a row for each basis function <strong>of</strong> ˆ L for each filament<br />
and columns for all nodes on the boundary. The i th row<br />
<strong>of</strong> Q consists <strong>of</strong> the values <strong>of</strong> fi in the boundary nodes<br />
<strong>of</strong> one filament. In a standard system, the system matrix<br />
is formed from blocks<br />
i,j<br />
AΩΩ AΩΓ<br />
AΓΩ AΓΓ<br />
<br />
, (7)<br />
where Ω refers to the exterior <strong>of</strong> the filaments and Γ<br />
refers to the filament boundary. We use Q as follows<br />
ÃΩΩ = AΩΩ<br />
ÃΩΓ = AΩΓQ T<br />
ÃΓΩ = QAΓΩ<br />
ÃΓΓ = QAΓΓQ T<br />
(8)<br />
(9)<br />
(10)<br />
(11)<br />
to build the new system matrix à in the same format as<br />
(7). Note that the block with most nodes, i.e. ÃΩΩ, is not<br />
transformed. The dimensions <strong>of</strong> the projection matrix are<br />
(KM) × (boundary nodes).<br />
Remark 1. If the energy stored and dissipated in the<br />
filaments is neglected the exterior problem can be independently<br />
solved for. However, if we use equations (1)–<br />
(4) without including the effects <strong>of</strong> the filament interiors’,<br />
the results are not satisfactory. As an example, we have<br />
the E-magnet from Figure 1. The model lacks the reluctance<br />
and eddy current effects from the filament interiors,<br />
and thus the filaments <strong>of</strong>fer no reluctance to the flux. This<br />
effect gets more pronounced when the filaments are close<br />
to each other. In a tightly wound coil the stored energy<br />
inside the filaments is considerable and, in addition, eddy<br />
current losses inside the filaments have an effect on<br />
the exterior magnetic field. In Figure 2, flux lines are<br />
shown for an exterior solution with a constant, sine, and<br />
cosine functions. In Figure 3 a reference result with A-<br />
V-formulation and a finely discretized mesh in the whole<br />
domain is shown for comparison. The A-V-formulation<br />
provides an accurate solution, but the computational cost<br />
is high.<br />
1 Here sin and cos are to be understood as the Whitney nodal basis<br />
function interpolations <strong>of</strong> the trigonometric functions.
Fig. 2. The solution to E-magnet problem <strong>of</strong> Fig.1 with a constant,<br />
sine and cosine on the boundary. Since the interior energy is ignored,<br />
the filaments <strong>of</strong>fer a zero reluctance path for the flux and the flux gets<br />
attracted into the filaments.<br />
Fig. 3. The E-magnet problem solved in Ω with an A-V-formulation as<br />
a reference result. The solution is very accurate, but the computational<br />
cost is high.<br />
III. INTERIOR FORMULATION<br />
A. Expansion <strong>of</strong> the Basis Functions to the Interior<br />
Because <strong>of</strong> the stored and dissipated energy inside<br />
the filaments and its effect on the exterior, the interiors<br />
cannot be separated from the exterior completely. Thus,<br />
we need to have a model for the interior part and solve<br />
for it simultaneously with the exterior. We take the lowdimensional<br />
function space ˆ L on the filament interface<br />
as the starting point.<br />
For the interior problem we have<br />
div 1<br />
Vz<br />
grad a = jωσ(a +<br />
μ jω ) in Ωj,<br />
<br />
(12)<br />
h · dl = Ij on ∂Ωj ∀ j, (13)<br />
∂Ωj<br />
aj = ae on ∂Ωj ∀ j, (14)<br />
where Vz the filamentwise constant potential gradient and<br />
ae is the value <strong>of</strong> the exterior problem solution on the<br />
- 394 - 15th IGTE Symposium 2012<br />
filament boundary. 2 We want to represent the solution to<br />
this eddy-current problem inside the filament by using<br />
only the unknowns cij related to the functions <strong>of</strong> ˆ L plus<br />
one unknown associated to the net current condition (13).<br />
Note that computational effort is saved in the interiorexterior<br />
coupling, since we restrict ae to be spanned by<br />
the functions <strong>of</strong> ˆ L.<br />
Once we have extensions <strong>of</strong> ˆ L into the filament, we<br />
can assemble a FEM assembly block and corresponding<br />
excitation for the problem (12)–(14). The single block<br />
can then be efficiently used for all filaments <strong>of</strong> identical<br />
cross section. It is important to notice that even though<br />
we have to solve for one boundary value problem (BVP)<br />
in the filament per basis element <strong>of</strong> ˆ L, we only have<br />
to solve them once for filaments with the same crosssection.<br />
Usually the number <strong>of</strong> BVPs is much smaller<br />
than the number <strong>of</strong> similar filaments.<br />
To produce accurate loss estimates, we choose the<br />
extensions to be the solution to (12)–(14) with the basis<br />
<strong>of</strong> ˆ L as boundary conditions. Hence, we decompose the<br />
problem into M +1 separately solvable problems and<br />
the solution to the eddy-current problem (12)–(14) is a<br />
linear combination <strong>of</strong> these solutions. 3 These solutions<br />
are then used to extend the basis { ˆ fj} (<strong>of</strong> (6)) inside<br />
the filaments and to add K extra unknowns for the net<br />
currents to the exterior problem. Note that we are not<br />
restricted to a specific geometry, because these solutions<br />
can be obtained by any means.<br />
The first M problems to solve are<br />
div 1<br />
μ grad ai − jωσai =0 (15)<br />
with boundary condition ai = fi on ∂Ωj, wherefi∈ˆ L.<br />
The remaining one problem is used to account for the<br />
filamentwise constant potential Vz with the following<br />
div 1<br />
μ grad aξ − jωσaξ = jωσ1 (16)<br />
with aξ =0on ∂Ωj and Vz<br />
jω is spanned by the constant 1.<br />
As the term (aξ +1) equals 1 on the interface, we can use<br />
this term to impose the net current <strong>of</strong> the filament with a<br />
circulation <strong>of</strong> h, wherehis the magnetic field. Note that<br />
the restriction <strong>of</strong> a0 into Ωj qualifies as (aξ +1), so that<br />
it involves no extra cost. The solution within the filament<br />
for the electric field (divided by jω) is expressed by the<br />
space <strong>of</strong> linear combinations<br />
a + Vz<br />
jω = cξ(aξ +1)+<br />
M<br />
ciai, (17)<br />
i=0<br />
where a is the magnetic vector potential inside the<br />
filament and cξ = Vz<br />
jω .<br />
2 At first glance, it might seem like the problem is overdetermined,<br />
because (14) states a Dirichlet condition on all <strong>of</strong> the boundary.<br />
However, we have as many extra conditions (13) as we have constants<br />
Vz in (12), namely K.<br />
3 This requires the material to be linear inside the filaments.
B. Assembly Block in Entire Problem<br />
Let us see how the system matrix <strong>of</strong> the exterior<br />
problem is modified when these extended basis functions<br />
(ai’s and aξ’s) are used. We form an assembly block that<br />
is added to the system matrix from the problem<br />
div 1<br />
μ<br />
Vz<br />
grad a = jωσ(a + ) in Ω, (18)<br />
jω<br />
because we combine the solution from the interior to the<br />
exterior problem. The variational formulation for this is<br />
<br />
Ω<br />
w(div 1<br />
μ<br />
Vz<br />
grad a − jωσ(a + )) dΩ =0, (19)<br />
jω<br />
where w is a test function. After integration by parts,<br />
(19) becomes<br />
<br />
grad w 1<br />
grad a dΩ =<br />
μ<br />
<br />
Ω<br />
∂Ω<br />
w 1<br />
<br />
grad a · n dl −<br />
μ<br />
Ω<br />
wjωσ(a + Vz<br />
) dΩ. (20)<br />
jω<br />
Because <strong>of</strong> homogenous Dirichlet condition for a everywhere<br />
on ∂Ω, the boundary integrals are zero for w = ai<br />
<br />
Ω<br />
1<br />
grad ai<br />
μ grad a + jωσai(a + Vz<br />
) dΩ =0. (21)<br />
jω<br />
With weight w = aξ +1, the net current condition is<br />
imposed for each filament. Now, aξ +1 is one at the<br />
boundary <strong>of</strong> its support Ωj, and (20) becomes<br />
<br />
<br />
Ωj<br />
∂Ωj<br />
grad (aξ +1) 1<br />
grad a+<br />
μ<br />
jωσ(aξ +1)(a + Vz<br />
) dΩ =<br />
jω<br />
1 1<br />
<br />
grad a · ndl = h · dl = Ij, (22)<br />
μ<br />
∂Ωj<br />
where Ij is the imposed net current.<br />
After substituting (17) into (21) and (22), we can<br />
reduce most <strong>of</strong> the assembly block elements <strong>of</strong> (21) and<br />
(22) to an integral on the boundary <strong>of</strong> the filament. For<br />
solutions ai and aj <strong>of</strong> (15), consider an integral equation<br />
<br />
Ωj<br />
ai( div 1<br />
μ grad aj − jωσaj) dΩ =0, (23)<br />
which holds because aj is a solution <strong>of</strong> (15). After<br />
integration by parts we get<br />
<br />
<br />
∂Ωj<br />
Ωj<br />
1<br />
ai<br />
μ grad aj · n dl =<br />
grad ai<br />
1<br />
μ grad aj + jωaiσaj dΩ, (24)<br />
which occurs as a basic building block in (21) and (22).<br />
- 395 - 15th IGTE Symposium 2012<br />
∂Ωj Ωj<br />
∂Ωj<br />
Fig. 4. Extension <strong>of</strong> the basis function inside the filament with the<br />
constant boundary condition in the E-magnet example, real part in solid<br />
line and imaginary part in dashed line.<br />
∂Ωj<br />
Ωj<br />
∂Ωj<br />
Fig. 5. Extension <strong>of</strong> the basis function inside the filament with the<br />
sine boundary condition in the E-magnet example, real part in solid<br />
line and imaginary part in dashed line.<br />
C. Example: Circular conductors<br />
Circular conductors <strong>of</strong> radius a admit exact solutions<br />
for (15) in closed form for a constant, sines and cosines:<br />
a0 = J0( 1<br />
−jωσμr)<br />
J0( √ (25)<br />
−jωσμa)<br />
a2n−1 = Jn( 1<br />
−jωσμr)<br />
Jn( √ cos nφ (26)<br />
−jωσμa)<br />
a2n = Jn( 1<br />
−jωσμr)<br />
Jn( √ sin nφ (27)<br />
−jωσμa)<br />
where Jn are Bessel functions <strong>of</strong> first kind with order<br />
n. In Figure 4, we see a cross-sectional view <strong>of</strong> the<br />
extension <strong>of</strong> the constant function (a0) into the filament<br />
and in Figure 5 the extension <strong>of</strong> the sine function (a2).<br />
In Figure 6, flux lines are shown for the interface<br />
technique with one sine and cosine with the extended<br />
basis functions. The comparison to A-V-formulated case<br />
(Fig. 3) shows the flux to be very similar. The figures<br />
cannot be identical, since the function space <strong>of</strong> the interface<br />
technique is severely restricted from the interface<br />
function space <strong>of</strong> A-V-formulation and also the basis<br />
functions inside the filaments differ.
Fig. 6. The flux lines obtained with the interface technique for the<br />
E-magnet are shown. Here we have taken into account the effect <strong>of</strong><br />
the interior <strong>of</strong> the filaments to the exterior solution and have a good<br />
correlation with the reference solution.<br />
<br />
<br />
<br />
Fig. 7. Filament numbers used in Table I for the E-magnet.<br />
IV. LOSS ESTIMATES<br />
By using the interior solution (17), we compute in Ωj<br />
the time average <strong>of</strong> the losses<br />
P = 1<br />
<br />
Re{e · j<br />
2<br />
∗ } da, (28)<br />
where e = −jω(a + Vz/jω) and j = σe.<br />
Losses for selected filaments (see Figure 7) <strong>of</strong> the<br />
example in Fig. 1 are shown in Table I. Losses are<br />
computed for five different filaments with the interface<br />
technique and A-V-formulation throughout for comparison.<br />
For the interface technique, the assembly block was<br />
produced with (25)–(27). The loss estimates were also<br />
computed analytically. The reference results for the A-<br />
V-formulation were obtained with GetDP [8] to verify<br />
our loss estimated from MATLAB R○ . The greatest error<br />
is in filament 2, where the error is -4.0%.<br />
In Table II some computationally relevant figures for<br />
the reference solution and the interface technique are<br />
shown. The mesh outside the filaments is <strong>of</strong> an equal<br />
density in both methods. The system <strong>of</strong> equations was<br />
solved using the backslash-operator in MATLAB R○ and<br />
- 396 - 15th IGTE Symposium 2012<br />
Filament A-V [mW/m] Interface<br />
technique [mW/m]<br />
Error %<br />
1 1.012 1.009 0.2<br />
2 0.7185 0.7470 -4.0<br />
3 0.2634 0.2703 -2.6<br />
4 0.04252 0.04200 1.2<br />
5 0.03497 0.03480 0.5<br />
TABLE I<br />
LOSSES IN NUMBERED FILAMENTS WITH UNIT CURRENT AND<br />
f =50HZ. A-VRESULTS FROM GETDP AND INTERFACE<br />
TECHNIQUE FROM MATLAB R○ . δ/r =0.92, WHERE<br />
δ = 2/(ωσμ) AND r THE RADIUS OF THE FILAMENT.<br />
A-V Interface technique Difference (%)<br />
Nodes 153 796 55 206 -64.1<br />
DoFs 153 435 49 645 -67.6<br />
Time [s] 3.928 0.5230 -86.7<br />
nnz<br />
No. <strong>of</strong> DoFs<br />
1 280 535 367 240 -71.3<br />
in filaments 98 525 100<br />
TABLE II<br />
-99.9<br />
SOME PERFORMANCE INDICATORS FOR COMPUTATIONS.NNZ IS<br />
THE NUMBER OF NONZERO ELEMENTS.<br />
the number <strong>of</strong> nonzero elements (nnz) in the system<br />
matrix is much lower than in the A-V-formulation. Most<br />
<strong>of</strong> the saved elements are in the conducting regions and<br />
this saves computation time. Additionally, for the A-Vformulation,<br />
the number <strong>of</strong> nodes required to maintain<br />
accuracy inside the filaments has to increase significantly<br />
with increasing frequency.<br />
V. CONCLUSION<br />
An approach to model coil filament losses was proposed.<br />
We expanded the function space on the filament<br />
boundaries from the floating potential approach with<br />
trigonometric functions. We observed that the filament<br />
interiors need to be considered as well due to the<br />
significant effect <strong>of</strong> the magnetic energy stored and<br />
dissipated inside them. When interface basis functions<br />
were extended into filaments with solutions <strong>of</strong> magnetoquasi-static<br />
problems, and these were used as FEM basis<br />
functions, the loss estimates are at the most 4% away<br />
from A-V-formulated estimates. The computation time is<br />
significantly reduced in a small problem consisting <strong>of</strong> 25<br />
filaments.<br />
ACKNOWLEDGEMENTS<br />
The authors thank Pr<strong>of</strong>essor Stefan Kurz for discussion<br />
and comments.<br />
REFERENCES<br />
[1] O. Moreau, L. Popiel and J. Pages, ”Proximity Losses Computation<br />
with a 2D Complex Permeability Modelling,” IEEE Trans. Magn.,<br />
vol. 34, pp. 3616-3619, 1998.<br />
[2] H. de Gersem and K. Hameyer, ”A Multiconductor Model for<br />
Finite-Element Eddy-Current Simulation,” IEEE Trans. Magn., vol.<br />
38, pp. 533-536, 2002.<br />
[3] A. Podoltsev, I. Kucheryavaya and B. Lebedev, ”Analysis <strong>of</strong><br />
effective resistance and eddy-current losses in multiturn winding<br />
<strong>of</strong> high-frequency magnetic components,” IEEE Trans. Magn., vol.<br />
36, pp. 539-548, 2003.
[4] J. Gyselinck, R. Sabariego and P. Dular, ”Time-Domain homogenization<br />
<strong>of</strong> windings in 2-D finite element models,” IEEE Trans.<br />
Magn., vol. 43, pp. 1297-1300, 2007.<br />
[5] T. Le-Duc, G. Meunier, O. Chadebec and J.-M. Guichon, ”A<br />
new integral formulation for eddy current computation in thin<br />
conductive shells,” IEEE Trans. Magn., vol. 48, pp. 427-430, 2012.<br />
[6] L. Lehti, J. Keränen, S. Suuriniemi, and L. Kettunen, ”Subsystem<br />
separation by flux linkage in coil filament modelling,” ACOMEN<br />
2011, Liège.<br />
[7] P. Dular, W. Legros, H. De Gersem, and K. Hameyer, ”Floating<br />
potentials in various electromagnetic problems using the finite<br />
element method,” Proc. <strong>of</strong> the 4th int. workshop on electric and<br />
magnetic fields, 1998, Marseille.<br />
[8] P. Dular and C. Geuzaine, GetDP: a General Environment for the<br />
Treatment <strong>of</strong> Discrete Problems, available: http://geuz.org/getdp/<br />
- 397 - 15th IGTE Symposium 2012
- 398 - 15th IGTE Symposium 2012<br />
Optimization <strong>of</strong> Energy Storage Usage<br />
Arnel Glotic 1 , Peter Kitak 1 , Igor Ticar 1 , Adnan Glotic 2<br />
1 <strong>University</strong> <strong>of</strong> Maribor, Faculty <strong>of</strong> electrical engineering and computer science, Smetanova 17, SI-2000 Maribor,<br />
Slovenia<br />
2 Holding Slovenske elektrarne Group, Koprska ulica 92, SI-1000 Ljubljana, Slovenia<br />
Abstract — Energy storage is a physical storage for energy, like Batteries, Flywheels, Compressed Air Storages, Pumped<br />
Storages, etc. This paper presents the use <strong>of</strong> the optimization algorithm in order to achieve the optimal usage <strong>of</strong> Energy Storage.<br />
Reservoirs <strong>of</strong> cascade Hydro Power Plants have been used as model <strong>of</strong> Energy Storage, and these are known as complex<br />
optimization problems. Optimization algorithm used in this paper was the adapted differential evolution algorithm.<br />
Index Terms — Differential evolution, energy storage, optimization, hydro power plants.<br />
I. INTRODUCTION<br />
Energy storage [1] is a physical storage for energy and<br />
can be found in different types. Authors’ research has<br />
been focused to cascade hydro power plants (HPP), where<br />
each individual plant has its own reservoir and energy<br />
storage, respectively.<br />
Various combinations <strong>of</strong> reservoirs’ charging and<br />
discharging produces different amount <strong>of</strong> electricity. In<br />
order to achieve optimal production, several methods can<br />
be implemented [2], such as Lagrangian relaxion and<br />
Benders decomposition-based methods, Mixed-integer<br />
programming, Dynamic programming, Evolutionary<br />
Computing Methods, Artificial intelligence methods and<br />
Interior-point methods.<br />
Differential Evolution (DE) Algorithm [3] is an<br />
efficient and robust global optimization algorithm and<br />
therefore it has been selected in this paper as an<br />
appropriate optimization technique.<br />
Short-term optimization using DE with self-adaptive<br />
parameter settings authors in [4] has been used on four<br />
cascades HPP, where the best objective value has reached<br />
after 2000 generations. The modified DE presented in [5]<br />
includes penalty factor during the objective function<br />
evaluation, which preserves the satisfied final reservoirs<br />
levels <strong>of</strong> four cascades HPP. In [6] authors combined<br />
advantages <strong>of</strong> the two modified DE algorithms, where the<br />
grouping and shuffling operation is carried out over the<br />
population periodically.<br />
Optimization <strong>of</strong> reservoirs scheduling HPP is known as<br />
a complex problem, where large number <strong>of</strong> HPP in<br />
cascade, means much larger number <strong>of</strong> reservoirs<br />
scheduling combinations and convergence time,<br />
respectively. The main goal <strong>of</strong> this paper was to modify<br />
DE in order to be capable <strong>of</strong> reaching the global optimal<br />
solution with fast convergence. This means the adequate<br />
distribution <strong>of</strong> individual HPP electrical energy<br />
production by scheduling reservoirs in order to satisfy the<br />
demand for 24 hours. Besides satisfying the demand, the<br />
decreased usage <strong>of</strong> water quantity per electrical energy<br />
unit (m 3 /MWh) has to be also achieved. Also, the<br />
optimization results must be feasible in range <strong>of</strong> couple<br />
minutes.<br />
Mathematical model <strong>of</strong> cascade hydro power plants is<br />
E-mail: arnel.glotic@uni-mb.si<br />
described in section II, standard and modified differential<br />
evolution algorithm in section III, results in section IV,<br />
and conclusion in section V.<br />
II. MATHEMATICAL MODEL OF CASCADE HYDRO POWER<br />
PLANTS<br />
The mathematical model describes cascade HPP on<br />
Drava River in Slovenia, owned by Dravske elektrarne<br />
Maribor (DEM). DEM is a subsidiary company <strong>of</strong><br />
Holding Slovenske elektrarne (HSE), which is the biggest<br />
producer and trader with electricity in Slovenia. DEM<br />
provides approximately 25.5% <strong>of</strong> produced energy in<br />
Slovenia, with maximum output <strong>of</strong> 587 MW.<br />
The mathematical model consists <strong>of</strong> eight cascades,<br />
t<br />
where i-th HPP has natural inflow Qi ,NI in the observed<br />
hour t <strong>of</strong> the day. The first HPP in decade structure has<br />
t<br />
the inflow Qi,I <strong>of</strong> Drava River coming from Austria. The<br />
source <strong>of</strong> Drava River lies in Italy, near Austrian-Italian<br />
border.<br />
The total inflow for the first HPP in the observed hour t<br />
is,<br />
t t t<br />
Qi,TI Qi,I Qi,NI<br />
, (1)<br />
i 1, t 1,2,...24<br />
t<br />
where Qi ,TI is the sum <strong>of</strong> inflows. The total inflow for the<br />
following seven HPP is expressed as<br />
t t t<br />
Qi,TI Q( i1) Qi,NI<br />
i 2,3,...6, t 1,2,...24 , (2)<br />
t<br />
Q is the outflow <strong>of</strong> the upper HPP, expressed as<br />
where ( i 1)<br />
t t t<br />
i i,T i,O<br />
Q Q Q<br />
i 1,2...8, t 1,2,...24 , (3)<br />
which represents the sum <strong>of</strong> the flow through the turbine<br />
and the overflow in the observed hour t. The last two<br />
HPP’s, HPP 7 and HPP 8, are canal based type HPP’s<br />
where flows merge with the riverbed at the end <strong>of</strong> the<br />
canal. Both <strong>of</strong> these HPP’s have the required biological<br />
minimum flow Q i,B<br />
, which must be provided to the<br />
riverbed.
t<br />
Q1,TI<br />
V<br />
t<br />
HPP1<br />
t<br />
Q1,O<br />
HPP 1<br />
dam<br />
t 1 <br />
H V<br />
t<br />
Q1,T<br />
t<br />
Q1<br />
V2,min<br />
Total inflow for the last two HPP in chain is expressed as<br />
t t t<br />
Qi,TI Q( i1) Qi,NI Qi,B<br />
(4)<br />
i 7,8, t 1,2,...24 .<br />
Inflow water can be used for charging reservoir up to the<br />
maximal reservoir height V i,max<br />
or used in combination<br />
with flow gained from discharging reservoirs. However it<br />
must be considered that in the observed hour t the<br />
reservoirs values<br />
t<br />
V i must be between minimal or<br />
maximal allowed value <strong>of</strong> the individual reservoir. All the<br />
reservoirs also have the prescribed maximal discharging<br />
value.<br />
The hydro generator output power is expressed as<br />
<br />
t<br />
i i,1 <br />
t<br />
i<br />
2<br />
i,2 <br />
t<br />
i<br />
2<br />
i,3 <br />
t<br />
i <br />
t<br />
i i,4 <br />
t<br />
i<br />
ci,5 t<br />
Qi ci,6<br />
P c V c Q c V Q c V<br />
,(5)<br />
<br />
where c represents the hydropower generation<br />
t<br />
coefficient. In cases where the inflow Q i,TI<br />
is larger than<br />
the maximal allowed flow through the turbines <strong>of</strong> the i-th<br />
HPP and the reservoir level t<br />
V i reaches the maximal<br />
t<br />
value allowed, then the overflow Q i,O<br />
is unavoidable and<br />
it can be expressed as<br />
t t t t<br />
Qi,O Qi,TI Qi<br />
Pi,max ,<br />
i 1,2,...8, t 1,2,...24<br />
(6)<br />
t t<br />
where i i,max<br />
<br />
Q P is the flow throughout the turbines,<br />
which provides the maximal output power. Power<br />
generation consider also the head effect,<br />
t t t<br />
Hi HVi Hi,O<br />
i 1,2...8, t 1,2,...24<br />
,<br />
(7)<br />
t<br />
where H i is the difference between the inlet and outlet<br />
t<br />
H V i<br />
t<br />
is the level <strong>of</strong> reservoir at volume V i and<br />
head, <br />
t<br />
i,O<br />
H is the level <strong>of</strong> the outlet. Both levels are expressed<br />
with the polynomial <strong>of</strong> the sixth degree:<br />
- 399 - 15th IGTE Symposium 2012<br />
V2,max<br />
t<br />
t<br />
V Q<br />
2<br />
2,TI<br />
Figure 1: Layout <strong>of</strong> two hydropower plants<br />
t<br />
Q2,O<br />
HPP 2<br />
dam<br />
t 2 <br />
H V<br />
t<br />
Q2,T<br />
<br />
<br />
3<br />
t <br />
2<br />
t <br />
1<br />
t<br />
6 5 4<br />
t t t t<br />
i,O i,1 i i,2 i i,3 i<br />
H k Q k Q k Q<br />
k Q k Q k Q k<br />
i,4 i i,5 i i,6 i i,7<br />
t<br />
Q2<br />
, (8)<br />
where ki are the coefficients <strong>of</strong> the polynomial obtained<br />
by experimental measurements <strong>of</strong> each reservoirs and<br />
provided by DEM personnel.<br />
III. OPTIMIZATION ALGORITHM<br />
Differential evolution (DE) algorithm has been used as<br />
effective global optimizer and was proposed by R. Storn<br />
and K. Price [3]. The main steps <strong>of</strong> DE algorithm are<br />
initialization, mutation, crossover, evaluation and<br />
selection. The initialization step is defined as a randomly<br />
chosen population. Each individual xi <strong>of</strong> the initial<br />
population is composed <strong>of</strong> j variables:<br />
x jG , x j,upp rand(0,1) ( x j,upp xj,low<br />
)<br />
(9)<br />
j 1,2..., D<br />
xiG<br />
, x1, x2,... xD<br />
(10)<br />
i 1,... NP<br />
where UPP x j,upp<br />
and LOW j,low<br />
x are upper and lower<br />
bounds defined for each variable x j , G denotes<br />
generation, NP number <strong>of</strong> population, D number <strong>of</strong><br />
parameters or problem dimension and i the number <strong>of</strong> the<br />
population member and individual, respectively. The<br />
population size depends on number <strong>of</strong> the problem<br />
variables D and parameters <strong>of</strong> the objective function,<br />
respectively.<br />
For the proposed mathematical model the optimization<br />
algorithm has upper and lower bounds defined as minimal<br />
and maximal value <strong>of</strong> the individual reservoirs.<br />
Therefore, after the initialization, the population is<br />
composed <strong>of</strong> NP D-dimensional vectors:<br />
1 24 1 24<br />
xiG<br />
, Vi,1 ,... Vi,1 ,... Vi,8 ,... V <br />
i,8<br />
<br />
(11)<br />
i 1,2... NP<br />
where V is the volume <strong>of</strong> individual HPP reservoir in<br />
time t. At the initialization step <strong>of</strong> DE the volumes are<br />
randomly chosen for each individual HPP and for each<br />
individual hour in 24 hour period. Therefore the
dimension <strong>of</strong> the problem D is 192 and the population<br />
size is five times larger. Therefore the population size is<br />
960.<br />
The mutation stem if followed after the initialization<br />
step. For each target individual and sometimes referred to<br />
as vector x iG , , the mutant vector is created according to<br />
the selected strategy. The applied strategy in this paper is<br />
formulated as<br />
viG , xiG , Fxbest, GxiG , Fxr 1, Gxr2, G<br />
,<br />
i 1,2..., NP<br />
(12)<br />
where xr and x<br />
1 r are randomly chosen individuals from<br />
2<br />
interval [1,NP], x best,G represents the best individual <strong>of</strong><br />
the generation G and F is the weight.<br />
The following step is crossover, where for the each<br />
mutant vector a new trial vector u iG , is produced via<br />
“binary” crossover:<br />
vi, j, G if rand(0,1) CR or j jrand<br />
<br />
ui,<br />
j, G<br />
<br />
xi, j, G if rand(0,1) CR or j jrand<br />
<br />
i 1,2..., NP, j 1,2,...,<br />
D<br />
(13)<br />
where CR is crossover constant selected by the user. The<br />
j rand is a randomly chosen integer from interval [1,…D],<br />
which ensures that the trial vector obtains at least one <strong>of</strong><br />
the parameters from the mutant vector.<br />
In the last step, known as selection, DE evaluates trial and<br />
target vector, commonly referred to as parent vector:<br />
<br />
iG , if f iG , f , <br />
<br />
u u xiG<br />
<br />
xiG<br />
, 1<br />
<br />
xiG , if f uiG , f x (14)<br />
iG , <br />
i 1,2..., NP<br />
where the lower objective function value occupies the<br />
position in next generation (G+1). This comparison is<br />
made for each <strong>of</strong> NP individuals and the new population<br />
in generation G+1 is selected and steps <strong>of</strong> DE algorithm<br />
start once again in the following order; mutation,<br />
crossover, evaluation and selection. The algorithm repeats<br />
all steps until one <strong>of</strong> the stopping criterions is reached.<br />
DE control parameters F, CR and strategy are selected<br />
by the user and have an important influence on the<br />
convergence time, global or local search and manner <strong>of</strong><br />
creating new mutants. Use <strong>of</strong> the standard DE for solving<br />
the presented optimization problem may not always lead<br />
towards the global solution, regardless <strong>of</strong> the effort given<br />
in order to choose the adequate control parameters. The<br />
algorithm can be easily trapped into local optimum and<br />
also the convergence time can be drastically increased. In<br />
order to overcome these problems the modified algorithm<br />
uses self-adaptive F and CR. For the initial generation<br />
both <strong>of</strong> the control parameter are selected by the user and<br />
vary along with the iteration number according to (15)<br />
and (16):<br />
FRif f( xbest, G1) f( xbest,<br />
G)<br />
<br />
FiG<br />
, 1<br />
<br />
FiG , Otherwise.<br />
(15)<br />
<br />
i 1,2..., NP<br />
If the algorithm finds a better solution in generation G<br />
- 400 - 15th IGTE Symposium 2012<br />
compared to generation G - 1, then a randomly selected<br />
FR is employed in generation G + 1.<br />
CRR if FiG , 1<br />
FiG , and rand(0,1)<br />
<br />
CRiG<br />
, 1<br />
<br />
CRiG , Otherwise.<br />
(16)<br />
<br />
i 1,2..., NP<br />
A random CRR in generation G+1 is also provided if a<br />
FR is previously employed and at the same time a<br />
randomly selected value from interval [0, 1] is lower than<br />
0.1 . The described modification loads toward the<br />
global solution and improves the convergence time. A<br />
further improvement in convergence time can be achieved<br />
by parallel computation.<br />
The presented optimization problem is a multiobjective<br />
problem [7], where three different objectives<br />
are merged into a single one by using the weighted sum<br />
method [8]. The first goal <strong>of</strong> the optimization process is<br />
the satisfied demand for 24 hours by scheduling<br />
reservoirs <strong>of</strong> cascade HPP. The satisfied demand should<br />
be followed by the decreased usage <strong>of</strong> water quantity per<br />
3<br />
electrical energy unit ( m MWh) which represents the<br />
second objective. The third objective represents the<br />
decreased and eliminated overflow, respectively. The<br />
objective function for each individual objective is<br />
expressed as:<br />
2<br />
24 8 <br />
<br />
<br />
t t<br />
1 demand i,opt<br />
<br />
<br />
<br />
<br />
<br />
t1 i1<br />
<br />
1<br />
f W W <br />
<br />
24<br />
<br />
8 24 <br />
t Qi,T<br />
<br />
<br />
i1 t1<br />
<br />
2 <br />
<br />
8 24 <br />
t Wi,opt<br />
<br />
<br />
i1 t1<br />
<br />
8 24 <br />
t<br />
3 i,O<br />
<br />
i1 t1<br />
f<br />
f Q<br />
<br />
(17)<br />
t<br />
Demand energy Wdemand and optimal production energy<br />
t<br />
i,opt<br />
W is formulated as a product <strong>of</strong> power P and time t<br />
W PtWh (18)<br />
The unified objective function f is defined as<br />
f f1w1 f2w2 f3w3 (19)<br />
where each individual objective is normalized and<br />
weights are set according to the selected priority <strong>of</strong> the<br />
individual objective. The values selected for a given<br />
problem were 0.6, 0.15 and 0.25, respectively.<br />
IV. RESULTS<br />
The proposed modified DE algorithm has been used in<br />
order to achieve globally optimal production <strong>of</strong> the<br />
cascade <strong>of</strong> the HPP and to satisfy the demand,<br />
respectively. The test data used was a real 24 hours<br />
demand plan from SCADA. It has been shown in Table I<br />
and it is valid for the observed day in the past and<br />
practically realized by scheduling reservoirs.
Time<br />
- 401 - 15th IGTE Symposium 2012<br />
Table I: The satisfied demand by scheduling reservoirs for the dispatcher, standard and modified DE<br />
Demand scheduling the<br />
reservoirs by dispatcher<br />
(real data from SCADA )<br />
Energy<br />
( MWh )<br />
Satisfied demand by scheduling<br />
reservoirs - standard DE<br />
Satisfied demand by scheduling<br />
reservoirs - modified DE<br />
Water discharge Energy Water discharge Energy Water discharge<br />
( /h ) ( MWh ) ( / h ) ( MWh ) ( / h )<br />
1 19.8 828000 0 0 19.0 738446<br />
2 6.8 349200 6.0 284205 7.0 218980<br />
3 0 0 0 0 0 0<br />
4 0 0 0 0 0 0<br />
5 0 0 0 0 0 0<br />
6 0 0 3.0 154185 0 0<br />
7 89.2 2080800 39.0 1372293 89.0 2106432<br />
8 368.6 8355600 368.1 9151060 368.6 8696690<br />
9 417.5 10018800 416.4 10288591 417.5 10553592<br />
10 315.5 7592400 314.7 6764925 315.5 8525579<br />
11 244.2 5968800 244.0 5853691 244.3 5827843<br />
12 313.3 7408800 312.7 7426771 313.3 7453704<br />
13 322.4 7740000 321.9 7392661 322.4 7548595<br />
14 332.7 8089200 332.4 8707231 332.6 7625784<br />
15 320.1 7776000 319.6 7223638 320.2 8137026<br />
16 306.7 7315200 306.4 6806393 306.7 6557976<br />
17 403.3 9518400 403.2 9281914 403.3 9743537<br />
18 397.2 9378000 396.7 9344417 397.1 9186977<br />
19 391.9 9381600 391.4 9250750 391.9 9260050<br />
20 306.5 7452000 306.6 6604082 306.5 7079875<br />
21 290.1 7185600 289.6 6610318 290.0 7440151<br />
22 272.3 6494400 272.2 5624925 272.3 5849220<br />
23 265.2 6091200 265.1 6813626 265.2 5808105<br />
24 249.0 5763600 248.6 5207242 248.9 5301717<br />
Total 5632.2 134787600 5557.4 130162918 5631.3 133660279<br />
The scheduling has been made by the dispatch personnel<br />
<strong>of</strong> the DEM Company. This data has been used as a<br />
reference followed by optimization algorithm – the DE<br />
and the modified DE – with the objective to satisfy the<br />
given demand by determining the optimal production <strong>of</strong><br />
individual HPP during the 24 hour period. According to<br />
the results from Table I, the modified DE compared to<br />
manual dispatch saved approximately 1.12 million<br />
3<br />
m <strong>of</strong><br />
water, which equals to approximately 50 MWh less <strong>of</strong><br />
potential energy used.<br />
Authors [8] showed DE’s control parameters impact on<br />
convergence and global optimization performance. By<br />
using smaller F values, a local optimum can be reached<br />
faster, while a global one can be reached by choosing<br />
larger values. Selection <strong>of</strong> larger CR values can reduce a<br />
convergence time. In order to improve algorithm’s<br />
performance on a given optimization problem, a search<br />
for suitable DE control parameters was not successful,<br />
although the best result have been obtained by using the<br />
following parameters F = 0.5, CR = 0.8 and strategy = 3.<br />
However, the standard DE was not able to achieve the<br />
global optimum and to minimize the difference between<br />
the demand and the production down to zero,<br />
respectively.<br />
Authors [9] have shown the benefits <strong>of</strong> self-adaptive<br />
parameters control. This was a solid research direction in<br />
this paper and in order to achieve the global optimum, the<br />
modified DE with self-adjusting F and CR has been<br />
proposed. The modified DE (Fig. 2) stopped the<br />
evolution process after 200 generations with the<br />
convergence time <strong>of</strong> 280 seconds, while the standard DE<br />
(Fig. 3) stopped after 1000 generations and 1050 seconds.<br />
The stopping criterion in this case was 500 generations<br />
without <strong>of</strong> any change in objective function value.<br />
The final result at the end <strong>of</strong> the optimization process by<br />
using the modified DE algorithm is an optimal 24 h<br />
production <strong>of</strong> each individual HPP. Such a production<br />
completely satisfies the demand. The optimal 24 hours<br />
production <strong>of</strong> individual HPP is shown in Fig.4 and the<br />
corresponding reservoir volumes during the 24 hours<br />
period are shown in Fig. 5.
Objectives<br />
1.4<br />
1.2<br />
1<br />
0.8<br />
0.6<br />
0.4<br />
0.2<br />
0<br />
0 20 40 60 80 100 120 140 160 180 200<br />
Generation<br />
Figure 2: Convergence <strong>of</strong> the unified and three individual<br />
objective functions values by using the modified DE<br />
Electrical Enegy Production [MWh]<br />
450<br />
400<br />
350<br />
300<br />
250<br />
200<br />
150<br />
100<br />
50<br />
HPP1<br />
HPP2<br />
HPP3<br />
HPP4<br />
HPP5<br />
HPP6<br />
HPP7<br />
HPP8<br />
Demand<br />
0<br />
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24<br />
Time [h]<br />
Figure 4: Optimal production <strong>of</strong> individual HPP proposed<br />
by the modified DE<br />
Vmax<br />
Volume [m 3 ]<br />
HPP 1<br />
HPP 2<br />
HPP 3<br />
HPP 4<br />
HPP 5<br />
HPP 6<br />
HPP 7<br />
HPP 8<br />
Vmin<br />
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24<br />
Time [h]<br />
Figure 5: Charging and discharging <strong>of</strong> reservoirs during<br />
the optimal production <strong>of</strong> individual HPP<br />
V. CONCLUSION<br />
The modified DE algorithm in this paper was capable<br />
<strong>of</strong> solving complex optimization problem. As shown on<br />
the presented optimization problem, the algorithm was<br />
f<br />
f<br />
1<br />
f 2<br />
f 3<br />
- 402 - 15th IGTE Symposium 2012<br />
Objectives<br />
1.4<br />
1.2<br />
1<br />
0.8<br />
0.6<br />
0.4<br />
0.2<br />
0<br />
0 100 200 300 400 500 600 700 800 900 1000<br />
Generation<br />
Figure 3: Convergence <strong>of</strong> the unified and three individual<br />
objective functions values by using the standard DE<br />
able to satisfy the demand along with the fast<br />
convergence speed. The optimization problem was<br />
observed in period <strong>of</strong> 24 hours and includes 8 HPP’s.<br />
Therefore it has 192 variables that need to be identified.<br />
Despite this fact, the algorithm ensures the exploration <strong>of</strong><br />
large solution space in time span <strong>of</strong> several minutes and<br />
provides the global solution. The dispatch personnel is<br />
unable to explore such a large solution space and the<br />
production <strong>of</strong> individual HPP is determined on the basis<br />
<strong>of</strong> previous experiences and therefore, non-optimally. By<br />
using the modified DE algorithm, the dispatch personnel<br />
obtain the guide which improves the overall production<br />
efficiency. The algorithm can also be used to indicate<br />
whether a given demand plan is feasible.<br />
REFERENCES<br />
[1] S. Vazquez, S. M. Lukic, E. Galvan, L. G. Franquelo and J. M.<br />
Carrasco “Energy Storage System for Transport and Grid<br />
Applications”, IEEE Transactions on industrial electronics, Vol.<br />
57, pp. 3881-3895, December 2010.<br />
[2] I. A. Farhat, M.E. El-Hawary, “Optimization methods applied for<br />
solving the short-term hydrothermal coordination problem,”<br />
Electric Power System Research, pp. 1308-1320, 2009.<br />
[3] R. Storn, K. Price, “Differential Evolution – A simple and<br />
efficient adaptive scheme for global optimization over continuous<br />
spaces,” Journal <strong>of</strong> Global Optimization, pp. 341-359, 1997.<br />
[4] X. Yuan, Y. Zhang, L. Wang, Y. Yuan, “An enhanced differential<br />
evolution algorithm for daily optimal hydro generation<br />
[5]<br />
scheduling,” Computers and Mathematics with Applications, pp.<br />
2458-2468, 2008.<br />
L. Lakshminarasimman, S. Subramanian, “Short-term scheduling<br />
<strong>of</strong> hydrothermal power system cascaded reservoirs by using<br />
modified differential evolution,” IEEE Proc.-Gener. Transm.<br />
Distrib., Vol. 153, pp. 693-700, 2006.<br />
[6] Y. Li, J. Zuo, “Optimal Scheduling <strong>of</strong> Cascade Hydropower<br />
System Using Grouping Differential Evolution Algorithm,”<br />
International Conference on Computer Science and Electronic<br />
Engineering, pp. 625-629, 2012.<br />
[7] J. Grobler, A.P. Engelbrecht, V.S.S. Yadavalli, “Multi-objective<br />
DE and PSO Strategies for Production Scheduling,” IEEE<br />
Congres on Evolutionary Computation, pp. 1154-1161, 2008.<br />
[8] R. Gamperle, S.D. Muller, P. Koumoutsakos, “A parameter Study<br />
for Differential Evolution,” Conf. on Adances in Intelligent<br />
System, Fuzzy Systems, pp. 293-298,2002.<br />
[9] J. Brest, V. Zumer, M.S. Maucec, “Self-Adaptive Differential<br />
Evolution Algorithm in Constrained Real-Parameter<br />
Optimization,” IEEE Conges on Evolutionary Computation, pp.<br />
215-222, 2006.<br />
f<br />
f<br />
1<br />
f 2<br />
f 3
- 403 - 15th IGTE Symposium 2012<br />
Adaptive Surrogate Approach for Bayesian<br />
Inference in Inverse Problems<br />
M. Neumayer∗ ,H.R.B.Orlande ‡ ,M.J.Colaço ‡ , D. Watzenig∗ ,G.Steiner∗ , B. Brandstätter † ,andG.S.<br />
Dulikravich §<br />
∗Institute <strong>of</strong> Electrical Measurement and Measurement Signal Processing, <strong>Graz</strong> <strong>University</strong> <strong>of</strong> <strong>Technology</strong>, <strong>Graz</strong>,<br />
Austria, † Elin Motoren GmbH, Elinmotorenstrasse 1, A-8160 Preding/Weiz, Austria, ‡ Department <strong>of</strong> Mechanical<br />
Engineering, Federal <strong>University</strong> <strong>of</strong> Rio de Janeiro, UFRJ Rio de Janeiro, RJ, Brazil, § Department <strong>of</strong> Mechanical<br />
and Materials Engineering, Florida International <strong>University</strong> Miami, Florida, U.S.A.<br />
E-mail: neumayer@TU<strong>Graz</strong>.at<br />
Abstract—Bayesian inference forms a flexible and versatile solution strategy for inverse problems. Its advantage lies in the<br />
straight forward formulation <strong>of</strong> the solution process, the ability to incorporate any existing knowledge, as well as in the<br />
output <strong>of</strong> the method itself, which provides statistical knowledge about unknown parameters. The costs <strong>of</strong> the mentioned<br />
benefits are <strong>of</strong>ten largely increased numerical efforts due to the use <strong>of</strong> sampling methods. This especially holds if the<br />
underlying physical problem requires the solution <strong>of</strong> a partial differential equation. In this paper we present a simple, yet<br />
versatile and effective strategy to accelerate Bayesian inference using an adaptive surrogate approach.<br />
Index Terms—surrogate technique, adaptive, Bayesian inference, MCMC<br />
I. INTRODUCTION<br />
Inverse problems and parameter estimation problems<br />
belong to the class <strong>of</strong> indirect measurement problems<br />
where one tries to estimate a parameter vector x ∈ R N<br />
from observations ˜ d ∈ R M [1]. They arise in many<br />
disciplines <strong>of</strong> engineering and science. The term inverse<br />
problem is most <strong>of</strong>ten associated with imaging techniques<br />
like electrical capacitance/impedance tomography (ECT<br />
and EIT), or computed tomography, but their mathematical<br />
common is their inherent ill-posed nature. Parameter<br />
estimation has not that massive association with imaging<br />
like inverse problems, but is in many ways <strong>of</strong> even larger<br />
importance in engineering. Such an example is given by<br />
the determination <strong>of</strong> material parameters from ”simple”<br />
a simple measurements setup.<br />
Formally the physical measurement process P can be<br />
denoted by P : x ↦→ ˜ d. Hereby ˜ d is the corrupted version<br />
<strong>of</strong> the otherwise noise free measurements d. Formost<br />
practical examples an additive noise model <strong>of</strong> form ˜ d =<br />
d + v is valid, where v ∈ R M follows a certain noise<br />
distribution described by a probability density function<br />
(pdf). The modern model based approach to estimate x<br />
from ˜ d maintains a model F : x ↦→ y (y ∈ R M )whichis<br />
referred to as forward map. For most real world problems<br />
this is a computer model solving the underlying partial<br />
differential equations (PDEs) for P in a numerical way.<br />
In this paper we will assume that P = F holds.<br />
Classical deterministic inversion methods then manipulate<br />
the vector x in order to minimize some useful norm<br />
<strong>of</strong> the residual vector e = y − ˜ d. In addition ill-posed<br />
problems require a regularization term for the numerical<br />
stabilization <strong>of</strong> such an optimization problem. The single<br />
result <strong>of</strong> the approach is referred to as point estimate,<br />
which we will denote by xMAP (maximum a posteriori)<br />
as the value <strong>of</strong> x which provides the smallest misfit.<br />
A contrastable approach to solve inverse problems is<br />
provided by the framework <strong>of</strong> Bayesian inference [2].<br />
Rather than providing a single result, Bayesian inference<br />
approaches provide the summary distribution π(x| ˜ d) (the<br />
posterior distribution, MAP). Out <strong>of</strong> this any statistics,<br />
like mean, variance, correlations, MAP estimates, etc.<br />
about x can be computed. The cost for this gain in<br />
information are the increased computational costs, as<br />
the approach requires numerous evaluations <strong>of</strong> F .This<br />
especially holds for the case that Markov chain Monte<br />
Carlo (MCMC) methods are applied. Thus, the practicability<br />
for the application <strong>of</strong> Bayesian methods is limited<br />
if the evaluation <strong>of</strong> F requires computational expensive<br />
operations like the numerical solution <strong>of</strong> PDEs.<br />
In this paper we will present a simple and versatile<br />
strategy to speed up Bayesian inference for inverse problems<br />
and parameter estimation problems. The speed up<br />
is provided by the use <strong>of</strong> an approximation or surrogate<br />
model [3]. The paper is structured as follows. In section<br />
II we will introduce the theory about Bayesian inversion<br />
and the exploration <strong>of</strong> the posterior distribution by the<br />
Metropolis Hastings (MH) algorithm. In section III we<br />
explain an acceleration approach for the MH which is<br />
based on the use <strong>of</strong> approximations. Finally we will<br />
present a numerical example where we estimate thermal<br />
material parameters from a heated slab.<br />
II. BAYESIAN INFERENCE AND MARKOV CHAIN<br />
MONTE CARLO<br />
In this section we will present the framework <strong>of</strong><br />
Bayesian inversion for the solution <strong>of</strong> inverse problems.<br />
Having measurements ˜ d from a measurement process P<br />
and a model F to simulate P , the solution process is
marked by the use <strong>of</strong> Bayes law [1]<br />
π(x| ˜ d)= π(˜ d|x)π(x)<br />
π( ˜ ∝ π(<br />
d)<br />
˜ d|x)π(x). (1)<br />
The law connects the so called likelihood function<br />
π( ˜ d|x) and the prior π(x) to formulate the posterior<br />
distribution π(x| ˜ d). π( ˜ d) is termed the evidence and<br />
has the role <strong>of</strong> a normalization constant to ensure the<br />
property <br />
RN π(x| ˜ d)dx =1<strong>of</strong> a pdf. Hence, it can be<br />
skipped leading to the right hand formula in equation (1).<br />
The likelihood function π( ˜ d|x) provides the probability<br />
measure for x causing the data ˜ d given the model and<br />
statistical knowledge about the measurement noise. For<br />
an additive noise model ˜ d = d + v, the likelihood is<br />
given by π( ˜ d|x) =πv(y − ˜ d), whereyisthe output<br />
<strong>of</strong> the forward map. For many practical problems zero<br />
mean white Gaussian noise, i.e. v ∝N(0, Σv), where<br />
Σv is the covariance matrix, can be assumed. Then the<br />
likelihood function becomes<br />
π( ˜ <br />
d|x) ∝ exp − 1<br />
<br />
y −<br />
2<br />
˜ T <br />
−1<br />
d Σ y − ˜ <br />
d<br />
<br />
, (2)<br />
where Σ is set to Σv.<br />
The prior π(x) provides a probability measure about x<br />
being the solution. While the design <strong>of</strong> the likelihood has<br />
to follow strict mathematical rules due to its definition<br />
the prior provides a very flexible way to incorporate<br />
expert knowledge about x. I.e. if we know that the ith<br />
component <strong>of</strong> x has a lower and an upper bound the<br />
corresponding prior is given by the uniform distribution<br />
xi ∝U(xi,min,xi,max). For the case that a mean value<br />
about the j-th component is known a Gaussian distribution<br />
xj ∝N(μxj ,σxj ) can be used to express the prior<br />
where σxj controls the deviation.<br />
The posterior π(x| ˜ d) expresses the probability for x<br />
being the solution given the data ˜ d, the model and the<br />
prior. Rather than a single result, the posterior covers<br />
all possible solutions. For a post analysis <strong>of</strong> π(x| ˜ d) one<br />
could look at a specific realization <strong>of</strong> x and evaluate its<br />
probability. However, as this procedure is not <strong>of</strong> big use<br />
some meaningful point measures have become popular.<br />
One <strong>of</strong> them is the maximum a posteriori (MAP) estimate<br />
xMAP =argmaxx π( ˜ d|x), which is the mode <strong>of</strong> the<br />
posterior. The other one is the conditional mean (CM)<br />
estimate<br />
<br />
xCM = xπ( ˜ d|x)dx, (3)<br />
R N<br />
which summarizes the complete distribution. It can be<br />
easily seen, that the MAP estimate can be found by<br />
solving an optimization problem by either maximizing<br />
the posterior, or minimizing its logarithm. This corresponds<br />
to classical regularized approaches except, that<br />
the likelihood introduces statistical knowledge about the<br />
noise. This fact results in generally higher modeling<br />
efforts when using Bayesian methods. The CM estimate<br />
requires the evaluation <strong>of</strong> a high dimensional integral.<br />
An analytic solution <strong>of</strong> the integral is <strong>of</strong>ten not possible,<br />
- 404 - 15th IGTE Symposium 2012<br />
as the integral is <strong>of</strong> high dimension and also because<br />
<strong>of</strong> the complicated interaction <strong>of</strong> the forward map. Also<br />
standard numerical schemes like the well known Gauss<br />
quadrature cannot be applied for such integrals, due to<br />
the lack <strong>of</strong> knowledge about the support. The numerical<br />
tool to solve such integrals is known as Monte Carlo<br />
integration. Hereby a set <strong>of</strong> samples x (N) from the posterior<br />
is generated, where the frequency <strong>of</strong> the samples<br />
follows the target distribution. Then the CM integral can<br />
be approximated by<br />
xCM =<br />
<br />
R N<br />
xπ( ˜ d|x)dx ≈<br />
N<br />
i=1<br />
x (N)<br />
i . (4)<br />
In the same way any other integral (also about functions<br />
<strong>of</strong> x) can be solved. The generation <strong>of</strong> samples from<br />
a distribution belongs to the discipline <strong>of</strong> computational<br />
Bayesian inference and will be discussed in the following<br />
subsection.<br />
One important aspect about the Bayesian framework<br />
which was not stated so far is the possibility to treat<br />
nuisance parameters ν in the same way as the state vector<br />
x. I.e.iftheforwardmapF is in fact a function Fν(x)<br />
it is possible to do inference about both, x and ν in<br />
the same natural way. This can be used if parameters<br />
<strong>of</strong> a measurement system are unknown or provide an<br />
uncertain factor.<br />
A. The Metropolis Hastings (MH) Algorithm<br />
Algorithms for practical computational Bayesian inference<br />
are typically sampling algorithms [4]. They can be<br />
seen as random number generators which compute independent<br />
samples from an arbitrary target distribution. For<br />
inverse problems the target distribution is the posterior.<br />
On a discrete state space the frequency <strong>of</strong> certain samples<br />
corresponds to the probability <strong>of</strong> the sample, enabling the<br />
powerful tool <strong>of</strong> Monte Carlo integration. As can be seen<br />
by equation (2), the evaluation <strong>of</strong> π( ˜ d|x) requires one<br />
evaluation <strong>of</strong> the forward map F . This already indicates<br />
the fact, that sampling methods result in generally higher<br />
computational cost. For computational inference a class<br />
<strong>of</strong> algorithms termed MCMC methods were developed,<br />
as they rely on an underlying Markov chain X. TheMH<br />
algorithm [5] is one prominent example out <strong>of</strong> this class<br />
<strong>of</strong> methods. The algorithm works as the following:<br />
1) Pick the current state x = Xn from the Markov<br />
chain.<br />
2) With proposal density q(x, x ′ ) generate a new<br />
state x ′ . <br />
3) Compute α = min 1, π(x′ | ˜ d)q(x ′ ,x)<br />
π(x| ˜ d)q(x,x ′ <br />
.<br />
)<br />
4) With probability α accept x ′ and set Xn+1 = x ′ ,<br />
otherwise reject x ′ and set Xn+1 = x.<br />
Starting from the current state x <strong>of</strong> the Markov chain<br />
(line 1) X the MH algorithm generates a proposal<br />
candidate x ′ (line 2) using the proposal kernel q(x, x ′ ).<br />
Then the acceptance ration α is evaluated in line 3 for the<br />
proposal x ′ , which requires one evaluation <strong>of</strong> the forward
map. If the proposal is accepted it becomes the new state<br />
<strong>of</strong> the Markov Chain, otherwise it gets rejected. The<br />
rejection <strong>of</strong> proposal candidates is critical with respect<br />
to the computational efficiency <strong>of</strong> the MH algorithm, as<br />
a high rejection rate, leads to a large number <strong>of</strong> forward<br />
map evaluations without generating a new state. This is<br />
strongly affected by the proposal kernel q(x, x ′ ) which<br />
drives the exploration <strong>of</strong> the posterior distribution.<br />
III. ACCELERATION OF THE MH USING SURROGATES<br />
The strategy we use to speed up the classical MH<br />
algorithm is based on the use <strong>of</strong> an approximation or<br />
surrogate F ∗ [6]. An approximation F ∗ has a considerable<br />
lower runtime with respect to F but at the cost<br />
<strong>of</strong> an approximation error e = y − y∗ . Subsequently<br />
we introduce the likelihood function π∗ ( ˜ d|x) to indicate<br />
the use <strong>of</strong> F ∗ . Then the delayed acceptance Metropolis<br />
Hastings (DAMH) algorithm [7] is given by<br />
1) Pick the current state x = Xn from the Markov<br />
chain.<br />
2) With proposal density q(x, x ′ ) generate a new<br />
state x ′ . <br />
3) Compute α = min 1, π∗ (x ′ | ˜ d)q(x ′ ,x)<br />
π∗ (x| ˜ d)q(x,x ′ <br />
.<br />
)<br />
4) With probability α accept x ′ to be a proposal for<br />
the standard MH algorithm. Otherwise set x ′ = x<br />
and return to 2. <br />
5) Compute β = min 1, π(x′ | ˜ d)q(x ′ ,x)<br />
π(x| ˜ d)q(x,x ′ <br />
.<br />
)<br />
6) With probability β accept x ′ and Xn+1 = x ′ ,<br />
otherwise reject x ′ and set Xn+1 = x.<br />
As can be seen, the DAMH algorithm consists <strong>of</strong> two<br />
nested MH algorithms (in the original MH algorithm<br />
step 3 and 4 do not exist). The DAMH tries to gain<br />
its advantage from a pre-evaluation <strong>of</strong> the proposal candidates<br />
x ′ on the distribution π∗ (x ′ | ˜ d). An evaluation<br />
<strong>of</strong> π(x ′ | ˜ d) in the inner MH is only performed if the<br />
proposal is accepted in the outer MH. In this sense the<br />
outer MH algorithm <strong>of</strong> the DAMH can be seen as a filter<br />
for bad proposals or as an improved proposal generator.<br />
It is obvious that the gain in performance gain strongly<br />
depends on the difference between π∗ (x ′ | ˜ d) and π(x ′ | ˜ d).<br />
An interesting point about the DAMH is the availability<br />
<strong>of</strong> the deterministic approximation error e = y − y∗ in line 5. This knowledge can be used to improve the<br />
algorithm by two points:<br />
• Learn about the approximation error to adapt the<br />
likelihood π∗ (x ′ | ˜ d).<br />
• Adapt the approximation F ∗ to improve the quality.<br />
The approach to incorporate knowledge about the approximation<br />
error is referred to as enhanced error model<br />
(EEM) [1]. Hereby the deterministic approximation error<br />
e is treated as a random variable. Mostly a Gaussian<br />
distribution about e is assumed, describing the error as<br />
e ∝N(μe, Σe). Then the likelihood function π∗ (x ′ | ˜ d)<br />
becomes π∗ ( ˜ d|x) ∝<br />
<br />
exp − 1<br />
<br />
y<br />
2<br />
∗ + μe − ˜ T <br />
−1<br />
d Σ y ∗ + μe − ˜ <br />
d<br />
<br />
,<br />
(5)<br />
- 405 - 15th IGTE Symposium 2012<br />
where Σ is the sum <strong>of</strong> Σv and Σe. In its original idea the<br />
distribution N (μe, Σe) is computed using samples over<br />
the prior π(x). However, with the availability <strong>of</strong> current<br />
value <strong>of</strong> en in the DAMH an adaptive approximation<br />
error model can be built by [8]<br />
μe,n = 1 <br />
(n − 1)μe,n−1 + en , (6)<br />
n<br />
Ce,n = Ce,n−1 + ene T n , (7)<br />
1 <br />
Σe,n = (n − 1)Ce,n − nμ<br />
n − 1<br />
e,nμ T <br />
e,n . (8)<br />
Due to this the likelihood π∗ ( ˜ d|x) adapts to the posterior<br />
during the runtime, which means that no sampling <strong>of</strong><br />
π(x) is necessary to built N (μe, Σe) in the priming <strong>of</strong><br />
the solution process for the data ˜ d.<br />
The second point addresses the possibility to use<br />
the knowledge about e to improve the quality <strong>of</strong> the<br />
approximation F ∗ during the runtime. A considerable<br />
simple update is possible if F ∗ is <strong>of</strong> form y ∗ = Pxa.<br />
Hereby xa denotes the augmented state vector, which<br />
holds x in an adequate form, i.e. arbitrary functions<br />
<strong>of</strong> the components <strong>of</strong> x or additional variables like the<br />
simulation time for transient problems.<br />
Thus, the approximation can be turned nonlinear with<br />
respect to x but it is linear with respect to the elements <strong>of</strong><br />
P . This is important, as for this class <strong>of</strong> approximations<br />
a number <strong>of</strong> update algorithms exist. In this work we use<br />
the least mean squares (LMS) algorithm given by [9]<br />
P n+1 = P n + γenx T a,n , (9)<br />
where γ is a step width parameter known as adaptation<br />
coefficient. The simpleness <strong>of</strong> the LMS update provides<br />
almost no computational costs and helps to improve<br />
the quality <strong>of</strong> the the approximation F ∗ for the<br />
posterior distribution. Again the initial matrix P can<br />
be determined by samples over the prior distribution<br />
π(x). For this the overdetermined equation system<br />
X aP T = Y has to be assembled and solved, where the<br />
matrix X a holds the augmented state vectors from the<br />
samples, and Y contains the exact solutions evaluated<br />
by F . There is also the possibility to run the standard<br />
MH for some time to learn about P and then switch<br />
to the DAMH. The choice <strong>of</strong> the adaptation parameter<br />
γ affects the learning speed <strong>of</strong> the LMS algorithm. For<br />
stability reasons γ has an upper limit which depends on<br />
the problem and can only be derived under restrictive<br />
conditions. However, as an MCMC algorithm provides a<br />
enormous number <strong>of</strong> evaluations it is less critical to set<br />
μ to a small value, as even this provides an improvement<br />
(although slower) to the approximation P and the LMS<br />
algorithm operates in a stable state.<br />
To use both, the update <strong>of</strong> the approximation y ∗ =<br />
Pxa and the adaptive error model, the approximation<br />
should be reevaluated for the current state vector x.<br />
This requires a second evaluation <strong>of</strong> F ∗ , but this is<br />
computational cheap due to the design <strong>of</strong> F ∗ .
IV. A NUMERICAL EXAMPLE<br />
To demonstrate our approach on a numerical example<br />
we consider an indirect measurement problem where we<br />
want to estimate thermophysical properties <strong>of</strong> a slab<br />
from a transient heat transfer experiment. We consider<br />
a slab <strong>of</strong> length L which we model by means <strong>of</strong> a<br />
1D simulation in the domain Ω : 0 ≤ x ≤ L. The<br />
slab is initially at the uniform temperature ϑ0. Onthe<br />
left side (x = 0) a uniform heat flux J is applied<br />
by an electric heater. On the right side at x = L the<br />
temperature ϑ(L, t) is measured over time. The heat on<br />
this side is exchanged by convection with the surrounding<br />
media at the temperature ϑ0. This exchange depends on<br />
a heat transfer coefficient α in Wm −2 K −1 .Thereareno<br />
heat sources within the medium and the thermophysical<br />
properties are supposed constant in the first assumption.<br />
The mathematical formulation for this heat conduction<br />
problem is given by:<br />
1 dϑ<br />
k dt = ∂2ϑ ∂x2 −λ<br />
in 0 0 (11)<br />
∂ϑ<br />
∂x + αϑ = αϑ0 at x = L, fort>0 (12)<br />
ϑ = ϑ0 for t =0,in0
σ e<br />
- 407 - 15th IGTE Symposium 2012<br />
TABLE I<br />
SUMMARY OF THE RESULTS FOR THE LINEAR CASE.<br />
Nr. Experiment σv<br />
K<br />
μλ<br />
W<br />
mK<br />
σλ<br />
W<br />
mK<br />
μα<br />
W<br />
m<br />
σα μk1<br />
σk1<br />
Tsim,r<br />
2K W<br />
m2K Ω<br />
m2 Ω<br />
m2 true 0.12 11 4.5 × 10<br />
%<br />
−3<br />
1 F 0.1 0.12 6.6 × 10−3 11.4 0.68 4.6 × 10−3 2.3 × 10−4 100<br />
2 F ∗ 1 0.1 0.13 6.3 × 10−3 10.7 0.59 4.3 × 10−3 1.7 × 10−4 3 F<br />
75<br />
∗ 2 0.1 0.12 8.4 × 10−3 10.2 0.23 4.3 × 10−3 6.0 × 10−4 4 F<br />
12<br />
∗ 3 0.1 0.12 4.3 × 10−3 12.1 0.29 4.6 × 10−3 2.3 × 10−5 5 F 0.5 0.13 7.0 × 10<br />
12<br />
−3 12.7 1.72 4.7 × 10−3 4.2 × 10−4 100<br />
6 F ∗ 1 0.5 0.13 5.9 × 10−3 9.8 1.42 4.2 × 10−3 3.5 × 10−4 7 F<br />
110<br />
∗ 2 0.5 0.13 3.1 × 10−3 10.5 1.10 4.4 × 10−3 2.9 × 10−4 8 F<br />
44<br />
∗ 3 0.5 0.13 3.3 × 10−3 10.5 1.05 4.4 × 10−3 2.6 × 10−4 44<br />
TABLE II<br />
SUMMARY OF THE RESULTS FOR THE NONLINEAR CASE.<br />
Nr. Experiment σv<br />
K<br />
μλ<br />
W<br />
mK<br />
σλ<br />
W<br />
mK<br />
μα<br />
W<br />
m<br />
σα μk1<br />
σk1<br />
μk2<br />
σk2<br />
Tsim,rel<br />
2K W<br />
m2K Ω<br />
m2 Ω<br />
m2 W<br />
mK2 W<br />
mK2 true 0.12 11 4.5 × 10<br />
%<br />
−3 0.02<br />
1 F 0.1 0.123 5.5 × 10−3 12.0 0.31 4.8 × 10−3 1.1 × 10−4 0.013 4.6 × 10−3 100<br />
2 F 0.05 0.127 3.9 × 10−3 11.3 0.19 4.6 × 10−3 7.1 × 10−5 0.015 3 × 10−3 100<br />
3 F 0.5 0.130 9.8 × 10−3 13.4 1.42 5.2 × 10−3 4.3 × 10−4 0.018 11 × 10−3 100<br />
4 F ∗ 1 0.1 0.129 4.4 × 10−3 10.7 0.27 4.4 × 10−3 1.1 × 10−4 0.016 5.6 × 10−3 5 F<br />
97.6<br />
∗ 1 0.5 0.128 5.8 × 10−3 11.9 1.24 4.8 × 10−3 3.7 × 10−4 0.012 9.4 × 10−3 6 F<br />
140<br />
∗ 3 0.1 0.122 3.1 × 10−3 11.8 0.22 4.7 × 10−3 8.1 × 10−5 0.019 4.6 × 10−3 7 F<br />
20.9<br />
∗ 3 0.5 0.128 5 × 10−3 10.8 1.29 4.6 × 10−3 4.2 × 10−4 0.017 9.4 × 10−3 72.5<br />
0.12<br />
0.1<br />
0.08<br />
0.06<br />
0.04<br />
0.02<br />
0<br />
20<br />
15<br />
10<br />
dt (s)<br />
5<br />
10<br />
8<br />
6<br />
# FE<br />
(a) Standard deviation <strong>of</strong> e over<br />
the discretization for F ∗ 1 .<br />
4<br />
2<br />
18000<br />
16000<br />
14000<br />
12000<br />
10000<br />
Fig. 1. Approximation error e for F ∗ 1 and F ∗ 2<br />
8000<br />
6000<br />
4000<br />
2000<br />
0<br />
−1.5 −1 −0.5 0 0.5 1 1.5<br />
e<br />
(b) Distribution (pdf) <strong>of</strong> e for F ∗ 2 .<br />
over the prior.<br />
how to analyze the results we refer to [4]. For<br />
the simulation we used the following parameters:<br />
L = 0.1 m, ρ = 1040 kgm −3 , c = 1350 Jkg −1 K −1 .<br />
The ambient temperature ϑ0 was set to ϑ0 = 20 ◦ C,<br />
the electrical current I was set to I = 100 A. We<br />
assumed that ϑ(L, t) is measured every 20 seconds for<br />
3000 s. The priors for the state vector are given by a<br />
Gaussian distribution with μλ = 0.13 Wm −1 K −1 and<br />
σλ =0.01 Wm −1 K −1 for λ and uniform distributions<br />
with the boundaries 1Wm −2 K −1 ≤ α ≤ 15 Wm −2 K −1 ,<br />
0.001 Ωm −2 ≤ k1 ≤ 0.01 Ωm −2 , and<br />
0Wm −1 K −2 ≤ k2 ≤ 0.04 Wm −1 K −2 (nonlinear case)<br />
for the remaining variables. The proposal generation is<br />
done by randomly selecting a component <strong>of</strong> the state<br />
vector x. Forλ the proposal is generated from the prior<br />
about λ. Forα and k1 an additive Gaussian distributed<br />
random variable with a standard deviation being 4% <strong>of</strong><br />
the range given by the prior is added to the current state.<br />
In the nonlinear case we only use the approximations<br />
F ∗ 1 and F ∗ 3 .<br />
Figure 2 depicts the output <strong>of</strong> the Markov chain for<br />
TABLE III<br />
BEHAVIOR OF THE CHAINS FOR THE LINEAR CASE.<br />
Nr. Experiment σv Acα Ac β|α Acβ τIACT<br />
K % % %<br />
1 F 0.1 16.4 X X 392<br />
2 F ∗ 1 0.1 18.0 75.8 13.7 1620<br />
3 F ∗ 2 0.1 10.3 65.8 6.8 420<br />
4 F ∗ 3 0.1 10.2 68.0 6.9 228<br />
5 F 0.5 56.3 X X 263<br />
6 F ∗ 1 0.5 54.4 82.9 45.1 110<br />
7 F ∗ 2 0.5 38.8 78.6 30.5 44<br />
8 F ∗ 3 0.5 38.2 78.6 29.8 37<br />
TABLE IV<br />
BEHAVIOR OF THE CHAINS FOR THE NONLINEAR CASE.<br />
Nr. Experiment σv Acα Ac β|α Acβ τIACT<br />
K % % %<br />
1 F 0.1 30.5 X X 57<br />
2 F 0.05 17.2 X X 63<br />
3 F 0.5 67.9 X X 132<br />
4 F ∗ 1 0.1 33.4 79.8 26.6 38<br />
5 F ∗ 1 0.5 64.6 87.1 56.3 72<br />
6 F ∗ 3 0.1 18.8 70.9 13.4 509<br />
7 F ∗ 3 0.5 56.0 86.1 48.3 121<br />
λ. From the histogram in figure 2(b) we can see the<br />
distribution. Table I and II summarize the results for the<br />
linear and the nonlinear case including the true values for<br />
the state vector x for different standard deviations σv <strong>of</strong><br />
the additive measurement noise. As it can be observed, all<br />
estimates meet the true values with reasonable accuracy.<br />
This especially holds for the linear case. For the nonlinear<br />
case some deviations occur, but that can be linked to<br />
the complexity <strong>of</strong> the problem. An interesting effect can<br />
be seen in the standard deviations <strong>of</strong> table I. Increased<br />
noise levels have a stronger effect on σk1 with respect to<br />
the other standard deviations. In the linear case a speed<br />
improvement <strong>of</strong> up to a factor <strong>of</strong> 10 for the linear, and 5
λ (Wm −1 K −1 )<br />
0.16<br />
0.15<br />
0.14<br />
0.13<br />
0.12<br />
0.11<br />
0 1 2 3 4 5 6<br />
x 10 4<br />
0.1<br />
# MCMC<br />
(a) MCMC output for λ.<br />
Fig. 2. MCMC output and analysis for λ.<br />
1000<br />
900<br />
800<br />
700<br />
600<br />
500<br />
400<br />
300<br />
200<br />
100<br />
0<br />
0.1 0.11 0.12 0.13<br />
λ (Wm<br />
0.14 0.15 0.16<br />
−1 K −1 )<br />
(b) Histogram plot for λ.<br />
for the nonlinear case could be achieved. The speed up<br />
also depends on the noise level, as the noise has direct<br />
influence on π(x| ˜ d) and thus affects the proposal kernel<br />
is important. Table III and IV provide statistics about the<br />
behavior <strong>of</strong> the algorithms. For the MH, the ratio Acα<br />
states the percentage <strong>of</strong> accepted proposal candidates. For<br />
the DAMH it states the ratio <strong>of</strong> accepted proposals in<br />
the first step and Acβ states the overall acceptance in<br />
the second step. The value Ac β|α states the acceptance<br />
in <strong>of</strong> a proposal in the second step, given an acceptance<br />
in the first step. Hence, this number provides a quality<br />
measure for the approximation F ∗ . The almost same<br />
level <strong>of</strong> Ac β|α in line 3 and 4, and line 7 and 8 in<br />
table III indicates, that the approximation already has<br />
a high quality, and that no further improvement could<br />
be achieved by the adaption. This corresponds to the<br />
observations <strong>of</strong> figure 1. The tables III and IV also<br />
explain the larger computation times for some DAMH<br />
variants with respect to the MH. In this case a high value<br />
<strong>of</strong> Acα results in a large number <strong>of</strong> evaluations <strong>of</strong> F .<br />
Thus, the sum <strong>of</strong> all evaluations <strong>of</strong> F and F ∗ increases<br />
the pure evaluation <strong>of</strong> F only. This also indicates, that<br />
the proposal kernel is yet not optimal for sampling from<br />
the posterior. The value τIACT is referred to as integrated<br />
auto correlation time (IACT). It was computed with the<br />
methods explained in [10] and provide a measure about<br />
the statistical efficiency <strong>of</strong> an MCMC algorithm by the<br />
distance between independent samples in the Markov<br />
chain. As can be seen, the DAMH variants have a lower<br />
IACT τIACT and thus are statistically more efficient.<br />
In section IV we stated the linear dependence <strong>of</strong> the<br />
parameters due to the equations (11) and (12) and that<br />
this circumstance can be observed in the results. Figure<br />
3 depicts a scatter plot for α and k1. The correlation<br />
factor can be computed with 0.98. From this we can<br />
conclude, that the measurement system is inappropriate<br />
and a second spatial distributed measurement would be<br />
required. This is a direct conclusion from the Bayesian<br />
analysis - an optimization based solution is not able to<br />
provide such inside information.<br />
VI. CONCLUSION<br />
In this work a general approach for accelerating<br />
Bayesian inference for indirect measurement problems<br />
is presented. The approach features the use <strong>of</strong> simple<br />
approximations by incorporating error knowledge and<br />
- 408 - 15th IGTE Symposium 2012<br />
k 1 (Ωm −2 )<br />
4.9<br />
4.8<br />
4.7<br />
4.6<br />
4.5<br />
4.4<br />
4.3<br />
4.2<br />
4.1<br />
x 10−3<br />
5<br />
4<br />
9.5 10 10.5 11 11.5 12 12.5 13<br />
α (Wm −2 K −1 )<br />
Fig. 3. The scatter plot <strong>of</strong> α and k1 indicates indicates a strong<br />
correlation between the variables. This indicates, that more spatial<br />
distributed measurements are required.<br />
can even be used to update approximation models during<br />
the runtime. The presented framework can be easily<br />
applied to different problem types, e.g. electrical capacitance<br />
tomography, to perform Bayesian inference for the<br />
solution <strong>of</strong> indirect measurement problems.<br />
REFERENCES<br />
[1] J. Kaipio and E. Somersalo, Statistical and Computational Inverse<br />
Problems, ser. Applied Mathematical Sciences. Springer, 2005,<br />
vol. 160.<br />
[2] J.M.BernardoandA.F.M.Smith,Bayesian Theory. New York:<br />
John Wiley & Sons, 1994 (ISBN: 0-471-92416-4).<br />
[3] A. Forrester, A. Sobester, and A. Keane, Engineering Design Via<br />
Surrogate Modelling: A Practical Guide. Wiley, 2008.<br />
[4] L. Tierney, “Markov chains for exploring posterior distributions,”<br />
Annals <strong>of</strong> Statistics, vol. 22, pp. 1701–1762, 1994.<br />
[5] W. Hastings, “Monte Carlo sampling using Markov chains and<br />
their applications,” Biometrica, vol. 57, no. 1, pp. pp. 97–109,<br />
1970.<br />
[6] H.R.B.Orlande,M.J.Colaço, and G. S. Dulikravich, “Approximation<br />
<strong>of</strong> the likelihood function in the bayesian technique for<br />
the solution <strong>of</strong> inverse problems,” Inverse Problems in Science &<br />
Engineering, vol. 16, pp. 677–692, 2008.<br />
[7] J. A. Christen and C. Fox, “Markov chain Monte Carlo Using<br />
an Approximation,” Journal <strong>of</strong> Computational and Graphical<br />
Statistics, vol. 14, no. 4, pp. 795–810, 2005. [Online]. Available:<br />
http://pubs.amstat.org/doi/abs/10.1198/106186005X76983<br />
[8] T. Cui, “Bayesian Calibration <strong>of</strong> Geothermal Reservoir Models<br />
via Markov Chain Monte Carlo,” Ph.D. dissertation, <strong>University</strong><br />
<strong>of</strong> Auckland, 2010.<br />
[9] S. Haykin, Adaptive Filter Theory (4th Edition). Prentice Hall,<br />
Sep.<br />
[10] U. Wolff, “Monte Carlo errors with less errors,” Computer Physics<br />
Communications, vol. 156, no. 2, pp. 143 – 153, 2004. [Online].<br />
Available: http://www.sciencedirect.com/science/article/B6TJ5-<br />
4B3NPMC-3/2/94bd1b60aba9b7a9ea69ac39d7372fc5
A<br />
Aleksić, Slavoljub, 73, 300<br />
Alotto, Piergiorgio, 267, 374<br />
Anastasiadis, Ioannis, 271<br />
Andjelic, Zoran, 167<br />
Arkkio, Antero, 214<br />
B<br />
Balabozov, Iosko, 59<br />
Bardi, Istvan, 1<br />
Bauernfeind, Thomas, 327, 337<br />
Bavastro, Davide, 101<br />
Belahcen, Anouar, 214<br />
Bellwald, Lukas, 271<br />
Benabou, Abdelkader, 95<br />
Besser, Bruno, 7<br />
Bielby, Steven, 248<br />
Bilicz, Sandor, 346<br />
Bíró, Oszkár, 31, 41, 144, 190, 232, 327,<br />
337<br />
Brandstätter, Bernhard, 67, 403<br />
Brochet, Pascal, 78<br />
Buchau, André, 89, 386<br />
Buchinger, Andreas, 271<br />
Burgard, Stefan, 13<br />
C<br />
Calvano, Flavio, 208<br />
Campana, Luca Giovanni, 171<br />
Canova, Aldo, 101<br />
Cardoso Bora, Teodoro, 267<br />
Chiariello, Andrea Gaetano, 357<br />
Ciric, Ioan R., 352<br />
Clénet, Stéphane, 95<br />
Coenen, Isabel, 198, 305<br />
Colaco, Marcello J., 403<br />
Cvetkovic, Nenad, 294<br />
D<br />
Dal Mut, Giorgio, 208<br />
Dessoude, Maxime, 78<br />
Di Barba, Paolo, 171<br />
Diwoky, Franz, 232<br />
dos Santos Coelho, Leandro, 267<br />
Duca, Anton, 262<br />
Düzgün, Bilal, 154<br />
Dughiero, Fabrizio, 171<br />
Dulikravich, George S., 403<br />
Dyczij-Edlinger, Romanus, 13, 19<br />
E<br />
Ebrahimi, Bashir Mahdi, 125, 131, 315<br />
Eidenberger, Norbert, 186<br />
Elistratova, Vera, 78<br />
Ellermann, Katrin, 144<br />
- 409 - 15th IGTE Symposium 2012<br />
Author Index<br />
Ertl, Michael, 181, 226<br />
F<br />
Faiz, Jawad, 125, 131, 220, 315<br />
Farle, Ortwin, 13, 19<br />
Farnleitner, Ernst, 31, 41<br />
Ferraioli, Fabrizio, 208<br />
Figueiredo, William, 175<br />
Fonteyn, Katarzyna, 214<br />
Formisano, Alessandro, 108, 208, 357<br />
Fornieles, Jesús, 7<br />
Fujita, Yoshihisa, 53<br />
Fujiwara, Koji, 113<br />
Fulmek, Paul, 331<br />
G<br />
Gavrila, Horia, 352<br />
Gergely, Koczka, 337<br />
Ghorbanian, Vahid, 125<br />
Giaccone, Luca, 101<br />
Gigov, Georgi, 63<br />
Gjonaj, Erion, 204<br />
Glotic, Adnan, 398<br />
Glotic, Arnel, 238, 398<br />
Göhner, Peter, 89<br />
Guarnieri, Massimo, 374<br />
Gueorgiev, Vultchan, 59<br />
Guimaraes, Frederico, 160<br />
Gyimóthy, Szabolcs, 242, 346<br />
H<br />
Hameyer, Kay, 198, 305<br />
Handgruber, Paul, 190<br />
Hantila, Florea I., 352<br />
Hauck, Andreas, 226<br />
Hecquet, Michel, 78<br />
Herold, Thomas, 198<br />
Hinov, Krastio, 59<br />
I<br />
Iatcheva, Ilona, 63<br />
Igarashi, Hajime, 276, 340<br />
Ikuno, Soichiro, 47, 53<br />
Ilić, Saša, 73, 300<br />
Iovine, Renato, 25<br />
Itoh, Taku, 47, 53<br />
J<br />
Janousek, Ladislav, 262<br />
Jorks, Hai Van, 204<br />
Jüttner, Matthias, 89, 386<br />
K<br />
Kaimori, Hiroyuki, 84<br />
Kaltenbacher, Manfred, 181, 226<br />
Kamitani, Atsushi, 47, 53
Karastoyanov, Dimitar, 59<br />
Kastner, Gebhard, 31, 41<br />
Katsumi, Ryuichi, 242<br />
Keränen, Janne, 392<br />
Kettunen, Lauri, 392<br />
Kiss, Péter, 242<br />
Kitak, Peter, 238, 398<br />
Klomberg, Stephan, 41<br />
Koczka, Gergely, 327<br />
Kömürgöz, Güven, 154<br />
Kotlan, Vaclav, 321<br />
Kouhia, Reijo, 214<br />
Kraiger, Markus, 310<br />
Krstic, Dejan, 294<br />
Kunov, Georgi, 63<br />
L<br />
La Spada, Luigi, 25<br />
Lambert, Nancy, 1<br />
Lehti, Leena, 392<br />
Li, Min, 160<br />
Lichtenegger, Herbert I. M., 7<br />
Lowther, David, 160, 175, 248, 254<br />
M<br />
Magele, Christian, 37<br />
Mair, Mathias, 144<br />
Manca, Michele, 101<br />
Maricaru, Mihai, 352<br />
Marignetti, Fabrizio, 208<br />
Martone, Raffaele, 108, 208, 357<br />
Metzker, Isabela, 175<br />
Miyagi, Daisuke, 84<br />
Moghnieh, Hussein, 254<br />
Mohr, Martin, 232<br />
Moro, Federico, 374<br />
N<br />
Nagano, Takumi, 288<br />
Nakata, Susumu, 53<br />
Nandi, Subhasis, 315<br />
Neumayer, Markus, 67, 403<br />
O<br />
Offermann, Peter, 305<br />
Ofner, Georg, 190<br />
Ojaghi, Mansour, 220<br />
Okamoto, Yoshifumi, 113, 282, 288<br />
Orlande, Helcio R.B., 403<br />
P<br />
Pávó, József, 242, 346<br />
Perić, Mirjana, 73<br />
Petersson, Rickard, 1<br />
Piantsop Mboo, Christelle, 198<br />
Portí, Jorge, 7<br />
Preda, Gabriel, 262<br />
Preis, Kurt, 271, 327, 337<br />
- 410 - 15th IGTE Symposium 2012<br />
R<br />
Raicevic, Nebojsa, 73<br />
Rainer, Siegfried, 144<br />
Ramarotafika, Rindra, 95<br />
Ramirez, Jaime, 160, 175<br />
Rasilo, Paavo, 214<br />
Rauscher, Michael, 89<br />
Rebican, Mihai, 262<br />
Recheis, Manes, 331<br />
Renhart, Werner, 37<br />
Rossi, Carlo Riccardo, 171<br />
Rubesa, Jelena, 380<br />
Rubinacci, Guglielmo, 208<br />
Rucker, Wolfgang M., 89, 386<br />
Ruela, Andre, 160<br />
S<br />
Sabouri, Mahdi, 220<br />
Sadovic, Salih, 167<br />
Salinas, Alfonso, 7<br />
Santos, Rafael, 175<br />
Sato, Shuji, 113, 282<br />
Sato, Yuki, 340<br />
Scharrer, Matthias, 368<br />
Schnizer, Bernhard, 310<br />
Schöberl, Joachim, 226<br />
Schrittwieser, Maximilian, 31<br />
Schweigh<strong>of</strong>er, Bernhard, 331<br />
Shimoyama, Kouske, 84<br />
Sieni, Elisabetta, 171<br />
Silva, Elizabeth, 175<br />
Silvestro, John, 1<br />
Simioli, Marco, 101<br />
Smetana, Milan, 262<br />
Sommer, Alexander, 19<br />
Sonmez, Oluş, 154<br />
Stancheva, Rumena, 63<br />
Steiner, Gerald, 67, 403<br />
Stella, Andrea, 374<br />
Stermecki, Andrej, 190, 232<br />
Štih, Željko, 137<br />
Stojanovic, Miodrag, 294<br />
Strapacova, Tatiana, 262<br />
Suhr, Bettina, 368, 380<br />
Suuriniemi, Saku, 392<br />
Szabo, Zsolt, 119<br />
T<br />
Takahashi, Norio, 84<br />
Takbash, Amir Masoud, 131, 315<br />
Tamburrino, Antonello, 208<br />
Tarhasaari, Timo, 392<br />
Ticar, Igor, 238, 398<br />
Toledo-Redondo, Sergio, 7<br />
Toratani, Tomoaki, 242<br />
Trkulja, Bojan, 137<br />
Tsuburaya, Tomonori, 113<br />
Tuerk, Christian, 37
U<br />
Ulrych, Bohus, 321<br />
V<br />
Vale, Joao Francisco, 175<br />
Varga, Gábor, 242<br />
Vasilescu, George-Marian, 352<br />
Vegni, Lucio, 25<br />
Ventre, Salvatore, 208<br />
Vizireanu, Darius, 78<br />
Volk, Adrian, 181<br />
Volkwein, Stefan, 362<br />
Voracek, Lukas, 321<br />
Vuckovic, Ana, 300<br />
Vuckovic, Dragan, 294<br />
W<br />
Wakao, Shinji, 288<br />
Watanabe, Yuta, 276<br />
Watzenig, Daniel, 67, 368, 403<br />
Wegleiter, Hannes, 331<br />
Weiland, Thomas, 204<br />
Weilharter, Bernhard, 144<br />
Werth, Tobias, 271<br />
Wesche, Andrea, 362<br />
Y<br />
Yasukawa, Shogo, 288<br />
Yatchev, Ivan, 59<br />
Z<br />
Zagar, Bernhard G., 186<br />
Zhao, Kezhong, 1<br />
Župan, Tomislav, 137<br />
- 411 - 15th IGTE Symposium 2012